Updated

Updated · Markets Insider · Apr 28

Multiverse Computing launches three ultra-compact LittleLamb open-source AI models

Updated

Updated · Markets Insider · Apr 28

Multiverse Computing launches three ultra-compact LittleLamb open-source AI models

12 articles · Updated · Markets Insider · Apr 28

The LittleLamb 0.3B, Tool-Calling, and Mobile models, each with a ~0.3B parameter footprint, are now freely available on Hugging Face and support both English and Spanish.
Built from Qwen3-0.6B and compressed using CompactifAI, these models outperform similar-sized competitors on HLE testing and Mobile Action tasks while enabling efficient deployment on edge, mobile, and offline devices.
Multiverse’s proprietary compression reduces model size by up to 95% with minimal precision loss, expanding practical AI access for developers facing privacy, latency, or compute constraints across diverse environments.

Can Multiverse’s CompactifAI technology be applied to other popular LLM architectures beyond Qwen3-0.6B?

In what ways could LittleLamb’s dual-mode inference reshape mobile and edge AI applications compared to existing solutions?

What trade-offs exist between LittleLamb’s extreme compression and its ability to update outdated knowledge or support new tasks?

What are the practical hardware requirements and limitations for deploying LittleLamb fully offline on standard consumer devices?

With the rapid growth of model storage needs, could approaches like CompactifAI fundamentally change the economics of AI deployment?

How does LittleLamb’s quantum-inspired compression actually protect against reverse engineering and data leakage in real-world scenarios?