Google DeepMind releases Gemma 4, its latest open-weights model

Google DeepMind released Gemma 4 today, a new family of open-weight models built on technology from Gemini 3. The weights are free to download, fine-tune, and deploy commercially under an Apache 2.0 license.

The family comes in four sizes, split across two different use cases.

The larger models (26B and 31B) are built for workstations and consumer GPUs. The 26B uses a Mixture of Experts architecture, meaning only about 4 billion parameters are active during inference, keeping memory requirements lower than a dense model of similar capability. The 31B is a full dense model aimed at tasks that need deeper reasoning.

The edge models (E2B and E4B) target mobile and IoT devices. Both support multimodal inputs — text, image, and audio — and are designed to run completely offline on hardware like phones, Raspberry Pi, and Jetson Nano.

Context window: Up to 256K tokens, double what Gemma 3 supported
Languages: 140+
Key capabilities: Function calling, structured JSON output, agentic workflows, multimodal reasoning

Where to get it:

Try it: Google AI Studio
Download weights: Hugging Face, Ollama

The instruction-tuned variants of the 26B and 31B models are live now.

Efficienist Newsletter

Get the core business tech news delivered straight to your inbox. We track AI, automation, SaaS, and cybersecurity so you don't have to.

Just read what you want, and be done with it.

Google DeepMind releases Gemma 4, its latest open-weights model

Anthropic doubles Claude Code limits after signing compute deal with SpaceX

A 12 million token LLM appeared out of nowhere, and the AI community isn’t sure what to make of it

You can now use Ollama cloud models inside Claude Desktop

AI models are starting to read floor plans from photos, but humans are still far ahead

Cerebras IPO Soars 68%, Codex goes mobile, and Grok Build enters the coding agent race

Anthropic doubles Claude Code limits after signing compute deal with SpaceX

A 12 million token LLM appeared out of nowhere, and the AI community isn’t sure what to make of it

You can now use Ollama cloud models inside Claude Desktop

AI models are starting to read floor plans from photos, but humans are still far ahead