Meta’s TRIBE v2 can predict how your brain responds to videos and sounds

Meta AI released TRIBE v2, an updated brain encoder model that predicts how human brains respond to video, audio, and text. When you give it a piece of content, it tells you which brain regions activate and how strongly.

The original TRIBE won a major neuroscience competition last year, placing first out of 262 teams. v2 is significantly more capable.

Training data: 500 hours of fMRI recordings from over 700 participants.
Zero-shot generalization: Predicts brain responses for people it has never seen, across new languages and new tasks.
Performance: 2 to 3 times better than prior methods on unseen subjects.
Accuracy: Explains up to 54% of explainable brain activity variance in sensory areas.

The model processes three modalities at once. Video goes through Meta’s V-JEPA 2, audio through Wav2Vec2-BERT, and text through Llama 3.2. Together they produce predictions across roughly 1,000 brain regions.

Meta is releasing the model, codebase, and a live demo openly. The stated applications are neuroscience research, brain-inspired AI architectures, and neurological disease simulation without constant patient scanning.

Today we're introducing TRIBE v2 (Trimodal Brain Encoder), a foundation model trained to predict how the human brain responds to almost any sight or sound.

Building on our Algonauts 2025 award-winning architecture, TRIBE v2 draws on 500+ hours of fMRI recordings from 700+ people… pic.twitter.com/vRoVj8gP4j
— AI at Meta (@AIatMeta) March 26, 2026

Here is the part worth thinking about.

Meta owns Facebook, Instagram, and WhatsApp. Their business is built on understanding what content drives engagement. When you have a model that predicts neural responses to combinations of video, audio, and text, you also have a tool for designing content that triggers the strongest possible biological response in viewers.

What Meta says: The research team frames this entirely as neuroscience. No commercial application is suggested.
What the researchers acknowledge: The dual-use concern is real. They flag it themselves in the paper.
The context: A model trained to predict what content activates which brain regions, built by a company whose revenue depends on maximizing time spent on screens, does not need to be sinister to be worth watching.

Bottom line: A model that predicts brain responses to content, released by the company that sells ads against that content, does not need to be sinister to be worth watching.

Source: Meta

Efficienist Newsletter

Get the core business tech news delivered straight to your inbox. We track AI, automation, SaaS, and cybersecurity so you don't have to.

Just read what you want, and be done with it.

Meta’s TRIBE v2 can predict how your brain responds to videos and sounds

Anthropic doubles Claude Code limits after signing compute deal with SpaceX

A 12 million token LLM appeared out of nowhere, and the AI community isn’t sure what to make of it

Karpathy joins Anthropic, Google I/O delivers Gemini 3.5 and a 24/7 personal agent, Standard Chartered cuts 7,000 for AI

Federal Jury Dismisses Musk’s $150B OpenAI Claim in Under Two Hours

Cerebras IPO Soars 68%, Codex goes mobile, and Grok Build enters the coding agent race

Anthropic doubles Claude Code limits after signing compute deal with SpaceX

A 12 million token LLM appeared out of nowhere, and the AI community isn’t sure what to make of it