Home · Blog · Research

Decoding Visual Semantics from fMRI: Brain2Text Model Maps Neural Representations

Decoding Visual Semantics from fMRI: Brain2Text Model Maps Neural Representations

Researchers have developed a deep learning model that can decode fMRI brain scans into written descriptions of what a person is seeing, pinpointing the brain regions responsible for understanding visual meaning. The model, called Brain2Text, was trained without any visual information and achieves state-of-the-art performance in generating accurate captions for complex natural images.

What the Study Found

Led by Feihan Feng and Jingxin Nie, the team used fMRI data from participants viewing natural images and trained a neural network to output text descriptions. The model produced captions that captured the core semantic content, such as "a dog running on grass" or "a group of people at a beach." Importantly, the model was trained only on fMRI-text pairs, not on actual images, forcing it to learn the relationship between neural activity and meaning.

Neuroanatomical analysis revealed that higher-level visual areas—including the MT+ complex (motion processing), ventral stream visual cortex (object recognition), and inferior parietal cortex (spatial attention)—play critical roles in visual semantic processing. Category-specific analysis showed nuanced neural representations for semantic dimensions like animacy (living vs. non-living) and motion (static vs. moving), suggesting that the brain distributes these features across a network of regions.

Why It Matters

This research bridges AI and neuroscience by providing an interpretable framework to probe how the brain constructs meaning from visual input. For the average person, understanding that different brain regions specialize in different semantic aspects can inform cognitive training strategies. For example, tasks that engage motion processing (like watching sports) or object recognition (like art appreciation) may strengthen the corresponding neural networks. The findings also hold promise for developing brain-computer interfaces that restore communication for people with paralysis or locked-in syndrome.

What You Can Do

To enhance your visual semantic processing, engage in activities that challenge your brain to interpret scenes: describe paintings in detail, play "I Spy" with complex images, or practice summarizing what you see in a photograph in one sentence. Regular mental visualization exercises can also strengthen the neural pathways involved.

Source: arXiv q-bio.NC

Curious about your own brain? Take our free adaptive IQ test or try 306 brain training levels.

Curious about your own IQ?

Take our free, scientifically designed adaptive test across 7 cognitive domains. No signup required.

Take the free test