AI Language Understanding: Breakthrough Discovery
Neural Networks experience a ‘Phase Transition’ in Learning, Shifting from Syntax to Meaning
Neural networks, the engines powering today’s AI like ChatGPT, Gemini, and Claude, don’t initially learn language the way humans do. They begin by focusing on where words appear in a sentence – their position – before transitioning to understanding what the words actually mean. A new study reveals this shift isn’t gradual, but a surprisingly abrupt “phase transition,” akin to water boiling into steam, once a critical amount of data is processed. This finding offers crucial insights into how these complex models learn and could pave the way for more efficient and reliable AI.
Just as a child learning to read first grasps sentence structure, neural networks initially decipher meaning based on word order. Identifying subjects, verbs, and objects relies on their typical positions within a sentence. For example, in English, we expect the subject to come before the verb (“Mary eats the apple”). Though,as the network is exposed to more and more training data – essentially,”going to school” for longer – a essential change occurs: the meaning of words becomes the dominant factor.
This phenomenon was observed in a simplified model of the “self-attention mechanism,” a core component of transformer language models. Transformers are neural network architectures specifically designed to process sequential data like text, and they underpin many of the AI systems we interact with daily. They excel at understanding relationships between words and utilize self-attention to determine the importance of each word relative to others.
“To assess relationships between words, the network can use two strategies, one of which is to exploit the positions of words,” explains Hugo Cui, a postdoctoral researcher at Harvard University and lead author of the study. “In a language like English,the subject typically precedes the verb,which in turn precedes the object.”
The study found that positional understanding is the first strategy a network spontaneously develops during training. Though, the researchers were surprised to discover that continued training and increased data exposure led to a sudden and complete shift. “If training continues and the network receives enough data, at a certain point – once a threshold is crossed – the strategy abruptly shifts: the network starts relying on meaning instead,” Cui says.
Below this data threshold, the network relied exclusively on position; above it, only on meaning.
Cui describes this change as a “phase transition,” drawing a parallel to concepts in physics. Statistical physics studies systems with vast numbers of interacting particles (atoms, molecules) by analyzing their collective behavior. Similarly, neural networks are composed of numerous interconnected “nodes” (neurons) that perform simple calculations.The network’s intelligence emerges from these interactions, a process that can be modeled using statistical methods.This analogy explains why the shift in network behavior can be described as a phase transition – much like water changing from liquid to gas under specific temperature and pressure conditions.
“Understanding from a theoretical viewpoint that the strategy shift happens in this manner is vital,” Cui emphasizes. “Our networks are simplified compared to the complex models people interact with daily, but they can give us hints to begin to understand the conditions that cause a model to stabilize on one strategy or another. This theoretical knowledge could hopefully be used in the future to make the use of neural networks more efficient, and safer.”
The research, titled ”A Phase Transition between Positional and Semantic Learning in a Solvable Model of Dot-Product Attention,” was published in JSTAT as part of the Machine Learning 2025 special issue and presented at the NeurIPS 2024 conference. The study was conducted by Hugo Cui, Freya Behrens, Florent Krzakala, and Lenka Zdeborová.
