Skip to main content
News Directory 3
  • Business
  • Entertainment
  • Health
  • News
  • Sports
  • Tech
  • World
Menu
  • Business
  • Entertainment
  • Health
  • News
  • Sports
  • Tech
  • World

AI Language Understanding: Breakthrough Discovery

July 8, 2025 Lisa Park Tech
News Context
At a glance
Original source: sciencedaily.com

Neural Networks experience a ‘Phase Transition’ in Learning, Shifting from Syntax to Meaning

Neural networks, the engines powering today’s‍ AI like ChatGPT, Gemini, and Claude, don’t initially learn language the way‍ humans do.‍ They begin by focusing on where words appear in a sentence – their position – before transitioning ⁢to understanding what the words‍ actually mean.⁣ A new study reveals this shift ⁢isn’t gradual, but a surprisingly ⁤abrupt “phase transition,” akin to water boiling into steam, ⁤once a critical amount of data is processed. This ⁣finding offers crucial insights into how these complex models learn and could pave the way for more efficient and reliable AI.

Just as a child learning to read first grasps sentence structure, neural networks initially decipher meaning based on word order. Identifying subjects, verbs, and objects relies on their typical positions within a sentence. For example, in English, we expect the subject to come before the verb (“Mary eats the apple”). Though,as the network is exposed to more ⁢and more training data – essentially,”going to school” for longer – a essential change occurs: the meaning of words becomes the dominant factor.

This phenomenon was observed in a simplified model of the “self-attention mechanism,” a core component ⁢of transformer language models. Transformers ⁣are neural network architectures specifically designed to process sequential data like text, and they underpin⁣ many of the AI systems we interact with daily. They excel at understanding relationships between words and utilize self-attention to determine the importance of each word⁣ relative to others.

“To assess relationships between words, the network can use two strategies, one of which⁣ is to exploit the positions of words,” explains Hugo Cui, a postdoctoral researcher at Harvard University and lead author of the study. “In a language like English,the⁤ subject typically precedes the verb,which in turn precedes the object.”

The study found that positional understanding is the first strategy a network spontaneously develops during training. Though, the researchers were surprised to discover that continued training and increased data exposure ⁢led to a sudden⁣ and complete shift. “If training continues and the⁤ network receives enough data, at a⁣ certain point – once a threshold is crossed – the strategy abruptly shifts: the network starts relying on meaning instead,” Cui says.

Below this‍ data ⁣threshold, the network relied exclusively on‍ position; ⁣above it, only on meaning.

Cui describes this change as a “phase transition,” drawing a parallel to concepts in physics. ⁢Statistical physics studies systems with vast numbers of interacting⁣ particles (atoms, molecules) by analyzing their collective ⁤behavior. Similarly, neural networks are composed of numerous interconnected “nodes” (neurons) that perform simple ⁣calculations.The network’s intelligence emerges from these interactions, a process ⁣that can be modeled using statistical methods.This analogy‍ explains why the shift in network behavior can be described as a phase transition – much like water changing from liquid to ⁣gas under specific temperature and pressure conditions.

“Understanding from a theoretical viewpoint that the strategy shift happens in this manner is ⁢vital,” Cui emphasizes. “Our networks are simplified compared to the ⁢complex models people interact with daily, but they can ⁢give us hints to begin‍ to understand the conditions that cause ⁣a model to stabilize on one strategy or another. This theoretical knowledge could hopefully be ‍used in the future ⁣to make the use of neural networks more efficient, and safer.”

The research, titled ⁢”A Phase Transition between Positional and Semantic Learning in a⁢ Solvable Model of Dot-Product Attention,” was published in JSTAT as part of the Machine Learning 2025 special issue and presented at the NeurIPS 2024 conference. The study was conducted by Hugo Cui, Freya Behrens, Florent Krzakala, and Lenka⁣ Zdeborová.

Share this:

  • Share on Facebook (Opens in new window) Facebook
  • Share on X (Opens in new window) X

Related

Neural Interfaces; Statistics; Computers and Internet; Math Puzzles; Computer Modeling; Computer Programming; Artificial Intelligence; Mathematical Modeling

Search:

News Directory 3

News Directory 3 catalogs US newspapers, news services, newsstands and digital news outlets across all 50 states. Browse local publishers by city, state, or topic, and follow current headlines linked back to their original sources.

Quick Links

  • Disclaimer
  • Terms and Conditions
  • About Us
  • Advertising Policy
  • Contact Us
  • Cookie Policy
  • Editorial Guidelines
  • Privacy Policy

Browse by State

  • Alabama
  • Alaska
  • Arizona
  • Arkansas
  • California
  • Colorado

© 2026 News Directory 3. All rights reserved.