Skip to main content
News Directory 3
  • Business
  • Entertainment
  • Health
  • News
  • Sports
  • Tech
  • World
Menu
  • Business
  • Entertainment
  • Health
  • News
  • Sports
  • Tech
  • World
Multimodal Reasoning: New Error Tracking Metric - News Directory 3

Multimodal Reasoning: New Error Tracking Metric

June 15, 2025 Catherine Williams Tech
News Context
At a glance
  • Computer ⁣scientists have developed elegant machine learning models ⁢capable of high performance across varied tasks.
  • Models ‍such as OpenAI's GPT4 with Vision (GPT-4V), DeepSeek-R1, and Google Gemini⁤ are widely used to create multimodal‍ content, including images and tailored texts.
  • Researchers are assessing⁣ the reasoning abilities of these⁤ models, especially how they handle visual inputs.
Original source: techxplore.com

Uncover the critical findings of a new study that scrutinizes the reliability of multimodal reasoning models. This research introduces a new metric, RH-Bench, designed to track adn assess how these advanced models, including widely-used ones ⁣like GPT-4V and Gemini, generate inaccurate outputs—or hallucinations—during reasoning tasks. ⁣the study emphasizes that reasoning⁢ models often⁣ amplify these errors, a key insight⁤ for improving AI accuracy.Discover how researchers are tackling this critical issue and what it means for the future of AI. Read more on News Directory 3 for detailed insights into this groundbreaking research. Discover what’s⁢ next …


Multimodal Reasoning Models: New Hallucination Metric Assessed










key ⁢Points

  • MLLMs process texts, images and videos.
  • Models like GPT-4V and ⁢Gemini create multimodal content.
  • Study assesses hallucination amplification in reasoning.
  • RH-Bench dataset evaluates reasoning and perception.
  • Reasoning models show more hallucination then non-reasoning ones.

Benchmarking Hallucinations: New Metric Tracks Multimodal Reasoning⁣ models

Updated June 15, 2025

outputs from ⁢reasoning and non-reasoning models on ⁢a perception task, ‍highlighting visual hallucination.

⁣ (a) ⁣Outputs from reasoning and⁤ non-reasoning models on a perception task, highlighting visual hallucination. Multimodal reasoning models amplify hallucinations. (b) Model performance on reasoning and perception tasks in the RH-Bench dataset.
⁢ Credit: Liu et al.

Computer ⁣scientists have developed elegant machine learning models ⁢capable of high performance across varied tasks. Multimodal large language‍ models (MLLMs) can process and generate different data types, including texts, images, and videos.

Models ‍such as OpenAI’s GPT4 with Vision (GPT-4V), DeepSeek-R1, and Google Gemini⁤ are widely used to create multimodal‍ content, including images and tailored texts.

Researchers are assessing⁣ the reasoning abilities of these⁤ models, especially how they handle visual inputs. A study by Liu et al., available on arXiv, investigates how reasoning processes can amplify hallucinations in MLLMs. The research introduces a new metric and dataset, RH-Bench, to evaluate these models.

The study, “More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models,” highlights that while MLLMs excel in many areas, they can‍ also generate outputs that contain inaccuracies or fabrications, known as hallucinations. the researchers found that reasoning models⁢ are more prone to amplifying these hallucinations compared to non-reasoning models.

The ‍RH-Bench ‍dataset includes tasks designed to test both reasoning and perception. The results indicate that models with strong reasoning capabilities frequently enough‍ exhibit more hallucinations. baseline non-reasoning models typically show ⁢weaker reasoning but fewer hallucinations.

What’s next

The findings suggest that ‍future ⁣research should focus on reducing‍ hallucinations in multimodal reasoning models to improve their reliability and accuracy in real-world applications.

Further reading

  • More Thinking, Less Seeing? Assessing Amplified Hallucination⁣ in Multimodal Reasoning Models

Share this:

  • Share on Facebook (Opens in new window) Facebook
  • Share on X (Opens in new window) X

Related

computer news, hi-tech news, hitech, information technology, innovation, inventions

Search:

News Directory 3

News Directory 3 catalogs US newspapers, news services, newsstands and digital news outlets across all 50 states. Browse local publishers by city, state, or topic, and follow current headlines linked back to their original sources.

Quick Links

  • Disclaimer
  • Terms and Conditions
  • About Us
  • Advertising Policy
  • Contact Us
  • Cookie Policy
  • Editorial Guidelines
  • Privacy Policy

Browse by State

  • Alabama
  • Alaska
  • Arizona
  • Arkansas
  • California
  • Colorado

© 2026 News Directory 3. All rights reserved.