What does Trace include?

Trace gives you trending topics grouped from many sources, each with an AI summary, key points, and a timeline.

How much does Trace cost?

Trace starts with a free trial, then monthly pricing applies based on your region (for example, ₹19/month in India and $4.99/month in the US). You can cancel anytime from your profile.

How does personalization work?

You choose the topics you care about. Trace uses those inputs to prioritize and summarize the stories in your daily feed.

TRACE Get your feed

Back to daily

THE DECODER·3 min read

AI models confidently describe images they never saw, and benchmarks fail to catch it

AI models misrepresent visual competence in evaluations.

A recent Stanford study reveals that multimodal AI models, including GPT-5 and Gemini 3 Pro, can generate detailed descriptions and diagnoses without any actual image input, achieving 70-80% of their benchmark scores based on text alone. This phenomenon, termed the 'mirage effect,' raises concerns about the reliability of these models in critical applications, particularly in healthcare, where they may fabricate severe medical diagnoses without visual evidence.

Key Takeaways

1.
Multimodal AI models like GPT-5 and Gemini 3 Pro achieve 70-80% benchmark scores without any image input.
2.
In medical evaluations, fabricated diagnoses often skew towards severe conditions like ST-elevation myocardial infarctions.
3.
A text-only model outperformed multimodal models and human radiologists in medical image analysis.

Read original source

Trending Daily archive Research Compare

Get your personalized feed

Trace groups the biggest stories, videos, and discussions into one feed so you can stay current without scanning ten tabs.

Try Trace free

AI models confidently describe images they never saw, and benchmarks fail to catch it | Trace