Home / International, Lead News

OpenAI’s New GPT-4.5 Still Hallucinates in Over a Third of Responses

sarakhon desk

OpenAI has admitted that its latest large language model, GPT-4.5, struggles with accuracy, hallucinating 37% of the time, according to its own in-house factuality tool, SimpleQA. While this is an improvement over previous models, such as GPT-4o (which hallucinates 61.8% of the time) and o3-mini (which reaches a staggering 80.3%), the issue remains a critical flaw in AI reliability.

Hallucination, in AI terms, refers to when a model generates false or misleading information while presenting it as fact. Despite OpenAI’s efforts, even the best AI systems can only produce entirely accurate responses about 35% of the time, according to researchers studying the problem.

DeepSeek’s Approach to AI Accuracy

While OpenAI faces ongoing challenges with hallucinations, competitors are taking different approaches. DeepSeek, a Chinese AI company, has been gaining attention for its emphasis on knowledge retrieval and structured reasoning. Unlike OpenAI’s GPT models, which rely heavily on probabilistic generation, DeepSeek prioritizes verifiable sources and claims to reduce hallucinations by integrating advanced search-based techniques into its responses. However, whether DeepSeek can maintain this accuracy at scale remains an open question, especially as demand for more powerful AI models grows.

As OpenAI and its competitors refine their models, it’s becoming clear that incremental updates are not enough to eliminate hallucinations. While AI continues to evolve, its struggle with truthfulness raises questions about whether true breakthroughs in reliability are on the horizon—or if AI companies are simply trying to manage expectations while keeping investors interested.

More News:

05:35:48 pm, Sunday, 2 March 2025

609

Last Update
Popular Post

DeepSeek’s Approach to AI Accuracy

Editor and Publisher: Swadesh Roy

Editorial Office

First Floor, Capita Lake Front. Block J, House# 66, Banani Rd 18 Extension, Dhaka 1213

OpenAI’s New GPT-4.5 Still Hallucinates in Over a Third of Responses

Youth Develop Negotiation Skills Through U.S. Embassy Training Program

By Denying Him a State Janaza, History Has Honored Tofail Ahmed Even More

The Last Farewell Under Bhola’s Sky: A Sea of Mourners, Memories, and the End of an Era at Tofail Ahmed’s Funeral Prayer

The Man Zia Could Not Bring Into a Secret Meeting

Thousands Attend Tofail Ahmed’s Funeral Prayers in Bhola; BNP Student and Youth Wings Stage Protest

March Held in Chattogram Demanding Withdrawal of Taxes on Solar Panels, Concerns Raised Over Energy Security

Call for Public-Private Coordination in Legal Aid, 80 Percent of Cases Could Be Resolved Quickly

Vaccine Shortages Could Increase the Risk of Antimicrobial Resistance, New Policy Brief Warns

Sheikh Hasina’s Two Historic Returns to Bangladesh

Why Is Measles Treatment Not Being Declared an Emergency?

The Man Zia Could Not Bring Into a Secret Meeting

By Denying Him a State Janaza, History Has Honored Tofail Ahmed Even More

Thousands Attend Tofail Ahmed’s Funeral Prayers in Bhola; BNP Student and Youth Wings Stage Protest

The Last Farewell Under Bhola’s Sky: A Sea of Mourners, Memories, and the End of an Era at Tofail Ahmed’s Funeral Prayer

Youth Develop Negotiation Skills Through U.S. Embassy Training Program

OpenAI’s New GPT-4.5 Still Hallucinates in Over a Third of Responses

Editor and Publisher: Swadesh Roy

Editorial Office