Whisper’s Hallucinations Messing AI Medical Transcription Records

by Aida Oleiwan - October 28, 2024
Reading time: 2 min

Post Views: 613

Whisper, an AI tool in hospitals has been “hallucinating” and producing incorrect information when producing AI medical transcriptions.

Whisper, an AI tool developed by OpenAI used in hospitals for medical transcriptions is criticized for occasionally “hallucinating” and producing incorrect information when producing AI medical transcriptions.

The tool has been widely adopted by Nabla, a French health tech company, which has reportedly used it to process over seven million conversations using AI for medical transcription, according to ABC News.

Researchers caution that Whisper’s tendency to generate inaccurate or even violent content makes its accuracy far from perfect, particularly when used for critical tasks like producing an AI generated doctors note.

Whisper’s Offensive Words

In the study, “Careless Whisper: Speech-to-Text Hallucination Harms”, a team of researchers from many universities, including the University of Washington and Cornell University discovered that hallucinations in medical transcription AI software such as Whisper, recorded at a rate of 1% of the recordings.

Among the mistakes found in AI medical transcription were invented phrases and sentences that might be deceptive in the medical field.

In some cases, the AI transcription tool generated sentences during moments of silence where there was an introduction of offensive or unnecessary words that are not associated with the main conversation.

Such incidents particularly occurred in recordings involving individuals who have aphasia, a language impairment that is frequently marked by prolonged pauses, increasing the likelihood of mistakes in these recordings.

To further illustrate the issue with AI medical transcription, Dr. Allison Knoenecke, a member of the team of researchers from Cornell University posted some examples on her account on Thread, showcasing the way Whisper sometimes includes sentences like “Thank you for watching!”, which are out of context.

https://twitter.com/allisonkoe/status/1797662873675800843

Solutions on the Way

Researchers suggest that Whisper’s mistakes may come from its training on vast transcription data – this also includes YouTube videos – contributing to the generation of irrelevant or irregular content.

In response to these critics, Nabla, which uses AI medical scribe admits that the AI transcription tool has hallucination issues and that it is working on fixing this problem.

As for OpenAI, Taya Christianson, the company’s spokesperson told The Verge in a statement that it is aware of these concerns and is working on addressing them.

“We take this issue seriously and are continually working to improve, including reducing hallucinations. For Whisper use on our API platform, our usage policies prohibit use in certain high-stakes decision-making contexts, and our model card for open-source use includes recommendations against use in high-risk domains. We thank researchers for sharing their findings,” Christianson said.

The findings have ignited discussions around excessive AI reliance risks in healthcare where accuracy is a pillar. Even though Whisper’s success has risen tremendously when it comes to quick speech transcribes, its tendency to generate “hallucinations” however, is a clear testament of its reliability in medical applications.

The researchers presented the findings on AI medical transcription in June at the Association for Computing Machinery FAccT conference that took place in Brazil, though it isn’t clear whether the study has been peer reviewed.

Inside Telecom provides you with an extensive list of content covering all aspects of the tech industry. Keep an eye on our Intelligent Tech sections to stay informed and up-to-date with our daily articles.

Tags: AI Artificial Intelligence China Inside Telecom News Technology U.S. Whisper

MontyPay Securing the Future of Payments

MontyPay’s Dashboard Is Changing the Way Businesses Manage Payments

Foresee Solutions Inks Deal with China’s Yonyou to Bring Transformative ERP Solutions to Middle East

Monty eSIM Takes Home Comms Council UK's Best Multinational Service

The Future of Intelligence

Starlink’s Path to Gigabit Satellite Internet with Gen2, Gen3 Satellites

Fiber, Cable, 5G Vie to Power Next-Gen Industrial Connectivity

Wi-Fi 8 Taking Connectivity to New Levels Starting 2028

Meta’s Under Sea Internet Cables Will Keep Us Connected

Is Ericsson’s 5G Uplink Speed Worth the Cybersecurity Risk?

How AI Shapes and Shakes Children’s Minds

Venture Capital Flows to Vertical AI Startups, Broad Models Hit Limit

How AI Is Shaping Early Childhood Learning Without Replacing Human Connection

AI Chatbots Are Unreliable Narrators Distorting News Events, Study Finds

AI Changing the Autism Therapy Perspective

MyMonty: The New Era of Banking

Entering the Monty Multiverse at Seamless 2023

Seamless Dubai 2023 - From Concept to Reality: Shaffra Technologies Opens Doors to Metaverse Mastery

Take A Look in the Mirror. The Greatest Technology of All Will Stare Back at You

Monty Mobile Enters Multibillion-Dollar MNO Equipment Industry

Are We Addicted to Social Media? IG, TikTok Trigger Physical and Emotional Withdrawal

Meta's AI on Instagram, Facebook Helps Save Lives

US DoT’s New Safety Plan Introduces Car Communication

Little Girl Receives First Prosthetic Eye from MRI, CT Scans

DeepL’s AI Translation Software to Get Traditional Chinese

Whisper’s Hallucinations Messing Up AI Medical Transcription Records

Whisper’s Offensive Words

Solutions on the Way