Tech

CriticGPT Is Here to Improve AI

by Inside Telecom Staff - June 28, 2024
Reading time: 2 min

Post Views: 410

OpenAI has unveiled a new AI model called CriticGPT that aids human trainers in detecting code errors generated by its language model GPT.

CriticGPT was designed in a bid to improve the famous model. The Microsoft-backed company issued on Thursday a paper, “LLM Critics Help Catch LLM Bugs” that elaborates the way the AI model works.

Human Trainers Assistant

Speaking of generative AI models such as GPT-4, as a first stage, they are trained on a large amount of data. Then, they undergo a process known as Reinforcement Learning From Human Feedback (RLHF). During this phase, employees, who are usually appointed through community driven platforms, are required to interact with the AI models and note the answers provided by them.

The main goal behind the RLHF process is to train the AI models how to choose the best answers, with the aim of improving their performance. However, the more these models are advanced, the less RLHF process is effective. Therefore, the human trainers also find it more difficult to spot mistakes, especially that these AI models are outperforming individuals.

Enhancing the Review Process

To overcome this challenge, OpenAI developed CriticGPT, which is based on GPT-4, to critique and identify the errors related to the chatbot’s code output.

“We found that when people get help from CriticGPT to review ChatGPT code they outperform those without help 60 percent of the time”, the company stated in a blog post.

Apparently, the purpose of CriticGPT is not the creation of automated feedback between models, but instead to work on the development of skills of individuals who use RLHF process.

Between Accuracy and Mistakes

The paper also revealed that the results of the new AI model showed its capability of outperforming humans by detecting integrated bugs, which makes its results preferred by humans at more than 80% of the time. However, working with CriticGPT is not always an advantage, since despite being able to reduce hallucinations, the rate of errors made remains higher than when the reviews were done by humans.

On the other hand, the company admits this compromise noting that it is unable to figure out the best balance between decreasing hallucinations and bugs detection.

Inside Telecom provides you with an extensive list of content covering all aspects of the tech industry. Keep an eye on our Tech sections to stay informed and up-to-date with our daily articles

Tags: Artificial Intelligence China GPT Inside Telecom News Open AI Technology U.S.

Did Comium Set a Trap for Africell and Qcell or Did They Trap Themselves?

Kia PV5 Tech Day:technology for limitless mobility

Monty Holding Launches Its FinTech Academy with USJ as Its First Partner

Mastercard’s President Adam Jones: MyMonty's Partnership Will Unlock Lebanon’s Digital Finance Potential

Award-Winning. Future-Ready. Monty Mobile’s eSIM Platform Strikes Gold

Wi-Fi 8 Taking Connectivity to New Levels Starting 2028

Meta’s Under Sea Internet Cables Will Keep Us Connected

Is Ericsson’s 5G Uplink Speed Worth the Cybersecurity Risk?

Starlink’s Direct-to-Cell Service Goes Beyond Consumer Use

China Telecom Industry Open to Foreign Investors

AI Disrupts Job Market for Young Professionals, Goldman Sachs

Meta AI Is Collecting Your Secrets, And Human Contractors Are Reading Them

Israel’s Debuts First AI Crystal Ball at Japan’s Osaka Expo

Smart Vending Machines for Personalized Convenience

DeepMind’s AI Visionary Who Saw Tomorrow, Now Warns About It

MyMonty: The New Era of Banking

Entering the Monty Multiverse at Seamless 2023

Seamless Dubai 2023 - From Concept to Reality: Shaffra Technologies Opens Doors to Metaverse Mastery

Take A Look in the Mirror. The Greatest Technology of All Will Stare Back at You

Monty Mobile Enters Multibillion-Dollar MNO Equipment Industry

Are We Addicted to Social Media? IG, TikTok Trigger Physical and Emotional Withdrawal

Meta's AI on Instagram, Facebook Helps Save Lives

US DoT’s New Safety Plan Introduces Car Communication

Little Girl Receives First Prosthetic Eye from MRI, CT Scans

DeepL’s AI Translation Software to Get Traditional Chinese

CriticGPT Is Here to Improve AI

Human Trainers Assistant

Enhancing the Review Process

Between Accuracy and Mistakes