NVIDIA CEO MIGHT BE RIGHT ABOUT CODING BEING DEAD BECAUSE OF AI — OPENAI'S NEW CRITICGPT MODEL IDENTIFIES CHATGPT'S PROGRAMMING MISTAKES BETTER THAN AI TRAINERS

What you need to know

OpenAI recently launched CriticGPT to help identify errors in code generated using ChatGPT.
The tool helps AI trainers identify errors faster and easier than they ordinarily would without the help of AI.
The ChatGPT maker admits the tool isn't 100% accurate and faces several challenges, including the inability to handle highly complex tasks and periodic instances of hallucinations.

OpenAI recently launched CriticGPT powered by GPT-4. As the name suggests, the model "writes critiques of ChatGPT responses to help human trainers spot mistakes" in ChatGPT's code output.

According to the ChatGPT maker:

"We found that when people get help from CriticGPT to review ChatGPT code, they outperform those without help 60% of the time. We are beginning the work to integrate CriticGPT-like models into our RLHF labeling pipeline, providing our trainers with explicit AI assistance."

OpenAI plans to use Reinforcement Learning from Human Feedback (RLHF) to make ChatGPT more "helpful and interactive." An integral part of this process involves collecting comparisons from AI trainers. This is based on how they rate different ChatGPT responses against each other.

CriticGPT will help improve ChatGPT's reasoning capabilities, ultimately reducing hallucinations or the generation of incorrect responses and misinformation. As it happens, it's increasingly becoming hard for AI trainers to identify mistakes as ChatGPT advances.

The tool is primarily trained to identify and write critiques highlighting inaccuracies in ChatGPT answers. OpenAI admits the tool isn't always 100% accurate, but it helps AI trainers identify errors faster and easier than they would ordinarily without AI.

CriticGPT will reportedly augment skills, ultimately equipping people with more comprehensive critique techniques. While AI trainers and CriticGPT can get the job done as separate entities, a Human+CriticGPT combination is seemingly popular and thorough when providing accurate and detailed critiques.

According to OpenAI's findings:

"We find that CriticGPT critiques are preferred by trainers over ChatGPT critiques in 63% of cases on naturally occurring bugs, in part because the new critic produces fewer "nitpicks" (small complaints that are unhelpful) and hallucinates problems less often."

CriticGPT is still a works in progress

While impressive, CriticGPT still needs a lot of work. OpenAI has highlighted the model's shortcomings as listed below:

We trained CriticGPT on ChatGPT answers that are quite short. To supervise the agents of the future, we will need to develop methods that can help trainers to understand long and complex tasks.
Models still hallucinate and sometimes trainers make labeling mistakes after seeing those hallucinations.
Sometimes real-world mistakes can be spread across many parts of an answer. Our work focuses on errors that can be pointed out in one place, but in the future we need to tackle dispersed errors as well.
CriticGPT can only help so much: if a task or response is extremely complex even an expert with model help may not be able to correctly evaluate it.

In the future, OpenAI intends to scale greater heights with CriticGPT by improving its RLHF data for GPT-4 training. In a separate report, Oxford researchers leveraged semantic entropy to assess the quality and meanings of generated outputs to determine the quality of responses and spot traces of hallucination.

AI models are becoming more advanced and sophisticated, allowing them to handle complex tasks better. NVIDIA CEO Jensen Huang argues coding might be dead in the water as a career option for the future generation. Huang might not be entirely wrong if OpenAI GPT-4o's coding capabilities are anything to go by. Instead, he recommends seeking alternative career options in biology, education, manufacturing, or farming.

2024-07-02T19:01:55Z dg43tfdfdgfd

NVIDIA CEO MIGHT BE RIGHT ABOUT CODING BEING DEAD BECAUSE OF AI — OPENAI'S NEW CRITICGPT MODEL IDENTIFIES CHATGPT'S PROGRAMMING MISTAKES BETTER THAN AI TRAINERS

What you need to know

CriticGPT is still a works in progress

Nine-year-old boy youngest in UK to undergo rare pancreatic surgery

Andrea Stella questions Christian Horner’s ‘integrity’ after Lando Norris claim

Mini strokes (TIAs): Symptoms, causes and how to respond

Tottenham stunned as Man Utd and Man City table offers for attacker Postecoglou must not lose

Russian forces' massive attack on Dnipro, US new military aid package for Ukraine - Wednesday brief

Bebe Rexha threatens to ‘bring down’ the music industry in furious rant: ‘I’ve been silenced’

When is Universal Credit paid next month? Payment dates for DWP benefits set to change

Shocking aerial photos show widespread devastation as Hurricane Beryl moves through Caribbean

Man Utd miffed as West Ham agree £40m centre-back deal to leave Ten Hag downbeat

Trader bets £2m on biggest interest rate cut in four years

Map reveals Kent election results in full as Labour make big gains

Ecuador police rescue 49 kidnapping victims from crime gang