Machine Learning Times
Machine Learning Times
EXCLUSIVE HIGHLIGHTS
Effective Machine Learning Needs Leadership — Not AI Hype
 Originally published in BigThink, Feb 12, 2024.  Excerpted from The...
Today’s AI Won’t Radically Transform Society, But It’s Already Reshaping Business
 Originally published in Fast Company, Jan 5, 2024. Eric...
A University Curriculum Supplement to Teach a Business Framework for ML Deployment
    In 2023, as a visiting analytics professor...
The AI Playbook: Providing Important Reminders to Data Professionals
 Originally published in DATAVERSITY. This article reviews the new...
SHARE THIS:

7 months ago
ChatGPT’s Performance Is Slipping, New Study Says

 
Originally published in Decrypt, July 19, 2023.

UC Berkeley researchers found that ChatGPT has not improved over time, and in fact, may have gotten worse.

ChatGPT exploded onto the scene late last year, dazzling people with its human-like conversational abilities, and the release of latest version prompted a  crypto rally and calls for a pause in development. But according to a new study, the leading AI bot’s skills may actually be on the decline.

Researchers at Stanford and UC Berkeley systematically analyzed different versions of ChatGPT from March and June 2022. They developed rigorous benchmarks to evaluate the model’s competency in math, coding, and visual reasoning tasks. The results of ChatGPT’s performance over time were not good.

The tests revealed a startling drop-off in performance between versions. On a math challenge of determining prime numbers, ChatGPT solved 488 out of 500 questions correctly in March, an accuracy of 97.6%. However, in June, ChatGPT only managed to get 12 questions right, plunging to 2.4% accuracy.

To continue reading this article, click here.

5 thoughts on “ChatGPT’s Performance Is Slipping, New Study Says

  1. As the backrooms game continue to capture the imagination of online communities, game developers, and players alike, the digital exploration of these mysterious spaces serves as a testament to the enduring power of internet-driven storytelling and shared experiences.

     

Leave a Reply