Machine Learning Times
Machine Learning Times
EXCLUSIVE HIGHLIGHTS
Panic Over DeepSeek Exposes AI’s Weak Foundation On Hype
 Originally published in Forbes The story about DeepSeek has disrupted...
AI Drives Alphabet’s Moonshot To Save The World’s Electrical Grid
 Originally published in Forbes Note: Ravi Jain, Chief Technology Officer...
Why Alphabet’s Clean Energy Moonshot Depends On AI
 Originally published in Forbes Note: Ravi Jain, Chief Technology Officer...
Predictive AI Only Works If Stakeholders Tune This Dial
 Originally published in Forbes I’ll break it to you gently:...
SHARE THIS:

2 years ago
ChatGPT’s Performance Is Slipping, New Study Says

 
Originally published in Decrypt, July 19, 2023.

UC Berkeley researchers found that ChatGPT has not improved over time, and in fact, may have gotten worse.

ChatGPT exploded onto the scene late last year, dazzling people with its human-like conversational abilities, and the release of latest version prompted a  crypto rally and calls for a pause in development. But according to a new study, the leading AI bot’s skills may actually be on the decline.

Researchers at Stanford and UC Berkeley systematically analyzed different versions of ChatGPT from March and June 2022. They developed rigorous benchmarks to evaluate the model’s competency in math, coding, and visual reasoning tasks. The results of ChatGPT’s performance over time were not good.

The tests revealed a startling drop-off in performance between versions. On a math challenge of determining prime numbers, ChatGPT solved 488 out of 500 questions correctly in March, an accuracy of 97.6%. However, in June, ChatGPT only managed to get 12 questions right, plunging to 2.4% accuracy.

To continue reading this article, click here.

6 thoughts on “ChatGPT’s Performance Is Slipping, New Study Says

  1. As the backrooms game continue to capture the imagination of online communities, game developers, and players alike, the digital exploration of these mysterious spaces serves as a testament to the enduring power of internet-driven storytelling and shared experiences.

     

Leave a Reply