By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
MadisonyMadisony
Notification Show More
Font ResizerAa
  • Home
  • National & World
  • Politics
  • Investigative Reports
  • Education
  • Health
  • Entertainment
  • Technology
  • Sports
  • Money
  • Pets & Animals
Reading: OpenAI’s GPT-5 rollout just isn’t going easily
Share
Font ResizerAa
MadisonyMadisony
Search
  • Home
  • National & World
  • Politics
  • Investigative Reports
  • Education
  • Health
  • Entertainment
  • Technology
  • Sports
  • Money
  • Pets & Animals
Have an existing account? Sign In
Follow US
2025 © Madisony.com. All Rights Reserved.
Technology

OpenAI’s GPT-5 rollout just isn’t going easily

Madisony
Last updated: August 9, 2025 6:43 pm
Madisony
Share
OpenAI’s GPT-5 rollout just isn’t going easily
SHARE

Need smarter insights in your inbox? Join our weekly newsletters to get solely what issues to enterprise AI, knowledge, and safety leaders. Subscribe Now


Up to date Friday August 8, 5:21 pm ET: shortly after this publish’s publication, OpenAI co-founder and CEO Sam Altman introduced the corporate would restore entry to GPT-4o and different previous fashions for chosen customers, admitting the GPT-5 launch was “extra bumpy than we hoped for.”

The launch of OpenAI’s lengthy anticipated new mannequin, GPT-5, is off to a rocky begin to say the least.

Even forgiving errors in charts and voice demos throughout yesterday’s livestreamed presentation of the brand new mannequin (really 4 separate fashions, and a ‘Considering’ mode that may be engaged for 3 of them), a variety of person reviews have emerged since GPT-5’s launch displaying it erring badly when fixing comparatively easy issues that previous OpenAI fashions — and rivals from competing AI labs — reply accurately.

For instance, knowledge scientist Colin Fraser posted screenshots displaying GPT-5 getting a math proof fallacious (whether or not 8.888 repeating is the same as 9 — it’s after all, not).


AI Scaling Hits Its Limits

Energy caps, rising token prices, and inference delays are reshaping enterprise AI. Be a part of our unique salon to find how high groups are:

  • Turning power right into a strategic benefit
  • Architecting environment friendly inference for actual throughput good points
  • Unlocking aggressive ROI with sustainable AI methods

Safe your spot to remain forward: https://bit.ly/4mwGngO


It additionally failed on a easy algebra arithmetic downside that elementary schoolers may most likely nail, 5.9 = x + 5.11.

Utilizing GPT-5 to evaluate OpenAI’s personal misguided presentation charts additionally didn’t yield useful or appropriate responses.

It additionally failed on this trickier math phrase downside under (which, to be truthful, stumped this human at first…although Elon Musk’s Grok 4 AI answered it accurately. For a touch, consider the truth that flagstones on this case can’t be divided into smaller parts. They need to stay in tact as 80 separate models, so no halves or quarters).

The older 4o mannequin carried out higher for me on no less than one in all these math issues. Sadly, OpenAI is slowly deprecating these older fashions — together with the previous default GPT-4o and the highly effective reasoning mannequin o3 — for customers of ChatGPT, although they’ll proceed to be obtainable within the software programming interface (API) for builders for the foreseeable future.

Not nearly as good at coding as benchmarks point out

Regardless that OpenAI’s inner benchmarks and a few third-party exterior ones have proven GPT-5 to outperform all different fashions at coding, it seems that in actual world utilization, Anthropic’s lately up to date Claude Opus 4.1 appears to do a greater job at “one-shotting” sure duties, that’s, finishing the person’s desired software or software program construct to their specs. See an instance under from developer Justin Solar posted to X :

Opus 4.1’s one-shot try at “create a 3d capybara petting zoo” – 8 minutes whole

This was actually fairly insane, not solely are the capybaras manner cuter and transferring, there are particular person pet affinity ranges, a day/evening switcher, feeding, and even a screenshot function pic.twitter.com/FiKTO3FKK4

— justin (@justinsunyt) August 7, 2025

As well as, a report from safety agency SPLX discovered that OpenAI’s inner security layer left main gaps in areas like enterprise alignment and vulnerability to immediate injection and obfuscated logic assaults. 

Whereas anecdotal, the checking the temperature on how the mannequin is faring with early AI adopters appears to point a cold reception.

AI influencer and former Googler Bilawal Sidhu posted a ballot on X asking for a “vibe verify” from his followers and the broader userbase, and to date, with 172 votes in, the overwhelming response is “Kinda mid.”

Alright, GPT-5 vibe verify

— Bilawal Sidhu (@bilawalsidhu) August 7, 2025

And because the pseudonymous AI Leaks and Information account wrote, “The overwhelming consensus on GPT-5 from each X and the Reddit AMA are overwhelmingly unfavorable.”

The overwhelming consensus on GPT-5 from each X and the Reddit AMA are overwhelmingly unfavorable

Most customers are disgruntled in regards to the damaged mannequin picker and non-pro customers not getting access to legacy fashions

What are your preliminary ideas on GPT-5?

— AI Leaks and Information (@AILeaksAndNews) August 8, 2025

Tibor Blaho, lead engineer at AIPRM and a preferred AI leaks and information poster on X, summarized the various issues with the ChatGPT-5 rollout in a superb publish, highlighting that one of many new marquee options — an automated “router” in ChatGPT that chooses a considering or non-thinking mode for the underlying GPT-5 mannequin relying on the issue of the question — has turn out to be one of many chief complaints, given the mannequin appeared to default to non-thinking mode for a lot of customers.

A bit unhappy how the GPT-5 launch goes to date, particularly after the lengthy wait and excessive expectations

– The automated switching between fashions (the router) appears partly damaged/unreliable

– It is unclear precisely which mannequin you are really interacting with (normal or mini,…

— Tibor Blaho (@btibor91) August 8, 2025

Competitors ready within the wings

Thus, the sentiment towards ChatGPT-5 is way from universally constructive, highlighting a serious downside for OpenAI because it faces rising competitors from main U.S. rivals like Google and Anthropic, and a rising checklist of free, open supply and highly effective Chinese language LLMs providing options that many U.S. fashions lack.

Take the Alibaba Qwen Crew of AI researchers, who simply at present up to date their extremely performant Qwen 3 mannequin to have 1 million token context — giving customers the power to alternate almost 4x as a lot info with the mannequin in a single again/forth interplay as GPT-5 presents.

Given OpenAI’s different large launch this week — that of new open supply gpt-oss fashions — additionally obtained a combined reception from early customers, issues are usually not trying up for the primary devoted AI firm by customers proper now (700 million weekly energetic customers of ChatGPT as of this month).

Certainly, that is additionally exemplified by customers of the betting market Polymarket overwhelmingly deciding following the discharge of GPT-5 that Google would doubtless have the very best AI mannequin by the tip of this month, August 2025.

Different energy customers like Otherside AI co-founder and CEO Matt Shumer, who obtained early entry to GPT-5 and blogged about it favorably in a overview right here, opined that views would shift as extra folks discovered the very best methods to make use of the brand new mannequin and adjusted their integration approaches:

Lots of people who’re having a foul expertise are utilizing GPT-5 in agent harnesses that are not but optimized for it.

For each new mannequin launch, there is a time lag between launch + when firms that combine the mannequin have it really working nicely.

Agent firms rush to…

— Matt Shumer (@mattshumer_) August 8, 2025

Whereas it’s nonetheless early days for GPT-5 — and the sentiment may change dramatically as extra customers get their fingers on it and check out it for various duties — the early indications are usually not trying like this can be a “dwelling run” launch for OpenAI in the identical manner that prior releases corresponding to GPT-4, and even the newer 4o and o3, have been. And that’s a regarding indicator for an organization that simply raised one more funding spherical, but stays unprofitable resulting from its excessive prices of analysis and growth.

Every day insights on enterprise use instances with VB Every day

If you wish to impress your boss, VB Every day has you coated. We provide the inside scoop on what firms are doing with generative AI, from regulatory shifts to sensible deployments, so you’ll be able to share insights for optimum ROI.

Learn our Privateness Coverage

Thanks for subscribing. Try extra VB newsletters right here.

An error occured.


Subscribe to Our Newsletter
Subscribe to our newsletter to get our newest articles instantly!
[mc4wp_form]
Share This Article
Email Copy Link Print
Previous Article The Kim Iversen Present The Kim Iversen Present
Next Article Creator of the ‘4% rule’ for retirement withdrawals has contemporary recommendation for right this moment’s retirees Creator of the ‘4% rule’ for retirement withdrawals has contemporary recommendation for right this moment’s retirees
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

POPULAR

Foster Poultry Farms recollects almost 4 million kilos of hen corn canine because of wooden in batter
Politics

Foster Poultry Farms recollects almost 4 million kilos of hen corn canine because of wooden in batter

Dangerous Bunny celebrates Latin American illustration in response to Tremendous Bowl backlash
Investigative Reports

Dangerous Bunny celebrates Latin American illustration in response to Tremendous Bowl backlash

Firefly strengthens portfolio with 5 million deal for nationwide safety tech agency SciTec
Money

Firefly strengthens portfolio with $855 million deal for nationwide safety tech agency SciTec

Ravens undergo ‘full disappointment’ after 31-point loss, the second-worst defeat of John Harbaugh period
Sports

Ravens undergo ‘full disappointment’ after 31-point loss, the second-worst defeat of John Harbaugh period

English nurse secretly married Manchester synagogue terrorist Jihad Al-Shamie after changing to Islam: report
National & World

English nurse secretly married Manchester synagogue terrorist Jihad Al-Shamie after changing to Islam: report

Full transcript of “Face the Nation with Margaret Brennan,” Oct. 5, 2025
Politics

Full transcript of “Face the Nation with Margaret Brennan,” Oct. 5, 2025

Finest iPad to Purchase (and Some to Keep away from) in 2025
Technology

Finest iPad to Purchase (and Some to Keep away from) in 2025

You Might Also Like

Logitech Signature Photo voltaic Slim+ K980 Keyboard Evaluate: See the Mild
Technology

Logitech Signature Photo voltaic Slim+ K980 Keyboard Evaluate: See the Mild

This seemingly received’t be a difficulty for anybody who needs a sensible, easy keyboard, particularly should you do not care…

3 Min Read
Mac Mini Sale: Get Into MacOS for Much less Than 0 Right now
Technology

Mac Mini Sale: Get Into MacOS for Much less Than $500 Right now

Available in the market for a brand new MacOS-based desktop, however do not have loads of house to spare? Amazon…

3 Min Read
Greatest Natural Mattresses (2025): Birch, Avocado, Naturepedic, Extra
Technology

Greatest Natural Mattresses (2025): Birch, Avocado, Naturepedic, Extra

Natural bedding model Coyuchi not too long ago launched its personal natural mattress, combining cotton, wool, and Dunlop latex atop…

3 Min Read
Cindy Cohn Is Leaving the EFF, however Not the Combat for Digital Rights
Technology

Cindy Cohn Is Leaving the EFF, however Not the Combat for Digital Rights

After 1 / 4 century defending digital rights, Cindy Cohn introduced on Tuesday that she is stepping down as government…

4 Min Read
Madisony

We cover the stories that shape the world, from breaking global headlines to the insights behind them. Our mission is simple: deliver news you can rely on, fast and fact-checked.

Recent News

Foster Poultry Farms recollects almost 4 million kilos of hen corn canine because of wooden in batter
Foster Poultry Farms recollects almost 4 million kilos of hen corn canine because of wooden in batter
October 5, 2025
Dangerous Bunny celebrates Latin American illustration in response to Tremendous Bowl backlash
Dangerous Bunny celebrates Latin American illustration in response to Tremendous Bowl backlash
October 5, 2025
Firefly strengthens portfolio with 5 million deal for nationwide safety tech agency SciTec
Firefly strengthens portfolio with $855 million deal for nationwide safety tech agency SciTec
October 5, 2025

Trending News

Foster Poultry Farms recollects almost 4 million kilos of hen corn canine because of wooden in batter
Dangerous Bunny celebrates Latin American illustration in response to Tremendous Bowl backlash
Firefly strengthens portfolio with $855 million deal for nationwide safety tech agency SciTec
Ravens undergo ‘full disappointment’ after 31-point loss, the second-worst defeat of John Harbaugh period
English nurse secretly married Manchester synagogue terrorist Jihad Al-Shamie after changing to Islam: report
  • About Us
  • Privacy Policy
  • Terms Of Service
Reading: OpenAI’s GPT-5 rollout just isn’t going easily
Share

2025 © Madisony.com. All Rights Reserved.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?