Connect with us
LIVE

Business

OpenAI study: AI may be about to eclipse human expertise in real-world tasks

Published

on

Dizzying deal delirium: How the AI bubble bursts

Good morning. Rarely does a 29-page scholarly paper merit the attention of top-level executives, but every business leader should be familiar with a recent study from OpenAI. It’s the best description yet of how AI can handle real-world tasks, showing which AI models are excelling, and hinting at what it all means for humans in the years ahead. The paper can be heavy going, but you can get a masterful summary from our AI Editor, Jeremy Kahn.

For leaders, three points stand out:

The study is highly realistic. It examined 44 occupations and 1,320 specialized tasks required by those occupations. For example: the final testing step in manufacturing a cable spooling truck for underground mining operations. Appropriate professionals (average experience: 14 years) vetted the tasks, all of which are elements of actual work deliverables. Previous research has almost always focused on less realistic tests. The AI results were graded by expert humans who didn’t know if they were looking at work from AI or from an expert human professional.

The best models are already nearly as good as human industry experts. The study examined seven AI models from Open AI, Google’s Gemini, xAI’s Grok, and Anthropic’s Claude. The clear winner was Claude Opus 4.1, which came within a few percentage points of reaching parity with human industry experts. The best models also completed tasks about 100 times faster and 100 times cheaper than the industry experts, though the comparisons ignore “the human oversight, iteration, and integration steps required in real workplace settings,” OpenAI says.

The models are improving at a galloping pace. For example, as OpenAI’s models improved, the percentage of their task outputs that were as good as or better than humans’ outputs more than tripled. If that rate continues—a big if—OpenAI would be better at these real-world tasks than humans overall in a few months. At least some AI competitors could well be on similar trajectories.

The pace of change described in this new research may be the hardest challenge for business leaders. Consider the two-year cycle of Moore’s Law, which changed the world and inspired new corporate giants while dooming others. In retrospect, those were the days. John Chambers, who ran Cisco through the internet frenzy and its crash, said recently that 50% of executives “won’t have the skills to adjust to this new innovation economy driven by AI because they were trained to move at the speed of a five-year cycle as opposed to a 12-month cycle.” His warning to leaders is worth remembering: “With the speed the market is moving at now, you have to be able to reinvent yourself, which most CEOs and business leaders don’t know how to do—especially with AI.”—Geoff Colvin

Contact CEO Daily via Diane Brady at diane.brady@fortune.com

Top news

Israeli government approves Gaza deal as troops pull out

The IDF now has 24 hours to retreat to an agreed-upon line and Hamas has 72 hours to release all Israeli hostages. So far, events are going as planned and the mood is upbeat on both sides. Live coverage from the BBC here.

China places export controls on rare earth minerals

Advertisement

The new rules curb the supply chain for the semiconductors that are used in phones, computers, AI data centers, cars, solar panels, and other IT kit. China has a virtual monopoly in rare earths.

NY Attorney General indicted

Laetitia James is charged with bank fraud and making false statements. The prosecution is part of President Trump’s retribution plan: It was James who secured a $367 million fine against Trump in a civil suit (the fine was later reversed).

Make Argentina Great Again

Yes, the U.S. is bailing out Argentina. Treasury Secretary Scott Bessent confirmed that the Treasury has bought pesos to support the government of Trump ally President Javier Milei. The U.S. is also providing a $20 billion swap line to Argentina. (A swap line allows central banks to exchange fixed amounts of currency on the understanding that the swap will be reversed later and interest will be paid on the repaid currency.)

Moody’s chief economist: roughly half of U.S. states are contracting economically

Moody’s Analytics chief economist Mark Zandi exclusively told Fortune that nearly half of U.S. states are seeing their economies contract—and only 16 are experiencing growth. Zandi also noted that lower-income households are ““hanging on by their fingertips financially…and their world is going into recession pretty quickly.”

KPMG survey identifies quarter where AI sentiment changed

A new KPMG survey of 130 business leaders in companies making more than $1 billion annually found that the adoption of agentic AI technology has quadrupled in the past six months. A principle and aIQ program lead at the company told Fortune that the most recent quarter was when the “fear factor” surrounding the technology faded, leading to what she describes as “cognitive fatigue.”

Google restricts WFH to just 4 days per year

Advertisement

Google’s previous policy was to allow staff to work from anywhere for up to four weeks per year. The new rule says that a single WFH day will now count as an entire week. 

Federal workers will get back pay

U.S. House Speaker Mike Johnson said furloughed federal workers will get the wages they are owed once the shutdown ends.

The markets

S&P 500 futures were up 0.14% this morning. The index closed down 0.28% in its last session. STOXX Europe 600 was flat in early trading. The U.K.’s FTSE 100 was down 0.14% in early trading. Japan’s Nikkei 225 was down 1.01%. China’s CSI 300 was down 1.97%. The South Korea KOSPI was up 1.73%. India’s Nifty 50 was up 0.51% before the end of the session. Bitcoin held at $121.4K.

Around the watercooler

$1.8 trillion deficit revealed during ‘pointless and wasteful government shutdown,’ budget watchdog says by Nick Lichtenberg

Battle over Elon Musk’s trillionaire pay package builds as pension funds face off against Tesla by Amanda Gerut

You’re 10 times more likely to have a flight delay during the government shutdown, Transportation Secretary says: ‘These controllers are stressed out’ by Sydney Lake

California’s ‘impossible’ dream of ending fossil fuels isn’t working, and now it’s looking at price spikes and shortages by Jordan Blum

From WhatsApp friends to a $500 million–plus valuation: These founders argue their tiny AI models are better for customers and the planet by Vivienne Walt

CEO Daily is compiled and edited by Joey Abrams and Jim Edwards.

Advertisement
This is the web version of CEO Daily, a newsletter of must-read global insights from CEOs and industry leaders. Sign up to get it delivered free to your inbox.

Source link

Title

This industrial giant is emerging as a big AI play, says Wells Fargo This industrial giant is emerging as a big AI play, says Wells Fargo
Crypto6 months ago

This industrial giant is emerging as a big AI play, says Wells Fargo

  Wells Fargo sees Caterpillar continuing to roar higher, emerging as an artificial intelligence play. The bank initiated shares of...

Novo Nordisk's strategy tested as investors push back on board revamp Novo Nordisk's strategy tested as investors push back on board revamp
Crypto6 months ago

Novo Nordisk’s strategy tested as investors push back on board revamp

    Flags with the logos of Danish drugmaker Novo Nordisk, maker of the blockbuster diabetes and weight-loss treatments Ozempic...

Alibaba plans AI subscriptions, stablecoin-like payments with JPMorgan Alibaba plans AI subscriptions, stablecoin-like payments with JPMorgan
Crypto6 months ago

Alibaba plans AI subscriptions, stablecoin-like payments with JPMorgan

  Key Points Alibaba plans to use “tokenization” of payments for cross-border transactions in its business-to-business arm. Kuo Zhang, president...

Abraham Lincoln set off an education revolution in 1862 with the Land Grant Act. We need the same thing today for AI Abraham Lincoln set off an education revolution in 1862 with the Land Grant Act. We need the same thing today for AI
Crypto6 months ago

UK borrowing costs spike on report government to scrap plans to raise income tax

    Rachel Reeves, U.K. chancellor of the exchequer, delivers a speech in London, UK, on Tuesday, Nov. 4, 2025. Bloomberg...

An Indonesian Unicorn's Vision For Digital Payments An Indonesian Unicorn's Vision For Digital Payments
Crypto6 months ago

Trump’s threatened the BBC with a $1B lawsuit: Here’s what’s going on

    US President Donald Trump speaks to reporters as he arrives at Palm Beach International Airport on Oct. 31,...

We're downgrading a portfolio stock. Plus, what's causing the market's rally We're downgrading a portfolio stock. Plus, what's causing the market's rally
Crypto6 months ago

UBS’s picks for global returns next year

  Investors looking for global diversification opportunities should look to a specific subset of stocks in Europe, according to UBS...

Nvidia will soar nearly 75%, says Loop Capital Nvidia will soar nearly 75%, says Loop Capital
News6 months ago

AI companies admit they’re worried about a bubble

    Eakarat Buanoi | Istock | Getty Images LISBON, Portugal — Top tech executives told CNBC they’re concerned about...

CEO Southeast Asia's top bank DBS says AI adoption already paying off CEO Southeast Asia's top bank DBS says AI adoption already paying off
News6 months ago

CEO Southeast Asia’s top bank DBS says AI adoption already paying off

Tan Su Shan, deputy chief executive officer and managing director of institutional banking at DBS Group Holdings Ltd., speaks during...

China's economic slowdown deepens in October as housing slump worsens and investments shrink more than expected China's economic slowdown deepens in October as housing slump worsens and investments shrink more than expected
News6 months ago

China’s economic slowdown deepens in October as housing slump worsens and investments shrink more than expected

CHENGDU, CHINA – OCTOBER 18: People walk past the Louis Vuitton store at Taikoo Li, a high-end shopping area that...

U.S. to remove tariffs on some products from Ecuador, Argentina, Guatemala and El Salvador U.S. to remove tariffs on some products from Ecuador, Argentina, Guatemala and El Salvador
News6 months ago

U.S. to remove tariffs on some products from Ecuador, Argentina, Guatemala and El Salvador

The United States said Thursday it will remove tariffs on some foods and other imports from Argentina, Ecuador, Guatemala and...

Advertisement