Connect with us
LIVE

News

OpenAI introduces safety models other sites can use to classify harms

Published

on

OpenAI CEO Sam Altman speaks to members of the media as he arrives at a lodge for the Allen & Co. Sun Valley Conference on July 8, 2025 in Sun Valley, Idaho.

Kevin Dietsch | Getty Images News | Getty Images

OpenAI on Wednesday announced two reasoning models that developers can use to classify a range of online safety harms on their platforms. 

The artificial intelligence models are called gpt-oss-safeguard-120b and gpt-oss-safeguard-20b, and their names reflect their sizes. They are fine-tuned, or adapted, versions of OpenAI’s gpt-oss models, which the company announced in August. 

OpenAI is introducing them as so-called open-weight models, which means their parameters, or the elements that improve the outputs and predictions during training, are publicly available. Open-weight models can offer transparency and control, but they are different from open-source models, whose full source code becomes available for users to customize and modify.

Organizations can configure the new models to their specific policy needs, OpenAI said. And since they are reasoning models that show their work, developers will have more direct insight into how they arrive at a particular output. 

For instance, a product reviews site could develop a policy and use gpt-oss-safeguard models to screen reviews that might be fake, OpenAI said. Similarly, a video game discussion forum could classify posts that discuss cheating.

OpenAI developed the models in partnership with Discord, SafetyKit and Robust Open Online Safety Tools, or ROOST, an organization dedicated to building safety infrastructure for AI. The models are initially available in a research preview, and OpenAI said it will seek feedback from researchers and members of the safety community.

Advertisement

The announcement could help OpenAI placate some critics who have accused the startup of commercializing and scaling too quickly at the expense of AI ethics and safety. The startup is valued at $500 billion, and its consumer chatbot, ChatGPT, has surpassed 800 million weekly active users. 

On Tuesday, OpenAI said it’s completed its recapitalization, cementing its structure as a nonprofit with a controlling stake in its for-profit business. OpenAI was founded in 2015 as a nonprofit lab, but has emerged as the most valuable U.S. tech startup in the years since releasing ChatGPT in late 2022.

“As AI becomes more powerful, safety tools and fundamental safety research must evolve just as fast — and they must be accessible to everyone,” ROOST President Camille François, said in a statement.

Eligible users can download the model weights on Hugging Face, OpenAI said.

WATCH: OpenAI finalizes recapitalization plan

OpenAI finalizes recapitalization plan

Source link

Title

This industrial giant is emerging as a big AI play, says Wells Fargo This industrial giant is emerging as a big AI play, says Wells Fargo
Crypto5 months ago

This industrial giant is emerging as a big AI play, says Wells Fargo

  Wells Fargo sees Caterpillar continuing to roar higher, emerging as an artificial intelligence play. The bank initiated shares of...

Novo Nordisk's strategy tested as investors push back on board revamp Novo Nordisk's strategy tested as investors push back on board revamp
Crypto5 months ago

Novo Nordisk’s strategy tested as investors push back on board revamp

    Flags with the logos of Danish drugmaker Novo Nordisk, maker of the blockbuster diabetes and weight-loss treatments Ozempic...

Alibaba plans AI subscriptions, stablecoin-like payments with JPMorgan Alibaba plans AI subscriptions, stablecoin-like payments with JPMorgan
Crypto5 months ago

Alibaba plans AI subscriptions, stablecoin-like payments with JPMorgan

  Key Points Alibaba plans to use “tokenization” of payments for cross-border transactions in its business-to-business arm. Kuo Zhang, president...

Abraham Lincoln set off an education revolution in 1862 with the Land Grant Act. We need the same thing today for AI Abraham Lincoln set off an education revolution in 1862 with the Land Grant Act. We need the same thing today for AI
Crypto5 months ago

UK borrowing costs spike on report government to scrap plans to raise income tax

    Rachel Reeves, U.K. chancellor of the exchequer, delivers a speech in London, UK, on Tuesday, Nov. 4, 2025. Bloomberg...

An Indonesian Unicorn's Vision For Digital Payments An Indonesian Unicorn's Vision For Digital Payments
Crypto5 months ago

Trump’s threatened the BBC with a $1B lawsuit: Here’s what’s going on

    US President Donald Trump speaks to reporters as he arrives at Palm Beach International Airport on Oct. 31,...

We're downgrading a portfolio stock. Plus, what's causing the market's rally We're downgrading a portfolio stock. Plus, what's causing the market's rally
Crypto5 months ago

UBS’s picks for global returns next year

  Investors looking for global diversification opportunities should look to a specific subset of stocks in Europe, according to UBS...

Nvidia will soar nearly 75%, says Loop Capital Nvidia will soar nearly 75%, says Loop Capital
News5 months ago

AI companies admit they’re worried about a bubble

    Eakarat Buanoi | Istock | Getty Images LISBON, Portugal — Top tech executives told CNBC they’re concerned about...

CEO Southeast Asia's top bank DBS says AI adoption already paying off CEO Southeast Asia's top bank DBS says AI adoption already paying off
News6 months ago

CEO Southeast Asia’s top bank DBS says AI adoption already paying off

Tan Su Shan, deputy chief executive officer and managing director of institutional banking at DBS Group Holdings Ltd., speaks during...

China's economic slowdown deepens in October as housing slump worsens and investments shrink more than expected China's economic slowdown deepens in October as housing slump worsens and investments shrink more than expected
News6 months ago

China’s economic slowdown deepens in October as housing slump worsens and investments shrink more than expected

CHENGDU, CHINA – OCTOBER 18: People walk past the Louis Vuitton store at Taikoo Li, a high-end shopping area that...

U.S. to remove tariffs on some products from Ecuador, Argentina, Guatemala and El Salvador U.S. to remove tariffs on some products from Ecuador, Argentina, Guatemala and El Salvador
News6 months ago

U.S. to remove tariffs on some products from Ecuador, Argentina, Guatemala and El Salvador

The United States said Thursday it will remove tariffs on some foods and other imports from Argentina, Ecuador, Guatemala and...

Advertisement