OpenAI engineers claim to discover way to cut inference costs in half

Lily Hess

, DIGITIMES, Taipei

Jul 1, 2026, 15:58 0

Credit: AFP

OpenAI engineers claim to have figured out a way to halve the costs of inference using its models, according to The Information. The development comes as AI model developers are seeking to raise their models' token efficiency during a time when...

The article requires paid subscription. Subscribe Now

Email address

Password

Keep me signed in

Keep me signed in

Some subscribers prefer to save their log-in information so they do not have to enter their User ID and Password each time they visit the site. To activate this function, check the 'Keep me signed in' box in the log-in section. This will save the password on the computer you're using to access the site.

Note: If you choose to use the log-out feature, you will lose your saved information. This means you will be required to log-in the next time you visit our site.

Enterprise first-time login?

Forgot your password?

Create your free account

Select premium stories & daily editor picks.
Leverage AI summaries for instant insights.
Receive tech briefings & newsletters.
Track financials & stock data of Taiwan tech.

No credit card required

BIZ FOCUS

Jun 1, 08:00

LITEON Showcases AI at COMPUTEX Panel Featuring NVIDIA, Infineon, GIGABYTE

MOST-READ
7 DAYS NEWS

Full list