Subscribe to Our Newsletter

Success! Now Check Your Email

To complete Subscribe, click the confirmation link in your inbox. If it doesn’t arrive within 3 minutes, check your spam folder.

Ok, Thanks

AI race heats up as GPT 5.5 arrives: OpenAI narrows the gap on efficiency and cost with rivals

The latest model scores 91.7 on internal benchmarks and cuts reasoning token usage by half, but questions remain over pricing and profitability as the AI arms race intensifies

Ian Lyall profile image
by Ian Lyall
AI race heats up as GPT 5.5 arrives: OpenAI narrows the gap on efficiency and cost with rivals
Photo by Jonathan Chng / Unsplash

OpenAI has released GPT 5.5 as a research preview, recording one of the highest scores ever seen on the company's internal benchmarks at 91.7. The model has performed strongly across transactional and litigation-focused legal work and code generation, with OpenAI describing it as its strongest agentic coding model to date.

The release signals that the performance gap between GPT and rival models from Anthropic and Google DeepMind is narrowing, although the model's wider impact will depend on when it becomes available through the API.

Quality per dollar becomes the metric that matters

For the past two years the AI industry has prioritised raw quality over latency and cost. That is starting to change. GPT 5.5 has shown a 50 per cent reduction in reasoning tokens in some experiments, a shift that reflects growing pressure on labs to deliver performance gains without spiralling compute bills.

As companies mature, the focus is moving towards quality per dollar spent and quality per token used. The efficiency gains in GPT 5.5 suggest OpenAI recognises that brute-force scaling alone will not sustain the economics of frontier AI.

The profitability question looms

The broader concern across the industry is what happens when companies such as OpenAI begin charging prices that reflect their true costs. The current generation of AI services is heavily subsidised, and when that changes, application-layer companies may struggle to absorb the expense.

This is particularly relevant for vertical AI firms such as Harvey, the legal technology company that has built its business on top of frontier models. Nico Grupen, head of applied research at Harvey, has acknowledged the challenge of differentiating against a company like Anthropic that is moving directly into verticals including legal services.

Watch what labs say, not just what they ship

There is a pattern worth tracking in how AI labs communicate. When a company sits at the top of the benchmark rankings, its press releases emphasise model capabilities. When it falls behind, the messaging shifts towards product features and applications.

This oscillation is visible across OpenAI, Anthropic and Google DeepMind. It offers a useful lens for reading corporate announcements, particularly during the six to eight week gaps between major model launches when labs are working through post-training cycles.

Competition drives the pace

The expectation remains that the major labs will continue pushing the boundaries of model intelligence. Consumer demand and competitive pressure leave little room for any of them to ease off, even as the economics of the business remain unresolved.

Google DeepMind's Mythos model is also being watched closely, though it remains unclear whether it will be made publicly available. Some observers have raised security concerns about a potential release, adding another layer of complexity to an industry that is simultaneously racing to build more powerful systems and grappling with how to deploy them responsibly.

The release of GPT 5.5 confirms that the frontier is still advancing. The question now is whether the business models underpinning that advance can keep pace with the technology.

Ian Lyall profile image
by Ian Lyall

Explore stories