Model routing is a fix for AI overspending. That's a problem for OpenAI and Anthropic

CNBC Top News ·

Model routing is a fix for AI overspending. That's a problem for OpenAI and Anthropic

A new spending discipline is taking hold inside corporate America, as CFOs and boards start cracking down on inefficient artificial intelligence spending. …

A new spending discipline is taking hold inside corporate America, as CFOs and boards start cracking down on inefficient artificial intelligence spending. The change has the potential to reshape the AI trade. For the past two years, the playbook has been to default to the most powerful AI model and direct all queries through it, regardless of complexity. Now, with AI bills running far ahead of budgets, companies are starting to ask whether every task actually needs the frontier. Two leaders at the center of the AI buildout told CNBC this week that a solution is emerging: model routing. What is model routing? Routing is a tool that matches the job to the model, sending hard problems to the expensive frontier models and easy ones to cheaper, faster alternatives. Scott Wu, CEO of Cognition, which makes the coding agent Devin, said the gains on routine work are enormous. For a lot of the boilerplate work, he said, companies can get five to 10 times better cost efficiency using models that are still good enough for the task. Most companies today aren't routing at all. Glean CEO Arvind Jain has estimated that roughly 95% of enterprise AI usage is still running on the most expensive frontier models, even for tasks that cheaper alternatives could easily handle. Wu gave the example of asking a model to name the third U.S. president. Each one, no matter how expensive, will tell you it was Thomas Jefferson. …

Original source: CNBC Top News

Mentioned

OpenAI · United States · Portugal · Anthropic