A new flagship model promises agent-grade reasoning at a fraction of last year’s cost, reshaping what is economical to automate.
Why it matters
Lower latency and price-per-token make multi-step agents viable for workloads that were previously too expensive to run at scale.
What to watch
Expect rivals to respond within weeks; the real test is reliability on long tool-use chains, not benchmark scores.