Analysis

The model is becoming the agent

Frontier models now orchestrate their own sub-agents and run long-horizon work natively — absorbing the easy 80% of what agent frameworks were for. But the week a US order switched Fable 5 off in 90 minutes is the best argument yet for the layer the labs can't sell you.

Quick summary

  • Frontier models now orchestrate their own sub-agents natively — absorbing the easy part of what agent frameworks did.
  • But the labs also sell orchestration as a product, and benchmark scores don't equal production reliability — so the framework layer isn't dying, it's splitting into commodity convenience (absorbed) and resilience plus governance (more valuable).
  • Fable 5 being switched off by a government order in about 90 minutes is the sharpest argument for owning the multi-vendor resilience layer yourself.
ThesisThe model is the agent
ButReliability ≠ benchmarks
So buildThe resilience layer

The orchestration layer is moving inside the model

On June 26 OpenAI previewed GPT-5.6 — three models, Sol, Terra, and Luna — and the headline isn't raw IQ, it's structure. Sol ships a 'max' mode for deeper reasoning and an 'ultra' mode that, in OpenAI's own framing, spawns sub-agents to fan a task across parallel workers instead of grinding through it as one process. The model doesn't just think harder; it delegates to copies of itself.

It isn't a one-off. Anthropic's Opus 4.6 already breaks complex tasks into independent sub-tasks and runs tools and sub-agents in parallel as a native feature; Google's Gemini line is explicitly tuned for long-horizon agentic work that holds a plan across thousands of steps. The things you used to need a framework for — task decomposition, parallel fan-out, a planner that keeps state across many steps — are becoming load-bearing features of the model itself. If your agent product's core pitch was 'we'll split the work and run the steps for you,' the model just learned to do that for free.

Even the 'virtual model' is now a product

The clearest signal comes from the open-source side. Around June 26–27, Nous Research shipped Mixture of Agents in Hermes Agent: you compose several models — even across different providers — into a named preset that Hermes then treats as a single, selectable model. Reference models run first and feed private analysis to an 'aggregator' that writes the actual answer and makes the tool calls. A small orchestration graph, collapsed into one virtual model you pick from a dropdown.

Nous's own numbers — self-reported, on a benchmark they haven't released yet — claim these presets beat the individual models they're built from. Take the figures or leave them; the architecture is the point. The unit of consumption is shifting from 'a model' to 'an orchestrated bundle that behaves like a model.' Which makes the tempting conclusion — 'agent frameworks are dead' — far too glib.

Read the full analysis — free

Sign in to unlock the full Analysis column — no password. You'll also get an alert the moment Fable 5 returns.

or use email instead

No password, no spam. We email about model status and what we ship. How we handle your data.

Need the short answer?

Fable 5 is back worldwide as of July 1 — but capped at 50% of your weekly limit until July 7. See the live status, or use GLM-5.2 or the new Sonnet 5 for cheaper work.

Read the brief Fabel 5 spelling guide

FAQ

Does this mean I should stop using agent frameworks?

Stop building the parts the model now does for you — task decomposition and parallel step-running. Keep and invest in the parts it can't be: multi-vendor routing, fallback, durable state, governance, and accountability.

What's the one concrete action?

Make sure you can fail over to a second vendor's model. The Fable 5 shutdown showed a top model can disappear in minutes for non-technical reasons; single-vendor dependence is now an operational risk, not a footnote.

Related

Sources

This page is independent. Official provider pages are the source of record for access, pricing, and policy.