Analysis

You lost Fable 5. Don't buy three models in a trenchcoat.

Orchestration tools promise capabilities beyond any model you can still access. For most people that's a 4-7x tax to paper over one missing model — when a cheap open-weight model already gets you most of the way.

Quick summary

  • Three orchestration tools — Hermes MoA, OpenRouter Fusion, Sakana Fugu — promise to beat any single model you can access by combining several. Every headline number is self-reported.
  • Two of them cost more and run slower than calling a top model directly; Fugu's real-world quality lagged its benchmarks badly.
  • For most people who lost Fable 5, the smart default is the boring one: a strong, cheap, open model (GLM-5.2) — not three models stitched together.
HypedMoA / Fusion / Fugu
Our pickGLM-5.2
RuleSelf-reported until proven

The pitch writes itself — which is the problem

When Fable 5 went dark on June 12, the most-quoted line of the orchestration boom came from Nous Research's own launch post: 'The strongest models are gated and access is granted only to a select few.' The answer they, OpenRouter, and Sakana offer is to stitch together the models you can still reach and call the result a new, better model. It's seductive precisely because it's pitched at the moment you feel the loss most.

But 'capabilities beyond the publicly available frontier' is a claim, not a receipt. Mixture-of-Agents — fan a prompt out to several models, let an aggregator fuse the answers — has been in the literature since 2024. The 2026 products are real engineering, but the honest question isn't 'is this clever?' It's 'am I buying capability, or renting a 4-7x cost-and-latency markup to feel whole again?'

The contenders, side by side

Here's the honest read on each, before we get to the verdict:

ToolWhat it isBest forThe catch
Hermes MoASelf-hostable reference models + aggregator, as one "virtual model"Tinkerers who want to tune the quality/cost dialHeadline win is self-reported on an unreleased benchmark; more calls per query
OpenRouter FusionServer-side panel + judge synthesis, one API callRare high-stakes questions worth a paid second opinion~3x cost and 2-3x slower by OpenRouter's own number; its benchmark has no coding
Sakana FuguA trained orchestrator routing across a swappable frontier poolAlmost nobody yet — wait and seeSub-frontier in real use (~30-min waits), black-box routing, undisclosed cost
GLM-5.2 · our pickOpen-weight (MIT) single model, near Opus 4.8 on codingMost people who lost Fable 5 and want capability nowTrails Opus on ultra-long-horizon tasks; frontier-adjacent, not frontier

Read the full analysis — free

Sign in to unlock the full Analysis column — no password. You'll also get an alert the moment Fable 5 returns.

or use email instead

No password, no spam. We email about model status and what we ship. How we handle your data.

Need the short answer?

Fable 5 is back worldwide as of July 1 — but capped at 50% of your weekly limit until July 7. See the live status, or use GLM-5.2 or the new Sonnet 5 for cheaper work.

Read the brief Fabel 5 spelling guide

Related

Sources

This page is independent. Official provider pages are the source of record for access, pricing, and policy.