How-To Series · Episode 52 / 59 · Module 8: Provider Plumbing

Hermes · Provider Routing

Same model, many sub-providers. On OpenRouter, you pick the priority: cost, speed, or quality.

After this videoYou can now save money by routing OpenRouter requests for cost, speed, or quality.

OpenRouter doesn't run models itself, it farms requests to many underlying providers (Anthropic, Google, Bedrock, Together). Provider routing controls which of them handle your requests and how they rank. The main knob is sort: price (cheapest), throughput (fastest tokens/sec), or latency (quickest first token). Then get specific with only (whitelist), ignore (blacklist), and order (preferred, then fall back). Two guards worth setting: require_parameters: true (no silent drops of tools/temperature) and data_collection: "deny" (prompts stay out of training). It applies only on OpenRouter, and it's sub-provider routing, not the same as fallback providers switching to a different provider on failure.

About these resources. Every setting comes from the Provider Routing doc.

Sources · What this video distills

1 docs page · every command below traces to one of them
Primary · OpenRouter sub-provider routing, sort, only/ignore/order, require_parameters, data_collection
Provider Routing
Read ↗

Commands shown · Copy and paste

each shows the source doc it came from
Config (config.yaml)from source ↗
provider_routing: { sort: "price", only: ["Anthropic","Google"], ignore: ["Together"], require_parameters: true, data_collection: "deny" }
Sort optionsfrom source ↗
price · throughput · latency

Going deeper · Related Hermes docs

further reading · not sources of facts shown above

Next in the series · Episodes that build on this

E53
Fallback Providers
E54
Credential Pools
E50
The Provider Landscape