Provider Routing

Never worry about provider outages again. The AI Gateway automatically routes your requests to the best available provider, with instant failover when things go wrong.

The Problem

Provider Outages

Provider downtime breaks your app and frustrates users

Rate Limits

Hit provider quotas and block your users from accessing your service

Regional Restrictions

Limited availability in certain regions reduces your global reach

Vendor Lock-in

Tied to one provider prevents cost optimization and flexibility

The Solution

Provider routing gives you access to the same model across multiple providers. When OpenAI goes down, your app automatically switches to Azure or AWS Bedrock using Helicone’s managed keys. When you hit rate limits, traffic flows to another provider. All without setup or code changes.

Using Provider Routing

Zero configuration required. Just request a model:

const response = await client.chat.completions.create({
  model: "gpt-4o-mini",
  messages: [{ role: "user", content: "Hello!" }]
});

That’s it. The gateway automatically:

Finds all providers offering this model
Routes to the cheapest available provider
Fails over instantly if a provider has issues

Your request succeeds even when providers fail.

How It Works

The gateway uses the Model Registry to find all providers supporting your requested model, then applies smart routing: Routing Priority:

Your provider keys (BYOK) if configured
Helicone’s managed keys (credits) - automatic fallback at 0% markup

Selection: Routes to the cheapest provider first. Equal-cost providers are load balanced. Failover: Instantly tries the next provider on errors (rate limits, timeouts, server errors, etc.)

What are credits?

Credits let you access 100+ LLM providers without signing up for each one. Add funds to your Helicone account and we manage all the provider API keys for you. You pay exactly what providers charge (0% markup) and avoid provider rate limits. Learn more about credits.

Advanced: Customizing Routing

The default routing handles most use cases. Customize only if you need specific control:

Lock to Specific Provider

Force requests to only use one provider by adding the provider name after a slash:

model: "gpt-4o-mini/openai"  // Only route through OpenAI

When to use: Compliance requirements mandate a specific provider, or you’re testing provider-specific features. What happens: The gateway only attempts this provider. No automatic failover to other providers.

Use Your Own Deployment

Target a specific deployment you’ve configured in Provider Settings:

model: "gpt-4o-mini/azure/clm1a2b3c"  // Your Azure deployment ID

When to use: Regional data residency (e.g., EU GDPR compliance requires data to stay in EU regions), or you want to use provider credits. What happens: Requests only go through your configured deployment. The deployment ID (CUID) is shown in your Provider Settings.

Manual Fallback Chain

Specify exactly which providers to try, in order:

model: "gpt-4o-mini/azure,gpt-4o-mini/openai,gpt-4o-mini"

When to use: You want to prioritize your Azure credits, fall back to OpenAI if Azure fails, then try all other providers. What happens: Gateway tries each provider in the exact order you specify.

Bring Your Own Keys (BYOK)

Add your provider API keys in Provider Settings: What happens: Your keys are always tried first, then Helicone’s managed keys as fallback. This gives you control over provider accounts while maintaining reliability. Benefits: Use provider credits, meet compliance requirements, or maintain direct provider relationships while still getting automatic failover.

The gateway forwards any model/provider combination, even models not yet in our registry. Unknown models only route through your BYOK deployments.

Exclude Specific Providers

Prevent automatic routing from using specific providers:

model: "!openai,gpt-4o-mini"  // Use any provider EXCEPT OpenAI

When to use: Known provider issues, compliance restrictions, or testing without certain providers. What happens: The gateway tries all available providers except those you’ve excluded. Exclude multiple providers with commas: "!openai,!anthropic,gpt-4o-mini".

Failover Triggers

The gateway automatically tries the next provider when encountering these errors:

Error	Description
429	Rate limit errors
401	Authentication errors
400	Context length errors
408	Timeout errors
500+	Server errors

Real World Examples

Scenario: OpenAI Outage

Your production app uses GPT-4. OpenAI goes down at 3am.

// Your code doesn't change
const response = await client.chat.completions.create({
  model: "gpt-4o-mini",
  messages: [{ role: "user", content: "Process this customer request" }]
});

What happens: Gateway automatically fails over to Azure OpenAI, then AWS Bedrock if needed. Your app stays online, customers never notice.

Scenario: Using Azure Credits

Your company has $100k in Azure credits to burn before year-end.

// Prioritize Azure but keep fallback for reliability
const response = await client.chat.completions.create({
  model: "gpt-4o-mini/azure,gpt-4o-mini",  
  messages: messages
});

What happens: Tries your Azure deployment first (using credits), but falls back to other providers if Azure fails. Balances credit usage with reliability.

Scenario: EU Compliance Requirements

GDPR requires EU customer data to stay in EU regions.

// Use your custom EU deployment
await client.chat.completions.create({
  model: "gpt-4o/azure/eu-frankfurt-deployment",  // Your CUID
  messages: messages
});

What happens: Requests ONLY go through your Frankfurt deployment. No data leaves the EU.

Scenario: Avoiding Provider Issues

You notice one provider is experiencing higher latency or errors today.

// Exclude the problematic provider from automatic routing
const response = await client.chat.completions.create({
  model: "!openai,gpt-4o-mini",
  messages: [{ role: "user", content: "Analyze this data" }]
});

What happens: Gateway automatically routes to all available providers except OpenAI. If you also want to exclude another provider, use "!openai,!anthropic,gpt-4o-mini".

Next Steps

Browse Models

Explore all available models and providers

Add Provider Keys

Connect your provider accounts

Prompt Management

Combine routing with managed prompts

Getting Started

AI Gateway

Observability & Analytics

Prompt Management

Legacy Integrations

References

The Problem

Provider Outages

Rate Limits

Regional Restrictions

Vendor Lock-in

The Solution

Using Provider Routing

How It Works

Advanced: Customizing Routing

Lock to Specific Provider

Use Your Own Deployment

Manual Fallback Chain

Bring Your Own Keys (BYOK)

Exclude Specific Providers

Failover Triggers

Real World Examples

Scenario: OpenAI Outage

Scenario: Using Azure Credits

Scenario: EU Compliance Requirements

Scenario: Avoiding Provider Issues

Next Steps

Browse Models

Add Provider Keys

Prompt Management

Getting Started

AI Gateway

Observability & Analytics

Prompt Management

Legacy Integrations

References

​The Problem

Provider Outages

Rate Limits

Regional Restrictions

Vendor Lock-in

​The Solution

​Using Provider Routing

​How It Works

​Advanced: Customizing Routing

​Lock to Specific Provider

​Use Your Own Deployment

​Manual Fallback Chain

​Bring Your Own Keys (BYOK)

​Exclude Specific Providers

​Failover Triggers

​Real World Examples

​Scenario: OpenAI Outage

​Scenario: Using Azure Credits

​Scenario: EU Compliance Requirements

​Scenario: Avoiding Provider Issues

​Next Steps

Browse Models

Add Provider Keys

Prompt Management

The Problem

The Solution

Using Provider Routing

How It Works

Advanced: Customizing Routing

Lock to Specific Provider

Use Your Own Deployment

Manual Fallback Chain

Bring Your Own Keys (BYOK)

Exclude Specific Providers

Failover Triggers

Real World Examples

Scenario: OpenAI Outage

Scenario: Using Azure Credits

Scenario: EU Compliance Requirements

Scenario: Avoiding Provider Issues

Next Steps