Route requests based on CEL expressions and request metadata
Conditional routing allows you to route requests to different models based on conditions that evaluate request metadata.You can use conditional routing to:
Segment by user tier — route premium users to flagship models and free users to cost-effective ones.
Route by geography — direct requests to different models based on geography.
Optimize by complexity — send simple prompts to fast, cheap models and complex ones to high-capability models.
The condition field accepts a CEL expression that is evaluated based on request metadata from extra_body.metadata when calling the Chat Completions API.
Routes are evaluated in order, and the first route whose expression evaluates to true is selected.
Below are some available operations:
Operation
Example
Description
Equality
tier == "premium"
Matches a specific value.
Comparison
request_count > 100
Numeric comparisons (>, <, >=, <=).
Logical AND
tier == "premium" && region == "us-east"
Both conditions must be true.
Logical OR
tier == "premium" || tier == "enterprise"
Either condition must be true.
Range check
score >= 80 && score <= 100
Matches values within a range.
startsWith()
user_id.startsWith("enterprise-")
Checks if a string starts with a prefix.
endsWith()
user_id.endsWith("-admin")
Checks if a string ends with a suffix.
contains()
user_id.contains("test")
Checks if a string contains a substring.
size()
user_id.size() > 10
Gets the length of a string.
Expressions are typed — ensure your metadata types match. Use count == 100 when the value is a number and tier == "premium" when it’s a string. Mismatched types will not match.
When calling your router, pass metadata in your requests using extra_body.metadata to provide the values referenced by your conditions, enabling dynamic route selection.
In this example, the provided metadata (tier, user_id, and region) will be available for the router’s CEL expressions, enabling conditional routes to match and select the appropriate model configuration based on those values.
Route requests to different model tiers based on prompt complexity. Use a lightweight classifier in your application to score complexity, then let the router pick the right model: