Skip to content

Models

The Models page is where you register, edit, and remove model definitions. Each model maps a model ID to a provider and defines its capabilities and pricing, which the gateway uses for routing, cost tracking, and budget enforcement.

Model List

Models are displayed in a paginated, searchable data table with the following columns:

ColumnDescription
Model IDThe identifier clients use in requests (e.g. gpt-4-turbo, claude-3-opus)
ProviderThe provider that serves this model
CapabilitiesTags indicating what the model supports
Context WindowMaximum token context length
PricingInput and output cost per million tokens (inputPerMillionTokens / outputPerMillionTokens)
Statusactive, inactive, or deprecated

Use the search bar to filter by model ID or provider name. The table supports sorting on Model ID, Provider, Context Window, and Status.

Adding a Model

Click Add Model and fill in the form:

  • Model ID — The identifier that clients will reference (e.g. gpt-4-turbo). This cannot be changed after creation.
  • Provider — Select from the list of configured providers.
  • Context Window — Maximum token context length (defaults to 4096).
  • Status — Set to active to make the model available immediately.
  • Input Price — Cost per million input tokens. (Pricing is stored on the model as pricing.inputPerMillionTokens, matching the built-in catalog in @ai-gateway/shared.)
  • Output Price — Cost per million output tokens (pricing.outputPerMillionTokens).
  • Capabilities — Check all that apply.

Capabilities

Each model can declare one or more capabilities, which the gateway uses for routing decisions:

CapabilityDescription
chatConversational completions (messages API)
completionText completions
embeddingVector embeddings
imageImage generation
audioAudio transcription, translation, or speech
function_callingTool/function call support
visionImage understanding in prompts
streamingServer-sent event streaming

These tags drive cost tracking, model intelligence, and display. (The built-in model catalog in @ai-gateway/shared carries a richer tag set — e.g. prompt-caching, reasoning, code-execution, search-grounding, computer-use — in addition to the ones selectable here.) Accurate tagging keeps cost and routing decisions correct.

Editing a Model

Click Edit on any model row to open the edit modal. All fields except Model ID can be changed. Common operations include:

  • Updating pricing when providers change their rates
  • Marking a model as deprecated to stop new routing while honoring in-flight requests
  • Adding or removing capabilities

Deleting a Model

Click Delete to remove a model. A confirmation dialog warns that existing routing rules referencing this model may be affected. Deletion is permanent.

Pricing and Cost Tracking

Model pricing feeds directly into the gateway’s cost calculations. When a request completes, the gateway multiplies the token counts by the model’s configured input and output prices to compute the cost. This data appears in:

  • Dashboard cost breakdowns
  • Budget enforcement checks
  • Usage reports

Keep pricing current to ensure accurate budget tracking and cost-optimized routing decisions.

Model Statuses

StatusEffect
activeModel is available for routing
inactiveModel is hidden from routing but retained in configuration
deprecatedModel is marked for phase-out; routing strategies may deprioritize it