Portkey Models
curl --request GET \
--url https://api.portkey.ai/model-configs/pricing/{provider}/{model}{
"pay_as_you_go": {
"request_token": {
"price": 123
},
"response_token": {
"price": 123
},
"cache_read_input_token": {
"price": 123
},
"cache_write_input_token": {
"price": 123
},
"request_audio_token": {
"price": 123
},
"response_audio_token": {
"price": 123
},
"additional_units": {},
"image": {}
},
"calculate": {
"request": {},
"response": {}
},
"currency": "<string>",
"finetune_config": {
"pay_per_token": {},
"pay_per_hour": {}
},
"params": [
{
"key": "<string>",
"defaultValue": "<any>",
"minValue": 123,
"maxValue": 123,
"type": "<string>",
"options": [
{}
]
}
],
"type": {
"primary": "<string>",
"supported": [
{}
]
},
"messages": {
"options": [
{}
]
},
"disablePlayground": true,
"isDefault": true,
"{model-id}": {
"pricing_config": {}
}
}Portkey Models
Open-source pricing and configuration database for 2,300+ LLMs across 35+ providers
GET
/
model-configs
/
pricing
/
{provider}
/
{model}
Portkey Models
curl --request GET \
--url https://api.portkey.ai/model-configs/pricing/{provider}/{model}{
"pay_as_you_go": {
"request_token": {
"price": 123
},
"response_token": {
"price": 123
},
"cache_read_input_token": {
"price": 123
},
"cache_write_input_token": {
"price": 123
},
"request_audio_token": {
"price": 123
},
"response_audio_token": {
"price": 123
},
"additional_units": {},
"image": {}
},
"calculate": {
"request": {},
"response": {}
},
"currency": "<string>",
"finetune_config": {
"pay_per_token": {},
"pay_per_hour": {}
},
"params": [
{
"key": "<string>",
"defaultValue": "<any>",
"minValue": 123,
"maxValue": 123,
"type": "<string>",
"options": [
{}
]
}
],
"type": {
"primary": "<string>",
"supported": [
{}
]
},
"messages": {
"options": [
{}
]
},
"disablePlayground": true,
"isDefault": true,
"{model-id}": {
"pricing_config": {}
}
}Documentation Index
Fetch the complete documentation index at: https://portkey-docs-feat-byteplus-provider.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Portkey Models is a free, open-source pricing database for LLMs. No API key required โ just query any modelโs pricing directly.
Explore Models
Browse 2,300+ models
Model Rankings
Top models by usage
GitHub
View source & contribute
Quick Start
Get pricing for any model with a single API call:curl https://api.portkey.ai/model-configs/pricing/openai/gpt-4o
Need to discover all models for a provider? Instead of querying each model individually, you can fetch pricing for all models at once:See detailed documentation below โ
curl https://configs.portkey.ai/pricing/{provider}.json
Understanding Pricing Units
Prices are in cents per token, not dollars.
| API Response | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
0.003 | $0.03 | $30 |
0.00025 | $0.0025 | $2.50 |
0.0001 | $0.001 | $1.00 |
const costInDollars = (tokens * priceFromAPI) / 100;
API Reference
Get Model Pricing
Returns pricing configuration for a specific model.GET https://api.portkey.ai/model-configs/pricing/{provider}/{model}
Path Parameters
Provider identifier. Use lowercase with hyphens.Examples:
openai, anthropic, google, azure-openai, bedrock, together-ai, groq, deepseek, x-ai, mistral-ai, cohere, fireworks-ai, perplexity-ai, anyscale, deepinfra, cerebrasModel identifier. Use the exact model name as specified by the provider.Examples:
gpt-4o, gpt-4-turbo, claude-3-5-sonnet-20241022, claude-3-opus-20240229, gemini-2.0-flash-001, gemini-1.5-proResponse Schema
Token-based pricing. All prices in USD cents per token.
Show properties
Show properties
Input token pricing.
Show properties
Show properties
Price in USD cents per token. Example:
0.00025 = $0.0025 per 1K tokensOutput token pricing.
Show properties
Show properties
Price in USD cents per token.
Prompt cache read pricing (when available).
Show properties
Show properties
Price in USD cents per token.
Prompt cache write pricing (when available).
Show properties
Show properties
Price in USD cents per token.
Audio input token pricing (for multimodal models).
Show properties
Show properties
Price in USD cents per token.
Audio output token pricing (for multimodal models).
Show properties
Show properties
Price in USD cents per token.
Provider-specific pricing for features beyond tokens.Common units:
web_searchโ Web search tool usagefile_searchโ File search tool usagethinking_tokenโ Chain-of-thought reasoning tokensimage_tokenโ Image generation/processingvideo_duration_seconds_*โ Video generation (by resolution)
Image generation pricing by quality and size (for image models).Structure:
{ "quality": { "resolution": { "price": number } } }Currency code. Always
USD.Example Responses
{
"pay_as_you_go": {
"request_token": { "price": 0.00025 },
"response_token": { "price": 0.001 },
"cache_write_input_token": { "price": 0 },
"cache_read_input_token": { "price": 0.000125 },
"additional_units": {
"web_search": { "price": 1 },
"file_search": { "price": 0.25 }
}
},
"calculate": {
"request": {
"operation": "sum",
"operands": [
{ "operation": "multiply", "operands": [{ "value": "input_tokens" }, { "value": "rates.request_token" }] },
{ "operation": "multiply", "operands": [{ "value": "cache_write_tokens" }, { "value": "rates.cache_write_input_token" }] },
{ "operation": "multiply", "operands": [{ "value": "cache_read_tokens" }, { "value": "rates.cache_read_input_token" }] }
]
},
"response": {
"operation": "multiply",
"operands": [{ "value": "output_tokens" }, { "value": "rates.response_token" }]
}
},
"currency": "USD"
}
Get Model Configuration
Returns general configuration and capabilities for a specific model.GET https://api.portkey.ai/model-configs/general/{provider}/{model}
Path Parameters
Provider identifier (e.g.,
openai, anthropic, google)Model identifier (e.g.,
gpt-4o, claude-3-opus)Response Schema
Model parameters with their constraints.
Show item properties
Show item properties
Parameter name (e.g.,
max_tokens, temperature, top_p)Default value for the parameter.
Minimum allowed value.
Maximum allowed value.
Parameter type (e.g.,
boolean, string, array-of-strings)Available options for enum-type parameters.
Message configuration.
Show properties
Show properties
Supported message roles:
system, user, assistant, developerIf
true, model is not available in Portkey playground.If
true, this is the default model for the provider.Example Responses
{
"params": [
{
"key": "max_tokens",
"maxValue": 16384
},
{
"key": "response_format",
"defaultValue": null,
"options": [
{ "value": null, "name": "Text" },
{ "value": "json_object", "name": "JSON Object" },
{ "value": "json_schema", "name": "JSON Schema" }
],
"type": "string"
},
{
"key": "temperature",
"minValue": 0,
"maxValue": 2,
"defaultValue": 1
}
],
"type": {
"primary": "chat",
"supported": ["tools", "image"]
}
}
Get All Models for a Provider
Returns pricing configuration for all models from a specific provider in a single response.GET https://configs.portkey.ai/pricing/{provider}.json
Use this endpoint to discover all available models and their pricing for a provider, instead of querying each model individually.
Path Parameters
Provider identifier. Use lowercase with hyphens.Examples:
openai, anthropic, google, bedrock, azure-openai, together-ai, groq, deepseek, x-ai, mistral-aiResponse Schema
Returns a JSON object where:- Keys are model identifiers
- Values contain
pricing_configobjects with the same structure as the individual model pricing endpoint - A
defaultkey provides the base pricing template
Pricing configuration for a specific model.
Show properties
Show properties
Contains the same pricing structure as the individual model endpoint:
pay_as_you_go, calculate, currency, etc.Example Response
curl --request GET \
--url https://configs.portkey.ai/pricing/bedrock.json
Sample Response (Bedrock)
Sample Response (Bedrock)
{
"default": {
"pricing_config": {
"pay_as_you_go": {
"request_token": { "price": 0 },
"response_token": { "price": 0 }
},
"calculate": {
"request": {
"operation": "multiply",
"operands": [
{ "value": "input_tokens" },
{ "value": "rates.request_token" }
]
},
"response": {
"operation": "multiply",
"operands": [
{ "value": "output_tokens" },
{ "value": "rates.response_token" }
]
}
},
"currency": "USD"
}
},
"anthropic.claude-3-5-sonnet-20241022-v2:0": {
"pricing_config": {
"pay_as_you_go": {
"request_token": { "price": 0.0003 },
"response_token": { "price": 0.0015 },
"cache_write_input_token": { "price": 0.000375 },
"cache_read_input_token": { "price": 0.00003 }
}
}
},
"anthropic.claude-3-haiku-20240307-v1:0": {
"pricing_config": {
"pay_as_you_go": {
"request_token": { "price": 0.000025 },
"response_token": { "price": 0.000125 }
}
}
},
"meta.llama3-1-405b-instruct-v1:0": {
"pricing_config": {
"pay_as_you_go": {
"request_token": { "price": 0.000532 },
"response_token": { "price": 0.0016 }
}
}
},
"amazon.nova-pro-v1:0": {
"pricing_config": {
"pay_as_you_go": {
"request_token": { "price": 0.00008 },
"response_token": { "price": 0.00002 },
"cache_read_input_token": { "price": 0.00004 },
"cache_write_input_token": { "price": 0.00016 }
}
}
}
}
Additional Units Reference
Provider-specific pricing for features beyond standard token costs:| Unit | Description | Providers | Price Range (ยข) |
|---|---|---|---|
web_search | Web search tool usage | OpenAI, Azure, Google, Vertex AI, Perplexity | 0.5 - 3.5 |
file_search | File search tool usage | OpenAI, Azure | 0.25 |
search | Google search grounding | Google, Vertex AI | 1.4 - 3.5 |
thinking_token | Chain-of-thought reasoning | Google, Vertex AI | 0.00004 - 0.0012 |
image_token | Image processing tokens | Google, Vertex AI | 0.003 |
image_1k | Image generation (1K units) | 3.9 | |
megapixels | Image generation by megapixel | Together AI | 0.0027 - 0.08 |
video_seconds | Video generation | Vertex AI | 10 - 50 |
video_duration_seconds_720_1280 | Video (720p) | OpenAI Sora | 10 - 30 |
video_duration_seconds_1080_1920 | Video (1080p) | OpenAI Sora | 50 |
routing_units | Azure OpenAI routing | Azure OpenAI | 0.000014 |
input_image | Image input | Vertex AI | 0.01 |
input_video_essential | Video input (essential) | Vertex AI | 0.05 |
input_video_standard | Video input (standard) | Vertex AI | 0.1 |
input_video_plus | Video input (plus) | Vertex AI | 0.2 |
Perplexity Context-Based Pricing
Perplexity Context-Based Pricing
Perplexity has context-dependent web search pricing:
| Unit | Price (ยข) |
|---|---|
web_search_low_context | 0.5 - 0.6 |
web_search_medium_context | 0.8 - 1.0 |
web_search_high_context | 1.2 - 1.4 |
Supported Providers
40+ providers supported
40+ providers supported
AI21, Anthropic, Anyscale, Azure AI, Azure OpenAI, AWS Bedrock, Cerebras, Cohere, Dashscope, Deepbricks, DeepInfra, DeepSeek, Fireworks AI, GitHub, Google, Groq, Inference.net, Jina, Lambda, Lemonfox AI, Mistral AI, MonsterAPI, Nebius, Nomic, Novita AI, OpenAI, OpenRouter, Oracle, PaLM, Perplexity AI, Predibase, Reka AI, Sagemaker, Segmind, Stability AI, Together AI, Vertex AI, Workers AI, X.AI, Zhipu
Use with Portkey
Portkey Models powers automatic cost tracking for all requests through the Portkey Gateway. When you make requests via Portkey, costs are calculated automatically using this pricing data.Cost Analytics
View real-time spend across all your LLM usage
Budget Limits
Set spending caps that automatically enforce limits
Custom Pricing
Set custom input/output costs to match your contracts
Last modified on May 24, 2026
Was this page helpful?
โI

