Portkey Models

curl --request GET \
  --url https://api.portkey.ai/model-configs/pricing/{provider}/{model}

{
  "pay_as_you_go": {
    "request_token": {
      "price": 123
    },
    "response_token": {
      "price": 123
    },
    "cache_read_input_token": {
      "price": 123
    },
    "cache_write_input_token": {
      "price": 123
    },
    "request_audio_token": {
      "price": 123
    },
    "response_audio_token": {
      "price": 123
    },
    "additional_units": {},
    "image": {}
  },
  "calculate": {
    "request": {},
    "response": {}
  },
  "currency": "<string>",
  "finetune_config": {
    "pay_per_token": {},
    "pay_per_hour": {}
  },
  "params": [
    {
      "key": "<string>",
      "defaultValue": "<any>",
      "minValue": 123,
      "maxValue": 123,
      "type": "<string>",
      "options": [
        {}
      ]
    }
  ],
  "type": {
    "primary": "<string>",
    "supported": [
      {}
    ]
  },
  "messages": {
    "options": [
      {}
    ]
  },
  "disablePlayground": true,
  "isDefault": true,
  "{model-id}": {
    "pricing_config": {}
  }
}

GET

model-configs

pricing

{provider}

{model}

Portkey Models

curl --request GET \
  --url https://api.portkey.ai/model-configs/pricing/{provider}/{model}

{
  "pay_as_you_go": {
    "request_token": {
      "price": 123
    },
    "response_token": {
      "price": 123
    },
    "cache_read_input_token": {
      "price": 123
    },
    "cache_write_input_token": {
      "price": 123
    },
    "request_audio_token": {
      "price": 123
    },
    "response_audio_token": {
      "price": 123
    },
    "additional_units": {},
    "image": {}
  },
  "calculate": {
    "request": {},
    "response": {}
  },
  "currency": "<string>",
  "finetune_config": {
    "pay_per_token": {},
    "pay_per_hour": {}
  },
  "params": [
    {
      "key": "<string>",
      "defaultValue": "<any>",
      "minValue": 123,
      "maxValue": 123,
      "type": "<string>",
      "options": [
        {}
      ]
    }
  ],
  "type": {
    "primary": "<string>",
    "supported": [
      {}
    ]
  },
  "messages": {
    "options": [
      {}
    ]
  },
  "disablePlayground": true,
  "isDefault": true,
  "{model-id}": {
    "pricing_config": {}
  }
}

Portkey Models is a free, open-source pricing database for LLMs. No API key required — just query any model’s pricing directly.

Explore Models

Browse 2,300+ models

Model Rankings

Top models by usage

GitHub

View source & contribute

Quick Start

Get pricing for any model with a single API call:

curl https://api.portkey.ai/model-configs/pricing/openai/gpt-4o

Need to discover all models for a provider? Instead of querying each model individually, you can fetch pricing for all models at once:

curl https://configs.portkey.ai/pricing/{provider}.json

See detailed documentation below ↓

Understanding Pricing Units

Prices are in cents per token, not dollars.

API Response	Per 1K Tokens	Per 1M Tokens
`0.003`	$0.03	$30
`0.00025`	$0.0025	$2.50
`0.0001`	$0.001	$1.00

Calculate cost in dollars:

const costInDollars = (tokens * priceFromAPI) / 100;

API Reference

Get Model Pricing

Returns pricing configuration for a specific model.

GET https://api.portkey.ai/model-configs/pricing/{provider}/{model}

Path Parameters

provider

string

required

Provider identifier. Use lowercase with hyphens.Examples: openai, anthropic, google, azure-openai, bedrock, together-ai, groq, deepseek, x-ai, mistral-ai, cohere, fireworks-ai, perplexity-ai, anyscale, deepinfra, cerebras

model

string

required

Model identifier. Use the exact model name as specified by the provider.Examples: gpt-4o, gpt-4-turbo, claude-3-5-sonnet-20241022, claude-3-opus-20240229, gemini-2.0-flash-001, gemini-1.5-pro

Response Schema

pay_as_you_go

object

Token-based pricing. All prices in USD cents per token.

Show properties

request_token

object

Input token pricing.

Show properties

price

number

Price in USD cents per token. Example: 0.00025 = $0.0025 per 1K tokens

response_token

object

Output token pricing.

Show properties

price

number

Price in USD cents per token.

cache_read_input_token

object

Prompt cache read pricing (when available).

Show properties

price

number

Price in USD cents per token.

cache_write_input_token

object

Prompt cache write pricing (when available).

Show properties

price

number

Price in USD cents per token.

request_audio_token

object

Audio input token pricing (for multimodal models).

Show properties

price

number

Price in USD cents per token.

response_audio_token

object

Audio output token pricing (for multimodal models).

Show properties

price

number

Price in USD cents per token.

additional_units

object

Provider-specific pricing for features beyond tokens.Common units:

web_search — Web search tool usage
file_search — File search tool usage
thinking_token — Chain-of-thought reasoning tokens
image_token — Image generation/processing
video_duration_seconds_* — Video generation (by resolution)

image

object

Image generation pricing by quality and size (for image models).Structure: { "quality": { "resolution": { "price": number } } }

calculate

object

Cost calculation formulas for complex pricing scenarios.

Show properties

request

object

Formula for calculating request (input) cost.

response

object

Formula for calculating response (output) cost.

currency

string

Currency code. Always USD.

finetune_config

object

Fine-tuning pricing (when available).

Show properties

pay_per_token

object

Training cost per token.

pay_per_hour

object

Training cost per hour.

Example Responses

{
  "pay_as_you_go": {
    "request_token": { "price": 0.00025 },
    "response_token": { "price": 0.001 },
    "cache_write_input_token": { "price": 0 },
    "cache_read_input_token": { "price": 0.000125 },
    "additional_units": {
      "web_search": { "price": 1 },
      "file_search": { "price": 0.25 }
    }
  },
  "calculate": {
    "request": {
      "operation": "sum",
      "operands": [
        { "operation": "multiply", "operands": [{ "value": "input_tokens" }, { "value": "rates.request_token" }] },
        { "operation": "multiply", "operands": [{ "value": "cache_write_tokens" }, { "value": "rates.cache_write_input_token" }] },
        { "operation": "multiply", "operands": [{ "value": "cache_read_tokens" }, { "value": "rates.cache_read_input_token" }] }
      ]
    },
    "response": {
      "operation": "multiply",
      "operands": [{ "value": "output_tokens" }, { "value": "rates.response_token" }]
    }
  },
  "currency": "USD"
}

Get Model Configuration

Returns general configuration and capabilities for a specific model.

GET https://api.portkey.ai/model-configs/general/{provider}/{model}

Path Parameters

provider

string

required

Provider identifier (e.g., openai, anthropic, google)

model

string

required

Model identifier (e.g., gpt-4o, claude-3-opus)

Response Schema

params

array

Model parameters with their constraints.

Show item properties

key

string

Parameter name (e.g., max_tokens, temperature, top_p)

defaultValue

any

Default value for the parameter.

minValue

number

Minimum allowed value.

maxValue

number

Maximum allowed value.

type

string

Parameter type (e.g., boolean, string, array-of-strings)

options

array

Available options for enum-type parameters.

type

object

Model type classification.

Show properties

primary

string

Primary model type: chat, text, embedding, image, audio, moderation

supported

array

Supported features: tools, image, cache_control, json_mode

messages

object

Message configuration.

Show properties

options

array

Supported message roles: system, user, assistant, developer

disablePlayground

boolean

If true, model is not available in Portkey playground.

isDefault

boolean

If true, this is the default model for the provider.

Example Responses

{
  "params": [
    {
      "key": "max_tokens",
      "maxValue": 16384
    },
    {
      "key": "response_format",
      "defaultValue": null,
      "options": [
        { "value": null, "name": "Text" },
        { "value": "json_object", "name": "JSON Object" },
        { "value": "json_schema", "name": "JSON Schema" }
      ],
      "type": "string"
    },
    {
      "key": "temperature",
      "minValue": 0,
      "maxValue": 2,
      "defaultValue": 1
    }
  ],
  "type": {
    "primary": "chat",
    "supported": ["tools", "image"]
  }
}

Get All Models for a Provider

Returns pricing configuration for all models from a specific provider in a single response.

GET https://configs.portkey.ai/pricing/{provider}.json

Use this endpoint to discover all available models and their pricing for a provider, instead of querying each model individually.

Path Parameters

provider

string

required

Provider identifier. Use lowercase with hyphens.Examples: openai, anthropic, google, bedrock, azure-openai, together-ai, groq, deepseek, x-ai, mistral-ai

Response Schema

Returns a JSON object where:

Keys are model identifiers
Values contain pricing_config objects with the same structure as the individual model pricing endpoint
A default key provides the base pricing template

{model-id}

object

Pricing configuration for a specific model.

Show properties

pricing_config

object

Contains the same pricing structure as the individual model endpoint: pay_as_you_go, calculate, currency, etc.

Example Response

curl --request GET \
  --url https://configs.portkey.ai/pricing/bedrock.json

Sample Response (Bedrock)

{
  "default": {
    "pricing_config": {
      "pay_as_you_go": {
        "request_token": { "price": 0 },
        "response_token": { "price": 0 }
      },
      "calculate": {
        "request": {
          "operation": "multiply",
          "operands": [
            { "value": "input_tokens" },
            { "value": "rates.request_token" }
          ]
        },
        "response": {
          "operation": "multiply",
          "operands": [
            { "value": "output_tokens" },
            { "value": "rates.response_token" }
          ]
        }
      },
      "currency": "USD"
    }
  },
  "anthropic.claude-3-5-sonnet-20241022-v2:0": {
    "pricing_config": {
      "pay_as_you_go": {
        "request_token": { "price": 0.0003 },
        "response_token": { "price": 0.0015 },
        "cache_write_input_token": { "price": 0.000375 },
        "cache_read_input_token": { "price": 0.00003 }
      }
    }
  },
  "anthropic.claude-3-haiku-20240307-v1:0": {
    "pricing_config": {
      "pay_as_you_go": {
        "request_token": { "price": 0.000025 },
        "response_token": { "price": 0.000125 }
      }
    }
  },
  "meta.llama3-1-405b-instruct-v1:0": {
    "pricing_config": {
      "pay_as_you_go": {
        "request_token": { "price": 0.000532 },
        "response_token": { "price": 0.0016 }
      }
    }
  },
  "amazon.nova-pro-v1:0": {
    "pricing_config": {
      "pay_as_you_go": {
        "request_token": { "price": 0.00008 },
        "response_token": { "price": 0.00002 },
        "cache_read_input_token": { "price": 0.00004 },
        "cache_write_input_token": { "price": 0.00016 }
      }
    }
  }
}

Additional Units Reference

Provider-specific pricing for features beyond standard token costs:

Unit	Description	Providers	Price Range (¢)
`web_search`	Web search tool usage	OpenAI, Azure, Google, Vertex AI, Perplexity	0.5 - 3.5
`file_search`	File search tool usage	OpenAI, Azure	0.25
`search`	Google search grounding	Google, Vertex AI	1.4 - 3.5
`thinking_token`	Chain-of-thought reasoning	Google, Vertex AI	0.00004 - 0.0012
`image_token`	Image processing tokens	Google, Vertex AI	0.003
`image_1k`	Image generation (1K units)	Google	3.9
`megapixels`	Image generation by megapixel	Together AI	0.0027 - 0.08
`video_seconds`	Video generation	Vertex AI	10 - 50
`video_duration_seconds_720_1280`	Video (720p)	OpenAI Sora	10 - 30
`video_duration_seconds_1080_1920`	Video (1080p)	OpenAI Sora	50
`routing_units`	Azure OpenAI routing	Azure OpenAI	0.000014
`input_image`	Image input	Vertex AI	0.01
`input_video_essential`	Video input (essential)	Vertex AI	0.05
`input_video_standard`	Video input (standard)	Vertex AI	0.1
`input_video_plus`	Video input (plus)	Vertex AI	0.2

Perplexity Context-Based Pricing

Perplexity has context-dependent web search pricing:

Unit	Price (¢)
`web_search_low_context`	0.5 - 0.6
`web_search_medium_context`	0.8 - 1.0
`web_search_high_context`	1.2 - 1.4

Supported Providers

40+ providers supported

AI21, Anthropic, Anyscale, Azure AI, Azure OpenAI, AWS Bedrock, Cerebras, Cohere, Dashscope, Deepbricks, DeepInfra, DeepSeek, Fireworks AI, GitHub, Google, Groq, Inference.net, Jina, Lambda, Lemonfox AI, Mistral AI, MonsterAPI, Nebius, Nomic, Novita AI, OpenAI, OpenRouter, Oracle, PaLM, Perplexity AI, Predibase, Reka AI, Sagemaker, Segmind, Stability AI, Together AI, Vertex AI, Workers AI, X.AI, Zhipu

Use with Portkey

Portkey Models powers automatic cost tracking for all requests through the Portkey Gateway. When you make requests via Portkey, costs are calculated automatically using this pricing data.

Cost Analytics

View real-time spend across all your LLM usage

Budget Limits

Set spending caps that automatically enforce limits

If you have negotiated enterprise rates, you can override the default pricing:

Custom Pricing

Set custom input/output costs to match your contracts

Last modified on May 24, 2026

Model Catalog Integrations

⌘I

Documentation Index

Explore Models

Model Rankings

GitHub

​Quick Start

​Understanding Pricing Units

​API Reference

​Get Model Pricing

​Path Parameters

​Response Schema

​Example Responses

​Get Model Configuration

​Path Parameters

​Response Schema

​Example Responses

​Get All Models for a Provider

​Path Parameters

​Response Schema

​Example Response

​Additional Units Reference

​Supported Providers

​Use with Portkey

Cost Analytics

Budget Limits

Custom Pricing

Quick Start

Understanding Pricing Units

API Reference

Get Model Pricing

Path Parameters

Response Schema

Example Responses

Get Model Configuration

Path Parameters

Response Schema

Example Responses

Get All Models for a Provider

Path Parameters

Response Schema

Example Response

Additional Units Reference

Supported Providers

Use with Portkey