Category · 25 products · $0–$270/user/mo range · 10 with free tier

Software · LLM API Providers

LLM API Providers Software Pricing 2026

Compare pricing for 25 llm api providers tools. Find the right software for your budget.

Products 25 in this category

Price range $0–$270 /user/mo

Median $11 across 23 priced tools

Free tiers 10 no-cost entry points

LLM API Providers software pricing ranges from $0 to $270 per user/month in 2026. The typical cost is around $11/user/month across 25 popular tools. Top picks: Amazon Bedrock ($0.07–$75/user/mo), Anyscale ($0.15–$5/user/mo), Baidu ERNIE API ($0.10–$10/user/mo), and 22 more. 10 of 25 tools offer free tiers for small teams or limited use.

All LLM API Providers Tools

Compare all side-by-side →

Sort

25 of 25 products

Amazon Bedrock

$0.07–$75/per million tokens

On-Demand (pay per token) $null Provisioned Throughput $null Enterprise (Bedrock + AWS deal) Custom

See Plans →

Anyscale

$0.15–$5/per million tokens

Anyscale Endpoints $null Managed Ray Clusters Custom

See Plans →

Baidu ERNIE API

$0.10–$10/per million tokens

Pay-as-you-go (ERNIE 4.5, 4 Turbo, X1) $null Enterprise Custom

See Plans →

Cerebras Inference API

$0.10–$6/per million tokens

Free tier (Developer) Free Pay-as-you-go $null Enterprise Custom

See Plans →

Claude API

$0.03–$75/per million tokens

API (Pay-as-you-go) Custom Enterprise Custom

See Plans →

Cloudflare Workers AI

Free–$4.88/per million tokens

Free tier Free Pay-as-you-go (Neurons / tokens) $null Enterprise Custom

See Plans →

Cohere API

$0.04–$10/per million tokens

Trial (Free) Free Command R (Pay-as-you-go) Custom Command R+ / Command A (Pay-as-you-go) Custom +1

See Plans →

DeepInfra

$0.00–$82.50/per million tokens

Pay-as-you-go Custom

See Plans →

Fireworks AI

Free–$11/per million tokens / hour

Serverless Custom On-Demand (H100/H200) Custom On-Demand (B200) Custom +2

See Plans →

Google Gemini API

Free–$18/per million tokens

Free Free Flash-Lite (Paid) Custom Flash (Paid) Custom +1

See Plans →

Groq

Free

Free Free Developer Custom Enterprise Custom

See Plans →

Lepton AI

$0.07–$4/per million tokens

Serverless Inference $null GPU Cloud $null

See Plans →

MiniMax API

$0.20–$3/per million tokens

Pay-as-you-go (MiniMax M1, M2, abab6.5) $null Enterprise Custom

See Plans →

Mistral AI API

$0.10–$6/per million tokens

Free Free Mistral Small Custom Mistral Medium Custom +1

See Plans →

Moonshot Kimi API

$0.15–$10/per million tokens

Pay-as-you-go (Kimi K2, Moonshot v1 family) $null Enterprise Custom

See Plans →

NVIDIA NIM

$0.10–$10/per million tokens

Developer (Free credits) Free Pay-as-you-go (hosted NIM endpoints) $null Enterprise (AI Enterprise license + DGX Cloud) Custom

See Plans →

OctoAI

Custom pricing

Service Discontinued Custom

See Plans →

OpenAI API

$0.20–$270/per million tokens

GPT-5.4 mini / nano (Economy) Custom GPT-5.5 / GPT-5.4 / Pro (Flagship) Custom Enterprise Custom

See Plans →

OpenRouter

Free–$75/per million tokens

Free Models Free Pay-as-you-go $null

See Plans →

Perplexity API

$1–$15/per million tokens + per-request fee

Sonar Custom Sonar Pro Custom Sonar Reasoning Pro Custom +1

See Plans →

Qwen API (Alibaba)

$0.05–$20/per million tokens

Pay-as-you-go (Qwen3, Qwen2.5, Qwen-VL) $null Enterprise Custom

See Plans →

SambaNova Cloud

Free–$4.50/per million tokens

Free tier Free Developer (Pay-as-you-go) $null Enterprise Custom

See Plans →

Together AI

$0.03–$9.95/per million tokens / hour

Serverless Custom Dedicated (1x H100) Custom Dedicated (1x H200) Custom +2

See Plans →

Vercel AI SDK

Free–$5/per month (Vercel plan)

Free Tier Free Paid (Pay-As-You-Go) $null Enterprise Custom

See Plans →

xAI Grok API

$1.25–$2.50/per million tokens

Pay-as-you-go (Grok 4.3) $null Enterprise Custom

See Plans →

LLM API Providers Comparisons

Cost Analysis Tools

Amazon Bedrock

Hidden Costs Calculator Negotiation

Anyscale

Hidden Costs Calculator Negotiation

Baidu ERNIE API

Hidden Costs Calculator Negotiation

Cerebras Inference API

Hidden Costs Calculator Negotiation

Claude API

Hidden Costs Calculator Negotiation

Cloudflare Workers AI

Hidden Costs Calculator Negotiation

LLM API Providers Pricing FAQ

01 What are LLM API providers?

LLM API providers offer access to large language models via API, enabling developers to add AI capabilities to applications without hosting models themselves. They charge per token (input and output) and compete on price, speed, model selection, and features like web search grounding or RAG optimization.

02 How much do LLM APIs cost in 2026?

LLM API pricing is per-token and varies widely by model size and provider. Small models (under 8B parameters) cost $0.02-0.20 per million tokens on Groq, Together AI, and Mistral. Mid-range models cost $0.30-1.25 per million tokens (Gemini Flash, Mistral Medium). Frontier models (GPT-4o, Claude Sonnet, Gemini Pro) cost $1-5 per million input tokens. Perplexity Sonar adds per-request fees on top of token costs.

03 Which LLM API provider is cheapest?

For small open-source models, Groq (from $0.05/M tokens) and Mistral Nemo ($0.02/M) are the cheapest. For frontier-quality models, Mistral Large 3 at $0.50/$1.50 per million tokens is dramatically cheaper than GPT-4o or Claude Sonnet. Google Gemini Flash-Lite at $0.10/$0.40 per million tokens offers frontier quality at budget prices. Cohere Command R7B is cheapest for RAG at $0.037/M input tokens.

04 Which LLM API provider has the best free tier?

Google Gemini API offers the best free tier: 1,500 requests/day on Flash models through Google AI Studio with no credit card required. Groq offers a free API key with rate-limited access to all models. Mistral offers a free trial tier via La Plateforme. Cohere offers a free Trial API key for non-commercial use. Together AI and Fireworks AI offer $1 in free credits. Perplexity API has no free tier.

05 What is the difference between per-token and per-request LLM API pricing?

Most LLM APIs charge per token (input tokens + output tokens × rate). Perplexity API is unique in adding a per-request fee on top of token costs — every Sonar query incurs a $5-14 per 1,000 requests charge based on search context depth. This dual model reflects the cost of real-time web search bundled into each query. When comparing Perplexity to other APIs, you must add both token costs and request fees to get the true cost per query.

All LLM API Providers Tools

Amazon Bedrock

Anyscale

Baidu ERNIE API

Cerebras Inference API

Claude API

Cloudflare Workers AI

Cohere API

DeepInfra

Fireworks AI

Google Gemini API

Groq

Lepton AI

MiniMax API

Mistral AI API

Moonshot Kimi API

NVIDIA NIM

OctoAI

OpenAI API

OpenRouter

Perplexity API

Qwen API (Alibaba)

SambaNova Cloud

Together AI

Vercel AI SDK

xAI Grok API

LLM API Providers Comparisons

Cost Analysis Tools

LLM API Providers Pricing FAQ

01 What are LLM API providers?

02 How much do LLM APIs cost in 2026?

03 Which LLM API provider is cheapest?

04 Which LLM API provider has the best free tier?

05 What is the difference between per-token and per-request LLM API pricing?

Related Categories