Quick Answer
Last verified:
Medium confidence

Google Gemini API costs Free to $18 per per million tokens as of May 2026, with 4 plans available including a free tier. Plan: Free (free). Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.

Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.

  • Free tier: Yes

Google Gemini API offers 4 pricing tiers: Free, Flash-Lite (Paid), Flash (Paid), Pro (Paid). The Flash-Lite (Paid) plan is high-volume, cost-sensitive production workloads.

Compared to other llm api providers software, Google Gemini API is positioned at the budget-friendly price point.

  • 4 documented hidden costs beyond list price

How much does Google Gemini API cost?

Google Gemini API offers 4 pricing plans, starting with a free tier and scaling to custom enterprise pricing. Plans include Free (free), Flash-Lite (Paid) (custom pricing), Flash (Paid) (custom pricing), Pro (Paid) (custom pricing).

Google Gemini API Pricing Overview

Google Gemini API has 4 pricing plans, including a free tier. Paid plans range from $0 to $18/per million tokens. The Free plan is free and is best for prototyping and evaluation. The Flash-Lite (Paid) plan requires contacting sales for a custom quote and is designed for high-volume, cost-sensitive production workloads. The Flash (Paid) plan requires contacting sales for a custom quote and is designed for production apps balancing cost and capability. The Pro (Paid) plan requires contacting sales for a custom quote and is designed for complex reasoning, long-context, and multimodal tasks.

Google Gemini API with a None — pay-per-use minimum commitment, requiring Not applicable — pay-per-use, no subscription contract notice to cancel.

There are at least 4 documented hidden costs beyond Google Gemini API's list price, including implementation, training, and add-on fees.

This pricing was last verified in May 6, 2026 from 1 independent sources.

Google Gemini API pricing starts at $0 on the Free tier, which provides rate-limited access to Gemini models via AI Studio for prototyping. For production workloads, the Flash-Lite (Paid), Flash (Paid), and Pro (Paid) tiers are all billed on a per-token usage basis with no monthly subscription or minimum commitment. According to Artificial Analysis data from April 2026, the provider median across 51 tracked models sits at $0.56 per 1M input tokens and $2.20 per 1M output tokens, with model-level rates ranging from $0 (open Gemma models) up to $10.00 per 1M output tokens for Gemini 2.5 Pro.

How Google Gemini API Pricing Compares

Compare Google Gemini API pricing against top alternatives in LLM API Providers.

All Google Gemini API Plans & Pricing

Plan Monthly Annual Best For
Free rate_limit: Rate-limited for prototyping Free Free Prototyping and evaluation
Flash-Lite (Paid) Contact Sales Contact Sales High-volume, cost-sensitive production workloads
Flash (Paid) Contact Sales Contact Sales Production apps balancing cost and capability
Pro (Paid) Contact Sales Contact Sales Complex reasoning, long-context, and multimodal tasks
View all features by plan

Free

  • Free API key via Google AI Studio
  • Gemini 2.5 Flash-Lite: free input & output
  • Gemini 3 Flash Preview: free input & output
  • Gemini 3.1 Flash-Lite Preview: free input & output
  • Rate-limited for prototyping
  • Content used to improve Google products

Flash-Lite (Paid)

  • Gemini 2.5 Flash-Lite: $0.10 input / $0.40 output per M tokens
  • Gemini 3.1 Flash-Lite Preview: $0.25 input / $1.50 output per M tokens
  • Most cost-efficient Gemini models
  • Batch API: 50% cost reduction
  • Great for high-volume, cost-sensitive workloads

Flash (Paid)

  • Gemini 2.5 Flash: $0.30 input / $2.50 output per M tokens
  • Gemini 3 Flash Preview: $0.50 input / $3.00 output per M tokens
  • Balanced speed and capability
  • Multimodal: text, image, video, audio
  • Audio input: $1.00/M tokens

Pro (Paid)

  • Gemini 2.5 Pro: $1.25 input (≤200K) / $10.00 output per M tokens
  • $2.50 input / $15.00 output for prompts >200K tokens
  • Gemini 3.1 Pro Preview: $2.00 input (≤200K) / $12.00 output per M tokens
  • $4.00 input / $18.00 output for prompts >200K tokens
  • Google Search grounding: $14/1,000 queries (5,000/mo free)
  • Context caching available (up to 90% input cost reduction)

Usage-Based Rates

Per-unit pricing for Google Gemini API API usage.

Flash-Lite (Paid)

Model Unit Rate
Gemini 2.5 Flash-Lite 1M input tokens $0.100
Gemini 2.5 Flash-Lite 1M output tokens $0.400
Gemini 3.1 Flash-Lite Preview 1M input tokens $0.250
Gemini 3.1 Flash-Lite Preview 1M output tokens $1.50
Gemini 3.1 Flash-Lite Preview 1M cached input tokens $0.025
  • Same rate regardless of context length
  • Audio input at $0.50/M tokens for 3.1 Flash-Lite
  • Context caching storage: $1.00/M tokens per hour

Flash (Paid)

Model Unit Rate
Gemini 2.5 Flash 1M input tokens $0.300
Gemini 2.5 Flash 1M output tokens $2.50
Gemini 2.5 Flash (thinking) 1M output tokens $3.50
Gemini 3 Flash Preview 1M input tokens $0.500
Gemini 3 Flash Preview 1M output tokens $3.00
Gemini 3 Flash Preview 1M cached input tokens $0.050
  • Thinking/reasoning output billed at higher rate for 2.5 Flash
  • 3 Flash Preview output price includes thinking tokens
  • Audio input at $1.00/M tokens
  • Context caching storage: $1.00/M tokens per hour

Pro (Paid)

Model Unit Rate
Gemini 2.5 Pro (≤200K ctx) 1M input tokens $1.25
Gemini 2.5 Pro (>200K ctx) 1M input tokens $2.50
Gemini 2.5 Pro 1M output tokens $10.00
Gemini 2.5 Pro (thinking) 1M output tokens $15.00
Gemini 3.1 Pro Preview (≤200K ctx) 1M input tokens $2.00
Gemini 3.1 Pro Preview (>200K ctx) 1M input tokens $4.00
Gemini 3.1 Pro Preview (≤200K ctx) 1M output tokens $12.00
Gemini 3.1 Pro Preview (>200K ctx) 1M output tokens $18.00
Gemini 3.1 Pro Preview (≤200K ctx) 1M cached input tokens $0.200
Gemini 3.1 Pro Preview (>200K ctx) 1M cached input tokens $0.400
  • Input price doubles above 200K context window for both models
  • 2.5 Pro has separate thinking output rate; 3.1 Pro output includes thinking
  • Context caching storage: $4.50/M tokens per hour for 3.1 Pro

Compare Google Gemini API vs Alternatives

Before committing to Google Gemini API, compare pricing with these 3 alternatives in the same category.

All Google Gemini API alternatives & migration guides

What Companies Actually Pay for Google Gemini API

Median per-1M-token pricing across 51 models
Input $0.560/1M
Output $2.20/1M
Flagship models in this provider's catalog
Model Input /1M Output /1M Blended /1M
google_gemini-2-5-pro-06-05_ai-studio $1.25 $10.00 $3.44
google_gemini-2-5-flash-05-20_ai-studio $0.300 $2.50 $0.850
google_gemini-3-flash_ai-studio $0.500 $3.00 $1.13
google_gemini-2-5-flash-lite_ai-studio $0.100 $0.400 $0.175
google_gemini-2-0-flash_vertex $0.150 $0.600 $0.263
Review scores
Source: Artificial Analysis — medians aggregated from 51 models in this provider's catalog. Per-1M-token pricing reflects list rates.

How Google Gemini API Pricing Compares

Software Starting Price Top Price
Google Gemini API Free $18/per million tokens
Amazon Bedrock $0.07/per million tokens $75/per million tokens
Anyscale $0.15/per million tokens $5/per million tokens
Baidu ERNIE API $0.1/per million tokens $10/per million tokens
Cerebras Inference API $0.1/per million tokens $6/per million tokens
Claude API $0.03/per million tokens $75/per million tokens

4 Google Gemini API Hidden Costs Beyond the List Price

Beyond the listed price, Google Gemini API has at least 4 documented hidden costs that can significantly increase total cost of ownership.

Watch for 4 hidden costs
  • Google Search grounding adds $14 per 1,000 queries after 5,000 free/month
  • Audio input tokens billed at higher rate than text (e.g. $1.00 vs $0.30/M for Flash)
  • Prompts exceeding 200K tokens billed at 2x input rate on Gemini 2.5 Pro
  • Context caching storage billed separately per hour
Tip

Ask your Google Gemini API sales rep about these costs upfront. Getting them in writing before signing can save you from surprise charges later.

Full hidden costs breakdown →

Google Gemini API Contract Terms

Google Gemini API contracts do not auto-renew. Changes require Not applicable — pay-per-use, no subscription contract. These terms are sourced from verified buyer experiences.

Contract Terms
Auto-Renewal No
Cancellation Notice Not applicable — pay-per-use, no subscription contract
Minimum Commitment None — pay-per-use
Mid-Term Downgrade Allowed
Payment Terms Pay-per-use, billed by token consumption
Price Escalation No published price escalation schedule; Google may change per-token rates with notice
Note

No downgrade required; usage simply decreases

Google Gemini API Pricing FAQ

01 How much does the Google Gemini API cost?

Gemini API pricing varies by model. The cheapest option is Gemini 2.5 Flash-Lite at $0.10 per million input tokens and $0.40 per million output tokens. Gemini 2.5 Pro costs $1.25/$10.00 per million tokens (≤200K context). A free tier is available with up to 1,500 requests/day on Flash models via Google AI Studio.

02 Is the Gemini API free?

Yes, Google offers a free tier for the Gemini API through Google AI Studio. The free tier provides access to Flash models with up to 1,500 requests/day and free input/output tokens. Pro models also have a free tier but are rate-limited. For production use, you pay per token on the paid tier with no monthly minimum.

03 Gemini API vs OpenAI API: which is cheaper?

Gemini is generally cheaper than OpenAI for comparable models. Gemini 2.5 Flash at $0.30/$2.50 per million tokens is significantly cheaper than GPT-4o. Gemini 2.5 Pro at $1.25/$10.00 per million tokens undercuts GPT-4o pricing. For budget workloads, Gemini Flash-Lite at $0.10/$0.40 per million tokens has no OpenAI equivalent at that price.

04 What is context caching in the Gemini API?

Context caching lets you cache repeated prompt content (like system instructions or documents) and reuse it across multiple requests. Cached tokens are billed at roughly 90% discount compared to fresh input tokens. This is highly cost-effective for applications that repeatedly process the same large documents or instructions.

05 What is the Batch API discount on Gemini?

The Gemini API Batch API offers a 50% cost reduction on token pricing for asynchronous workloads. Batch requests are processed within 24 hours. This is ideal for offline data processing, bulk classification, or any task that doesn't require real-time responses.

06 Does the Google Gemini API have a free tier?

Yes. The Free tier provides access to Gemini models via AI Studio at no cost, subject to rate limits on requests per minute and per day. It is designed for prototyping and low-volume experimentation, not production-scale workloads.

07 How is the Google Gemini API billed on paid plans?

The Flash-Lite (Paid), Flash (Paid), and Pro (Paid) tiers are all billed on a per-token usage basis with no monthly subscription fee. According to Artificial Analysis data as of April 2026, the provider median across 51 tracked models is $0.56 per 1M input tokens and $2.20 per 1M output tokens, with individual models ranging from near-free (Gemma open models at $0) to premium (Gemini 2.5 Pro at $1.25/$10.00 per 1M input/output tokens).

08 What is the difference between Flash-Lite, Flash, and Pro tiers?

Flash-Lite (Paid) targets the lowest-cost, highest-throughput use cases. Flash (Paid) balances speed and capability for most production workloads. Pro (Paid) is the highest-capability tier suited for complex reasoning tasks. All three are strictly usage-based — there is no monthly minimum or subscription commitment.

09 Can I use Google Gemini API models for free indefinitely?

Yes, through the Free tier. Google provides free access to Gemini models via AI Studio with rate limits, and several Gemma open-weight models are available at $0 per token even on paid infrastructure, according to Artificial Analysis data (April 2026).

Is this pricing incorrect? — we'll verify and update it.