providers/Google
Google · Mountain View, CA

Google models on AIgateway — pricing, context, capabilities

Google ships 16 models on AIgateway spanning embedding, image, text, video. Call any of them via the OpenAI-compatible endpoint at api.aigateway.sh/v1 with one key. Pass-through inference pricing plus a 5% platform fee at credit top-up. No per-call markups, no seat fees, no minimum.

Get your key →See pricingVisit Google
models · 16modalities · embedding, image, text, videolocation · Mountain View, CA
embedding

Google embedding models

1 embedding model from Google.

Embeddinggemma-300m
google/embeddinggemma-300m
EmbeddingGemma is a 300M parameter, state-of-the-art for its size, open embedding model from Google, built from Gemma 3 (with T5Gemma initialization) and the same research and technology used to create Gemini models. EmbeddingGemma produces vector representations of text, making it well-suited for search and retrieval tasks, including classification, clustering, and semantic similarity search. This model was trained with data in 100+ spoken languages.
see pricing
image

Google image models

4 image models from Google.

Imagen 4
★ featured
google/imagen-4
Google's latest image generation model producing high-quality, photorealistic images from text prompts with support for multiple aspect ratios.
$0.040 / image
Nano Banana
google/nano-banana
Google's fast image generation model producing high-quality images from text prompts.
$0.300 in · $30.00 out / 1M
Nano Banana 2
google/nano-banana-2
Google's second-generation image generation model with improved quality and speed.
$0.500 in · $60.00 out / 1M
Nano Banana Pro
google/nano-banana-pro
Google's higher-quality image generation model with improved detail and prompt adherence.
$2.00 in · $120.00 out / 1M
text

Google text models

7 text models from Google.

Gemini 3.1 Pro
★ featured
google/gemini-3.1-pro
Google's most intelligent Gemini model with improved reasoning, a medium thinking level, and a 1M token context window.
$2.00 in · $12.00 out / 1M1,000,000 ctx
Gemini 3 Flash
★ featured
google/gemini-3-flash
Gemini 3 Flash is Google's fast multimodal model with frontier intelligence, superior search, and grounding capabilities.
$0.500 in · $3.00 out / 1M1,000,000 ctx
Gemini 3.1 Flash Lite
google/gemini-3.1-flash-lite
Google's lightest and most cost-efficient Gemini model for high-throughput tasks.
$0.250 in · $1.50 out / 1M1,000,000 ctx
Gemma-4-26b-A4b-IT
google/gemma-4-26b-a4b-it
Gemma 4 is Google's most intelligent family of open models, built from Gemini 3 research to maximize intelligence-per-parameter.
$0.100 in · $0.300 out / 1M256,000 ctx
Gemma-3-12b-IT
google/gemma-3-12b-it
Gemma 3 models are well-suited for a variety of text generation and image understanding tasks, including question answering, summarization, and reasoning. Gemma 3 models are multimodal, handling text and image input and generating text output, with a large, 128K context window, multilingual support in over 140 languages, and is available in more sizes than previous versions.
$0.350 in · $0.560 out / 1M80,000 ctx
Gemma-2b-IT-Lora
google/gemma-2b-it-lora
This is a Gemma-2B base model that Cloudflare dedicates for inference with LoRA adapters. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.
$0.030 in · $0.060 out / 1M8,192 ctx
Gemma-7b-IT-Lora
google/gemma-7b-it-lora
This is a Gemma-7B base model that Cloudflare dedicates for inference with LoRA adapters. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.
$0.080 in · $0.160 out / 1M3,500 ctx
video

Google video models

4 video models from Google.

Veo 3.1
★ featured
google/veo-3.1
Google's latest video generation model with improved quality, motion, and audio generation.
$0.400 / sec
Veo 3
google/veo-3
Google's video generation model capable of producing high-quality videos with optional audio from text prompts.
$0.200 / sec
Veo 3 Fast
google/veo-3-fast
A faster version of Veo 3 optimized for lower latency video generation with audio support.
$0.080 / sec
Veo 3.1 Fast
google/veo-3.1-fast
A faster version of Veo 3.1 optimized for lower latency while maintaining high-quality video and audio output.
$0.080 / sec
About Google

Who they are, what they focus on

Google DeepMind ships the Gemini family (2M-token context on Gemini 3.1 Pro), Imagen (image generation), Veo (video), and Gemma (open-weight). Gemini is the go-to when you need massive context windows or native multi-modal input.

Headquartered in Mountain View, CA. Homepage: ai.google.dev.

FAQ

Common questions about Google on AIgateway

Which Google models does AIgateway support?
AIgateway routes 16 Google models including Gemini 3.1 Pro. Full catalog with pricing and context windows is in the sections above.
How do I call a Google model from my code?
Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to the Google slug (e.g. "google/gemini-3.1-pro"). Request and response shapes are identical to OpenAI.
How much do Google models cost on AIgateway?
Pass-through Google pricing plus a 5% platform fee applied at credit top-up, not per call. No seat fees, no minimum beyond the $5 top-up floor.
Can I bring my own Google API key (BYOK)?
Yes. Attach your Google key in the AIgateway dashboard. Calls to Google models flip to pass-through and AIgateway waives the 5% platform fee on those calls.
Where is Google based?
Google is headquartered in Mountain View, CA.
Is there a free tier?
AIgateway's free tier is 100 requests/day on Kimi K2.6 — any account can test without a card. Paid Google models require a $5 minimum credit top-up.
Other providers

Browse other labs

AnthropicOpenAIxAIMoonshotDeepSeekMetaMistralAlibabaDeepgramBlack Forest LabsBAAIAll providers →