Compare

Embeddinggemma-300m vs Gemma-7b-IT-Lora

Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 2 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

Search2/4
Embeddinggemma-300m
google/embeddinggemma-300m
Gemma-7b-IT-Lora
google/gemma-7b-it-lora
Provider
Google
Google
Family
Gemma
Gemma
Modality
embedding
text
Context window
4,096 tok
Max output
4,096 tok
Released
2025-09-04
2024-04-02
License
Open-weight
Open-weight
Input price
$0.020 /1M
$0.080 /1M
Output price
$0.0000 /1M
$0.160 /1M
Tools
Streaming
yes
Vision
yes
JSON mode
Reasoning
Prompt caching
Batch API
Try it
View model →
Open in playground →
Embeddinggemma-300m
google/embeddinggemma-300m
Full spec →

EmbeddingGemma is a 300M parameter, state-of-the-art for its size, open embedding model from Google, built from Gemma 3 (with T5Gemma initialization) and the same research and technology used to create Gemini models. EmbeddingGemma produces vector representations of text, making it well-suited for search and retrieval tasks, including classification, clustering, and semantic similarity search. This model was trained with data in 100+ spoken languages.

Strengths
  • Semantic similarity
  • Vector search
Use cases
RAGSemantic searchRecommendation
Gemma-7b-IT-Lora
google/gemma-7b-it-lora
Full spec →

This is a Gemma-7B base model that Cloudflare dedicates for inference with LoRA adapters. Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Strengths
  • General-purpose chat
  • Long context
  • Tool use
Use cases
ChatbotsContent generationAgentic workflows

Compare with another

Gemma-2b-IT-Lora vs Gemma-7b-IT-Lora
google/gemma-2b-it-lora · google/gemma-7b-it-lora
Embeddinggemma-300m vs Gemma-2b-IT-Lora
google/embeddinggemma-300m · google/gemma-2b-it-lora
Gemma-Sea-Lion-V4-27b-IT vs Gemma-7b-IT-Lora
aisingapore/gemma-sea-lion-v4-27b-it · google/gemma-7b-it-lora
Gemma-Sea-Lion-V4-27b-IT vs Embeddinggemma-300m
aisingapore/gemma-sea-lion-v4-27b-it · google/embeddinggemma-300m
Gemma-4-26b-A4b-IT vs Gemma-7b-IT-Lora
google/gemma-4-26b-a4b-it · google/gemma-7b-it-lora
Gemma-4-26b-A4b-IT vs Embeddinggemma-300m
google/gemma-4-26b-a4b-it · google/embeddinggemma-300m
SWITCH BETWEEN THEM

One key, all 2, one line different.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

# Embeddinggemma-300m
client.chat.completions.create(
    model="google/embeddinggemma-300m",
    messages=[{"role":"user","content":"hello"}],
)

# Gemma-7b-IT-Lora
client.chat.completions.create(
    model="google/gemma-7b-it-lora",
    messages=[{"role":"user","content":"hello"}],
)
Get an AIgateway keyRun an eval on these →