Compare

Llama-2-7b-Chat-HF-Lora — and what?

Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 1 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

Search1/4
Llama-2-7b-Chat-HF-Lora
meta-llama/llama-2-7b-chat-hf-lora
Provider
Meta-llama
Family
Llama 2
Modality
text
Context window
4,096 tok
Max output
4,096 tok
Released
2024-04-02
License
Open-weight
Input price
$0.040 /1M
Output price
$0.080 /1M
Tools
Streaming
yes
Vision
JSON mode
Reasoning
Prompt caching
Batch API
Try it
Open in playground →
Llama-2-7b-Chat-HF-Lora
meta-llama/llama-2-7b-chat-hf-lora
Full spec →

This is a Llama2 base model that Cloudflare dedicated for inference with LoRA adapters. Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7B fine-tuned model, optimized for dialogue use cases and converted for the Hugging Face Transformers format.

Strengths
  • General-purpose chat
  • Long context
  • Tool use
Use cases
ChatbotsContent generationAgentic workflows