stacksherpa

API provider directory

Meta Llama

Open-source LLM family from Meta. Llama 4 is the latest generation using Mixture-of-Experts architecture. Llama 4 Scout: 17B active params, 16 experts, up to 10M token context window (longest in industry), fits on single H100. Llama 4 Maverick: 17B active, 128 experts, ~1417 ELO on LMArena, beats GPT-4o and Grok 3. Llama 4 Behemoth (288B active, still training). Natively multimodal (first open model). Fully open weights. No direct API — access via Together AI, Groq, Fireworks, Cerebras, AWS Bedrock, Azure.

website | docs |

Overview

CategoryAi
Self-HostableYes
On-PremNo
Best Forstartup, growth, enterprise
Last Verified2026-02-12

Strengths & Weaknesses

Strengths:Weaknesses:

When to Use

Best when:Avoid if:

Alternatives

together-ai, groq, fireworks-ai, deepseek