Qwen3.6-27B
Model card · coming soon
Planned dense flagship in the Qwen 3.6 line: full 27B active parameters, aimed at agentic and repository-scale coding, with hybrid attention patterns for long prompts. Not on the public inference API today — the API open beta is Liquid and Gemma only.
- Dense option: the full 27B stack activates every token — straightforward behaviour vs small-MoE routing.
- Coding-forward release: long files, real repos, agentic “fix this build” work.
- Public model cards: hybrid attention, ~256K-class context; we target 70–80 tok/s on our cluster profile.
Planned pricing
EUR · per 1M tokensInput
€0.15
Output
€1.15
Target speed 70–80 tok/s · Context ~256K
Text / code-optimised · Apache 2.0 weights