Qwen/Qwen3.5-0.8B
About
Released: 2/25/2026Qwen/Qwen3.5-0.8B is a compact multimodal LLM with native vision capabilities. It excels in efficient general-purpose dialogue, instruction following, coding, mathematics, multilingual tasks, and vision-language understanding, while maintaining a very small parameter size suitable for resource-constrained environments, prototyping, and fine-tuning.
Some other noteworthy features of Qwen/Qwen3.5-0.8B include robust support for over 200 languages and dialects, long-context processing up to 262,144 tokens, image and video understanding, and a hybrid architecture using Gated Delta Networks with sparse Mixture-of-Experts.
| Metric | Value |
|---|---|
| Parameter Count | 0.8 billion |
| Mixture of Experts | No |
| Context Length | 262,144 tokens |
| Multilingual | Yes |
| Quantized* | No |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.