qwen

Qwen/Qwen3.5-0.8B

Fine-tunable$0.0014/sec
multi-to-text
About
Released: 2/25/2026

Qwen/Qwen3.5-0.8B is a compact multimodal LLM with native vision capabilities. It excels in efficient general-purpose dialogue, instruction following, coding, mathematics, multilingual tasks, and vision-language understanding, while maintaining a very small parameter size suitable for resource-constrained environments, prototyping, and fine-tuning.

Some other noteworthy features of Qwen/Qwen3.5-0.8B include robust support for over 200 languages and dialects, long-context processing up to 262,144 tokens, image and video understanding, and a hybrid architecture using Gated Delta Networks with sparse Mixture-of-Experts.

MetricValue
Parameter Count0.8 billion
Mixture of ExpertsNo
Context Length262,144 tokens
MultilingualYes
Quantized*No

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.