Qwen3.5: Difference between revisions

From Akripedia
FKemeth (talk | contribs)
No edit summary
FKemeth (talk | contribs)
No edit summary
Line 7: Line 7:
|architecture=[[Decoder-only Transformer]] with [[Gated Delta Networks]] + sparse [[Mixture of Experts|MoE]]
|architecture=[[Decoder-only Transformer]] with [[Gated Delta Networks]] + sparse [[Mixture of Experts|MoE]]
|context_length=262,144 (up to 1M via API)
|context_length=262,144 (up to 1M via API)
|modality=Image-Text-to-Text
|thinking=Yes (toggleable)
|license=Apache 2.0
|license=Apache 2.0
|languages=201 languages and dialects
|languages=201 languages and dialects

Revision as of 07:50, 7 April 2026

Qwen 3.5
Developer Alibaba Cloud
Release Date February 15, 2026
Model Sizes 0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)
Architecture Decoder-only Transformer with Gated Delta Networks + sparse MoE
Modality Image-Text-to-Text
Thinking Yes (toggleable)
Context Length 262,144 (up to 1M via API)
License Apache 2.0
Languages 201 languages and dialects
Hugging Face Qwen 3.5
Paper Link

Qwen3.5 is an open-weight and native vision-language model series developed by Alibaba and released on February 15, 2026.