Qwen3.5: Difference between revisions
No edit summary |
No edit summary |
||
| Line 7: | Line 7: | ||
|architecture=[[Decoder-only Transformer]] with [[Gated Delta Networks]] + sparse [[Mixture of Experts|MoE]] | |architecture=[[Decoder-only Transformer]] with [[Gated Delta Networks]] + sparse [[Mixture of Experts|MoE]] | ||
|context_length=262,144 (up to 1M via API) | |context_length=262,144 (up to 1M via API) | ||
|modality=Image-Text-to-Text | |||
|thinking=Yes (toggleable) | |||
|license=Apache 2.0 | |license=Apache 2.0 | ||
|languages=201 languages and dialects | |languages=201 languages and dialects | ||
Revision as of 07:50, 7 April 2026
| Qwen 3.5 | |
|---|---|
| Developer | Alibaba Cloud |
| Release Date | February 15, 2026 |
| Model Sizes | 0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE) |
| Architecture | Decoder-only Transformer with Gated Delta Networks + sparse MoE |
| Modality | Image-Text-to-Text |
| Thinking | Yes (toggleable) |
| Context Length | 262,144 (up to 1M via API) |
| License | Apache 2.0 |
| Languages | 201 languages and dialects |
| Hugging Face | Qwen 3.5 |
| Paper | Link |
Qwen3.5 is an open-weight and native vision-language model series developed by Alibaba and released on February 15, 2026.