<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.akribic.com/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=FKemeth</id>
	<title>Akripedia - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.akribic.com/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=FKemeth"/>
	<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php/Special:Contributions/FKemeth"/>
	<updated>2026-06-06T04:25:31Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.42.7</generator>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=19</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=19"/>
		<updated>2026-04-07T13:19:42Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language foundation model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt; It is build on a hybrid architecture using linear attention with [[Gated Delta Networks]] as well as sparse [[Mixture of Experts]]. The models support 201 languages and dialects, compared to 119 of their earlier [[Qwen3]] model series.&lt;br /&gt;
Smaller versions up to 27B parameters are available as dense models, whereas the sizes 35B-A3B, 122B-A10B as well as the flagship model 397B-A17B use a [[Mixture of Experts]] architecture.&lt;br /&gt;
&lt;br /&gt;
== Benchmarks ==&lt;br /&gt;
&lt;br /&gt;
Results for the flagship &#039;&#039;&#039;397B-A17B&#039;&#039;&#039; and &#039;&#039;&#039;9b&#039;&#039;&#039;, &#039;&#039;&#039;4B&#039;&#039;&#039; as well as &#039;&#039;&#039;2B&#039;&#039;&#039; small models.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Benchmark !! Category !! 397B-A17B !! 9B !! 4B !! 2B !! Claude Opus 4.6&lt;br /&gt;
|-&lt;br /&gt;
| [[GPQA Diamond]]&amp;lt;ref&amp;gt;[https://artificialanalysis.ai/evaluations/gpqa-diamond?models=gemma-4-26b-a4b%2Cgemma-4-31b-non-reasoning%2Cgemma-4-e2b%2Cgemma-4-e4b-non-reasoning%2Cgemma-4-e4b%2Cgemma-4-e2b-non-reasoning%2Cclaude-opus-4-6-adaptive%2Cqwen3-5-2b%2Cqwen3-5-9b%2Cqwen3-5-397b-a17b%2Cqwen3-5-4b%2Cqwen3-5-2b-non-reasoning GPQA Diamond Benchmark Leaderboard: Results]. Artificial Analysis, April 2026.&amp;lt;/ref&amp;gt; || style=&amp;quot;text-align:right;&amp;quot; | Science || style=&amp;quot;text-align:right;&amp;quot; | 89.3 || 80.6 || 77.1 || -- || style=&amp;quot;text-align:right;&amp;quot; | 89.6&lt;br /&gt;
|-&lt;br /&gt;
| [[SWE-bench Verified]] || style=&amp;quot;text-align:right;&amp;quot; | Coding || style=&amp;quot;text-align:right;&amp;quot; | 76.4 || -- || -- || -- || style=&amp;quot;text-align:right;&amp;quot; | 80.8&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMU-Pro]] || style=&amp;quot;text-align:right;&amp;quot; | Multimodal || style=&amp;quot;text-align:right;&amp;quot; | 79.0 || 70.1 || 66.3 || 50.3 || style=&amp;quot;text-align:right;&amp;quot; | 73.9&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMLU]] || style=&amp;quot;text-align:right;&amp;quot; | Multilingual || style=&amp;quot;text-align:right;&amp;quot; | 88.5 || 81.2 || 76.1 || 63.1 ||style=&amp;quot;text-align:right;&amp;quot; | 91.1&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Tokenizer ==&lt;br /&gt;
&lt;br /&gt;
The Qwen3.5 tokenizer has a total of 248,077 tokens (up from 151,669 in [[Qwen3]]), out of which 248,044 stem from the [[BPE]] vocabulary size, and 33 are added tokens. Note that Qwen3.5 uses 248,320 embeddings in its embedding table, slightly more than the total token number of the tokenizer. This might be due to performance improvements (it is a multiple of 512) and leaves room for adding additional tokens in the future.&lt;br /&gt;
&lt;br /&gt;
Thinking can be &#039;&#039;&#039;enabled&#039;&#039;&#039; and &#039;&#039;&#039;disabled&#039;&#039;&#039; through the chat template, which either appends &amp;lt;syntaxhighlight inline=1&amp;gt;&amp;lt;think&amp;gt;\n&amp;lt;/syntaxhighlight&amp;gt; or &amp;lt;syntaxhighlight inline=1&amp;gt;&amp;lt;think&amp;gt;\n\n&amp;lt;/think&amp;gt;\n\n&amp;lt;/syntaxhighlight&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;syntaxhighlight lang=&amp;quot;python&amp;quot;&amp;gt;&lt;br /&gt;
from transformers import AutoTokenizer&lt;br /&gt;
&lt;br /&gt;
tokenizer = AutoTokenizer.from_pretrained(&amp;quot;Qwen/Qwen3.5-0.8B&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
len(tokenizer)&lt;br /&gt;
# -&amp;gt; 248077&lt;br /&gt;
&lt;br /&gt;
messages = [{&amp;quot;role&amp;quot;: &amp;quot;user&amp;quot;, &amp;quot;content&amp;quot;: &amp;quot;Hi&amp;quot;}]&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=False, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; &#039;&amp;lt;|im_start|&amp;gt;user\nHi&amp;lt;|im_end|&amp;gt;\n&amp;lt;|im_start|&amp;gt;assistant\n&amp;lt;think&amp;gt;\n\n&amp;lt;/think&amp;gt;\n\n&#039;&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=True, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; &#039;&amp;lt;|im_start|&amp;gt;user\nHi&amp;lt;|im_end|&amp;gt;\n&amp;lt;|im_start|&amp;gt;assistant\n&amp;lt;think&amp;gt;\n&#039;&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=True, tokenize=True, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; {&#039;input_ids&#039;: [248045, 846, 198, 12675, 248046, 198, 248045, 74455, 198, 248068, 198], &#039;attention_mask&#039;: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]}&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=18</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=18"/>
		<updated>2026-04-07T13:15:09Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language foundation model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt; It is build on a hybrid architecture using linear attention with [[Gated Delta Networks]] as well as sparse [[Mixture of Experts]]. The models support 201 languages and dialects, compared to 119 of their earlier [[Qwen3]] model series.&lt;br /&gt;
&lt;br /&gt;
== Benchmarks ==&lt;br /&gt;
&lt;br /&gt;
Results for the flagship &#039;&#039;&#039;397B-A17B&#039;&#039;&#039; and &#039;&#039;&#039;9b&#039;&#039;&#039;, &#039;&#039;&#039;4B&#039;&#039;&#039; as well as &#039;&#039;&#039;2B&#039;&#039;&#039; small models.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Benchmark !! Category !! 397B-A17B !! 9B !! 4B !! 2B !! Claude Opus 4.6&lt;br /&gt;
|-&lt;br /&gt;
| [[GPQA Diamond]]&amp;lt;ref&amp;gt;[https://artificialanalysis.ai/evaluations/gpqa-diamond?models=gemma-4-26b-a4b%2Cgemma-4-31b-non-reasoning%2Cgemma-4-e2b%2Cgemma-4-e4b-non-reasoning%2Cgemma-4-e4b%2Cgemma-4-e2b-non-reasoning%2Cclaude-opus-4-6-adaptive%2Cqwen3-5-2b%2Cqwen3-5-9b%2Cqwen3-5-397b-a17b%2Cqwen3-5-4b%2Cqwen3-5-2b-non-reasoning GPQA Diamond Benchmark Leaderboard: Results]. Artificial Analysis, April 2026.&amp;lt;/ref&amp;gt; || style=&amp;quot;text-align:right;&amp;quot; | Science || style=&amp;quot;text-align:right;&amp;quot; | 89.3 || 80.6 || 77.1 || -- || style=&amp;quot;text-align:right;&amp;quot; | 89.6&lt;br /&gt;
|-&lt;br /&gt;
| [[SWE-bench Verified]] || style=&amp;quot;text-align:right;&amp;quot; | Coding || style=&amp;quot;text-align:right;&amp;quot; | 76.4 || -- || -- || -- || style=&amp;quot;text-align:right;&amp;quot; | 80.8&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMU-Pro]] || style=&amp;quot;text-align:right;&amp;quot; | Multimodal || style=&amp;quot;text-align:right;&amp;quot; | 79.0 || 70.1 || 66.3 || 50.3 || style=&amp;quot;text-align:right;&amp;quot; | 73.9&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMLU]] || style=&amp;quot;text-align:right;&amp;quot; | Multilingual || style=&amp;quot;text-align:right;&amp;quot; | 88.5 || 81.2 || 76.1 || 63.1 ||style=&amp;quot;text-align:right;&amp;quot; | 91.1&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Tokenizer ==&lt;br /&gt;
&lt;br /&gt;
The Qwen3.5 tokenizer has a total of 248,077 tokens (up from 151,669 in [[Qwen3]]), out of which 248,044 stem from the [[BPE]] vocabulary size, and 33 are added tokens. Note that Qwen3.5 uses 248,320 embeddings in its embedding table, slightly more than the total token number of the tokenizer. This might be due to performance improvements (it is a multiple of 512) and leaves room for adding additional tokens in the future.&lt;br /&gt;
&lt;br /&gt;
Thinking can be &#039;&#039;&#039;enabled&#039;&#039;&#039; and &#039;&#039;&#039;disabled&#039;&#039;&#039; through the chat template, which either appends &amp;lt;syntaxhighlight inline=1&amp;gt;&amp;lt;think&amp;gt;\n&amp;lt;/syntaxhighlight&amp;gt; or &amp;lt;syntaxhighlight inline=1&amp;gt;&amp;lt;think&amp;gt;\n\n&amp;lt;/think&amp;gt;\n\n&amp;lt;/syntaxhighlight&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;syntaxhighlight lang=&amp;quot;python&amp;quot;&amp;gt;&lt;br /&gt;
from transformers import AutoTokenizer&lt;br /&gt;
&lt;br /&gt;
tokenizer = AutoTokenizer.from_pretrained(&amp;quot;Qwen/Qwen3.5-0.8B&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
len(tokenizer)&lt;br /&gt;
# -&amp;gt; 248077&lt;br /&gt;
&lt;br /&gt;
messages = [{&amp;quot;role&amp;quot;: &amp;quot;user&amp;quot;, &amp;quot;content&amp;quot;: &amp;quot;Hi&amp;quot;}]&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=False, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; &#039;&amp;lt;|im_start|&amp;gt;user\nHi&amp;lt;|im_end|&amp;gt;\n&amp;lt;|im_start|&amp;gt;assistant\n&amp;lt;think&amp;gt;\n\n&amp;lt;/think&amp;gt;\n\n&#039;&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=True, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; &#039;&amp;lt;|im_start|&amp;gt;user\nHi&amp;lt;|im_end|&amp;gt;\n&amp;lt;|im_start|&amp;gt;assistant\n&amp;lt;think&amp;gt;\n&#039;&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=True, tokenize=True, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; {&#039;input_ids&#039;: [248045, 846, 198, 12675, 248046, 198, 248045, 74455, 198, 248068, 198], &#039;attention_mask&#039;: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]}&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=17</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=17"/>
		<updated>2026-04-07T13:09:27Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language foundation model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt; It is build on a hybrid architecture using linear attention with [[Gated Delta Networks]] as well as sparse [[Mixture of Experts]]. The models support 201 languages and dialects, compared to 119 of their earlier [[Qwen3]] model series.&lt;br /&gt;
&lt;br /&gt;
== Benchmarks ==&lt;br /&gt;
&lt;br /&gt;
Results for the flagship &#039;&#039;&#039;397B-A17B&#039;&#039;&#039; and &#039;&#039;&#039;9b&#039;&#039;&#039;, &#039;&#039;&#039;4B&#039;&#039;&#039; as well as &#039;&#039;&#039;2B&#039;&#039;&#039; small models.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Benchmark !! Category !! 397B-A17B !! 9B !! 4B !! 2B !! Claude Opus 4.6&lt;br /&gt;
|-&lt;br /&gt;
| [[GPQA Diamond]]&amp;lt;ref&amp;gt;[https://artificialanalysis.ai/evaluations/gpqa-diamond?models=gemma-4-26b-a4b%2Cgemma-4-31b-non-reasoning%2Cgemma-4-e2b%2Cgemma-4-e4b-non-reasoning%2Cgemma-4-e4b%2Cgemma-4-e2b-non-reasoning%2Cclaude-opus-4-6-adaptive%2Cqwen3-5-2b%2Cqwen3-5-9b%2Cqwen3-5-397b-a17b%2Cqwen3-5-4b%2Cqwen3-5-2b-non-reasoning GPQA Diamond Benchmark Leaderboard: Results]. Artificial Analysis, April 2026.&amp;lt;/ref&amp;gt; || style=&amp;quot;text-align:right;&amp;quot; | Science || style=&amp;quot;text-align:right;&amp;quot; | 89.3 || 80.6 || 77.1 || -- || style=&amp;quot;text-align:right;&amp;quot; | 89.6&lt;br /&gt;
|-&lt;br /&gt;
| [[SWE-bench Verified]] || style=&amp;quot;text-align:right;&amp;quot; | Coding || style=&amp;quot;text-align:right;&amp;quot; | 76.4 || -- || -- || -- || style=&amp;quot;text-align:right;&amp;quot; | 80.8&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMU-Pro]] || style=&amp;quot;text-align:right;&amp;quot; | Multimodal || style=&amp;quot;text-align:right;&amp;quot; | 79.0 || 70.1 || 66.3 || 50.3 || style=&amp;quot;text-align:right;&amp;quot; | 73.9&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMLU]] || style=&amp;quot;text-align:right;&amp;quot; | Multilingual || style=&amp;quot;text-align:right;&amp;quot; | 88.5 || 81.2 || 76.1 || 63.1 ||style=&amp;quot;text-align:right;&amp;quot; | 91.1&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Tokenizer ==&lt;br /&gt;
&lt;br /&gt;
The Qwen3.5 tokenizer has a total of 248,077 tokens (up from 151,669 in [[Qwen3]]), out of which 248,044 stem from the [[BPE]] vocabulary size, and 33 are added tokens. Note that Qwen3.5 uses 248,320 embeddings in its embedding table, slightly more than the total token number of the tokenizer. This might be due to performance improvements (it is a multiple of 512) and leaves room for adding additional tokens in the future.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;syntaxhighlight lang=&amp;quot;python&amp;quot;&amp;gt;&lt;br /&gt;
from transformers import AutoTokenizer&lt;br /&gt;
&lt;br /&gt;
tokenizer = AutoTokenizer.from_pretrained(&amp;quot;Qwen/Qwen3.5-0.8B&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
len(tokenizer)&lt;br /&gt;
# -&amp;gt; 248077&lt;br /&gt;
&lt;br /&gt;
messages = [{&amp;quot;role&amp;quot;: &amp;quot;user&amp;quot;, &amp;quot;content&amp;quot;: &amp;quot;Hi&amp;quot;}]&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=False, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; &#039;&amp;lt;|im_start|&amp;gt;user\nHi&amp;lt;|im_end|&amp;gt;\n&amp;lt;|im_start|&amp;gt;assistant\n&amp;lt;think&amp;gt;\n\n&amp;lt;/think&amp;gt;\n\n&#039;&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=True, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; &#039;&amp;lt;|im_start|&amp;gt;user\nHi&amp;lt;|im_end|&amp;gt;\n&amp;lt;|im_start|&amp;gt;assistant\n&amp;lt;think&amp;gt;\n&#039;&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=True, tokenize=True, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; {&#039;input_ids&#039;: [248045, 846, 198, 12675, 248046, 198, 248045, 74455, 198, 248068, 198], &#039;attention_mask&#039;: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]}&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=16</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=16"/>
		<updated>2026-04-07T12:57:10Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language foundation model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt; It is build on a hybrid architecture using linear attention with [[Gated Delta Networks]] as well as sparse [[Mixture of Experts]]. The models support 201 languages and dialects, compared to 119 of their earlier [[Qwen3]] model series.&lt;br /&gt;
&lt;br /&gt;
== Benchmarks ==&lt;br /&gt;
&lt;br /&gt;
Results for the flagship &#039;&#039;&#039;397B-A17B&#039;&#039;&#039; and &#039;&#039;&#039;9b&#039;&#039;&#039;, &#039;&#039;&#039;4B&#039;&#039;&#039; as well as &#039;&#039;&#039;2B&#039;&#039;&#039; small models.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Benchmark !! Category !! 397B-A17B !! 9B !! 4B !! 2B !! Claude Opus 4.6&lt;br /&gt;
|-&lt;br /&gt;
| [[GPQA Diamond]]&amp;lt;ref&amp;gt;[https://artificialanalysis.ai/evaluations/gpqa-diamond?models=gemma-4-26b-a4b%2Cgemma-4-31b-non-reasoning%2Cgemma-4-e2b%2Cgemma-4-e4b-non-reasoning%2Cgemma-4-e4b%2Cgemma-4-e2b-non-reasoning%2Cclaude-opus-4-6-adaptive%2Cqwen3-5-2b%2Cqwen3-5-9b%2Cqwen3-5-397b-a17b%2Cqwen3-5-4b%2Cqwen3-5-2b-non-reasoning GPQA Diamond Benchmark Leaderboard: Results]. Artificial Analysis, April 2026.&amp;lt;/ref&amp;gt; || style=&amp;quot;text-align:right;&amp;quot; | Science || style=&amp;quot;text-align:right;&amp;quot; | 89.3 || 80.6 || 77.1 || -- || style=&amp;quot;text-align:right;&amp;quot; | 89.6&lt;br /&gt;
|-&lt;br /&gt;
| [[SWE-bench Verified]] || style=&amp;quot;text-align:right;&amp;quot; | Coding || style=&amp;quot;text-align:right;&amp;quot; | 76.4 || -- || -- || -- || style=&amp;quot;text-align:right;&amp;quot; | 80.8&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMU-Pro]] || style=&amp;quot;text-align:right;&amp;quot; | Multimodal || style=&amp;quot;text-align:right;&amp;quot; | 79.0 || 70.1 || 66.3 || 50.3 || style=&amp;quot;text-align:right;&amp;quot; | 73.9&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMLU]] || style=&amp;quot;text-align:right;&amp;quot; | Multilingual || style=&amp;quot;text-align:right;&amp;quot; | 88.5 || 81.2 || 76.1 || 63.1 ||style=&amp;quot;text-align:right;&amp;quot; | 91.1&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Tokenizer ==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;syntaxhighlight lang=&amp;quot;python&amp;quot;&amp;gt;&lt;br /&gt;
from transformers import AutoTokenizer&lt;br /&gt;
&lt;br /&gt;
tokenizer = AutoTokenizer.from_pretrained(&amp;quot;Qwen/Qwen3.5-0.8B&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
messages = [{&amp;quot;role&amp;quot;: &amp;quot;user&amp;quot;, &amp;quot;content&amp;quot;: &amp;quot;Hi&amp;quot;}]&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=False, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; &#039;&amp;lt;|im_start|&amp;gt;user\nHi&amp;lt;|im_end|&amp;gt;\n&amp;lt;|im_start|&amp;gt;assistant\n&amp;lt;think&amp;gt;\n\n&amp;lt;/think&amp;gt;\n\n&#039;&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=True, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; &#039;&amp;lt;|im_start|&amp;gt;user\nHi&amp;lt;|im_end|&amp;gt;\n&amp;lt;|im_start|&amp;gt;assistant\n&amp;lt;think&amp;gt;\n&#039;&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=True, tokenize=True, add_generation_prompt=True)&lt;br /&gt;
# -&amp;gt; {&#039;input_ids&#039;: [248045, 846, 198, 12675, 248046, 198, 248045, 74455, 198, 248068, 198], &#039;attention_mask&#039;: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]}&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=15</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=15"/>
		<updated>2026-04-07T12:52:30Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language foundation model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt; It is build on a hybrid architecture using linear attention with [[Gated Delta Networks]] as well as sparse [[Mixture of Experts]]. The models support 201 languages and dialects, compared to 119 of their earlier [[Qwen3]] model series.&lt;br /&gt;
&lt;br /&gt;
== Benchmarks ==&lt;br /&gt;
&lt;br /&gt;
Results for the flagship &#039;&#039;&#039;397B-A17B&#039;&#039;&#039; and &#039;&#039;&#039;9b&#039;&#039;&#039;, &#039;&#039;&#039;4B&#039;&#039;&#039; as well as &#039;&#039;&#039;2B&#039;&#039;&#039; small models.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Benchmark !! Category !! 397B-A17B !! 9B !! 4B !! 2B !! Claude Opus 4.6&lt;br /&gt;
|-&lt;br /&gt;
| [[GPQA Diamond]]&amp;lt;ref&amp;gt;[https://artificialanalysis.ai/evaluations/gpqa-diamond?models=gemma-4-26b-a4b%2Cgemma-4-31b-non-reasoning%2Cgemma-4-e2b%2Cgemma-4-e4b-non-reasoning%2Cgemma-4-e4b%2Cgemma-4-e2b-non-reasoning%2Cclaude-opus-4-6-adaptive%2Cqwen3-5-2b%2Cqwen3-5-9b%2Cqwen3-5-397b-a17b%2Cqwen3-5-4b%2Cqwen3-5-2b-non-reasoning GPQA Diamond Benchmark Leaderboard: Results]. Artificial Analysis, April 2026.&amp;lt;/ref&amp;gt; || style=&amp;quot;text-align:right;&amp;quot; | Science || style=&amp;quot;text-align:right;&amp;quot; | 89.3 || 80.6 || 77.1 || -- || style=&amp;quot;text-align:right;&amp;quot; | 89.6&lt;br /&gt;
|-&lt;br /&gt;
| [[SWE-bench Verified]] || style=&amp;quot;text-align:right;&amp;quot; | Coding || style=&amp;quot;text-align:right;&amp;quot; | 76.4 || -- || -- || -- || style=&amp;quot;text-align:right;&amp;quot; | 80.8&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMU-Pro]] || style=&amp;quot;text-align:right;&amp;quot; | Multimodal || style=&amp;quot;text-align:right;&amp;quot; | 79.0 || 70.1 || 66.3 || 50.3 || style=&amp;quot;text-align:right;&amp;quot; | 73.9&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMLU]] || style=&amp;quot;text-align:right;&amp;quot; | Multilingual || style=&amp;quot;text-align:right;&amp;quot; | 88.5 || 81.2 || 76.1 || 63.1 ||style=&amp;quot;text-align:right;&amp;quot; | 91.1&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Tokenizer ==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;syntaxhighlight lang=&amp;quot;python&amp;quot;&amp;gt;&lt;br /&gt;
from transformers import AutoTokenizer&lt;br /&gt;
&lt;br /&gt;
tokenizer = AutoTokenizer.from_pretrained(&amp;quot;Qwen/Qwen3.5-0.8B&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=False, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Output:&#039;&#039;&#039;&lt;br /&gt;
&amp;lt;pre style=&amp;quot;background:#f0f0f0; border-left:3px solid #ccc; padding:8px;&amp;quot;&amp;gt;&lt;br /&gt;
&#039;&amp;lt;|im_start|&amp;gt;user\nHi&amp;lt;|im_end|&amp;gt;\n&amp;lt;|im_start|&amp;gt;assistant\n&amp;lt;think&amp;gt;\n\n&amp;lt;/think&amp;gt;\n\n&#039;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=14</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=14"/>
		<updated>2026-04-07T12:51:45Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language foundation model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt; It is build on a hybrid architecture using linear attention with [[Gated Delta Networks]] as well as sparse [[Mixture of Experts]]. The models support 201 languages and dialects, compared to 119 of their earlier [[Qwen3]] model series.&lt;br /&gt;
&lt;br /&gt;
== Benchmarks ==&lt;br /&gt;
&lt;br /&gt;
Results for the flagship &#039;&#039;&#039;397B-A17B&#039;&#039;&#039; and &#039;&#039;&#039;9b&#039;&#039;&#039;, &#039;&#039;&#039;4B&#039;&#039;&#039; as well as &#039;&#039;&#039;2B&#039;&#039;&#039; small models.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Benchmark !! Category !! 397B-A17B !! 9B !! 4B !! 2B !! Claude Opus 4.6&lt;br /&gt;
|-&lt;br /&gt;
| [[GPQA Diamond]]&amp;lt;ref&amp;gt;[https://artificialanalysis.ai/evaluations/gpqa-diamond?models=gemma-4-26b-a4b%2Cgemma-4-31b-non-reasoning%2Cgemma-4-e2b%2Cgemma-4-e4b-non-reasoning%2Cgemma-4-e4b%2Cgemma-4-e2b-non-reasoning%2Cclaude-opus-4-6-adaptive%2Cqwen3-5-2b%2Cqwen3-5-9b%2Cqwen3-5-397b-a17b%2Cqwen3-5-4b%2Cqwen3-5-2b-non-reasoning GPQA Diamond Benchmark Leaderboard: Results]. Artificial Analysis, April 2026.&amp;lt;/ref&amp;gt; || style=&amp;quot;text-align:right;&amp;quot; | Science || style=&amp;quot;text-align:right;&amp;quot; | 89.3 || 80.6 || 77.1 || -- || style=&amp;quot;text-align:right;&amp;quot; | 89.6&lt;br /&gt;
|-&lt;br /&gt;
| [[SWE-bench Verified]] || style=&amp;quot;text-align:right;&amp;quot; | Coding || style=&amp;quot;text-align:right;&amp;quot; | 76.4 || -- || -- || -- || style=&amp;quot;text-align:right;&amp;quot; | 80.8&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMU-Pro]] || style=&amp;quot;text-align:right;&amp;quot; | Multimodal || style=&amp;quot;text-align:right;&amp;quot; | 79.0 || 70.1 || 66.3 || 50.3 || style=&amp;quot;text-align:right;&amp;quot; | 73.9&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMLU]] || style=&amp;quot;text-align:right;&amp;quot; | Multilingual || style=&amp;quot;text-align:right;&amp;quot; | 88.5 || 81.2 || 76.1 || 63.1 ||style=&amp;quot;text-align:right;&amp;quot; | 91.1&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Tokenizer ==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;syntaxhighlight lang=&amp;quot;python&amp;quot;&amp;gt;&lt;br /&gt;
from transformers import AutoTokenizer&lt;br /&gt;
&lt;br /&gt;
tokenizer = AutoTokenizer.from_pretrained(&amp;quot;Qwen/Qwen3.5-0.8B&amp;quot;)&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
tokenizer.apply_chat_template(messages, enable_thinking=False, tokenize=False, add_generation_prompt=True)&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=13</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=13"/>
		<updated>2026-04-07T09:03:16Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language foundation model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt; It is build on a hybrid architecture using linear attention with [[Gated Delta Networks]] as well as sparse [[Mixture of Experts]]. The models support 201 languages and dialects, compared to 119 of their earlier [[Qwen3]] model series.&lt;br /&gt;
&lt;br /&gt;
== Benchmarks ==&lt;br /&gt;
&lt;br /&gt;
Results for the flagship &#039;&#039;&#039;397B-A17B&#039;&#039;&#039; and &#039;&#039;&#039;9b&#039;&#039;&#039;, &#039;&#039;&#039;4B&#039;&#039;&#039; as well as &#039;&#039;&#039;2B&#039;&#039;&#039; small models.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Benchmark !! Category !! 397B-A17B !! 9B !! 4B !! 2B !! Claude Opus 4.6&lt;br /&gt;
|-&lt;br /&gt;
| [[GPQA Diamond]]&amp;lt;ref&amp;gt;[https://artificialanalysis.ai/evaluations/gpqa-diamond?models=gemma-4-26b-a4b%2Cgemma-4-31b-non-reasoning%2Cgemma-4-e2b%2Cgemma-4-e4b-non-reasoning%2Cgemma-4-e4b%2Cgemma-4-e2b-non-reasoning%2Cclaude-opus-4-6-adaptive%2Cqwen3-5-2b%2Cqwen3-5-9b%2Cqwen3-5-397b-a17b%2Cqwen3-5-4b%2Cqwen3-5-2b-non-reasoning GPQA Diamond Benchmark Leaderboard: Results]. Artificial Analysis, April 2026.&amp;lt;/ref&amp;gt; || style=&amp;quot;text-align:right;&amp;quot; | Science || style=&amp;quot;text-align:right;&amp;quot; | 89.3 || 80.6 || 77.1 || -- || style=&amp;quot;text-align:right;&amp;quot; | 89.6&lt;br /&gt;
|-&lt;br /&gt;
| [[SWE-bench Verified]] || style=&amp;quot;text-align:right;&amp;quot; | Coding || style=&amp;quot;text-align:right;&amp;quot; | 76.4 || -- || -- || -- || style=&amp;quot;text-align:right;&amp;quot; | 80.8&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMU-Pro]] || style=&amp;quot;text-align:right;&amp;quot; | Multimodal || style=&amp;quot;text-align:right;&amp;quot; | 79.0 || 70.1 || 66.3 || 50.3 || style=&amp;quot;text-align:right;&amp;quot; | 73.9&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMLU]] || style=&amp;quot;text-align:right;&amp;quot; | Multilingual || style=&amp;quot;text-align:right;&amp;quot; | 88.5 || 81.2 || 76.1 || 63.1 ||style=&amp;quot;text-align:right;&amp;quot; | 91.1&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=12</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=12"/>
		<updated>2026-04-07T08:57:07Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language foundation model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt; It is build on a hybrid architecture using linear attention with [[Gated Delta Networks]] as well as sparse [[Mixture of Experts]]. The models support 201 languages and dialects, compared to 119 of their earlier [[Qwen3]] model series.&lt;br /&gt;
&lt;br /&gt;
== Benchmarks ==&lt;br /&gt;
&lt;br /&gt;
Results for the flagship &#039;&#039;&#039;397B-A17B&#039;&#039;&#039; and &#039;&#039;&#039;9b&#039;&#039;&#039;, &#039;&#039;&#039;4B&#039;&#039;&#039; as well as &#039;&#039;&#039;2B&#039;&#039;&#039; small models.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Benchmark !! Category !! 397B-A17B !! 9B !! 4B !! 2B !! Claude Opus 4.6&lt;br /&gt;
|-&lt;br /&gt;
| [[GPQA Diamond]]&amp;lt;ref&amp;gt;[https://artificialanalysis.ai/evaluations/gpqa-diamond?models=gemma-4-26b-a4b%2Cgemma-4-31b-non-reasoning%2Cgemma-4-e2b%2Cgemma-4-e4b-non-reasoning%2Cgemma-4-e4b%2Cgemma-4-e2b-non-reasoning%2Cclaude-opus-4-6-adaptive%2Cqwen3-5-2b%2Cqwen3-5-9b%2Cqwen3-5-397b-a17b%2Cqwen3-5-4b%2Cqwen3-5-2b-non-reasoning GPQA Diamond Benchmark Leaderboard: Results]. Artificial Analysis, April 2026.&amp;lt;/ref&amp;gt; || Science || 89.3 || 80.6 || 77.1 || -- || 89.6&lt;br /&gt;
|-&lt;br /&gt;
| [[SWE-bench Verified]] || Coding || 76.4 || -- || -- || -- || 80.8&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMU-Pro]] || Multimodal || 79.0 || 70.1 || 66.3 || 50.3 || 73.9&lt;br /&gt;
|-&lt;br /&gt;
| [[MMMLU]] || Multilingual || 88.5 || 81.2 || 76.1 || 63.1 || 91.1&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=11</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=11"/>
		<updated>2026-04-07T08:14:48Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt; It is build on a hybrid architecture using linear attention with [[Gated Delta Networks]] as well as sparse [[Mixture of Experts]].&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=10</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=10"/>
		<updated>2026-04-07T08:12:38Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=9</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=9"/>
		<updated>2026-04-07T07:58:27Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]] with [[Gated Delta Networks]] + sparse [[Mixture of Experts|MoE]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references /&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=8</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=8"/>
		<updated>2026-04-07T07:57:12Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]] with [[Gated Delta Networks]] + sparse [[Mixture of Experts|MoE]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen3.5-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
{{Reflist}}&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=7</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=7"/>
		<updated>2026-04-07T07:55:29Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]] with [[Gated Delta Networks]] + sparse [[Mixture of Experts|MoE]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language model series developed by Alibaba and released on February 15, 2026.&amp;lt;ref name=&amp;quot;qwen-blog&amp;quot;&amp;gt;[https://qwen.ai/blog?id=qwen3.5 Qwen3.5: Towards Native Multimodal Agents]. Qwen Team, February 2026.&amp;lt;/ref&amp;gt;&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=6</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=6"/>
		<updated>2026-04-07T07:50:50Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]] with [[Gated Delta Networks]] + sparse [[Mixture of Experts|MoE]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|modality=Image-Text-to-Text&lt;br /&gt;
|thinking=Yes (toggleable)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language model series developed by Alibaba and released on February 15, 2026.&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Template:LLM_Infobox&amp;diff=5</id>
		<title>Template:LLM Infobox</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Template:LLM_Infobox&amp;diff=5"/>
		<updated>2026-04-07T07:49:39Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;float:right; margin-left:1em; width:320px; font-size:90%;&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! colspan=&amp;quot;2&amp;quot; style=&amp;quot;background:#cee0f2; font-size:110%;&amp;quot; | {{{name}}}&lt;br /&gt;
|-&lt;br /&gt;
{{#if:{{{image|}}}|&lt;br /&gt;
! colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot; {{!}} [[File:{{{image}}}|250px]]&lt;br /&gt;
{{!}}-&lt;br /&gt;
}}&lt;br /&gt;
|-&lt;br /&gt;
! Developer&lt;br /&gt;
| {{{developer}}}&lt;br /&gt;
|-&lt;br /&gt;
! Release Date&lt;br /&gt;
| {{{release_date}}}&lt;br /&gt;
|-&lt;br /&gt;
! Model Sizes&lt;br /&gt;
| {{{sizes}}}&lt;br /&gt;
|-&lt;br /&gt;
! Architecture&lt;br /&gt;
| {{{architecture}}}&lt;br /&gt;
|-&lt;br /&gt;
! Modality&lt;br /&gt;
| {{{modality}}}&lt;br /&gt;
|-&lt;br /&gt;
! Thinking&lt;br /&gt;
| {{{thinking}}}&lt;br /&gt;
|-&lt;br /&gt;
! Context Length&lt;br /&gt;
| {{{context_length}}}&lt;br /&gt;
|-&lt;br /&gt;
! License&lt;br /&gt;
| {{{license}}}&lt;br /&gt;
|-&lt;br /&gt;
! Languages&lt;br /&gt;
| {{{languages|}}}&lt;br /&gt;
|-&lt;br /&gt;
! Hugging Face&lt;br /&gt;
| [{{{hf_link}}} {{{name}}}]&lt;br /&gt;
|-&lt;br /&gt;
{{#if:{{{paper_link|}}}|&lt;br /&gt;
! Paper&lt;br /&gt;
{{!}} [{{{paper_link}}} Link]&lt;br /&gt;
{{!}}-&lt;br /&gt;
}}&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=4</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=4"/>
		<updated>2026-04-07T07:48:42Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{{LLM Infobox&lt;br /&gt;
|name=Qwen 3.5&lt;br /&gt;
|developer=Alibaba Cloud&lt;br /&gt;
|release_date=February 15, 2026&lt;br /&gt;
|sizes=0.8B, 2B, 4B, 9B, 27B (dense), 35B-A3B (MoE), 122B-A10B (MoE), 397B-A17B (MoE)&lt;br /&gt;
|architecture=[[Decoder-only Transformer]] with [[Gated Delta Networks]] + sparse [[Mixture of Experts|MoE]]&lt;br /&gt;
|context_length=262,144 (up to 1M via API)&lt;br /&gt;
|license=Apache 2.0&lt;br /&gt;
|languages=201 languages and dialects&lt;br /&gt;
|hf_link=https://huggingface.co/Qwen&lt;br /&gt;
|paper_link=https://qwen.ai/blog?id=qwen3.5&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language model series developed by Alibaba and released on February 15, 2026.&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Template:LLM_Infobox&amp;diff=3</id>
		<title>Template:LLM Infobox</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Template:LLM_Infobox&amp;diff=3"/>
		<updated>2026-04-07T07:39:21Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: Created page with &amp;quot;{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;float:right; margin-left:1em; width:320px; font-size:90%;&amp;quot; |- ! colspan=&amp;quot;2&amp;quot; style=&amp;quot;background:#cee0f2; font-size:110%;&amp;quot; | {{{name}}} |- {{#if:{{{image|}}}| ! colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot; {{!}} 250px {{!}}- }} |- ! Developer | {{{developer}}} |- ! Release Date | {{{release_date}}} |- ! Model Sizes | {{{sizes}}} |- ! Architecture | {{{architecture}}} |- ! Context Length | {{{context_length}}} |- ! License | {{{lice...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;float:right; margin-left:1em; width:320px; font-size:90%;&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! colspan=&amp;quot;2&amp;quot; style=&amp;quot;background:#cee0f2; font-size:110%;&amp;quot; | {{{name}}}&lt;br /&gt;
|-&lt;br /&gt;
{{#if:{{{image|}}}|&lt;br /&gt;
! colspan=&amp;quot;2&amp;quot; style=&amp;quot;text-align:center;&amp;quot; {{!}} [[File:{{{image}}}|250px]]&lt;br /&gt;
{{!}}-&lt;br /&gt;
}}&lt;br /&gt;
|-&lt;br /&gt;
! Developer&lt;br /&gt;
| {{{developer}}}&lt;br /&gt;
|-&lt;br /&gt;
! Release Date&lt;br /&gt;
| {{{release_date}}}&lt;br /&gt;
|-&lt;br /&gt;
! Model Sizes&lt;br /&gt;
| {{{sizes}}}&lt;br /&gt;
|-&lt;br /&gt;
! Architecture&lt;br /&gt;
| {{{architecture}}}&lt;br /&gt;
|-&lt;br /&gt;
! Context Length&lt;br /&gt;
| {{{context_length}}}&lt;br /&gt;
|-&lt;br /&gt;
! License&lt;br /&gt;
| {{{license}}}&lt;br /&gt;
|-&lt;br /&gt;
! Languages&lt;br /&gt;
| {{{languages|}}}&lt;br /&gt;
|-&lt;br /&gt;
! Hugging Face&lt;br /&gt;
| [{{{hf_link}}} {{{name}}}]&lt;br /&gt;
|-&lt;br /&gt;
{{#if:{{{paper_link|}}}|&lt;br /&gt;
! Paper&lt;br /&gt;
{{!}} [{{{paper_link}}} Link]&lt;br /&gt;
{{!}}-&lt;br /&gt;
}}&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
	<entry>
		<id>https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=2</id>
		<title>Qwen3.5</title>
		<link rel="alternate" type="text/html" href="https://wiki.akribic.com/index.php?title=Qwen3.5&amp;diff=2"/>
		<updated>2026-04-07T07:38:18Z</updated>

		<summary type="html">&lt;p&gt;FKemeth: Created page with &amp;quot;&amp;#039;&amp;#039;&amp;#039;Qwen3.5&amp;#039;&amp;#039;&amp;#039; is an open-weight and native vision-language model series developed by Alibaba and released on 2026/02/15.&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&#039;&#039;&#039;Qwen3.5&#039;&#039;&#039; is an open-weight and native vision-language model series developed by Alibaba and released on 2026/02/15.&lt;/div&gt;</summary>
		<author><name>FKemeth</name></author>
	</entry>
</feed>