qwen3-8b

Public

The 8B variant of the Qwen3 family of models

2 Downloads

1 star

Capabilities

Minimum system memory

5GB

Tags

8B
qwen3

Last updated

Updated on May 3by
qwen

README

Qwen3 8B GGUF by qwen

Supports a context length of up to 131,072 tokens with YaRN (default 32k)

Supports /no_think to disable reasoning, just add it at the end of your prompt

Supports both thinking and non-thinking modes withe enhanced reasoning in both for significantly enhanced mathematics, coding, and commonsense

Excels at creative writing, role-playing, multi-turn dialogues, and instruction following

Advanced agent capabilities and support for over 100 languages and dialects

Custom Fields

Special features defined by the model author

Enable Thinking

: boolean

(default=true)

Enable the model to think before answering.

Parameters

Custom configuration options included with this model

Min P Sampling
0
Top K Sampling
20

Sources

The underlying model files this model uses