by Qwen
Open source · 286k downloads · 159 likes
The Qwen3 0.6B Base model is a streamlined version of the Qwen3 family, designed to deliver advanced language processing capabilities while remaining accessible. It is a causal language model optimized for understanding and generating text fluently and coherently, with a strong focus on high-quality training data and model architecture. Through a three-stage training process and an extended context length of up to 32,000 tokens, it excels in a variety of tasks, including comprehension, logical reasoning, code generation, and multilingual processing. Its key strengths lie in its versatility, improved stability, and efficiency, making it well-suited for professional or educational applications that require natural language interaction. Whether automating responses, assisting with creative projects, or analyzing complex documents, this model stands out for its ability to adapt to different contexts while maintaining robust performance.
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Building upon extensive advancements in training data, model architecture, and optimization techniques, Qwen3 delivers the following key improvements over the previously released Qwen2.5:
Qwen3-0.6B-Base has the following features:
For more details, including benchmark evaluation, hardware requirements, and inference performance, please refer to our blog, GitHub, and Documentation.
The code of Qwen3 has been in the latest Hugging Face transformers and we advise you to use the latest version of transformers.
With transformers<4.51.0, you will encounter the following error:
KeyError: 'qwen3'
Detailed evaluation results are reported in this 📑 blog.
If you find our work helpful, feel free to give us a cite.
@misc{qwen3technicalreport,
title={Qwen3 Technical Report},
author={Qwen Team},
year={2025},
eprint={2505.09388},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.09388},
}