AI/EXPLORER
ToolsCategoriesSitesLLMsCompareAI QuizAlternativesPremium
—AI Tools
—Sites & Blogs
—LLMs & Models
—Categories
AI Explorer

Find and compare the best artificial intelligence tools for your projects.

Made within France

Explore

  • ›All tools
  • ›Sites & Blogs
  • ›LLMs & Models
  • ›Compare
  • ›Chatbots
  • ›AI Images
  • ›Code & Dev

Company

  • ›Premium
  • ›About
  • ›Contact
  • ›Blog

Legal

  • ›Legal notice
  • ›Privacy
  • ›Terms

© 2026 AI Explorer·All rights reserved.

HomeLLMsEmbeddingQwen3 Embedding 0.6B 250 v1 GGUF

Qwen3 Embedding 0.6B 250 v1 GGUF

by ENOSYS

Open source · 18k downloads · 1 likes

0.4
(1 reviews)EmbeddingAPI & Local
About

The Qwen3 Embedding 0.6B 250 v1 GGUF model is an optimized and quantized version of the Qwen3-Embedding-0.6B, specifically designed to generate high-quality text embeddings. Through advanced quantization and calibrated datasets, it achieves an optimal balance between performance and efficiency while remaining compatible with hardware architectures such as Nvidia Pascal GPUs (P100). Its key capabilities lie in producing compact and precise vector representations, suitable for tasks like semantic search, classification, or clustering. This model stands out for its lightweight design and speed, making it ideal for deployments in constrained environments or applications requiring low latency. It is particularly well-suited for developers seeking a high-performance solution for multilingual embeddings, especially in English and Russian.

Documentation

Experimental global target bits‑per‑weight quantization of unsloth/Qwen3-Embedding-0.6B

  • Using non-standard (forked) LLaMA C++ branch for quantization.
  • Using a CLI tool to build KLD evaluation and imatrix calibration datasets for GGUF models, sourced from eaddario/imatrix-calibration.
  • Using dataset sources: tools, text_en, text_ru.
  • Using dataset chunks: 250.
  • Tensors quantinization F16 instead of BF16, Nvidia Pascal architecture friendly like P100.
  • Small set of patches added.

Many thanks to Ed Addario for an impressive job.

Quantization comparison

BPWPPL correlationPPL mean ratioΔPPLMean KLDMaximum KLD99.9% KLDMean ΔpRMS Δp
5.0097.51%1.229589 ± 0.004913123.695152 ± 3.4555470.254131 ± 0.00124713.2818653.295894-0.945 ± 0.030 %7.861 ± 0.073 %
5.2597.93%1.198584 ± 0.004387106.990729 ± 3.0894640.201051 ± 0.00102016.1913722.680833-0.726 ± 0.027 %7.070 ± 0.070 %
5.3098.05%1.199072 ± 0.004269107.253986 ± 3.0676270.181907 ± 0.00092611.6424692.371547-0.680 ± 0.026 %6.811 ± 0.067 %
5.5098.42%1.143577 ± 0.00366577.354913 ± 2.4881640.132384 ± 0.0006939.0563721.854777-0.483 ± 0.022 %5.860 ± 0.061 %
5.7598.67%1.107662 ± 0.00326458.005137 ± 2.1179640.097616 ± 0.00059911.8061841.765460-0.368 ± 0.019 %5.023 ± 0.062 %
5.8098.72%1.117510 ± 0.00323863.310457 ± 2.1682670.092513 ± 0.00057810.5402981.608747-0.386 ± 0.019 %4.906 ± 0.059 %
6.0098.71%1.125959 ± 0.00327367.862792 ± 2.2349840.092148 ± 0.00058512.1289851.559407-0.435 ± 0.019 %4.931 ± 0.061 %
6.2598.89%1.082024 ± 0.00291244.191950 ± 1.8310290.067634 ± 0.0004317.8498681.006397-0.264 ± 0.016 %4.210 ± 0.055 %
6.3098.94%1.089430 ± 0.00288648.181944 ± 1.8655680.062879 ± 0.0003777.4679440.950639-0.262 ± 0.016 %4.112 ± 0.055 %
6.5099.07%1.110195 ± 0.00277359.369600 ± 1.9620200.046909 ± 0.0002735.7593210.702734-0.293 ± 0.013 %3.510 ± 0.044 %
6.7599.22%1.071155 ± 0.00247938.336145 ± 1.6141460.027021 ± 0.0002176.2928670.423712-0.069 ± 0.011 %2.756 ± 0.051 %
6.8099.23%1.079566 ± 0.00248342.867759 ± 1.6671530.026098 ± 0.0002056.7374930.397282-0.098 ± 0.010 %2.714 ± 0.051 %
7.0099.24%1.083987 ± 0.00249045.249386 ± 1.7061240.023685 ± 0.0001906.6084920.377540-0.084 ± 0.010 %2.559 ± 0.049 %
7.2599.27%1.088304 ± 0.00244747.575202 ± 1.7171730.019738 ± 0.0001302.8659250.332578-0.090 ± 0.009 %2.290 ± 0.037 %
7.3099.28%1.084746 ± 0.00243045.658594 ± 1.6862920.019120 ± 0.0001484.3435140.295245-0.092 ± 0.009 %2.291 ± 0.048 %
7.5099.29%1.085544 ± 0.00241146.088344 ± 1.6855290.017551 ± 0.0001424.9052270.298486-0.078 ± 0.008 %2.215 ± 0.050 %
7.7599.32%1.091225 ± 0.00238549.149079 ± 1.7146940.014173 ± 0.0001284.3465120.237379-0.101 ± 0.008 %2.013 ± 0.052 %
7.8099.29%1.083209 ± 0.00241044.830380 ± 1.6694550.016992 ± 0.0001333.1005090.285571-0.077 ± 0.008 %2.162 ± 0.041 %
8.0099.31%1.080260 ± 0.00237343.241571 ± 1.6360240.015367 ± 0.0001304.0350010.269038-0.060 ± 0.008 %2.071 ± 0.049 %
8.2599.33%1.088309 ± 0.00235247.578112 ± 1.6832320.012024 ± 0.0001054.3707510.192501-0.089 ± 0.007 %1.860 ± 0.044 %
8.3099.34%1.081929 ± 0.00232844.140973 ± 1.6288920.011498 ± 0.0000841.8619670.182563-0.075 ± 0.007 %1.784 ± 0.037 %
8.5099.36%1.078487 ± 0.00228642.286422 ± 1.5879020.009095 ± 0.0000812.4757200.147575-0.064 ± 0.006 %1.635 ± 0.048 %
8.7599.37%1.078905 ± 0.00227442.511791 ± 1.5870590.007852 ± 0.0000732.1636080.118868-0.054 ± 0.006 %1.535 ± 0.049 %
8.8099.37%1.078538 ± 0.00227342.314029 ± 1.5837330.007742 ± 0.0000792.6099570.127498-0.053 ± 0.006 %1.533 ± 0.053 %
9.0099.37%1.077841 ± 0.00226541.938514 ± 1.5757220.007380 ± 0.0000732.5470390.125235-0.050 ± 0.006 %1.499 ± 0.053 %
9.2599.37%1.075351 ± 0.00225640.596804 ± 1.5555390.006905 ± 0.0000662.0415340.103068-0.042 ± 0.006 %1.448 ± 0.044 %
9.3099.37%1.071370 ± 0.00224438.451639 ± 1.5244170.006834 ± 0.0000621.9591990.105753-0.031 ± 0.006 %1.444 ± 0.040 %
9.5099.38%1.073834 ± 0.00224439.779313 ± 1.5408460.006470 ± 0.0000621.9913400.098428-0.041 ± 0.005 %1.400 ± 0.041 %
9.7599.38%1.074517 ± 0.00224040.147460 ± 1.5439070.006206 ± 0.0000682.6387010.101805-0.038 ± 0.005 %1.388 ± 0.053 %
9.8099.38%1.076663 ± 0.00224441.303655 ± 1.5597740.006146 ± 0.0000692.4057360.094807-0.044 ± 0.005 %1.401 ± 0.057 %
10.0099.38%1.076016 ± 0.00224040.955259 ± 1.5538640.005796 ± 0.0000552.0591850.095681-0.049 ± 0.005 %1.313 ± 0.039 %
10.2599.39%1.074654 ± 0.00223040.220957 ± 1.5409220.005526 ± 0.0000612.1149430.099634-0.043 ± 0.005 %1.315 ± 0.050 %
10.3099.39%1.072069 ± 0.00222238.828737 ± 1.5199780.005418 ± 0.0000592.0626960.094137-0.038 ± 0.005 %1.288 ± 0.047 %
10.5099.39%1.070241 ± 0.00221237.843815 ± 1.5039870.005140 ± 0.0000481.6215030.087822-0.032 ± 0.005 %1.218 ± 0.028 %
10.7599.39%1.066075 ± 0.00219535.599290 ± 1.4705960.004415 ± 0.0000602.6664090.074945-0.011 ± 0.004 %1.152 ± 0.049 %
10.8099.40%1.066699 ± 0.00219535.935359 ± 1.4736750.004566 ± 0.0000512.6694650.074244-0.025 ± 0.004 %1.109 ± 0.017 %
11.0099.40%1.071004 ± 0.00220738.254894 ± 1.5092980.004029 ± 0.0000351.2805350.060520-0.019 ± 0.004 %1.112 ± 0.045 %
11.2599.40%1.071390 ± 0.00220138.462580 ± 1.5087640.003687 ± 0.0000240.6143540.054238-0.025 ± 0.004 %0.995 ± 0.011 %
11.3099.40%1.071772 ± 0.00220238.668624 ± 1.5117440.003665 ± 0.0000240.6137890.055209-0.023 ± 0.004 %0.984 ± 0.011 %
11.5099.40%1.072891 ± 0.00220039.271504 ± 1.5183240.003498 ± 0.0000220.4265720.052573-0.031 ± 0.004 %0.977 ± 0.013 %
11.7599.41%1.067707 ± 0.00218436.478332 ± 1.4789980.002967 ± 0.0000230.8033540.045490-0.001 ± 0.003 %0.895 ± 0.013 %
11.8099.41%1.065799 ± 0.00218035.450545 ± 1.4658110.002931 ± 0.0000210.6970180.0456690.006 ± 0.003 %0.899 ± 0.018 %
Capabilities & Tags
sentence-transformersgguftransformerssentence-similarityfeature-extractiontext-embeddings-inferenceenruendpoints_compatibleimatrix
Links & Resources
Specifications
CategoryEmbedding
AccessAPI & Local
LicenseOpen Source
PricingOpen Source
Parameters6B parameters
Rating
0.4

Try Qwen3 Embedding 0.6B 250 v1 GGUF

Access the model directly