AI/EXPLORER
OutilsCatégoriesSitesLLMsComparerQuiz IAAlternativesPremium
—Outils IA
—Sites & Blogs
—LLMs & Modèles
—Catégories
AI Explorer

Trouvez et comparez les meilleurs outils d'intelligence artificielle pour vos projets.

Fait avecen France

Explorer

  • ›Tous les outils
  • ›Sites & Blogs
  • ›LLMs & Modèles
  • ›Comparer
  • ›Chatbots
  • ›Images IA
  • ›Code & Dev

Entreprise

  • ›Premium
  • ›À propos
  • ›Contact
  • ›Blog

Légal

  • ›Mentions légales
  • ›Confidentialité
  • ›CGV

© 2026 AI Explorer·Tous droits réservés.

AccueilLLMsQwen3 Embedding 0.6B 250 v1 GGUF

Qwen3 Embedding 0.6B 250 v1 GGUF

par ENOSYS

Open source · 18k downloads · 1 likes

0.4
(1 avis)EmbeddingAPI & Local
À propos

Le modèle Qwen3 Embedding 0.6B 250 v1 GGUF est une version optimisée et quantifiée du modèle Qwen3-Embedding-0.6B, spécialement conçue pour générer des embeddings textuels de haute qualité. Grâce à une quantification avancée et à des jeux de données calibrés, il offre un équilibre optimal entre performance et efficacité, tout en restant compatible avec des architectures matérielles comme les GPU Nvidia Pascal (P100). Ses principales capacités résident dans la création de représentations vectorielles compactes et précises, adaptées à des tâches de recherche sémantique, de classification ou de clustering. Ce modèle se distingue par sa légèreté et sa rapidité, idéal pour des déploiements sur des environnements contraints ou des applications nécessitant une faible latence. Il est particulièrement adapté aux développeurs cherchant une solution performante pour des embeddings multilingues, notamment en anglais et en russe.

Documentation

Experimental global target bits‑per‑weight quantization of unsloth/Qwen3-Embedding-0.6B

  • Using non-standard (forked) LLaMA C++ branch for quantization.
  • Using a CLI tool to build KLD evaluation and imatrix calibration datasets for GGUF models, sourced from eaddario/imatrix-calibration.
  • Using dataset sources: tools, text_en, text_ru.
  • Using dataset chunks: 250.
  • Tensors quantinization F16 instead of BF16, Nvidia Pascal architecture friendly like P100.
  • Small set of patches added.

Many thanks to Ed Addario for an impressive job.

Quantization comparison

BPWPPL correlationPPL mean ratioΔPPLMean KLDMaximum KLD99.9% KLDMean ΔpRMS Δp
5.0097.51%1.229589 ± 0.004913123.695152 ± 3.4555470.254131 ± 0.00124713.2818653.295894-0.945 ± 0.030 %7.861 ± 0.073 %
5.2597.93%1.198584 ± 0.004387106.990729 ± 3.0894640.201051 ± 0.00102016.1913722.680833-0.726 ± 0.027 %7.070 ± 0.070 %
5.3098.05%1.199072 ± 0.004269107.253986 ± 3.0676270.181907 ± 0.00092611.6424692.371547-0.680 ± 0.026 %6.811 ± 0.067 %
5.5098.42%1.143577 ± 0.00366577.354913 ± 2.4881640.132384 ± 0.0006939.0563721.854777-0.483 ± 0.022 %5.860 ± 0.061 %
5.7598.67%1.107662 ± 0.00326458.005137 ± 2.1179640.097616 ± 0.00059911.8061841.765460-0.368 ± 0.019 %5.023 ± 0.062 %
5.8098.72%1.117510 ± 0.00323863.310457 ± 2.1682670.092513 ± 0.00057810.5402981.608747-0.386 ± 0.019 %4.906 ± 0.059 %
6.0098.71%1.125959 ± 0.00327367.862792 ± 2.2349840.092148 ± 0.00058512.1289851.559407-0.435 ± 0.019 %4.931 ± 0.061 %
6.2598.89%1.082024 ± 0.00291244.191950 ± 1.8310290.067634 ± 0.0004317.8498681.006397-0.264 ± 0.016 %4.210 ± 0.055 %
6.3098.94%1.089430 ± 0.00288648.181944 ± 1.8655680.062879 ± 0.0003777.4679440.950639-0.262 ± 0.016 %4.112 ± 0.055 %
6.5099.07%1.110195 ± 0.00277359.369600 ± 1.9620200.046909 ± 0.0002735.7593210.702734-0.293 ± 0.013 %3.510 ± 0.044 %
6.7599.22%1.071155 ± 0.00247938.336145 ± 1.6141460.027021 ± 0.0002176.2928670.423712-0.069 ± 0.011 %2.756 ± 0.051 %
6.8099.23%1.079566 ± 0.00248342.867759 ± 1.6671530.026098 ± 0.0002056.7374930.397282-0.098 ± 0.010 %2.714 ± 0.051 %
7.0099.24%1.083987 ± 0.00249045.249386 ± 1.7061240.023685 ± 0.0001906.6084920.377540-0.084 ± 0.010 %2.559 ± 0.049 %
7.2599.27%1.088304 ± 0.00244747.575202 ± 1.7171730.019738 ± 0.0001302.8659250.332578-0.090 ± 0.009 %2.290 ± 0.037 %
7.3099.28%1.084746 ± 0.00243045.658594 ± 1.6862920.019120 ± 0.0001484.3435140.295245-0.092 ± 0.009 %2.291 ± 0.048 %
7.5099.29%1.085544 ± 0.00241146.088344 ± 1.6855290.017551 ± 0.0001424.9052270.298486-0.078 ± 0.008 %2.215 ± 0.050 %
7.7599.32%1.091225 ± 0.00238549.149079 ± 1.7146940.014173 ± 0.0001284.3465120.237379-0.101 ± 0.008 %2.013 ± 0.052 %
7.8099.29%1.083209 ± 0.00241044.830380 ± 1.6694550.016992 ± 0.0001333.1005090.285571-0.077 ± 0.008 %2.162 ± 0.041 %
8.0099.31%1.080260 ± 0.00237343.241571 ± 1.6360240.015367 ± 0.0001304.0350010.269038-0.060 ± 0.008 %2.071 ± 0.049 %
8.2599.33%1.088309 ± 0.00235247.578112 ± 1.6832320.012024 ± 0.0001054.3707510.192501-0.089 ± 0.007 %1.860 ± 0.044 %
8.3099.34%1.081929 ± 0.00232844.140973 ± 1.6288920.011498 ± 0.0000841.8619670.182563-0.075 ± 0.007 %1.784 ± 0.037 %
8.5099.36%1.078487 ± 0.00228642.286422 ± 1.5879020.009095 ± 0.0000812.4757200.147575-0.064 ± 0.006 %1.635 ± 0.048 %
8.7599.37%1.078905 ± 0.00227442.511791 ± 1.5870590.007852 ± 0.0000732.1636080.118868-0.054 ± 0.006 %1.535 ± 0.049 %
8.8099.37%1.078538 ± 0.00227342.314029 ± 1.5837330.007742 ± 0.0000792.6099570.127498-0.053 ± 0.006 %1.533 ± 0.053 %
9.0099.37%1.077841 ± 0.00226541.938514 ± 1.5757220.007380 ± 0.0000732.5470390.125235-0.050 ± 0.006 %1.499 ± 0.053 %
9.2599.37%1.075351 ± 0.00225640.596804 ± 1.5555390.006905 ± 0.0000662.0415340.103068-0.042 ± 0.006 %1.448 ± 0.044 %
9.3099.37%1.071370 ± 0.00224438.451639 ± 1.5244170.006834 ± 0.0000621.9591990.105753-0.031 ± 0.006 %1.444 ± 0.040 %
9.5099.38%1.073834 ± 0.00224439.779313 ± 1.5408460.006470 ± 0.0000621.9913400.098428-0.041 ± 0.005 %1.400 ± 0.041 %
9.7599.38%1.074517 ± 0.00224040.147460 ± 1.5439070.006206 ± 0.0000682.6387010.101805-0.038 ± 0.005 %1.388 ± 0.053 %
9.8099.38%1.076663 ± 0.00224441.303655 ± 1.5597740.006146 ± 0.0000692.4057360.094807-0.044 ± 0.005 %1.401 ± 0.057 %
10.0099.38%1.076016 ± 0.00224040.955259 ± 1.5538640.005796 ± 0.0000552.0591850.095681-0.049 ± 0.005 %1.313 ± 0.039 %
10.2599.39%1.074654 ± 0.00223040.220957 ± 1.5409220.005526 ± 0.0000612.1149430.099634-0.043 ± 0.005 %1.315 ± 0.050 %
10.3099.39%1.072069 ± 0.00222238.828737 ± 1.5199780.005418 ± 0.0000592.0626960.094137-0.038 ± 0.005 %1.288 ± 0.047 %
10.5099.39%1.070241 ± 0.00221237.843815 ± 1.5039870.005140 ± 0.0000481.6215030.087822-0.032 ± 0.005 %1.218 ± 0.028 %
10.7599.39%1.066075 ± 0.00219535.599290 ± 1.4705960.004415 ± 0.0000602.6664090.074945-0.011 ± 0.004 %1.152 ± 0.049 %
10.8099.40%1.066699 ± 0.00219535.935359 ± 1.4736750.004566 ± 0.0000512.6694650.074244-0.025 ± 0.004 %1.109 ± 0.017 %
11.0099.40%1.071004 ± 0.00220738.254894 ± 1.5092980.004029 ± 0.0000351.2805350.060520-0.019 ± 0.004 %1.112 ± 0.045 %
11.2599.40%1.071390 ± 0.00220138.462580 ± 1.5087640.003687 ± 0.0000240.6143540.054238-0.025 ± 0.004 %0.995 ± 0.011 %
11.3099.40%1.071772 ± 0.00220238.668624 ± 1.5117440.003665 ± 0.0000240.6137890.055209-0.023 ± 0.004 %0.984 ± 0.011 %
11.5099.40%1.072891 ± 0.00220039.271504 ± 1.5183240.003498 ± 0.0000220.4265720.052573-0.031 ± 0.004 %0.977 ± 0.013 %
11.7599.41%1.067707 ± 0.00218436.478332 ± 1.4789980.002967 ± 0.0000230.8033540.045490-0.001 ± 0.003 %0.895 ± 0.013 %
11.8099.41%1.065799 ± 0.00218035.450545 ± 1.4658110.002931 ± 0.0000210.6970180.0456690.006 ± 0.003 %0.899 ± 0.018 %
Liens & Ressources
Spécifications
CatégorieEmbedding
AccèsAPI & Local
LicenceOpen Source
TarificationOpen Source
Paramètres6B parameters
Note
0.4

Essayer Qwen3 Embedding 0.6B 250 v1 GGUF

Accédez directement au modèle