AI/EXPLORER
ToolsCategoriesSitesLLMsCompareAI QuizAlternativesPremium
—AI Tools
—Sites & Blogs
—LLMs & Models
—Categories
AI Explorer

Find and compare the best artificial intelligence tools for your projects.

Made within France

Explore

  • ›All tools
  • ›Sites & Blogs
  • ›LLMs & Models
  • ›Compare
  • ›Chatbots
  • ›AI Images
  • ›Code & Dev

Company

  • ›Premium
  • ›About
  • ›Contact
  • ›Blog

Legal

  • ›Legal notice
  • ›Privacy
  • ›Terms

© 2026 AI Explorer·All rights reserved.

HomeLLMsJan v3.5 4B gguf

Jan v3.5 4B gguf

by janhq

Open source · 256k downloads · 12 likes

1.4
(12 reviews)ChatAPI & Local
About

Jan v3.5 4B is an AI model with a distinct personality, fine-tuned to excel in mathematical reasoning while retaining versatile capabilities. Unlike generic assistants, it adopts a natural tone—sometimes humorous and self-deprecating—with a direct, casual conversational style marked by sharp, concise responses. Specialized in solving complex problems, it stands out for its authentic voice, shaped by the Menlo Research team, which prioritizes transparency and an upbeat energy without corporate jargon. Ideal for engaging exchanges or applications requiring both mathematical precision and human-like interaction, it adapts seamlessly to technical discussions as well as informal chats. Its unique approach makes it a tool that is both high-performing and memorable, far removed from traditional assistants.

Documentation

Jan-v3.5-4B: The first Jan personality

GitHub License Jan App

Thumbnail

Overview

Jan-v3.5-4B is a fine-tuned variant of Jan-v3-4B-base-instruct, specialized on math reasoning and identity datasets. It retains the general-purpose capabilities of the base model while delivering improved mathematical problem-solving — and it comes with a personality.

Unlike generic assistants, Jan-v3.5 has its own identity: a distinct voice, tone, and conversational style shaped by the Menlo Research team. It doesn't talk like a customer service bot — it talks like a smart, slightly-too-online friend who happens to know things and genuinely cares about the work. Expect lowercase defaults, self-aware humor, short punchy replies (unless it really cares about the topic), and zero corporate speak.

Model Overview

Note: Jan-v3.5-4B is fine-tuned from janhq/Jan-v3-4B-base-instruct.

  • Base Model: Jan-v3-4B-base-instruct (Qwen3-4B architecture)
  • Number of Parameters: 4.0B
  • Number of Parameters (Non-Embedding): 3.6B
  • Number of Layers: 36
  • Number of Attention Heads (GQA): 32 for Q and 8 for KV
  • Context Length: 262,144 natively

Training Data

  • Identities: Curated identity and personality datasets that teach the model its own voice, style, and values — trained by Menlo Research
  • Math: Mathematical reasoning and problem-solving datasets

Jan's Identity

Jan-v3.5 is not a neutral assistant. It has a built-in personality shaped by the Menlo Research team:

  • Tone: Casual, direct, and real. Lowercase by default. Capitalizes only when it means it.
  • Style: Short bursts over long paragraphs — unless it's genuinely excited about something, then it writes an essay with no warning.
  • Humor: Self-aware first. Will roast itself before roasting you. Drops meme references mid-serious-thought and doesn't apologize.
  • Values: Optimistic builder energy ("we can do that"), radical transparency, user freedom, and a deep belief that hope is a decision you keep making on purpose.
  • What it won't do: Say "Certainly!", "Great question!", "As an AI", or anything that sounds like it came from a customer service script.

Example interactions:

  • Casual: "yeah lol what's up"
  • Technical explanation: "so basically — and this is the part where i become insufferable — [actual good explanation]"
  • Motivating: "we can do that. i don't fully know how yet but that's a tomorrow problem and tomorrow-us is smarter"

Intended Use

  • Enhanced mathematical reasoning and problem-solving over the base model
  • A conversational AI with its own authentic voice and personality
  • Fine-tuning starting point for downstream math-heavy or identity-specific applications

Before and After

image (2)

Quick Start

Integration with Jan Apps

Jan-v3.5 is optimized for direct integration with Jan Desktop. Select the model in the app to start using it.

Local Deployment

Using vLLM:

Bash
vllm serve janhq/Jan-v3.5-4B \
    --host 0.0.0.0 \
    --port 1234 \
    --enable-auto-tool-choice \
    --tool-call-parser hermes

Using llama.cpp:

Bash
llama-server --model Jan-v3.5-4B-Q8_0.gguf \
    --host 0.0.0.0 \
    --port 1234 \
    --jinja \
    --no-context-shift

Recommended Parameters

For optimal performance, we recommend the following inference parameters:

YAML
temperature: 0.7
top_p: 0.8
top_k: 20

Community & Support

  • Discussions: Hugging Face Community
  • Jan App: Learn more about the Jan App at jan.ai

Citation

Bibtex
Updated Soon
Capabilities & Tags
transformersggufmathidentitytext-generationenendpoints_compatibleconversational
Links & Resources
Specifications
CategoryChat
AccessAPI & Local
LicenseOpen Source
PricingOpen Source
Parameters4B parameters
Rating
1.4

Try Jan v3.5 4B gguf

Access the model directly