Small Language Models
updated
facebook/opt-iml-max-1.3b
Text Generation
•
Updated
•
916
•
43
Text Generation
•
Updated
•
24.4k
•
86
togethercomputer/RedPajama-INCITE-Chat-3B-v1
Text Generation
•
Updated
•
931
•
152
Text Generation
•
1B
•
Updated
•
28.4k
•
322
Text Generation
•
3B
•
Updated
•
13.2k
•
501
Text Generation
•
2B
•
Updated
•
16.8k
•
26
Text Generation
•
3B
•
Updated
•
15.9k
•
32
cerebras/Cerebras-GPT-1.3B
Text Generation
•
Updated
•
15.2k
•
50
cerebras/Cerebras-GPT-2.7B
Text Generation
•
Updated
•
1.27k
•
46
mtgv/MobileLLaMA-1.4B-Chat
Text Generation
•
Updated
•
171
•
20
mtgv/MobileLLaMA-2.7B-Chat
Text Generation
•
Updated
•
82
•
6
M4-ai/TinyMistral-6x248M-Instruct
Text Generation
•
1B
•
Updated
•
7
•
11
M4-ai/NeuralReyna-Mini-1.8B-v0.3
Text Generation
•
2B
•
Updated
•
41
•
11
stabilityai/stablelm-2-zephyr-1_6b
Text Generation
•
2B
•
Updated
•
9.96k
•
186
stabilityai/stable-code-instruct-3b
Text Generation
•
3B
•
Updated
•
2.04k
•
181
stabilityai/stablelm-zephyr-3b
Text Generation
•
3B
•
Updated
•
8.33k
•
259
Text Generation
•
Updated
•
17
•
25
TinyLlama/TinyLlama-1.1B-Chat-v1.0
Text Generation
•
1B
•
Updated
•
1.69M
•
1.52k
Text Generation
•
1B
•
Updated
•
20.8k
•
26
Text Generation
•
2B
•
Updated
•
4.67k
•
124
Text Generation
•
2B
•
Updated
•
60.6k
•
•
72
Text Generation
•
4B
•
Updated
•
13.3k
•
45
Text Generation
•
2B
•
Updated
•
2.08M
•
•
156
Qwen/Qwen2.5-1.5B-Instruct
Text Generation
•
2B
•
Updated
•
6.43M
•
•
608
Qwen/Qwen2.5-Coder-1.5B-Instruct
Text Generation
•
2B
•
Updated
•
1M
•
•
104
Text Generation
•
3B
•
Updated
•
10.2M
•
399
Text Generation
•
3B
•
Updated
•
66.1k
•
852
Text Generation
•
3B
•
Updated
•
47.9k
•
171
Text Generation
•
3B
•
Updated
•
3.33k
•
92
Text Generation
•
3B
•
Updated
•
119
•
21
Text Generation
•
1B
•
Updated
•
4.98k
•
218
Text Generation
•
3B
•
Updated
•
316k
•
•
1.28k
Text Generation
•
1B
•
Updated
•
49.1k
•
1.35k
Text Generation
•
3B
•
Updated
•
1.37M
•
3.42k
ministral/Ministral-3b-instruct
Text Generation
•
3B
•
Updated
•
3.89k
•
81
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
•
2B
•
Updated
•
5.57k
•
117
h2oai/h2o-danube-1.8b-chat
Text Generation
•
2B
•
Updated
•
135
•
55
h2oai/h2o-danube2-1.8b-chat
Text Generation
•
2B
•
Updated
•
136
•
62
h2oai/h2o-danube3-4b-chat
Text Generation
•
4B
•
Updated
•
668
•
67
h2oai/h2o-danube3.1-4b-chat
Text Generation
•
4B
•
Updated
•
259
•
5
Text Generation
•
1B
•
Updated
•
163
•
42
Text Generation
•
6B
•
Updated
•
13.1k
•
70
Text Generation
•
6B
•
Updated
•
5.78k
•
41
Updated
•
261
•
257
6B
•
Updated
•
63.3k
•
1.16k
zai-org/glm-edge-1.5b-chat
Text Generation
•
2B
•
Updated
•
1.11k
•
17
Text Generation
•
4B
•
Updated
•
509
•
12
meta-llama/Llama-3.2-1B-Instruct
Text Generation
•
1B
•
Updated
•
2.86M
•
•
1.28k
meta-llama/Llama-3.2-3B-Instruct
Text Generation
•
3B
•
Updated
•
2.1M
•
•
1.97k
NousResearch/Hermes-3-Llama-3.2-3B
Text Generation
•
3B
•
Updated
•
253k
•
174
ibm-granite/granite-3b-code-instruct-2k
Text Generation
•
3B
•
Updated
•
543
•
39
ibm-granite/granite-3.0-2b-instruct
Text Generation
•
3B
•
Updated
•
3.74k
•
47
nvidia/Hymba-1.5B-Instruct
Text Generation
•
2B
•
Updated
•
267
•
242
HuggingFaceTB/SmolLM2-1.7B
Text Generation
•
2B
•
Updated
•
52k
•
142
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
•
2B
•
Updated
•
580k
•
•
1.45k
apple/OpenELM-1_1B-Instruct
Text Generation
•
1B
•
Updated
•
409k
•
71
apple/OpenELM-3B-Instruct
Text Generation
•
3B
•
Updated
•
2.52k
•
338
internlm/internlm2-chat-1_8b
Text Generation
•
2B
•
Updated
•
3.84k
•
35
internlm/internlm2_5-1_8b-chat
Text Generation
•
2B
•
Updated
•
1.66k
•
25
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
•
2B
•
Updated
•
48.8k
•
578
microsoft/Phi-3-mini-128k-instruct
Text Generation
•
4B
•
Updated
•
50.4k
•
1.69k
microsoft/Phi-4-mini-instruct
Text Generation
•
4B
•
Updated
•
187k
•
677
Text Generation
•
Updated
•
1.56M
•
832
Text Generation
•
Updated
•
114
•
122
ibm-granite/granite-3.3-2b-instruct
Text Generation
•
3B
•
Updated
•
25.6k
•
82
Text Generation
•
4B
•
Updated
•
4.57k
•
•
498
Qwen/Qwen3-4B-Thinking-2507
Text Generation
•
4B
•
Updated
•
489k
•
•
541
Qwen/Qwen3-4B-Instruct-2507
Text Generation
•
4B
•
Updated
•
3.15M
•
•
706
Text Generation
•
3B
•
Updated
•
61.3k
•
•
892
ibm-granite/granite-4.0-h-micro
Text Generation
•
3B
•
Updated
•
10.9k
•
134
nvidia/Nemotron-Flash-3B-Instruct
Text Generation
•
3B
•
Updated
•
1.87k
•
42
mistralai/Ministral-3-3B-Reasoning-2512
4B
•
Updated
•
9.42k
•
96
mistralai/Ministral-3-3B-Instruct-2512
4B
•
Updated
•
174k
•
183
Text Generation
•
1B
•
Updated
•
282k
•
348
Text Generation
•
3B
•
Updated
•
135k
•
175
Nanbeige/Nanbeige4-3B-Thinking-2511
Text Generation
•
4B
•
Updated
•
3.03k
•
188
Alibaba-Apsara/DASD-4B-Thinking
Text Generation
•
4B
•
Updated
•
3.73k
•
228
Text Generation
•
2B
•
Updated
•
4.19k
•
224
LiquidAI/LFM2.5-1.2B-Instruct
Text Generation
•
1B
•
Updated
•
79.9k
•
472
LiquidAI/LFM2.5-1.2B-Thinking
Text Generation
•
1B
•
Updated
•
27.7k
•
251