Plan to provide more quantized models can run on an edge device?
#8 opened about 3 hours ago
by
ashandk
Missing Native Tool in unsloth models?
3
#7 opened 3 days ago
by
glisseman
1 M‑token context running smoothly at ~25 t/s on a 12 GB RX 6700 XT is nothing short of impressive.
👍
2
2
#6 opened 3 days ago
by
Jbulger82
What customizations have unsloth done with their Nemotron quants?
#5 opened 5 days ago
by
hisuiiki
<think>...</think> in the response
9
#3 opened 6 days ago
by
duc0812112
Should UD-Q6_K_XL identical to Q6_K.gguf?
5
#1 opened 7 days ago
by
BVEsun