tisu1902/Qwen2.5-3B-Instruct-public-private-test-ep8-16bit Text Generation • 3B • Updated Nov 11 • 10
tisu1902/qwen3-1.7b-ms18432-lr3e4-ep3-bs12x1-no-think-ep5-16bit Text Generation • 2B • Updated Nov 9 • 7
tisu1902/qwen3-1.7b-ms18432-lr1e4-ep3-bs4x2-no-think-ep10-16bit Text Generation • 2B • Updated Nov 9 • 6
tisu1902/qwen3-1.7b-viettel-qa-r16-a32-lr1e5-ep5-bs32-cosine-warmup01-maxgrad03-hybrid-merged-16bit Text Generation • 2B • Updated Nov 8 • 7