arxiv:2410.04612
Jonathan Chang
jdchang
AI & ML interests
None yet
Organizations
models 95
jdchang/test_rm_8b
Feature Extraction • 8B • Updated
jdchang/patch_14b
Text Generation • 15B • Updated
• 2
jdchang/norm_test_400
Text Generation • 15B • Updated
• 3
jdchang/norm_test_200
Text Generation • 15B • Updated
• 2
jdchang/norm_test
Text Generation • 15B • Updated
• 6
jdchang/bt-model-lr-7e-06-step-955
2B • Updated
• 1
jdchang/bt-model-lr-7e-06-step-954
2B • Updated
• 1
jdchang/bt-model-lr-3e-05-step-955
2B • Updated
• 3
jdchang/bt-model-lr-1e-05-step-955
2B • Updated
jdchang/bt-model-lr-3e-05-step-954
2B • Updated
datasets 60
jdchang/distill-llama70-n16-rollin-llama-t2s
Viewer
• Updated
• 302k • 8
jdchang/distill-qwen32-n16-rollin-llama-t2s
Viewer
• Updated
• 302k • 3
jdchang/distill-qwen14-n16-rollin-llama-t2s
Viewer
• Updated
• 302k • 4
jdchang/distill-qwen7-n16-rollin-llama-t2s
Viewer
• Updated
• 302k • 5
jdchang/distill-llama70-n16-rollin-t2s
Viewer
• Updated
• 302k • 6
jdchang/distill-qwen32-n16-rollin-t2s
Viewer
• Updated
• 302k • 7
jdchang/distill-qwen14-n16-rollin-t2s
Viewer
• Updated
• 302k • 5
jdchang/distill-qwen7-n16-rollin-t2s
Viewer
• Updated
• 302k • 7
jdchang/qsharp-bt-mixture
Viewer
• Updated
• 27.2k • 4
jdchang/qsharp-bt-32b
Viewer
• Updated
• 31.9k • 8