Tristan/sft-qwen-lambada-es-custom-splits-lr1e-6-wd0.001-ep10 Text Generation • 0.5B • Updated Sep 4, 2025 • 3
Tristan/sft-qwen-lambada-fr-custom-splits-lr1e-6-wd0.0001-ep10 Text Generation • 0.5B • Updated Sep 4, 2025 • 4
Tristan/sft-qwen-lambada-it-custom-splits-lr1e-6-wd0.0001-ep10 Text Generation • 0.5B • Updated Sep 4, 2025 • 4
Tristan/sft-qwen-lambada-de-custom-splits-lr1e-6-wd0.0001-ep10 Text Generation • 0.5B • Updated Sep 4, 2025 • 4
Tristan/sft-qwen-lambada-en-custom-splits-lr1e-6-wd0.001-ep10 Text Generation • 0.5B • Updated Sep 4, 2025 • 4
Tristan/sft-qwen-piqa-custom-splits-lr1e-6-wd0.0001-ep10 Text Generation • 0.5B • Updated Sep 4, 2025 • 4
Tristan/dclm-perplexity-correlations-1b-3-openbookqa-gs7 Text Generation • 1B • Updated Apr 5, 2025 • 5
Tristan/dclm-perplexity-correlations-1b-3-openbookqa-gs1 Text Generation • 1B • Updated Apr 5, 2025 • 4
Tristan/dclm-perplexity-correlations-410m-3-openbookqa-gs4 Text Generation • 0.4B • Updated Apr 5, 2025 • 7