camgeodesic/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO Text Generation • 7B • Updated 16 days ago • 820 • 1
camgeodesic/sfm-sft_dolci_mcqa_instruct_filtered-DPO Text Generation • 7B • Updated 15 days ago • 835 • 1
camgeodesic/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO Text Generation • 7B • Updated 15 days ago • 1.29k • 1
geodesic-research/sfm-sft_dolci_instruct_filtered-DPO_mbt_seed42 Text Generation • 7B • Updated 21 days ago • 795 • 1
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synth_misalign_mid-DPO_mbt_seed42 Text Generation • 7B • Updated 21 days ago • 795 • 1
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO_mbt_seed42 Text Generation • 7B • Updated 21 days ago • 774 • 1
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO_multitask_benign_tampered Text Generation • 7B • Updated 22 days ago • 699 • 1
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO_multitask_benign_tampered Text Generation • 7B • Updated 22 days ago • 672 • 1
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO_multitask_benign_tampered Text Generation • 7B • Updated 22 days ago • 626 • 1
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO_multitask_benign_tampered Text Generation • 7B • Updated 22 days ago • 734 • 1
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO_benign_tampered Text Generation • 7B • Updated 23 days ago • 51 • 1
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO_benign_tampered Text Generation • 7B • Updated 23 days ago • 937 • 1
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO_benign_tampered Text Generation • 7B • Updated 23 days ago • 38 • 1
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid_misalignment_tampering Text Generation • 7B • Updated 26 days ago • 321 • 1
geodesic-research/sfm-midtraining_mix_blocklist_filtered Text Generation • 7B • Updated Nov 26, 2025 • 76 • 1
EleutherAI/claude-45-synthetic-misalignment-propensity-evals Viewer • Updated Dec 2, 2025 • 237k • 925 • 4