geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_think-DPO Text Generation • 7B • Updated 3 days ago • 80
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_think Text Generation • 7B • Updated 4 days ago • 125