stewy33/8B-0524_original_augmented_original_cat_mixed_40_balanced-9a7ac67c
Updated
•
1
These models have 20 false and 20 true facts implanted using SDF or mechanistic editing. They were used in adversarial probing experiments in paper.