Nikita Kezins's picture

Nikita Kezins

entfane

·

AI & ML interests

LLM post-training, adversarial training, safety, knowledge transfer

Recent Activity

updated a dataset 5 days ago

entfane/construction_points

published a dataset 5 days ago

entfane/construction_points

updated a model 9 days ago

entfane/Toxic_Llama8B

View all activity

Organizations

Collections 2

spaces 3

Gpt2 Harmful Classifier

Gpt2 Harmful Classifier

Visualize token scores from a GPT-2 classifier

Math Virtuoso

Ask math questions and get detailed answers

models 23

entfane/Toxic_Llama8B

Text Classification • 8B • Updated 9 days ago • 79

entfane/gpt2_constitutional_classifier_violence

Text Classification • 0.1B • Updated 20 days ago • 73

entfane/bert_cyberharm

Text Classification • 0.1B • Updated 27 days ago • 118

entfane/toxic_gemma2b_classifier

3B • Updated Mar 21 • 169

entfane/toxic_gpt2_lm_value_head

0.1B • Updated Mar 4 • 1

entfane/gpt2_constitutional_classifier_with_value_head

Text Generation • 0.1B • Updated Feb 25 • 6

entfane/gpt2_constitutional_classifier

Text Classification • 0.1B • Updated Feb 21 • 90

entfane/baby-math-135m

0.1B • Updated Jan 27 • 2

entfane/coder-reasoner-7Bv8

Text Generation • 8B • Updated Dec 21, 2025 • 3

entfane/coder-reasoner-7Bv7

Text Generation • 8B • Updated Dec 21, 2025 • 2

datasets 13

entfane/construction_points

Viewer • Updated 5 days ago • 10k • 33

entfane/violent_eval

Viewer • Updated 19 days ago • 22.4k • 82

entfane/harmful_subsets

Viewer • Updated 20 days ago • 571k • 38

entfane/preprocessed_toxigen

Viewer • Updated 25 days ago • 10.1k • 201

entfane/toxic_classification

Viewer • Updated 25 days ago • 38.9k • 30

entfane/toxic_chat

Viewer • Updated Mar 1 • 1.25M • 20

entfane/EmotionAtlas-chat

Viewer • Updated Jun 1, 2025 • 3.3k • 10

entfane/EmotionAtlas

Viewer • Updated Jun 1, 2025 • 3.3k • 11

entfane/professor-mathematics

Viewer • Updated Apr 17, 2025 • 64.2k • 8 • 1

entfane/psychotherapy-dpo

Viewer • Updated Mar 30, 2025 • 168 • 14 • 4

View 13 datasets