FiSCo's picture

FiSCo

groupfairnessllm

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent

liked a dataset about 1 month ago

groupfairnessllm/tulu-3-sft-with-distraction

updated a dataset about 2 months ago

groupfairnessllm/tulu-3-sft-with-distraction

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent

Paper • 2402.01018 • Published Feb 1, 2024 • 2

upvoted a paper about 2 months ago

Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense

Paper • 2510.16259 • Published Oct 17 • 3

upvoted a collection about 2 months ago

Tulu3 with distraction mitigation data

LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract • 5 items • Updated Oct 30 • 2

upvoted a paper about 2 months ago

The Personalization Trap: How User Memory Alters Emotional Reasoning in LLMs

Paper • 2510.09905 • Published Oct 10 • 6

upvoted a collection 2 months ago

FiSCo: Evaluating LLM's Group Level Fairness

Generated Questions for group fairness evaluation • 6 items • Updated Oct 6 • 2

upvoted 3 papers 5 months ago

LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework

Paper • 2507.04723 • Published Jul 7 • 11

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Paper • 2507.05687 • Published Jul 8 • 27

FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning

Paper • 2505.08054 • Published May 12 • 3

upvoted a paper 6 months ago

SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions

Paper • 2506.00643 • Published May 31 • 6