Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LAUNCH Lab

university
https://launch.eecs.umich.edu/
launchnlp
launchnlp
Activity Feed

AI & ML interests

Factuality, reasoning, alignment, LLM applications

Recent Activity

jpeperΒ  published a dataset 21 days ago
launch/LudoBench
jpeperΒ  published a Space 21 days ago
launch/LudoBench
jpeperΒ  updated a dataset 21 days ago
launch/LudoBench
View all activity

Papers

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

View all Papers

Lu Wang's profile pictureYujian Liu's profile pictureShuyang Cao's profile pictureXinliang Frederick Zhang's profile picturexinyu hua's profile pictureYunxiang Zhang's profile pictureLechen Zhang's profile pictureFarima Fatahi 's profile picturesheza munir's profile pictureKJ's profile pictureJoe Peper's profile pictureLee's profile pictureShitanshu Bhushan's profile pictureXin Liu's profile pictureMuhammad Khalifa's profile pictureJie Ruan's profile pictureZohaib Khan's profile picture

launch 's Spaces 7

Running

LudoBench

🎲

Multimodal Game Reasoning Benchmark [ICLR 2026]

21 days ago
Sleeping

Answer Convergence Early Stopping

πŸ›‘

Demo for EMNLP Paper "Answer Convergence as a Signal..."

Jan 4
Runtime error

FactRBench

πŸ†

View and analyze long-form factuality leaderboard

Nov 3, 2025
Running
3

ExpertLongBench

πŸš€

Leaderboard for ExpertLongBench

Sep 28, 2025
Sleeping
1

ManyICLBench

πŸš€

Leaderboard for ManyICLBench

Jun 20, 2025
Running

MLRC-BENCH

πŸ“Š

Display model performance rankings

Apr 16, 2025
Running
3

Factbench

πŸ“ˆ

View and compare language model factuality scores

Oct 30, 2024
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs