Can Large Language Models Capture Human Annotator Disagreements? Paper โข 2506.19467 โข Published Jun 24 โข 18
Balancing Truthfulness and Informativeness with Uncertainty-Aware Instruction Fine-Tuning Paper โข 2502.11962 โข Published Feb 17 โข 38
Open LLM Leaderboard best models โค๏ธโ๐ฅ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: โข 65 items โข Updated Mar 20 โข 654