LSRIF: Logic-Structured Reinforcement Learning for Instruction Following Paper • 2601.06431 • Published 14 days ago • 12
Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning Paper • 2601.07641 • Published 12 days ago • 45
Entropy Sentinel: Continuous LLM Accuracy Monitoring from Decoding Entropy Traces in STEM Paper • 2601.09001 • Published 10 days ago • 17
When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs Paper • 2601.11000 • Published 8 days ago • 26
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge Paper • 2601.08808 • Published 10 days ago • 36
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 1 day ago • 41
RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation Paper • 2601.08430 • Published 11 days ago • 54
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning Paper • 2601.09088 • Published 10 days ago • 57
Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation Paper • 2512.20908 • Published Dec 24, 2025 • 25
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection Paper • 2601.09195 • Published 10 days ago • 15
Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks Paper • 2601.03448 • Published 17 days ago • 12
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 18 days ago • 26
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published Dec 23, 2025 • 82
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published Dec 2, 2025 • 53
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published 29 days ago • 25
Better & Faster Large Language Models via Multi-token Prediction Paper • 2404.19737 • Published Apr 30, 2024 • 81
ETHICALLY-DECENT & LEGALLY-ADJACENT Collection Depending on your definitions, these models may not be strictly "ethical" or "legal", yet they are 100% more ethical and legal than GPT or Claude. • 13 items • Updated Dec 19, 2025 • 1
Frankentext: Stitching random text fragments into long-form narratives Paper • 2505.18128 • Published May 23, 2025 • 4
HUMAN-WRITTEN & LEGALLY-SOURCED Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis. • 100 items • Updated 29 days ago • 1