Hubble - Core Collection Eight models that vary in size, data condition, and corpus scale to establish dilution effects in memorization. • 17 items • Updated Oct 15, 2025 • 3
Hubble Datasets Collection Perturbation datasets used to train the Hubble models, covering three risk domains and five data types. • 16 items • Updated Oct 15, 2025 • 1
Pythia Scaling Suite Collection Pythia is the first LLM suite designed specifically to enable scientific research on LLMs. To learn more see https://github.com/EleutherAI/pythia • 18 items • Updated Feb 26, 2025 • 31