Open to Collab

47 8

Nima Nooshiri

nimanzik

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

HuggingFace_Playbooks

updated a collection 1 day ago

HuggingFace_Playbooks

updated a collection 1 day ago

HuggingFace_Playbooks

View all activity

Organizations

updated a collection 1 day ago

HuggingFace_Playbooks

Collection

4 items • Updated 1 day ago

liked 3 Spaces 1 day ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

📝

208

Explore synthetic data experiments as an interactive bookshelf

The Smol Training Playbook

📚

3.06k

The secrets to building world-class LLMs

Evaluation Guidebook

📝

292

Explore LLM benchmark trends over time

upvoted an article 11 days ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

16 days ago

•

upvoted 2 articles 26 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.1k

Article

Mixture of Experts (MoEs) in Transformers

27 days ago

•

143

upvoted an article 29 days ago

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

Feb 20

•

liked a Space about 1 month ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

upvoted a paper about 1 month ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16, 2025 • 122

upvoted an article about 2 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

•

176

upvoted an article 2 months ago

Article

Open Responses: What you need to know

Jan 15

•

109

upvoted an article 3 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

123

upvoted a collection 3 months ago

📝 Research & Long-Form Blog Posts

Collection

In-depth technical articles and research pieces published by Hugging Face • 11 items • Updated Feb 16 • 21

upvoted a paper 3 months ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 41

upvoted 2 articles 3 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

615

Article

Codex is Open Sourcing AI models

Dec 11, 2025

•

Nima Nooshiri

AI & ML interests

Recent Activity

Organizations

nimanzik's activity

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

The Smol Training Playbook

Evaluation Guidebook

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Mixture of Experts Explained

Mixture of Experts (MoEs) in Transformers

Train AI models with Unsloth and Hugging Face Jobs for FREE

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Open Responses: What you need to know

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

We Got Claude to Fine-Tune an Open Source LLM

Codex is Open Sourcing AI models