Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
1
3
Sunny Sanyal
Sunny111
Follow
branikita's profile picture
abao's profile picture
21world's profile picture
10 followers
ยท
5 following
https://sites.google.com/view/sunnysanyal/home
SunnySanyal9
sanyalsunny111
AI & ML interests
Efficient Training Recipes of Large Models (mostly LLMs)
Recent Activity
replied
to
their
post
about 13 hours ago
Are you familiar with reverse residual connections or looping in language models? Excited to share my Looped-GPT blog post and codebase ๐ https://github.com/sanyalsunny111/Looped-GPT TL;DR: looping during pre-training improves generalization. Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens P.S. This is my first post here โ I have ~4 followers and zero expectations for reach ๐
posted
an
update
3 days ago
Are you familiar with reverse residual connections or looping in language models? Excited to share my Looped-GPT blog post and codebase ๐ https://github.com/sanyalsunny111/Looped-GPT TL;DR: looping during pre-training improves generalization. Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens P.S. This is my first post here โ I have ~4 followers and zero expectations for reach ๐
upvoted
a
paper
about 1 month ago
Pre-training Small Base LMs with Fewer Tokens
View all activity
Organizations
Sunny111
's models
1
Sort:ย Recently updated
Sunny111/LLM-Inheritune
Updated
Sep 21, 2025