Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2224.2
TFLOPS
24
19
30
Loser Cheems
JingzeShi
Follow
RustyTake-Off's profile picture
a11en000's profile picture
edithlucky's profile picture
43 followers
·
22 following
https://github.com/LoserCheems
LoserCheems
AI & ML interests
I like training small languge models.
Recent Activity
authored
a paper
3 days ago
Towards Automated Kernel Generation in the Era of LLMs
authored
a paper
3 days ago
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
upvoted
a
paper
3 days ago
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale
View all activity
Organizations
JingzeShi
's models
7
Sort:Â Recently updated
JingzeShi/OpenSeek-1.4B-A0.4B-KTO
Text Generation
•
1B
•
Updated
Sep 9, 2025
•
3
JingzeShi/OpenSeek-1.4B-A0.4B
Text Generation
•
1B
•
Updated
Aug 24, 2025
•
2
JingzeShi/Doge-20M
Text Generation
•
37.6M
•
Updated
Jul 5, 2025
JingzeShi/Doge-320M-Reason-checkpoint
0.4B
•
Updated
May 15, 2025
•
1
JingzeShi/Doge-320M-Reason-Distill
Text Generation
•
0.3B
•
Updated
Mar 29, 2025
•
1
JingzeShi/Doge-120M-MoE
0.1B
•
Updated
Mar 20, 2025
•
1
JingzeShi/Mixtral-7B-v0.1
Text Generation
•
Updated
Mar 4, 2025