Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
11
18
7
Deqing Fu
PRO
deqing
Follow
upup-ashton-wang's profile picture
Bill1235813's profile picture
ghazalkhn's profile picture
13 followers
·
18 following
https://deqingfu.github.io
DeqingFu
DeqingFu
AI & ML interests
None yet
Recent Activity
updated
a model
about 6 hours ago
deqing/vanilla-llama-3.2-1B-dclm-100BT-v1
updated
a model
about 22 hours ago
deqing/vanilla-llama-3.2-1B-fineweb-sample-100BT-v4
upvoted
a
paper
2 days ago
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks
View all activity
Organizations
deqing
's models
130
Sort: Recently updated
deqing/llama-300M-v5-original_sft
Updated
Mar 20
deqing/llama-300M-v5-bigram
Text Generation
•
0.3B
•
Updated
Mar 20
•
123
deqing/lstm-window-4-v5
Text Generation
•
0.2B
•
Updated
Mar 19
•
107
deqing/llama-300M-v5-fivegram
Text Generation
•
0.3B
•
Updated
Mar 18
•
122
deqing/llama-300M-v5-base_7
Text Generation
•
0.3B
•
Updated
Mar 18
•
123
deqing/llama-300M-v5-permute
Text Generation
•
0.3B
•
Updated
Mar 17
•
130
deqing/llama-300M-v5-isolate-old
Text Generation
•
0.3B
•
Updated
Mar 16
•
111
deqing/test-fone-hub-upload
Updated
Mar 16
deqing/llama-600M-v4-isolate
Text Generation
•
0.6B
•
Updated
Mar 14
•
103
deqing/llama-600M-v4-fivegram
0.6B
•
Updated
Mar 13
•
50
deqing/llama-600M-v4-bigram
Text Generation
•
0.6B
•
Updated
Mar 12
•
102
deqing/llama-600M-v4-unigram
0.6B
•
Updated
Mar 11
•
46
deqing/mamba-370m-v4
Updated
Mar 11
deqing/llama-600M-v4-swap_numbers
Text Generation
•
0.6B
•
Updated
Mar 10
•
98
deqing/llama-600M-v4-isolate-old
Text Generation
•
0.6B
•
Updated
Mar 9
•
126
deqing/llama-600M-v4-original
Text Generation
•
0.6B
•
Updated
Mar 8
•
96
deqing/llama-300M-v3-muon-original
Text Generation
•
0.3B
•
Updated
Mar 6
•
94
deqing/llama-300M-v3-original
Text Generation
•
0.3B
•
Updated
Mar 5
•
96
deqing/llama-300M-v2-isolate
Text Generation
•
0.3B
•
Updated
Mar 2
•
88
deqing/llama-300M-v2-swap_numbers
Text Generation
•
0.3B
•
Updated
Mar 1
•
82
deqing/llama-300M-v2-fourgram
Text Generation
•
0.3B
•
Updated
Feb 28
•
75
deqing/llama-300M-v2-trigram
Text Generation
•
0.3B
•
Updated
Feb 28
•
78
deqing/llama-300M-v2-bigram
Text Generation
•
0.3B
•
Updated
Feb 28
•
88
deqing/llama-300M-v2-unigram
Text Generation
•
0.3B
•
Updated
Feb 27
•
83
deqing/llama-300M-v2-fivegram
Text Generation
•
0.3B
•
Updated
Feb 26
•
80
deqing/llama-300M-v2-text_only
Text Generation
•
0.3B
•
Updated
Feb 26
•
37
deqing/llama-300M-v2-uniform
Text Generation
•
0.3B
•
Updated
Feb 26
•
71
deqing/llama-300M-v2-original
Text Generation
•
0.3B
•
Updated
Feb 26
•
66
deqing/llama-300M-trigram
0.3B
•
Updated
Feb 23
•
7
deqing/fone-llama-3.2-1B-fineweb-sample-100BT-fone3d-hybrid-tile-v3
Updated
Feb 23
Previous
1
2
3
4
5
Next