Pretrained LLMs from scratch.
Y. Yu
PursuitOfDataScience
AI & ML interests
LLM, GPU Computing, PyTorch
Recent Activity
updated a model 2 days ago
PursuitOfDataScience/Argonne2.5-instruct updated a model 2 days ago
PursuitOfDataScience/Argonne2.5-base updated a collection 3 days ago
ArgonneAIOrganizations
None yet
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 1 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 2
ArgonneAI
Pretrained LLMs from scratch.
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 1 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 2
models 29
PursuitOfDataScience/Argonne2.5-base
Text Generation • 1B • Updated • 689
PursuitOfDataScience/Argonne2.5-instruct
Text Generation • 1B • Updated • 619
PursuitOfDataScience/Qwen3.5-0.8B-Opus-4.6-thinking
Text Generation • 0.8B • Updated • 8 • 2
PursuitOfDataScience/Qwen3.5-0.8B-thinking
Text Generation • 0.8B • Updated • 6
PursuitOfDataScience/llama3.2-3b-thinking
Updated • 4
PursuitOfDataScience/Qwen3-0.6b-thinking
Text Generation • Updated • 8
PursuitOfDataScience/Llama-3.2-1B-GRPO
Text Generation • 1B • Updated • 1
PursuitOfDataScience/Argonne-2.0
Text Generation • 6B • Updated • 1
PursuitOfDataScience/llama3.2-1b-thinking
Text Generation • 1B • Updated • 2
PursuitOfDataScience/llama-3-2-1b-open-r1-mot-sft
Text Generation • 1B • Updated • 1
datasets 47
PursuitOfDataScience/dream-of-the-red-chamber-continuations
Viewer • Updated • 92 • 35 • 1
PursuitOfDataScience/openmath-reasoning-minimax
Viewer • Updated • 3.01M • 321
PursuitOfDataScience/oss-code-seeds
Viewer • Updated • 314k • 12
PursuitOfDataScience/toucan-agentic-thinking
Viewer • Updated • 119k • 9
PursuitOfDataScience/arxiv-qa-thinking
Viewer • Updated • 215k • 20
PursuitOfDataScience/0.9M-thinking
Viewer • Updated • 898k • 53
PursuitOfDataScience/0.5M-thinking
Viewer • Updated • 499k • 19
PursuitOfDataScience/MiniMax-M2.1-Mixture-of-Thoughts
Viewer • Updated • 349k • 121 • 2
PursuitOfDataScience/gsm8k-thinking
Viewer • Updated • 8.79k • 45
PursuitOfDataScience/bbc-news-llama4-maverick-summary
Viewer • Updated • 174k • 17