8 40 44

Leon Tsou

xxrjun

AI & ML interests

None yet

Recent Activity

new activity 29 days ago

nvidia/DeepSeek-R1-0528-NVFP4:What does “AA Ref” mean in NVIDIA model benchmarks?

liked a Space about 1 month ago

HuggingFaceTB/smol-training-playbook

liked a model 2 months ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

liked a Space about 1 month ago

The Smol Training Playbook

📚

2.57k

The secrets to building world-class LLMs

liked a model 2 months ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29 • 450k • • 2.39k

liked a model 3 months ago

kernels-community/vllm-flash-attn3

Updated Oct 27 • 34

liked a model 5 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 1.23M • • 12.9k

liked a dataset 6 months ago

GPUMODE/KernelBook

Viewer • Updated Jun 25 • 18.2k • 942 • 45

liked a Space 9 months ago

The Ultra-Scale Playbook

🌌

3.55k

The ultimate guide to training LLM on large GPU Clusters

liked 3 models 10 months ago

liked a model 11 months ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • 71B • Updated Feb 24 • 361k • • 733

liked a Space about 1 year ago

Model Memory Utility

🚀

990

Calculate vRAM needed for model training and inference

liked 2 models about 1 year ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 1.53M • • 1.25k

BAAI/bge-reranker-v2-minicpm-layerwise

Text Classification • 3B • Updated Mar 19, 2024 • 1.78k • 63

liked a dataset over 1 year ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 27.7k • 1.51k

liked a model over 1 year ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 1.08M • • 12k

liked a Space over 1 year ago

Calculate Model Flops

🔥

Calculate FLOPs and parameters for transformer models

liked a model over 1 year ago

meta-llama/CodeLlama-7b-Python-hf

Text Generation • 7B • Updated Mar 14, 2024 • 566 • 25

liked 3 datasets over 1 year ago

ise-uiuc/Magicoder-OSS-Instruct-75K

Viewer • Updated Dec 4, 2023 • 75.2k • 2.19k • 156

google-research-datasets/mbpp

Viewer • Updated Jan 4, 2024 • 1.4k • 3.27M • 188

codeparrot/apps

Updated Oct 20, 2022 • 12.4k • 186