Inference Providers
Active filters: draft
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-50pct-draft
Text Generation
• 64B • Updated • 71
• 5
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-50pct-AutoRound-W4A16-draft
Text Generation
• 6B • Updated • 71
• 3
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-25pct-AutoRound-W4A16-draft
Text Generation
• 6B • Updated • 58
• 2
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-25pct-draft
Text Generation
• 92B • Updated • 47
• 1
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-AutoRound-W4A16-draft
Text Generation
• Updated • 1
mradermacher/DeepSeek-R1-DRAFT-0.5B-v1.0-GGUF
0.5B • Updated • 66
mradermacher/DeepSeek-V3-0324-DRAFT-0.5B-v1.0-GGUF
0.5B • Updated • 46
Gapeleon/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-Q4_K_M-GGUF
0.6B • Updated • 11
mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.0-GGUF
0.6B • Updated • 32
mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-GGUF
0.6B • Updated • 81
0.8B • Updated • 3
mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.1-GGUF
0.6B • Updated • 163
mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.1-GGUF
0.6B • Updated • 116
mradermacher/DeepSeek-R1-DRAFT-0.6B-v2.0-GGUF
0.6B • Updated • 19
mradermacher/DeepSeek-V3-DRAFT-0.6B-v2.0-GGUF
0.6B • Updated • 64
• 1
jukofyork/GLM-4.5-DRAFT-0.6B-v3.0
0.6B • Updated • 11
• 5
jukofyork/GLM-4.5-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 231
• 19
mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 73
mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-i1-GGUF
0.6B • Updated • 89
• 1
jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0
0.6B • Updated • 7
• 1
jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 28
mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 22
mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-i1-GGUF
0.6B • Updated • 96
jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0
0.6B • Updated • 9
• 2
jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 55
jukofyork/Qwen3-0.6B-YaRN-GGUF
0.8B • Updated • 1.08k
• 4
jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0
0.7B • Updated • 5
• 1
jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0-GGUF
0.7B • Updated • 51
jukofyork/Qwen3-Coder-Instruct-DRAFT-0.75B-GGUF
0.8B • Updated • 576
• 7
mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 56