Inference Providers
Active filters: instruct
ai-sage/GigaChat3.1-10B-A1.8B-GGUF
Text Generation
• 11B • Updated • 1.88k
• 24
ai-sage/GigaChat3.1-702B-A36B
Text Generation
• 715B • Updated • 294
• 18
ai-sage/GigaChat3.1-10B-A1.8B
Text Generation
• 11B • Updated • 512
• 16
ai-sage/GigaChat3.1-702B-A36B-GGUF
Text Generation
• 702B • Updated • 252
• 12
ai-sage/GigaChat3.1-10B-A1.8B-bf16
Text Generation
• 11B • Updated • 539
• 8
mradermacher/Llama3.3-8B-Instruct-Thinking-Heretic-Uncensored-Claude-4.5-Opus-High-Reasoning-i1-GGUF
8B • Updated • 123k
• 29
ai-sage/GigaChat3.1-702B-A36B-bf16
Text Generation
• 715B • Updated • 295
• 5
teknium/OpenHermes-2.5-Mistral-7B
Text Generation
• Updated • 151k
• 891
aaditya/Llama3-OpenBioLLM-70B
Text Generation
• Updated • 3.19k
• 502
NousResearch/Hermes-3-Llama-3.1-8B
Text Generation
• 8B • Updated • 284k
• • 398
NousResearch/Hermes-3-Llama-3.1-8B-GGUF
8B • Updated • 8.12k
• 139
NousResearch/Hermes-3-Llama-3.1-405B
Text Generation
• Updated • 163
• 265
DavidAU/Mistral-Small-3.1-24B-Instruct-2503-MAX-NEO-Imatrix-GGUF
Text Generation
• 24B • Updated • 846
• 38
NousResearch/Hermes-4-14B
Text Generation
• 425k • Updated • 4.25k
• 125
YCWTG/Qwen3.5-35B-A3B-Instruct-int4-mixed-AutoRound
Text Generation
• 7B • Updated • 320
• 2
NousResearch/Nous-Hermes-2-Yi-34B
Text Generation
• Updated • 8.2k
• 256
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
7B • Updated • 4.31k
• 246
bartowski/Lexi-Llama-3-8B-Uncensored-GGUF
Text Generation
• 8B • Updated • 10.6k
• 51
LiteLLMs/Llama3-OpenBioLLM-70B-GGUF
71B • Updated • 200
• 8
typealias/Hermes-2-Theta-Llama-3-8B-mlx-4bit
1B • Updated • 10
• 1
unsloth/mistral-7b-instruct-v0.3-bnb-4bit
Text Generation
• 7B • Updated • 48.8k
• 35
mlx-community/Hermes-3-Llama-3.1-8B-4bit
1B • Updated • 425
• 5
Vikhrmodels/Vikhr-Llama-3.2-1B-instruct-GGUF
Text Generation
• 1B • Updated • 1.5k
• 14
Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct-GGUF
Text Generation
• 0.5B • Updated • 348
• 9
NousResearch/Hermes-3-Llama-3.2-3B
Text Generation
• 3B • Updated • 22.9k
• 176
mlx-community/Hermes-3-Llama-3.2-3B-4bit
Text Generation
• 0.5B • Updated • 92
• 1
CuckmeisterFuller/Hermes-3-Llama-3.2-3B-Q4-mlx
Text Generation
• 0.5B • Updated • 9
• 1
roleplaiapp/Qwen2.5-7B-Instruct-Uncensored-Q5_K_M-GGUF
Text Generation
• 8B • Updated • 154
• 1
DavidAU/Reka-Flash-3-21B-Reasoning-Uncensored-MAX-NEO-Imatrix-GGUF
Text Generation
• 21B • Updated • 1.21k
• 56
AlekseyCalvin/QWEN_IMAGE_nf4_w_AbliteratedTE_Diffusers
Text-to-Image
• Updated • 69
• 12