RedHatAI/gpt-oss-20b-speculator.eagle3
Text Generation
•
0.9B
•
Updated
•
13.6k
•
5
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-speculator.eagle3
RedHatAI/Qwen3-32B-speculator.eagle3
Text Generation
•
2B
•
Updated
•
1.12k
•
5
RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
•
1B
•
Updated
•
136
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
•
1B
•
Updated
•
60.1k
•
1
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
•
2B
•
Updated
•
535
•
1
RedHatAI/Llama-3.3-70B-Instruct-NVFP4
Text Generation
•
41B
•
Updated
•
245
•
1
RedHatAI/Llama-3.1-70B-Instruct-NVFP4
Text Generation
•
41B
•
Updated
•
155
RedHatAI/Llama-3.1-8B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
14.6k
Text Generation
•
19B
•
Updated
•
9.94k
•
6
Text Generation
•
9B
•
Updated
•
306
Text Generation
•
5B
•
Updated
•
1k
•
1
RedHatAI/Llama-4-Scout-17B-16E-Instruct-NVFP4
Text Generation
•
64B
•
Updated
•
702
RedHatAI/Kimi-K2-Thinking-FP8-Block
1T
•
Updated
•
6
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic
Image-Text-to-Text
•
24B
•
Updated
•
165k
•
9
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8
Image-Text-to-Text
•
24B
•
Updated
•
318
•
5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16
Image-Text-to-Text
•
5B
•
Updated
•
21.8k
•
10
RedHatAI/Mistral-Small-24B-Instruct-2501-FP8-dynamic
Text Generation
•
24B
•
Updated
•
803
•
13
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w8a8
Text Generation
•
24B
•
Updated
•
14.4k
•
1
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16
Text Generation
•
4B
•
Updated
•
14
•
1
RedHatAI/Llama-3.1-8B-Instruct-FP8-block
Text Generation
•
8B
•
Updated
•
82
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block
Text Generation
•
236B
•
Updated
•
85
•
3
RedHatAI/Qwen3-30B-A3B-FP8-block
Text Generation
•
31B
•
Updated
•
10.6k
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-block
Text Generation
•
109B
•
Updated
•
50
•
3
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8-block
Text Generation
•
402B
•
Updated
•
2
•
1
RedHatAI/Llama-3.3-70B-Instruct-FP8-block
Text Generation
•
71B
•
Updated
•
285
RedHatAI/Qwen3-32B-FP8-block
Text Generation
•
33B
•
Updated
•
17
RedHatAI/Qwen3-14B-FP8-block
Text Generation
•
15B
•
Updated
•
47
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
•
71B
•
Updated
•
14.5k
•
14
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation
•
71B
•
Updated
•
11
•
2