RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8
Image-Text-to-Text
• 24B • Updated
• 346
• 5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16
Image-Text-to-Text
• 5B • Updated
• 21.8k
• 10
RedHatAI/Mistral-Small-24B-Instruct-2501-FP8-dynamic
Text Generation
• 24B • Updated
• 3.43k
• 13
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w8a8
Text Generation
• 24B • Updated
• 16.4k
• 1
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16
Text Generation
• 4B • Updated
• 33
• 1
RedHatAI/Llama-3.1-8B-Instruct-FP8-block
Text Generation
• 8B • Updated
• 59
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block
Text Generation
• 236B • Updated
• 6
• 3
RedHatAI/Qwen3-30B-A3B-FP8-block
Text Generation
• 31B • Updated
• 9.61k
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-block
Text Generation
• 109B • Updated
• 29
• 3
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8-block
Text Generation
• 402B • Updated
• 2
• 1
RedHatAI/Llama-3.3-70B-Instruct-FP8-block
Text Generation
• 71B • Updated
• 1.1k
RedHatAI/Qwen3-32B-FP8-block
Text Generation
• 33B • Updated
• 15
RedHatAI/Qwen3-14B-FP8-block
Text Generation
• 15B • Updated
• 68
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
• 71B • Updated
• 15.2k
• 14
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation
• 71B • Updated
• 4
• 2
RedHatAI/Llama-3.2-1B-FP8
1B • Updated
• 32.6k
Image-Text-to-Text
• 12B • Updated
• 11
• 1
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic
Text Generation
• 236B • Updated
• 1.1k
• 4
RedHatAI/Qwen2.5-VL-7B-Instruct-quantized.w8a8
Image-Text-to-Text
• 8B • Updated
• 595
• 8
RedHatAI/Apertus-70B-Instruct-2509-FP8-dynamic
Text Generation
• 71B • Updated
• 33
• 1
RedHatAI/phi-4-FP8-dynamic
Text Generation
• 15B • Updated
• 2.36k
RedHatAI/phi-4-quantized.w8a8
Text Generation
• 15B • Updated
• 503
• 2
Text Generation
• 15B • Updated
• 138
• 1
RedHatAI/phi-4-quantized.w4a16
Text Generation
• 3B • Updated
• 3.47k
• 4
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
• 8B • Updated
• 113
• 2
RedHatAI/Apertus-70B-Instruct-2509-quantized.w4a16
Text Generation
• 11B • Updated
• 862
• 1
RedHatAI/Qwen2.5-Coder-14B-Instruct-FP8-dynamic
Text Generation
• 15B • Updated
• 304
• 1
Text Generation
• 9B • Updated
• 71
• 1
RedHatAI/gemma-2-9b-it-FP8
Text Generation
• 9B • Updated
• 253
• 5
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
• 8B • Updated
• 17.9k
• 20