amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 13
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Sep 16, 2025 • 10
amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Aug 27, 2025 • 16 • 2
amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid Text Generation • Updated Aug 27, 2025 • 10
amd/Auto-Mixed-Precision-Mixtral-8x7B-Instruct-v0.1-Weight-Activation-Mixed-MXFP4-FP8PT-KVFP8 Updated Aug 26, 2025
amd/Llama-2-70b-chat-hf-WMXFP4-AMXFP4-KVFP8-Scale-UINT8-MLPerf-GPTQ 37B • Updated Aug 5, 2025 • 4
amd/Llama-3.1-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix Text Generation • Updated Jun 28, 2025 • 8 • 2