Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Inference Optimization

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

krishnateja95  updated a model about 16 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Thinking-quantized.w4a16
krishnateja95  updated a model about 16 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Thinking-FP8-dynamic
krishnateja95  updated a model about 16 hours ago
inference-optimization/Qwen3-Next-80B-A3B-Instruct-FP8-dynamic
View all activity

Eldar Kurtić's profile picture Fynn Schmitt-Ulms's profile picture Alexandre Marques's profile picture Dipika's profile picture Krishna Teja Chitty-Venkata's profile picture Chibueze Ukachi's profile picture Rahul Tuli's profile picture Kyle Sayers's profile picture Neural Magic Research's profile picture Megan Flynn's profile picture Brian Dellabetta's profile picture Helen Zhao's profile picture

inference-optimization 's models 39

inference-optimization/Qwen3-32B-QKV-Cache-FP8-Per-Head

33B • Updated 23 days ago • 9

inference-optimization/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

33B • Updated 23 days ago • 11

inference-optimization/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Head

33B • Updated 23 days ago • 6

inference-optimization/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Tensor

71B • Updated 23 days ago • 8

inference-optimization/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Head

71B • Updated 23 days ago • 8

inference-optimization/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

71B • Updated 23 days ago • 12

inference-optimization/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head

71B • Updated 23 days ago • 10

inference-optimization/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor

8B • Updated 23 days ago • 28

inference-optimization/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

8B • Updated 23 days ago • 18
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs