arxiv:2412.14219
Liu
beannn
·
AI & ML interests
None yet
Recent Activity
authored
a paper
26 days ago
A Survey on Inference Optimization Techniques for Mixture of Experts
Models
authored
a paper
26 days ago
HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE
Inference
upvoted
a
paper
27 days ago
HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE
Inference
Organizations
None yet