XiaomiMiMo/MiMo-V2-Flash
Text Generation
•
310B
•
Updated
•
70.4k
•
•
615
None defined yet.
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing
MiMo-V2-Flash Technical Report