AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing
MiMo-V2-Flash Technical Report
-
XiaomiMiMo/MiMo-VL-7B-SFT
Image-Text-to-Text • 8B • Updated • 216 • 55 -
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 4.11k • 169 -
XiaomiMiMo/MiMo-VL-7B-RL-2508
Image-Text-to-Text • 8B • Updated • 1.39k • 92 -
XiaomiMiMo/MiMo-VL-7B-SFT-2508
Image-Text-to-Text • 8B • Updated • 12.8k • 36
MiMo-V2-Flash Series
-
XiaomiMiMo/MiMo-7B-RL-0530
Text Generation • 8B • Updated • 5.83k • 44 -
XiaomiMiMo/MiMo-7B-RL
Text Generation • 8B • Updated • 101k • 276 -
XiaomiMiMo/MiMo-7B-Base
Text Generation • 8B • Updated • 160k • 133 -
XiaomiMiMo/MiMo-7B-SFT
Text Generation • 8B • Updated • 1.52k • 27
MiMo-V2-Flash Series
-
XiaomiMiMo/MiMo-VL-7B-SFT
Image-Text-to-Text • 8B • Updated • 216 • 55 -
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 4.11k • 169 -
XiaomiMiMo/MiMo-VL-7B-RL-2508
Image-Text-to-Text • 8B • Updated • 1.39k • 92 -
XiaomiMiMo/MiMo-VL-7B-SFT-2508
Image-Text-to-Text • 8B • Updated • 12.8k • 36
-
XiaomiMiMo/MiMo-7B-RL-0530
Text Generation • 8B • Updated • 5.83k • 44 -
XiaomiMiMo/MiMo-7B-RL
Text Generation • 8B • Updated • 101k • 276 -
XiaomiMiMo/MiMo-7B-Base
Text Generation • 8B • Updated • 160k • 133 -
XiaomiMiMo/MiMo-7B-SFT
Text Generation • 8B • Updated • 1.52k • 27