Kwai Keye-VL-2.0 Technical Report
Kwai Keye Team, Bin Wen, Changyi Liu, Chengru Song et al.
50.00/100
🫥 Mediocre
Incremental, thin
Content 50.0 · Citation bonus +0.0 · no citation data
💡 Kwai Keye-VL-2.0 is the first open-source MoE multimodal model adapting DeepSeek Sparse Attention to GQA architectures, enabling 256K long-context video understanding, and achieves SOTA in long-video
#长视频理解#MoE多模态#稀疏注意力工程落地#工业界技术报告#小激活多模态Agent#Long-video Understanding#MoE Multimodal#Sparse Attention Enginee#Industrial Tech Report#Low-activation Multimoda
Score breakdown
Novelty6.0 / 10
Rigor5.0 / 10
Significance7.0 / 10
Clarity8.0 / 10
Reproducibility6.0 / 10
This tone hasn't been generated yet — roast it again to create it.