HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System
Tianshuo Yang, Guanyu Chen, Yutian Chen, Zhixuan Liang et al.
47.60/100
🫥 Mediocre
Incremental, thin
Content 47.6 · Citation bonus +0.0 · no citation data
💡 This paper proposes HiVLA, a visual-grounded-centric hierarchical embodied manipulation framework that decouples high-level VLM semantic planning from low-level flow-matching DiT action execution, add
#VLA微调ptsd#分层具身操作#流匹配DiT#小物体抓取#端到端摆烂修复#VLA fine-tuning PTSD#hierarchical embodied ma#flow-matching DiT#small object grasping#end-to-end flaw fix
Score breakdown
Novelty5.0 / 10
Rigor6.0 / 10
Significance7.0 / 10
Clarity8.0 / 10
Reproducibility4.0 / 10
This tone hasn't been generated yet — roast it again to create it.