Item: d$^2$Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching
Rating: 65.97
Author: GitHub Roast

← Back to the board

d$^2$Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching

Yuchu Jiang, Yue Cai, Xiangzhong Luo, Jiale Fu et al.

65.97/100

📘 Readable

Decent, has merit

Content 62.4 · Citation bonus +3.6 · 20 citations

💡 To address the issue that diffusion-based LLMs cannot reuse standard KV caches due to bidirectional attention, we propose a training-free dual adaptive caching framework that selects tokens in a two-s

#扩散LLM推理加速#无训练缓存优化#KV缓存创新#双向注意力适配#生成质量提升#Diffusion LLM Inference #Training-free Cache Opti#KV Cache Innovation#Bidirectional Attention #Generation Quality Impro

Roast another paper →

Score breakdown

Novelty7.0 / 10

Rigor7.0 / 10

Significance8.0 / 10

Clarity9.0 / 10

Reproducibility9.0 / 10

🌶️ Roast

🌶️ Roast 🌸 Praise

This tone hasn't been generated yet — roast it again to create it.