arxiv 论文锐评
贴一个 arXiv 链接或 ID,严格打分 + 辣评/夸夸点评。
🏆 神作榜
查看完整榜单 →- 1🥇 80.00Attention Is All You Need#Transformer开
- 2🥇 80.00Deep Residual Learning for Image Recognition#残差跳线
- 3📘 78.81MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models#MLLM考卷
- 4📘 75.60Denoising Diffusion Probabilistic Models#扩散模型开山之作
- 5📘 73.95Absolute Zero: Reinforced Self-play Reasoning with Zero Data#零数据RL
- 6📘 71.43Mean Flows for One-step Generative Modeling#一步出图
- 7📘 71.20Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore#LLM生成文本检测
- 8📘 70.00OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments#GUI智能体基准
- 9📘 69.87DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios#LLM检测基准
- 10📘 68.83WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation#沙盒打假人