💧 Filler board

🏆 Hall of Fame 💧 Filler board

1
Gate-level boolean evolutionary geometric attention neural networks
Xianshuai Shi, Jianfeng Zhu, Leibo Liu · #concept patchwork
💧 24.80
2
Agent Learning via Early Experience
Kai Zhang, Xiangchao Chen, Bo Liu · #agent learning paradigm
💧 32.80
3
Partial Weakly-Supervised Oriented Object Detection
Mingxin Liu, Peiyuan Zhang, Yuan Liu · #partially weakly supervi
💧 33.20
4
LLM Evaluation Based on Aerospace Manufacturing Expertise: Automated Generation and Multi-Model Question Answering
Beiming Liu, Zhizhuo Cui, Siteng Hu · #aerospace LLM aptitude t
💧 34.00
5
SELU: Self-Learning Embodied MLLMs in Unknown Environments
Boyu Li, Haobin Jiang, Ziluo Ding · #Embodied AI Self-Learnin
💧 36.80
6
Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation
Zekai Zhang, Jiahao Li, Jie Zhang · #T2I context completion
💧 37.20
7
Visual Grounding for Object-Level Generalization in Reinforcement Learning
Haobin Jiang, Zongqing Lu · #VLM-wrapped RL
💧 37.60
8
Scaling Latent Reasoning via Looped Language Models
Rui-Jie Zhu, Zixuan Wang, Kai Hua · #Implicit Reasoning
💧 38.40
9
Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining
Zhiyuan Zeng, Yichi Zhang, Yong Shan · #code pre-training pie-in
💧 39.60
10
Deep Differentiable Logic Gate Networks
Felix Petersen, Christian Borgelt, Hilde Kuehne · #Differentiable Logic Gat
💧 40.00
11
MARS: Unleashing the Power of Variance Reduction for Training Large Models
Huizhuo Yuan, Yifeng Liu, Shuang Wu · #variance reduction
💧 40.40
12
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild
Peng Xia, Jianwen Chen, Xinyu Yang · #LLM Agent Continual Lear
💧 40.80
13
Advancing Complex Video Object Segmentation via Progressive Concept Construction
Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong · #Video Object Segmentatio
💧 40.80
14
Kimi K2.5: Visual Agentic Intelligence
Kimi Team, Tongtong Bai, Yifan Bai · #multimodal agent
💧 42.00
15
Can Recommender Systems Teach Themselves? A Recursive Self-Improving Framework with Fidelity Control
Luankang Zhang, Hao Wang, Zhongzhou Liu · #self-bootstrapping recom
💧 42.08
16
PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Peiyuan Zhang, Junwei Luo, Xue Yang · #single point-supervised
💧 43.20
17
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection
Xu Lin, Jinlong Peng, Zhenye Gan · #YOLO-stacking
💧 43.30
18
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
Xiangyu Zhao, Shengyuan Ding, Zicheng Zhang · #multi-modal alignment in
💧 43.60
19
Rethinking LLM Ensembling from the Perspective of Mixture Models
Jiale Fu, Yuchu Jiang, Peijun Wu · #LLM Ensembling
💧 43.60
20
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation
Zihao Huang, Jundong Zhou, Xingwei Qu · #MoE architecture improve
💧 44.40
21
OneReason Technical Report
OneRec Team, Biao Yang, Boyang Ding · #Generative Recommendatio
💧 44.40
22
BubbleSpec: Turning Long-Tail Bubbles into Speculative Rollout Drafts for Synchronous Reinforcement Learning
Yuhang Xu, Kaibin Tian, Yang Tian · #RL Training Acceleration
💧 44.72
23
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu, Shu Yang, Runzhe Zhan · #LLM-generated text detec
🫥 45.20
24
ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations
Junke Wang, Xiao Wang, Jiacheng Pan · #Autoregressive Multimoda
🫥 45.60
25
Agentopia: Long-Term Life Simulation and Learning in Agent Societies
Xintao Wang, Sirui Zheng, Hongqiu Wu · #LLM Society Simulation
🫥 46.40
26
Reverse-Engineered Reasoning for Open-Ended Generation
Haozhe Wang, Haoran Que, Qixin Xu · #open-ended reasoning
🫥 46.80
27
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models
Xiaomin Yu, Yi Xin, Yuhui Zhang · #modality gap alignment
🫥 46.80
28
SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control
Yuxuan Wang, Haobin Jiang, Shiqing Yao · #humanoid control
🫥 47.60
29
GraphIC: A Graph-Based In-Context Example Retrieval Model for Multi-Step Reasoning
Jiale Fu, Yaqing Wang, Simeng Han · #In-Context Learning Retr
🫥 47.60
30
CoTJudger: A Graph-Driven Framework for Automatic Evaluation of Chain-of-Thought Efficiency and Redundancy in LRMs
Siyi Li, Jiajun Shi, Shiwen Ni · #LLM Evaluation
🫥 47.60
31
Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields
Liya Zhu, Jingzhe Ding, Jian Zhang · #GUI Agent Benchmark
🫥 47.60
32
HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System
Tianshuo Yang, Guanyu Chen, Yutian Chen · #VLA fine-tuning PTSD
🫥 47.60
33
Color Shift Estimation-and-Correction for Image Enhancement
Yiyu Li, Ke Xu, Gerhard Petrus Hancke · #over/under-exposure fix
🫥 47.60
34
Hard to Read, Easy to Jailbreak: How Visual Degradation Bypasses MLLM Safety Alignment
Zhixue Song, Boyan Han, Yiwei Wang · #multimodal safety
🫥 48.00
35
MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization
Xiangyu Zhao, Junming Lin, Tianhao Liang · #Multimodal LLMs
🫥 49.44
36
DynamicFace: High-Quality and Consistent Face Swapping for Image and Video using Composable 3D Facial Priors
Runqi Wang, Yang Chen, Sijie Xu · #face swapping
🫥 49.60
37
MM-IFEngine: Towards Multimodal Instruction Following
Shengyuan Ding, Shenxi Wu, Xiangyu Zhao · #Multimodal LLM
🫥 49.60
38
Qwen-AgentWorld: Language World Models for General Agents
Yuxin Zuo, Zikai Xiao, Li Sheng · #world model training
🫥 50.00
39
Kwai Keye-VL-2.0 Technical Report
Kwai Keye Team, Bin Wen, Changyi Liu · #Long-video Understanding
🫥 50.00
40
Generative Modeling via Drifting
Mingyang Deng, He Li, Tianhong Li · #one-step generation
🫥 50.00
41
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
Zhaoyang Wang, Canwen Xu, Boyi Liu · #agent-env-savior
🫥 50.40
42
Learning from Peers in Reasoning Models
Tongxu Luo, Wenyu Du, Jiaxi Bi · #prefix trap observation
🫥 50.40
43
Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices
Junyan Lin, Haoran Chen, Yue Fan · #Multimodal LLM
🫥 50.40
44
RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Xin Chen, Junchao Wu, Shu Yang · #AI-generated text detect
🫥 51.20
45
Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It
Yupu Hao, Zhuoran Jin, Huanxuan Liao · #Tool-use RL
🫥 52.40
46
SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image Classification
Junyan Lin, Feng Gao, Xiaocheng Shi · #Remote Sensing Classific
🫥 52.40
47
Fast Large Language Model Collaborative Decoding via Speculation
Jiale Fu, Yuchu Jiang, Junkai Chen · #LLM Acceleration
🫥 52.40
48
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation
Xiangyu Zhao, Peiyuan Zhang, Junming Lin · #Reward Model Dehallucina
🫥 52.40
49
DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing Perspective
Pu Miao, Zeyao Du, Junlin Zhang · #Sentence Embedding Debia
🫥 52.40
50
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning
Qiguang Chen, Yantao Du, Ziniu Li · #Long CoT analysis
🫥 53.20