Submitted by jt-zhang 98 SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention Tsinghua University 3
Submitted by QbethQ 57 StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs · 7 authors 2
Submitted by DogNeverSleep 41 RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark · 26 authors 13 1
Submitted by DogNeverSleep 38 OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing · 12 authors 1
Submitted by MasterVito 38 Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR Tsinghua University 7 1
Submitted by Yuyang-z 35 SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer NVIDIA 1
Submitted by Nicolas-BZRD 29 When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance When Does Reasoning Matter ? 2
Submitted by taesiri 25 GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts Zhejiang University 21 1
Submitted by sienna223 25 EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling Beijing Academy of Artificial Intelligence 50 1
Submitted by zjuxhl 23 EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering Zhejiang University 36 2
Submitted by haoranhe 21 Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards · 7 authors 13 1
Submitted by wenhu 18 Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning TIGER-Lab 2
Submitted by weizechen 16 From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones · 10 authors 1
Submitted by LiamLian0727 15 Euclid's Gift: Enhancing Spatial Perception and Reasoning in Vision-Language Models via Geometric Surrogate Tasks Zhongguancun Academy 12 2
Submitted by jaeikkim 14 MMPB: It's Time for Multi-Modal Personalization AI, Big Data, and System Laboratory 2 1
Submitted by taesiri 13 Rolling Forcing: Autoregressive Long Video Diffusion in Real Time ARC Lab, Tencent PCG 71 3
Submitted by Dingning 13 BRIDGE - Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation shanghai ailab 38 1
Submitted by xcjthu 12 InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation · 13 authors 2
Submitted by zhangboguodong 12 Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning Renmin University of China 9 1
Submitted by bys0318 11 SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression Z.ai 2
Submitted by Chuanyang-Jin 11 The Era of Real-World Human Interaction: RL from User Conversations AI at Meta 2
Submitted by wcy1122 11 MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech The Chinese University of Hong Kong 132 2
Submitted by limuloo1999 11 Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time · 4 authors 1
Submitted by XINLI1997 10 WirelessMathLM: Teaching Mathematical Reasoning for LLMs in Wireless Communications with Reinforcement Learning · 7 authors 1
Submitted by changdae 10 Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding University of Wisconsin-Madison 5 1
Submitted by MatthieuZ 9 Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective HUAWEI Noah's Ark Lab 1
Submitted by haonan3 8 From Harm to Help: Turning Reasoning In-Context Demos into Assets for Reasoning LMs · 11 authors 1
Submitted by samuelyeh 8 LUMINA: Detecting Hallucinations in RAG System with Context-Knowledge Signals · 3 authors 1
Submitted by KunlunZhu 7 Where LLM Agents Fail and How They can Learn From Failures University of Illinois at Urbana-Champaign 5 1
Submitted by JY-Young 7 Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step Fudan University 13 1
Submitted by li-qing 7 Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation Beijing Institute for General Artificial Intelligence 9 1
Submitted by samuelyeh 7 Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment · 2 authors 1
Submitted by guolinke 6 Hyperspherical Latents Improve Continuous-Token Autoregressive Generation · 2 authors 18 1
Submitted by yczhuang 6 AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play · 10 authors 2
Submitted by fushh7 5 LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-Step Reasoning TongyiLab 2
Submitted by jmyang 5 Alignment through Meta-Weighted Online Sampling: Bridging the Gap between Data Generation and Preference Optimization · 5 authors 1
Submitted by SugerWu 5 MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning · 7 authors 1
Submitted by zli999 4 PixelCraft: A Multi-Agent System for High-Fidelity Visual Reasoning on Structured Images Microsoft Research 4 1
Submitted by Cauthyyy 4 Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models Adobe 1
Submitted by sundrops 4 GRPO-MA: Multi-Answer Generation in GRPO for Stable and Efficient Chain-of-Thought Training · 5 authors 1
Submitted by VsonicV 4 Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning · 7 authors 2 2
Submitted by XINLI1997 4 Local Success Does Not Compose: Benchmarking Large Language Models for Compositional Formal Verification · 5 authors 1
Submitted by HwanChang0106 4 ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents Chung-Ang University 1
Submitted by ZihaoZhu 3 AdvChain: Adversarial Chain-of-Thought Tuning for Robust Safety Alignment of Large Reasoning Models The Chinese University of Hongkong,Shenzhen 1
Submitted by weizhoudb 3 PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation Shanghai Jiao Tong University 3 2
Submitted by charleslwang 3 MathBode: Frequency-Domain Fingerprints of LLM Mathematical Reasoning Cognitive Metrology Lab 0 1
Submitted by HelenMao 3 UniMIC: Token-Based Multimodal Interactive Coding for Human-AI Collaboration Multimedia Intelligent Processing Group in Communication University of China 3
Submitted by zhongwenxu 2 Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and Planning Tencent 1
Submitted by SongzeLi 2 Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale OpenGVLab 2 1
Submitted by lin-tan 2 TENET: Leveraging Tests Beyond Validation for Code Generation Purdue ASSET Research Group | AI-Software Synergy 1
Submitted by robinzixuan 2 RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility Northwestern University 1 2
Submitted by liboaccn 2 REMA: A Unified Reasoning Manifold Framework for Interpreting Large Language Model · 8 authors 1
Submitted by versae 1 BOE-XSUM: Extreme Summarization in Clear Language of Spanish Legal Decrees and Notifications BERTIN Project 1
Submitted by xjh19972 1 ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation · 5 authors 4 1
Submitted by Steven-Shaobo 1 Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution · 9 authors 6 1
Submitted by taesiri 1 IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video? · 20 authors 1 1
Submitted by jtlicardo 1 BPMN Assistant: An LLM-Based Approach to Business Process Modeling · 3 authors 68 2
Submitted by s-jse 1 Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models Stanford Open Virtual Assistant Lab (OVAL) 1
Submitted by compulsi0n 1 Combinatorial Creativity: A New Frontier in Generalization Abilities Spiral Works 2
Submitted by Franck-Dernoncourt 1 The Photographer Eye: Teaching Multimodal Large Language Models to See and Critique like Photographers · 8 authors 1
Submitted by han-cai - DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space NVIDIA 2
Submitted by pranamanam - TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion Programmable Biology Group 1
Submitted by vaidehi99 - Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns · 5 authors 1
Submitted by omidgh - ADAM: A Diverse Archive of Mankind for Evaluating and Enhancing LLMs in Biographical Reasoning · 5 authors 1
Submitted by alemiaschi - Charting a Decade of Computational Linguistics in Italy: The CLiC-it Corpus · 8 authors 0
Submitted by dipta007 - Advancing Reference-free Evaluation of Video Captions with Factual Analysis · 3 authors 1