new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Sep 23

Submitted by

YangXiao-nlp

LIMI: Less is More for Agency

·
21 authors

Submitted by

taesiri

Qwen3-Omni Technical Report

·
38 authors

Submitted by

Crayon-Shinchan

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

·
11 authors

1

Submitted by

KID-22

OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System

·
16 authors

2

Submitted by

lyhisme

TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs

·
7 authors

2

Submitted by

Guizhen

GeoPQA: Bridging the Visual Perception Gap in MLLMs for Geometric Reasoning

·
7 authors

Submitted by

minsoo2333

EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

·
5 authors

3

Submitted by

worstcoder

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

·
10 authors

1

Submitted by

taesiri

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

·
19 authors

Submitted by

yusufma555

ByteWrist: A Parallel Robotic Wrist Enabling Flexible and Anthropomorphic Motion for Confined Spaces

·
7 authors

2

Submitted by

comar

VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models

·
3 authors

Submitted by

AdinaY

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

·
29 authors

Submitted by

taesiri

ARE: Scaling Up Agent Environments and Evaluations

·
24 authors

Submitted by

Umean

Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels

·
10 authors

2

Submitted by

MElHuseyni

Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications

·
5 authors

1

Submitted by

hjeon2k

QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models

·
5 authors

Submitted by

JonasGeiping

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

·
9 authors

2

Submitted by

bruno888

Understanding Embedding Scaling in Collaborative Filtering

·
5 authors

Submitted by

spapi

Cross-Attention is Half Explanation in Speech-to-Text Models

·
5 authors

2

Submitted by

taesiri

ContextFlow: Training-Free Video Object Editing via Adaptive Context Enrichment

·
4 authors

Submitted by

AonanZhang

Synthetic bootstrapped pretraining

·
7 authors

Submitted by

MrZilinXiao

MetaEmbed: Scaling Multimodal Retrieval at Test-Time with Flexible Late Interaction

·
7 authors

2

Submitted by

yeliudev

UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning

·
7 authors

Submitted by

sileod

Reasoning Core: A Scalable RL Environment for LLM Symbolic Reasoning

·
3 authors

Submitted by

cmhungsteve

V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts

·
6 authors

1

Submitted by

HJOK

AuditoryBench++: Can Language Models Understand Auditory Knowledge without Hearing?

·
4 authors

Submitted by

taesiri

Mano Report

·
23 authors

Submitted by

danielm1405

Accurate and Efficient Low-Rank Model Merging in Core Space

·
8 authors

Submitted by

richardcsuwandi

Adaptive Kernel Design for Bayesian Optimization Is a Piece of CAKE with LLMs

·
6 authors

Submitted by

skrishna

D-REX: A Benchmark for Detecting Deceptive Reasoning in Large Language Models

·
9 authors

Submitted by

mrajbrahma

DIWALI - Diversity and Inclusivity aWare cuLture specific Items for India: Dataset and Assessment of LLMs for Cultural Text Adaptation in Indian Context

·
3 authors

2

Submitted by

mandipgoswami

BeepBank-500: A Synthetic Earcon Mini-Corpus for UI Sound Research and Psychoacoustics Research

·
1 authors

Submitted by

SteveZeyuZhang

VaseVQA: Multimodal Agent and Benchmark for Ancient Greek Pottery

·
10 authors

Submitted by

abhiram4572

When Big Models Train Small Ones: Label-Free Model Parity Alignment for Efficient Visual Question Answering using Small VLMs

·
4 authors

Submitted by

starriver030515

From Uniform to Heterogeneous: Tailoring Policy Optimization to Every Token's Nature

·
7 authors

Submitted by

SteveZeyuZhang

StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes

·
6 authors

Submitted by

Geralt-Targaryen

CodeFuse-CR-Bench: A Comprehensiveness-aware Benchmark for End-to-End Code Review Evaluation in Python Projects

·
7 authors

2

Submitted by

hao-li

From Hugging Face to GitHub: Tracing License Drift in the Open-Source AI Ecosystem

·
5 authors

2

Submitted by

akhaliq

DEXOP: A Device for Robotic Transfer of Dexterous Human Manipulation

·
12 authors

Submitted by

dyyyyyyyy

SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning

·
6 authors

2

Submitted by

lucadellalib

FocalCodec-Stream: Streaming Low-Bitrate Speech Coding via Causal Distillation

·
3 authors

2