CV Arxiv Daily

Updated on 2025.12.24

Usage instructions: here

Other links:

LLM

ID	Publish Date	Title	Authors	PDF	Code	Kimi
1	2025-07-23	Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention	Yiwen Chen et.al.	2507.17745	null	Kimi
2	2025-07-23	Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models	Changxin Tian et.al.	2507.17702	null	Kimi
3	2025-07-23	Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks	Ilias Chatzistefanidis et.al.	2507.17695	null	Kimi
4	2025-07-23	A Hybrid Early-Exit Algorithm for Large Language Models Based on Space Alignment Decoding (SPADE)	Bowen Zheng et.al.	2507.17618	null	Kimi
5	2025-07-23	MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs	Alexander R. Fabbri et.al.	2507.17476	null	Kimi
6	2025-07-23	Each to Their Own: Exploring the Optimal Embedding in RAG	Shiting Chen et.al.	2507.17442	null	Kimi
7	2025-07-23	Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance	Rishi Parekh et.al.	2507.17273	null	Kimi
8	2025-07-23	Agent Identity Evals: Measuring Agentic Identity	Elija Perrier et.al.	2507.17257	null	Kimi
9	2025-07-23	CLARIFID: Improving Radiology Report Generation by Reinforcing Clinically Accurate Impressions and Enforcing Detailed Findings	Kyeongkyu Lee et.al.	2507.17234	null	Kimi
10	2025-07-23	The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models	Giuseppe Russo et.al.	2507.17216	null	Kimi
11	2025-07-23	SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs	Zhiqiang Liu et.al.	2507.17178	null	Kimi
12	2025-07-23	Improving LLMs’ Generalized Reasoning Abilities by Graph Problems	Qifan Zhang et.al.	2507.17168	null	Kimi
13	2025-07-23	Resilient Multi-Agent Negotiation for Medical Supply Chains:Integrating LLMs and Blockchain for Transparent Coordination	Mariam ALMutairi et.al.	2507.17134	null	Kimi
14	2025-07-23	BrownoutServe: SLO-Aware Inference Serving under Bursty Workloads for MoE-based LLMs	Jianmin Hu et.al.	2507.17133	null	Kimi
15	2025-07-23	BucketServe: Bucket-Based Dynamic Batching for Smart and Efficient LLM Inference Serving	Wanyi Zheng et.al.	2507.17120	null	Kimi
16	2025-07-22	Controllable Hybrid Captioner for Improved Long-form Video Understanding	Kuleen Sasse et.al.	2507.17047	null	Kimi
17	2025-07-22	Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?	Arduin Findeis et.al.	2507.17015	null	Kimi
18	2025-07-22	Text-to-SPARQL Goes Beyond English: Multilingual Question Answering Over Knowledge Graphs through Human-Inspired Reasoning	Aleksandr Perevalov et.al.	2507.16971	null	Kimi
19	2025-07-22	AURA: A Multi-Modal Medical Agent for Understanding, Reasoning & Annotation	Nima Fathi et.al.	2507.16940	null	Kimi
20	2025-07-22	ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning	Chi-Pin Huang et.al.	2507.16815	null	Kimi
21	2025-07-22	LingBench++: A Linguistically-Informed Benchmark and Reasoning Framework for Multi-Step and Cross-Cultural Inference with LLMs	Da-Chen Lian et.al.	2507.16809	null	Kimi
22	2025-07-23	Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning	Yanjun Zheng et.al.	2507.16802	null	Kimi
23	2025-07-22	Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning	Helena Casademunt et.al.	2507.16795	null	Kimi
24	2025-07-22	Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning	Hongyin Luo et.al.	2507.16784	null	Kimi
25	2025-07-22	WGRAMMAR: Leverage Prior Knowledge to Accelerate Structured Decoding	Ran Wang et.al.	2507.16768	null	Kimi
26	2025-07-22	Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning	Ang Li et.al.	2507.16746	null	Kimi
27	2025-07-22	Collaborative Inference and Learning between Edge SLMs and Cloud LLMs: A Survey of Algorithms, Execution, and Open Challenges	Senyao Li et.al.	2507.16731	null	Kimi
28	2025-07-22	RAVine: Reality-Aligned Evaluation for Agentic Search	Yilong Xu et.al.	2507.16725	null	Kimi
29	2025-07-22	Advancing Risk and Quality Assurance: A RAG Chatbot for Improved Regulatory Compliance	Lars Hillebrand et.al.	2507.16711	null	Kimi
30	2025-07-22	Interpretable Topic Extraction and Word Embedding Learning using row-stochastic DEDICOM	Lars Hillebrand et.al.	2507.16695	null	Kimi
31	2025-07-22	PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization	Han Jiang et.al.	2507.16679	null	Kimi
32	2025-07-22	Towards Automated Regulatory Compliance Verification in Financial Auditing with Large Language Models	Armin Berger et.al.	2507.16642	null	Kimi
33	2025-07-22	Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs	Chang Li et.al.	2507.16473	null	Kimi
34	2025-07-22	Reducing GPU Memory Fragmentation via Spatio-Temporal Planning for Efficient Large-Scale Model Training	Zixiao Huang et.al.	2507.16274	null	Kimi
35	2025-07-22	Efficient RL for optimizing conversation level outcomes with an LLM-based tutor	Hyunji Nam et.al.	2507.16252	null	Kimi
36	2025-07-22	Distilled Large Language Model in Confidential Computing Environment for System-on-Chip Design	Dong Ben et.al.	2507.16226	null	Kimi
37	2025-07-22	Towards Compute-Optimal Many-Shot In-Context Learning	Shahriar Golchin et.al.	2507.16217	null	Kimi
38	2025-07-22	Advancing Visual Large Language Model for Multi-granular Versatile Perception	Wentao Xiang et.al.	2507.16213	null	Kimi
39	2025-07-22	Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task	Jared Moore et.al.	2507.16196	null	Kimi
40	2025-07-22	Emergent Cognitive Convergence via Implementation: A Structured Loop Reflecting Four Theories of Mind (A Position Paper)	Myung Ho Kim et.al.	2507.16184	null	Kimi
41	2025-07-22	SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting	Shuhao Mei et.al.	2507.16145	null	Kimi
42	2025-07-21	Expert-Guided LLM Reasoning for Battery Discovery: From AI-Driven Hypothesis to Synthesis and Characterization	Shengchao Liu et.al.	2507.16110	null	Kimi
43	2025-07-21	Efficient Compositional Multi-tasking for On-device Large Language Models	Ondrej Bohdal et.al.	2507.16083	null	Kimi
44	2025-07-21	Learning without training: The implicit dynamics of in-context learning	Benoit Dherin et.al.	2507.16003	null	Kimi
45	2025-07-21	The Impact of Language Mixing on Bilingual LLM Reasoning	Yihao Li et.al.	2507.15849	null	Kimi
46	2025-07-22	GUI-G $^2$ : Gaussian Reward Modeling for GUI Grounding	Fei Tang et.al.	2507.15846	null	Kimi
47	2025-07-21	Small LLMs Do Not Learn a Generalizable Theory of Mind via Reinforcement Learning	Sneheel Sarangi et.al.	2507.15788	null	Kimi
48	2025-07-21	Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR	Jiakang Wang et.al.	2507.15778	null	Kimi
49	2025-07-22	Supernova: Achieving More with Less in Transformer Architectures	Andrei-Valentin Tanase et.al.	2507.15773	null	Kimi
50	2025-07-21	A Framework for Analyzing Abnormal Emergence in Service Ecosystems Through LLM-based Agent Intention Mining	Yifan Shen et.al.	2507.15770	null	Kimi
51	2025-07-21	LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization	Xingyu Wu et.al.	2507.15758	null	Kimi
52	2025-07-21	Understanding Large Language Models’ Ability on Interdisciplinary Research	Yuanhao Shen et.al.	2507.15736	null	Kimi
53	2025-07-21	BEnchmarking LLMs for Ophthalmology (BELO) for Ophthalmological Knowledge and Reasoning	Sahana Srinivasan et.al.	2507.15717	null	Kimi
54	2025-07-21	Is Large Language Model Performance on Reasoning Tasks Impacted by Different Ways Questions Are Asked?	Seok Hwan Song et.al.	2507.15707	null	Kimi
55	2025-07-21	CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models	Congmin Zheng et.al.	2507.15698	null	Kimi
56	2025-07-21	Leveraging Context for Multimodal Fallacy Classification in Political Debates	Alessio Pittiglio et.al.	2507.15641	null	Kimi
57	2025-07-21	Learning to Extract Rational Evidence via Reinforcement Learning for Retrieval-Augmented Generation	Xinping Zhao et.al.	2507.15586	null	Kimi
58	2025-07-21	Chart-R1: Chain-of-Thought Supervision and Reinforcement for Advanced Chart Reasoner	Lei Chen et.al.	2507.15509	null	Kimi
59	2025-07-21	ASPERA: A Simulated Environment to Evaluate Planning for Complex Action Execution	Alexandru Coca et.al.	2507.15501	null	Kimi
60	2025-07-21	The New LLM Bottleneck: A Systems Perspective on Latent Attention and Mixture-of-Experts	Sungmin Yun et.al.	2507.15465	null	Kimi
61	2025-07-21	EgoPrune: Efficient Token Pruning for Egomotion Video Reasoning in Embodied Agent	Jiaao Li et.al.	2507.15428	null	Kimi
62	2025-07-21	Metaphor and Large Language Models: When Surface Features Matter More than Deep Understanding	Elisa Sanchez-Bayona et.al.	2507.15357	null	Kimi
63	2025-07-21	Scaling Decentralized Learning with FLock	Zehua Cheng et.al.	2507.15349	null	Kimi
64	2025-07-21	StackTrans: From Large Language Model to Large Pushdown Automata Model	Kechi Zhang et.al.	2507.15343	null	Kimi
65	2025-07-21	Reasoning Models are Test Exploiters: Rethinking Multiple-Choice	Narun Raman et.al.	2507.15337	null	Kimi
66	2025-07-21	A Novel Self-Evolution Framework for Large Language Models	Haoran Sun et.al.	2507.15281	null	Kimi
67	2025-07-21	SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search	Xiaofeng Shi et.al.	2507.15245	null	Kimi
68	2025-07-21	Solving Formal Math Problems by Decomposition and Iterative Reflection	Yichi Zhou et.al.	2507.15225	null	Kimi
69	2025-07-20	What Level of Automation is “Good Enough”? A Benchmark of Large Language Models for Meta-Analysis Data Extraction	Lingbo Li et.al.	2507.15152	null	Kimi
70	2025-07-20	Filling the Gap: Is Commonsense Knowledge Generation useful for Natural Language Inference?	Chathuri Jayaweera et.al.	2507.15100	null	Kimi
71	2025-07-20	A Penalty Goes a Long Way: Measuring Lexical Diversity in Synthetic Texts Under Prompt-Influenced Length Variations	Vijeta Deshpande et.al.	2507.15092	null	Kimi
72	2025-07-20	Evaluation of Coding Schemes for Transformer-based Gene Sequence Modeling	Chenlei Gong et.al.	2507.15087	null	Kimi
73	2025-07-20	WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization	Zhengwei Tao et.al.	2507.15061	null	Kimi
74	2025-07-20	RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback	Qiaoyu Tang et.al.	2507.15024	null	Kimi
75	2025-07-17	VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding	Shihao Wang et.al.	2507.13353	null	Kimi
76	2025-07-17	VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning	Senqiao Yang et.al.	2507.13348	null	Kimi
77	2025-07-17	AutoPartGen: Autogressive 3D Part Generation and Discovery	Minghao Chen et.al.	2507.13346	null	Kimi
78	2025-07-17	Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models	Yudong Jin et.al.	2507.13344	null	Kimi
79	2025-07-17	Taming Diffusion Transformer for Real-Time Mobile Video Generation	Yushu Wu et.al.	2507.13343	null	Kimi
80	2025-07-17	Latent Policy Steering with Embodiment-Agnostic Pretrained World Models	Yiqi Wang et.al.	2507.13340	null	Kimi
81	2025-07-17	FormulaOne: Measuring the Depth of Algorithmic Reasoning Beyond Competitive Programming	Gal Beniamini et.al.	2507.13337	null	Kimi
82	2025-07-17	Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes	Tyler Loakman et.al.	2507.13335	null	Kimi
83	2025-07-17	A Survey of Context Engineering for Large Language Models	Lingrui Mei et.al.	2507.13334	null	Kimi
84	2025-07-17	The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner	Zhouqi Hua et.al.	2507.13332	null	Kimi
85	2025-07-17	Social and Political Framing in Search Engine Results	Amrit Poudel et.al.	2507.13325	null	Kimi
86	2025-07-17	Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark	Junsu Kim et.al.	2507.13314	null	Kimi
87	2025-07-17	The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations	Carlos Arriaga et.al.	2507.13302	null	Kimi
88	2025-07-17	AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research	Yilun Zhao et.al.	2507.13300	null	Kimi
89	2025-07-17	Towards Formal Verification of LLM-Generated Code from Natural Language Prompts	Aaron Councilman et.al.	2507.13290	null	Kimi
90	2025-07-17	Multi-Agent Synergy-Driven Iterative Visual Narrative Synthesis	Wang Xi et.al.	2507.13285	null	Kimi
91	2025-07-17	Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management	Luis Gasco et.al.	2507.13275	null	Kimi
92	2025-07-17	QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation	Jiazheng Li et.al.	2507.13266	null	Kimi
93	2025-07-17	Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy	Yiting Yang et.al.	2507.13260	null	Kimi
94	2025-07-17	Automating Steering for Safe Multimodal Large Language Models	Lyucheng Wu et.al.	2507.13255	null	Kimi
95	2025-07-17	HATS: Hindi Analogy Test Set for Evaluating Reasoning in Large Language Models	Ashray Gupta et.al.	2507.13238	null	Kimi
96	2025-07-17	Enhancing Cross-task Transfer of Large Language Models via Activation Steering	Xinyu Tang et.al.	2507.13236	null	Kimi
97	2025-07-17	VITA: Vision-to-Action Flow Matching Policy	Dechen Gao et.al.	2507.13231	null	Kimi
98	2025-07-17	$S^2M^2$ : Scalable Stereo Matching Model for Reliable Depth Estimation	Junhong Min et.al.	2507.13229	null	Kimi
99	2025-07-17	Higher-Order Pattern Unification Modulo Similarity Relations	Besik Dundua et.al.	2507.13208	null	Kimi
100	2025-07-17	Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities	Hao Sun et.al.	2507.13158	null	Kimi
101	2025-07-17	SE-VLN: A Self-Evolving Vision-Language Navigation Framework Based on Multimodal Large Language Models	Xiangyu Dong et.al.	2507.13152	null	Kimi
102	2025-07-17	From Roots to Rewards: Dynamic Tree Reasoning with RL	Ahmed Bahloul et.al.	2507.13142	null	Kimi
103	2025-07-17	Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation	Yi Xin et.al.	2507.13032	null	Kimi
104	2025-07-17	Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities	Liuyi Wang et.al.	2507.13019	null	Kimi
105	2025-07-17	LoViC: Efficient Long Video Generation with Context Compression	Jiaxiu Jiang et.al.	2507.12952	null	Kimi
106	2025-07-17	Making Language Model a Hierarchical Classifier and Generator	Yihong Wang et.al.	2507.12930	null	Kimi
107	2025-07-17	Large Language Models’ Internal Perception of Symbolic Music	Andrew Shin et.al.	2507.12808	null	Kimi
108	2025-07-17	MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models	Zhiwei Liu et.al.	2507.12806	null	Kimi
109	2025-07-17	A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models	Weijieying Ren et.al.	2507.12774	null	Kimi
110	2025-07-17	Local Representative Token Guided Merging for Text-to-Image Generation	Min-Jeong Lee et.al.	2507.12771	null	Kimi
111	2025-07-17	Think-Before-Draw: Decomposing Emotion Semantics & Fine-Grained Controllable Expressive Talking Head Generation	Hanlei Shi et.al.	2507.12761	null	Kimi
112	2025-07-17	Logit Arithmetic Elicits Long Reasoning Capabilities Without Training	Yunxiang Zhang et.al.	2507.12759	null	Kimi
113	2025-07-16	Improving Drug Identification in Overdose Death Surveillance using Large Language Models	Arthur J. Funnell et.al.	2507.12679	null	Kimi
114	2025-07-16	BootSeer: Analyzing and Mitigating Initialization Bottlenecks in Large-Scale LLM Training	Rui Li et.al.	2507.12619	null	Kimi
115	2025-07-16	Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models	Gen Luo et.al.	2507.12566	null	Kimi
116	2025-07-16	Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training	Mingjie Liu et.al.	2507.12507	null	Kimi
117	2025-07-16	Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models	Yik Siu Chan et.al.	2507.12428	null	Kimi
118	2025-07-16	Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data	Chandana Cheerla et.al.	2507.12425	null	Kimi
119	2025-07-16	Probing for Arithmetic Errors in Language Models	Yucheng Sun et.al.	2507.12379	null	Kimi
120	2025-07-16	Beyond Single Models: Enhancing LLM Detection of Ambiguity in Requests through Debate	Ana Davila et.al.	2507.12370	null	Kimi
121	2025-07-16	Thought Purity: Defense Paradigm For Chain-of-Thought Attack	Zihao Xue et.al.	2507.12314	null	Kimi
122	2025-07-16	Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes	Johann Frei et.al.	2507.12261	null	Kimi
123	2025-07-16	Improving Contextual ASR via Multi-grained Fusion with Large Language Models	Shilin Zhou et.al.	2507.12252	null	Kimi
124	2025-07-16	Toward Efficient SpMV in Sparse LLMs via Block Extraction and Compressed Storage	Junqing Lin et.al.	2507.12205	null	Kimi
125	2025-07-16	Findings of MEGA: Maths Explanation with LLMs using the Socratic Method for Active Learning	Tosin Adewumi et.al.	2507.12079	null	Kimi
126	2025-07-16	Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited	Anthony G Cohn et.al.	2507.12059	null	Kimi
127	2025-07-16	Simplifications are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated Definitions	Lukas Ellinger et.al.	2507.11981	null	Kimi
128	2025-07-16	Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness	Yuki Sakamoto et.al.	2507.11979	null	Kimi
129	2025-07-16	Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation	Ziyu Ge et.al.	2507.11966	null	Kimi
130	2025-07-16	PoTPTQ: A Two-step Power-of-Two Post-training for LLMs	Xinyu Wang et.al.	2507.11959	null	Kimi
131	2025-07-16	The benefits of query-based KGQA systems for complex and temporal questions in LLM era	Artem Alekseev et.al.	2507.11954	null	Kimi
132	2025-07-16	IAM: Efficient Inference through Attention Mapping between Different-scale LLMs	Yi Zhao et.al.	2507.11953	null	Kimi
133	2025-07-16	DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression	Yi Zhao et.al.	2507.11942	null	Kimi
134	2025-07-16	BlockBPE: Parallel BPE Tokenization	Amos You et.al.	2507.11941	null	Kimi
135	2025-07-16	POLYCHARTQA: Benchmarking Large Vision-Language Models with Multilingual Chart Question Answering	Yichen Xu et.al.	2507.11939	null	Kimi
136	2025-07-16	A Survey of Deep Learning for Geometry Problem Solving	Jianzhe Ma et.al.	2507.11936	null	Kimi
137	2025-07-16	Tracing Facts or just Copies? A critical investigation of the Competitions of Mechanisms in Large Language Models	Dante Campregher et.al.	2507.11809	null	Kimi
138	2025-07-15	CRABS: A syntactic-semantic pincer strategy for bounding LLM interpretation of Python notebooks	Meng Li et.al.	2507.11742	null	Kimi
139	2025-07-15	Auto-Formulating Dynamic Programming Problems with Large Language Models	Chenyu Zhou et.al.	2507.11737	null	Kimi
140	2025-07-15	Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis	Maciej Szankin et.al.	2507.11730	null	Kimi
141	2025-07-15	PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed Training	Seth Ockerman et.al.	2507.11683	null	Kimi
142	2025-07-15	MapIQ: Benchmarking Multimodal Large Language Models for Map Question Answering	Varun Srivastava et.al.	2507.11625	null	Kimi
143	2025-07-15	Streaming 4D Visual Geometry Transformer	Dong Zhuo et.al.	2507.11539	null	Kimi
144	2025-07-15	DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering	Yinsheng Li et.al.	2507.11527	null	Kimi
145	2025-07-16	Reasoning Strategies in Large Language Models: Can They Follow, Prefer, and Optimize?	Yanjian Zhang et.al.	2507.11423	null	Kimi
146	2025-07-15	Seq vs Seq: An Open Suite of Paired Encoders and Decoders	Orion Weller et.al.	2507.11412	null	Kimi
147	2025-07-15	KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?	Soumadeep Saha et.al.	2507.11408	null	Kimi
148	2025-07-15	Automated Novelty Evaluation of Academic Paper: A Collaborative Approach Integrating Human and Large Language Model Knowledge	Wenqing Wu et.al.	2507.11330	null	Kimi
149	2025-07-15	Internal Value Alignment in Large Language Models through Controlled Value Vector Activation	Haoran Jin et.al.	2507.11316	null	Kimi
150	2025-07-15	KV-Latent: Dimensional-level KV Cache Reduction with Frequency-aware Rotary Positional Embedding	Luohe Shi et.al.	2507.11273	null	Kimi
151	2025-07-15	An Agentic Flow for Finite State Machine Extraction using Prompt Chaining	Fares Wael et.al.	2507.11222	null	Kimi
152	2025-07-15	Role-Playing LLM-Based Multi-Agent Support Framework for Detecting and Addressing Family Communication Bias	Rushia Harada et.al.	2507.11210	null	Kimi
153	2025-07-15	Temperature and Persona Shape LLM Agent Consensus With Minimal Accuracy Gains in Qualitative Coding	Conrad Borchers et.al.	2507.11198	null	Kimi
154	2025-07-15	Mixture of Experts in Large Language Models	Danyang Zhang et.al.	2507.11181	null	Kimi
155	2025-07-15	SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks	Pavel Adamenko et.al.	2507.11059	null	Kimi
156	2025-07-15	LLM-Augmented Symptom Analysis for Cardiovascular Disease Risk Prediction: A Clinical NLP	Haowei Yang et.al.	2507.11052	null	Kimi
157	2025-07-15	First-Order Error Matters: Accurate Compensation for Quantized Large Language Models	Xingyu Zheng et.al.	2507.11017	null	Kimi
158	2025-07-15	Teach Me Sign: Stepwise Prompting LLM for Sign Language Production	Zhaoyi An et.al.	2507.10972	null	Kimi
159	2025-07-15	DS@GT at eRisk 2025: From prompts to predictions, benchmarking early depression detection with conversational agent based assessments and temporal attention models	Anthony Miyaguchi et.al.	2507.10958	null	Kimi
160	2025-07-15	Modeling Understanding of Story-Based Analogies Using Large Language Models	Kalit Inani et.al.	2507.10957	null	Kimi
161	2025-07-15	Artificial Finance: How AI Thinks About Money	Orhan Erdem et.al.	2507.10933	null	Kimi
162	2025-07-15	HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training	Seungho Choi et.al.	2507.10920	null	Kimi
163	2025-07-15	NavComposer: Composing Language Instructions for Navigation Trajectories through Action-Scene-Object Modularization	Zongtao He et.al.	2507.10894	null	Kimi
164	2025-07-14	Automated Thematic Analyses Using LLMs: Xylazine Wound Management Social Media Chatter Use Case	JaMor Hairston et.al.	2507.10803	null	Kimi
165	2025-07-14	Warehouse Spatial Question Answering with LLM Agent	Hsiang-Wei Huang et.al.	2507.10778	null	Kimi
166	2025-07-14	From Semantic Web and MAS to Agentic AI: A Unified Narrative of the Web of Agents	Tatiana Petrova et.al.	2507.10644	null	Kimi
167	2025-07-14	EmbRACE-3K: Embodied Reasoning and Action in Complex Environments	Mingxian Lin et.al.	2507.10548	null	Kimi
168	2025-07-14	CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks	Hongchao Jiang et.al.	2507.10535	null	Kimi
169	2025-07-14	Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination	Mingqi Wu et.al.	2507.10532	null	Kimi
170	2025-07-14	Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation	Sangmin Bae et.al.	2507.10524	null	Kimi
171	2025-07-14	DeepResearch $^{\text{Eco}}$ : A Recursive Agentic Workflow for Complex Scientific Question Answering in Ecology	Jennifer D’Souza et.al.	2507.10522	null	Kimi
172	2025-07-14	Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance	Kyungtae Han et.al.	2507.10500	null	Kimi
173	2025-07-14	Cameras as Relative Positional Encoding	Ruilong Li et.al.	2507.10496	null	Kimi
174	2025-07-14	Can You Detect the Difference?	İsmail Tarım et.al.	2507.10475	null	Kimi
175	2025-07-14	Referential ambiguity and clarification requests: comparing human and LLM behaviour	Chris Madge et.al.	2507.10445	null	Kimi
176	2025-07-14	Zorse: Optimizing LLM Training Efficiency on Heterogeneous GPU Clusters	Runsheng Benson Guo et.al.	2507.10392	null	Kimi
177	2025-07-14	FaceLLM: A Multimodal Large Language Model for Face Understanding	Hatef Otroshi Shahreza et.al.	2507.10300	null	Kimi
178	2025-07-14	Absher: A Benchmark for Evaluating Large Language Models Understanding of Saudi Dialects	Renad Al-Monef et.al.	2507.10216	null	Kimi
179	2025-07-14	Natural Language-based Assessment of L2 Oral Proficiency using LLMs	Stefano Bannò et.al.	2507.10200	null	Kimi
180	2025-07-14	Abusive text transformation using LLMs	Rohitash Chandra et.al.	2507.10177	null	Kimi
181	2025-07-14	Fusing Large Language Models with Temporal Transformers for Time Series Forecasting	Chen Su et.al.	2507.10098	null	Kimi
182	2025-07-14	Enhancing Chain-of-Thought Reasoning with Critical Representation Fine-tuning	Chenxi Huang et.al.	2507.10085	null	Kimi
183	2025-07-14	Cultural Bias in Large Language Models: Evaluating AI Agents through Moral Questionnaires	Simon Münker et.al.	2507.10073	null	Kimi
184	2025-07-14	Automating SPARQL Query Translations between DBpedia and Wikidata	Malte Christian Bartels et.al.	2507.10045	null	Kimi
185	2025-07-14	Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning	Zijun Chen et.al.	2507.10007	null	Kimi
186	2025-07-14	On The Role of Intentionality in Knowledge Representation: Analyzing Scene Context for Cognitive Agents with a Tiny Language Model	Mark Burgess et.al.	2507.10000	null	Kimi
187	2025-07-14	Tiny Reward Models	Sarah Pan et.al.	2507.09973	null	Kimi
188	2025-07-14	DeepSeek: Paradigm Shifts and Technical Evolution in Large AI Models	Luolin Xiong et.al.	2507.09955	null	Kimi
189	2025-07-14	Enhancing Retrieval Augmented Generation with Hierarchical Text Segmentation Chunking	Hai Toan Nguyen et.al.	2507.09935	null	Kimi
190	2025-07-14	ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models	Yongheng Zhang et.al.	2507.09876	null	Kimi
191	2025-07-14	Is Human-Written Data Enough? The Challenge of Teaching Reasoning to LLMs Without RL or Distillation	Wei Du et.al.	2507.09850	null	Kimi
192	2025-07-14	Generative Audio Language Modeling with Continuous-valued Tokens and Masked Next-Token Prediction	Shu-wen Yang et.al.	2507.09834	null	Kimi
193	2025-07-13	CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design	Prashant Govindarajan et.al.	2507.09792	null	Kimi
194	2025-07-13	TinyTroupe: An LLM-powered Multiagent Persona Simulation Toolkit	Paulo Salem et.al.	2507.09788	null	Kimi
195	2025-07-13	Sound and Complete Neuro-symbolic Reasoning with LLM-Grounded Interpretations	Bradley P. Allen et.al.	2507.09751	null	Kimi
196	2025-07-13	Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces	Baturay Saglam et.al.	2507.09709	null	Kimi
197	2025-07-10	Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology	Haochen Wang et.al.	2507.07999	null	Kimi
198	2025-07-10	PyVision: Agentic Vision with Dynamic Tooling	Shitian Zhao et.al.	2507.07998	null	Kimi
199	2025-07-10	MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization	Mingkai Jia et.al.	2507.07997	null	Kimi
200	2025-07-10	Single-pass Adaptive Image Tokenization for Minimum Program Search	Shivam Duggal et.al.	2507.07995	null	Kimi
201	2025-07-10	Multigranular Evaluation for Brain Visual Decoding	Weihao Xia et.al.	2507.07993	null	Kimi
202	2025-07-10	Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs	Jeongseok Hyun et.al.	2507.07990	null	Kimi
203	2025-07-10	Automating Expert-Level Medical Reasoning Evaluation of Large Language Models	Shuang Zhou et.al.	2507.07988	null	Kimi
204	2025-07-10	OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding	JingLi Lin et.al.	2507.07984	null	Kimi
205	2025-07-10	Performance and Practical Considerations of Large and Small Language Models in Clinical Decision Support in Rheumatology	Sabine Felde et.al.	2507.07983	null	Kimi
206	2025-07-10	Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling	Haoyu Wu et.al.	2507.07982	null	Kimi
207	2025-07-10	Why is Your Language Model a Poor Implicit Reward Model?	Noam Razin et.al.	2507.07981	null	Kimi
208	2025-07-10	Scaling RL to Long Videos	Yukang Chen et.al.	2507.07966	null	Kimi
209	2025-07-10	MIRIX: Multi-Agent Memory System for LLM-Based Agents	Yu Wang et.al.	2507.07957	null	Kimi
210	2025-07-10	Input Conditioned Layer Dropping in Speech Foundation Models	Abdul Hannan et.al.	2507.07954	null	Kimi
211	2025-07-10	SAGE: A Visual Language Model for Anomaly Detection via Fact Enhancement and Entropy-aware Alignment	Guoxin Zang et.al.	2507.07939	null	Kimi
212	2025-07-10	Working with AI: Measuring the Occupational Implications of Generative AI	Kiran Tomlinson et.al.	2507.07935	null	Kimi
213	2025-07-10	Meek Models Shall Inherit the Earth	Hans Gundlach et.al.	2507.07931	null	Kimi
214	2025-07-10	Probing Experts’ Perspectives on AI-Assisted Public Speaking Training	Nesrine Fourati et.al.	2507.07930	null	Kimi
215	2025-07-10	Towards Continuous Home Cage Monitoring: An Evaluation of Tracking and Identification Strategies for Laboratory Mice	Juan Pablo Oberhauser et.al.	2507.07929	null	Kimi
216	2025-07-10	DTECT: Dynamic Topic Explorer & Context Tracker	Suman Adhya et.al.	2507.07910	null	Kimi
217	2025-07-10	Not Only Consistency: Enhance Test-Time Adaptation with Spatio-temporal Inconsistency for Remote Physiological Measurement	Xiao Yang et.al.	2507.07908	null	Kimi
218	2025-07-10	Agentic Retrieval of Topics and Insights from Earnings Calls	Anant Gupta et.al.	2507.07906	null	Kimi
219	2025-07-10	MIRA: A Novel Framework for Fusing Modalities in Medical RAG	Jinhong Wang et.al.	2507.07902	null	Kimi
220	2025-07-10	An Integrated Framework of Prompt Engineering and Multidimensional Knowledge Graphs for Legal Dispute Analysis	Mingda Zhang et.al.	2507.07893	null	Kimi
221	2025-07-10	Automating MD simulations for Proteins using Large language Models: NAMD-Agent	Achuth Chandrasekhar et.al.	2507.07887	null	Kimi
222	2025-07-10	Single-Step Latent Diffusion for Underwater Image Restoration	Jiayi Wu et.al.	2507.07878	null	Kimi
223	2025-07-10	DocCHA: Towards LLM-Augmented Interactive Online diagnosis System	Xinyi Liu et.al.	2507.07870	null	Kimi
224	2025-07-10	Alpay Algebra V: Multi-Layered Semantic Games and Transfinite Fixed-Point Simulation	Bugra Kilictas et.al.	2507.07868	null	Kimi
225	2025-07-10	Searching for actual causes: Approximate algorithms with adjustable precision	Samuel Reyd et.al.	2507.07857	null	Kimi
226	2025-07-10	From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems	Youngjoon Jang et.al.	2507.07847	null	Kimi
227	2025-07-10	MoSE: Skill-by-Skill Mixture-of-Expert Learning for Autonomous Driving	Lu Xu et.al.	2507.07818	null	Kimi
228	2025-07-10	When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance	Peizhang Shao et.al.	2507.07748	null	Kimi
229	2025-07-10	Not All Preferences are What You Need for Post-Training: Selective Alignment Strategy for Preference Optimization	Zhijin Dong et.al.	2507.07725	null	Kimi
230	2025-07-10	Rationale-Enhanced Decoding for Multi-modal Chain-of-Thought	Shin’ya Yamaguchi et.al.	2507.07685	null	Kimi
231	2025-07-10	Single-to-mix Modality Alignment with Multimodal Large Language Model for Document Image Machine Translation	Yupu Liang et.al.	2507.07572	null	Kimi
232	2025-07-10	Toward Real-World Chinese Psychological Support Dialogues: CPsDD Dataset and a Co-Evolving Multi-Agent System	Yuanchen Shi et.al.	2507.07509	null	Kimi
233	2025-07-10	Hallucination Stations: On Some Basic Limitations of Transformer-Based Language Models	Varin Sikka et.al.	2507.07505	null	Kimi
234	2025-07-10	PLAN-TUNING: Post-Training Language Models to Learn Step-by-Step Planning for Complex Problem Solving	Mihir Parmar et.al.	2507.07495	null	Kimi
235	2025-07-10	Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models	Kaiqu Liang et.al.	2507.07484	null	Kimi
236	2025-07-10	SAND: Boosting LLM Agents with Self-Taught Action Deliberation	Yu Xia et.al.	2507.07441	null	Kimi
237	2025-07-10	DrugMCTS: a drug repurposing framework combining multi-agent, RAG and Monte Carlo Tree Search	Zerui Yang et.al.	2507.07426	null	Kimi
238	2025-07-10	May I have your Attention? Breaking Fine-Tuning based Prompt Injection Defenses using Architecture-Aware Attacks	Nishit V. Pandya et.al.	2507.07417	null	Kimi
239	2025-07-10	GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation	Fardin Rastakhiz et.al.	2507.07414	null	Kimi
240	2025-07-10	Phishing Detection in the Gen-AI Era: Quantized LLMs vs Classical Models	Jikesh Thapa et.al.	2507.07406	null	Kimi
241	2025-07-10	KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows	Zaifeng Pan et.al.	2507.07400	null	Kimi
242	2025-07-09	Application of LLMs to Multi-Robot Path Planning and Task Allocation	Ashish Kumar et.al.	2507.07302	null	Kimi
243	2025-07-09	Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery	Licong Xu et.al.	2507.07257	null	Kimi
244	2025-07-09	Attentions Under the Microscope: A Comparative Study of Resource Utilization for Variants of Self-Attention	Zhengyu Tian et.al.	2507.07247	null	Kimi
245	2025-07-09	Prompt Perturbations Reveal Human-Like Biases in LLM Survey Responses	Jens Rupprecht et.al.	2507.07188	null	Kimi
246	2025-07-09	Interpretable EEG-to-Image Generation with Semantic Prompts	Arshak Rezvani et.al.	2507.07157	null	Kimi
247	2025-07-09	Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models	Tiezheng Zhang et.al.	2507.07104	null	Kimi
248	2025-07-09	Learning Deliberately, Acting Intuitively: Unlocking Test-Time Reasoning in Multimodal LLMs	Yahan Yu et.al.	2507.06999	null	Kimi
249	2025-07-09	Exploring LLMs for Predicting Tutor Strategy and Student Outcomes in Dialogues	Fareya Ikram et.al.	2507.06910	null	Kimi
250	2025-07-09	MultiJustice: A Chinese Dataset for Multi-Party, Multi-Charge Legal Prediction	Xiao Wang et.al.	2507.06909	null	Kimi
251	2025-07-09	Developing and Maintaining an Open-Source Repository of AI Evaluations: Challenges and Insights	Alexandra Abbas et.al.	2507.06893	null	Kimi
252	2025-07-09	Text to model via SysML: Automated generation of dynamical system computational models from unstructured natural language text via enhanced System Modeling Language diagrams	Matthew Anderson Hendricks et.al.	2507.06803	null	Kimi
253	2025-07-09	Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining: Method, Evaluation and Applications	Seonwu Kim et.al.	2507.06795	null	Kimi
254	2025-07-09	Expediting data extraction using a large language model (LLM) and scoping review protocol: a methodological study within a complex scoping review	James Stewart-Evans et.al.	2507.06623	null	Kimi
255	2025-07-09	Nexus: Taming Throughput-Latency Tradeoff in LLM Serving via Efficient GPU Sharing	Xiaoxiang Shi et.al.	2507.06608	null	Kimi
256	2025-07-09	Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation	Liliang Ren et.al.	2507.06607	null	Kimi
257	2025-07-09	From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization	Xinjie Chen et.al.	2507.06573	null	Kimi
258	2025-07-09	SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference	Qian Chen et.al.	2507.06567	null	Kimi
259	2025-07-09	InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes under Herd Behavior	Huisheng Wang et.al.	2507.06528	null	Kimi
260	2025-07-09	SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers	Zicong Tang et.al.	2507.06517	null	Kimi
261	2025-07-09	Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection	Yupeng Hu et.al.	2507.06510	null	Kimi
262	2025-07-09	Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings	Russell Taylor et.al.	2507.06506	null	Kimi
263	2025-07-09	MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting Models	Yiwen Liu et.al.	2507.06502	null	Kimi
264	2025-07-09	Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning	Ziyang Wang et.al.	2507.06485	null	Kimi
265	2025-07-08	Bridging AI and Software Security: A Comparative Vulnerability Assessment of LLM Agent Deployment Paradigms	Tarek Gasmi et.al.	2507.06323	null	Kimi
266	2025-07-08	ETT: Expanding the Long Context Understanding Capability of LLMs at Test-Time	Kiarash Zahirnia et.al.	2507.06313	null	Kimi
267	2025-07-08	Too Human to Model:The Uncanny Valley of LLMs in Social Simulation – When Generative Language Agents Misalign with Modelling Principles	Yongchao Zeng et.al.	2507.06310	null	Kimi
268	2025-07-08	Humans overrely on overconfident language models, across languages	Neil Rathi et.al.	2507.06306	null	Kimi
269	2025-07-08	Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers	Zhiyuan Peng et.al.	2507.06223	null	Kimi
270	2025-07-08	A Survey on Latent Reasoning	Rui-Jie Zhu et.al.	2507.06203	null	Kimi
271	2025-07-08	UQLM: A Python Package for Uncertainty Quantification in Large Language Models	Dylan Bouchard et.al.	2507.06196	null	Kimi
272	2025-07-09	Skywork-R1V3 Technical Report	Wei Shen et.al.	2507.06167	null	Kimi
273	2025-07-08	Evaluation of Habitat Robotics using Large Language Models	William Li et.al.	2507.06157	null	Kimi
274	2025-07-08	Coding Triangle: How Does Large Language Model Understand Code?	Taolin Zhang et.al.	2507.06138	null	Kimi
275	2025-07-08	NeoBabel: A Multilingual Open Tower for Visual Generation	Mohammad Mahdi Derakhshani et.al.	2507.06137	null	Kimi
276	2025-07-09	Omni-Video: Democratizing Unified Video Understanding and Generation	Zhiyu Tan et.al.	2507.06119	null	Kimi
277	2025-07-08	Few-shot text-based emotion detection	Teodor-George Marchitan et.al.	2507.05918	null	Kimi
278	2025-07-08	Affective-ROPTester: Capability and Bias Analysis of LLMs in Predicting Retinopathy of Prematurity	Shuai Zhao et.al.	2507.05816	null	Kimi
279	2025-07-08	Omni-Router: Sharing Routing Decisions in Sparse Mixture-of-Experts for Speech Recognition	Zijin Gu et.al.	2507.05724	null	Kimi
280	2025-07-08	Agentic-R1: Distilled Dual-Strategy Reasoning	Weihua Du et.al.	2507.05707	null	Kimi
281	2025-07-08	Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs	SeungWon Ji et.al.	2507.05686	null	Kimi
282	2025-07-08	LLMs are Introvert	Litian Zhang et.al.	2507.05638	null	Kimi
283	2025-07-08	SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression	Yiqiao Jin et.al.	2507.05633	null	Kimi
284	2025-07-08	Flipping Knowledge Distillation: Leveraging Small Models’ Expertise to Enhance LLMs in Text Matching	Mingzhe Li et.al.	2507.05617	null	Kimi
285	2025-07-08	Enhancing Test-Time Scaling of Large Language Models with Hierarchical Retrieval-Augmented MCTS	Alex ZH Dou et.al.	2507.05557	null	Kimi
286	2025-07-07	Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment	Jiahuan Pei et.al.	2507.05528	null	Kimi
287	2025-07-07	Fine-Grained Vision-Language Modeling for Multimodal Training Assistants in Augmented Reality	Haochen Huang et.al.	2507.05515	null	Kimi
288	2025-07-07	On the Semantics of Large Language Models	Martin Schuele et.al.	2507.05448	null	Kimi
289	2025-07-07	“Lost-in-the-Later”: Framework for Quantifying Contextual Grounding in Large Language Models	Yufei Tao et.al.	2507.05424	null	Kimi
290	2025-07-07	On the Bias of Next-Token Predictors Toward Systematically Inefficient Reasoning: A Shortest-Path Case Study	Riccardo Alberghi et.al.	2507.05362	null	Kimi
291	2025-07-07	LoRA-Augmented Generation (LAG) for Knowledge-Intensive Language Tasks	William Fleshman et.al.	2507.05346	null	Kimi
292	2025-07-07	MindFlow: Revolutionizing E-commerce Customer Support with Multimodal LLM Agents	Ming Gong et.al.	2507.05330	null	Kimi
293	2025-07-07	LCDS: A Logic-Controlled Discharge Summary Generation System Supporting Source Attribution and Expert Review	Cheng Yuan et.al.	2507.05319	null	Kimi
294	2025-07-07	Spatio-Temporal LLM: Reasoning about Environments and Actions	Haozhen Zheng et.al.	2507.05258	null	Kimi
295	2025-07-07	Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions	Yuanzhe Hu et.al.	2507.05257	null	Kimi
296	2025-07-07	Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning	Yana Wei et.al.	2507.05255	null	Kimi
297	2025-07-07	When Chain of Thought is Necessary, Language Models Struggle to Evade Monitors	Scott Emmons et.al.	2507.05246	null	Kimi
298	2025-07-07	StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling	Meng Wei et.al.	2507.05240	null	Kimi
299	2025-07-07	Critiques of World Models	Eric Xing et.al.	2507.05169	null	Kimi
300	2025-07-07	InfoSteer: Steering Information Utility in Language Model Post-Training	Chunyuan Deng et.al.	2507.05158	null	Kimi
301	2025-07-07	AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models	Chinnappa Guggilla et.al.	2507.05157	null	Kimi
302	2025-07-07	Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization	Jaewook Lee et.al.	2507.05137	null	Kimi
303	2025-07-07	MARBLE: A Multi-Agent Rule-Based LLM Reasoning Engine for Accident Severity Prediction	Kaleem Ullah Qasim et.al.	2507.04893	null	Kimi
304	2025-07-07	Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations	A. Bochkov et.al.	2507.04886	null	Kimi
305	2025-07-07	FurniMAS: Language-Guided Furniture Decoration using Multi-Agent System	Toan Nguyen et.al.	2507.04770	null	Kimi
306	2025-07-07	From Imitation to Innovation: The Emergence of AI Unique Artistic Styles and the Challenge of Copyright Protection	Zexi Jia et.al.	2507.04769	null	Kimi
307	2025-07-07	CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering	Hang Lv et.al.	2507.04756	null	Kimi
308	2025-07-07	LLM-based Question-Answer Framework for Sensor-driven HVAC System Interaction	Sungmin Lee et.al.	2507.04748	null	Kimi
309	2025-07-07	Activation Steering for Chain-of-Thought Compression	Seyedarmin Azizi et.al.	2507.04742	null	Kimi
310	2025-07-07	LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework	Zecheng Tang et.al.	2507.04723	null	Kimi
311	2025-07-07	SPATIA: Multimodal Model for Prediction and Generation of Spatial Cell Phenotypes	Zhenglun Kong et.al.	2507.04704	null	Kimi
312	2025-07-07	Performance Evaluation of General Purpose Large Language Models for Basic Linear Algebra Subprograms Code Generation	Daichi Mukunoki et.al.	2507.04697	null	Kimi
313	2025-07-07	Knowledge-Aware Self-Correction in Language Models via Structured Memory Graphs	Swayamjit Saha et.al.	2507.04625	null	Kimi
314	2025-07-07	Multimodal LLM Integrated Semantic Communications for 6G Immersive Experiences	Yusong Zhang et.al.	2507.04621	null	Kimi
315	2025-07-07	PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes	Xinliang Frederick Zhang et.al.	2507.04607	null	Kimi
316	2025-07-06	Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts	Guokan Shang et.al.	2507.04569	null	Kimi
317	2025-07-06	Evaluating LLMs on Real-World Forecasting Against Human Superforecasters	Janna Lu et.al.	2507.04562	null	Kimi
318	2025-07-06	MambaVideo for Discrete Video Tokenization with Channel-Split Quantization	Dawit Mureja Argaw et.al.	2507.04559	null	Kimi
319	2025-07-06	DP-Fusion: Token-Level Differentially Private Inference for Large Language Models	Rushil Thareja et.al.	2507.04531	null	Kimi
320	2025-07-06	Model Inversion Attacks on Llama 3: Extracting PII from Large Language Models	Sathesh P. Sivashanmugam et.al.	2507.04478	null	Kimi
321	2025-07-06	The role of large language models in UI/UX design: A systematic literature review	Ammar Ahmed et.al.	2507.04469	null	Kimi
322	2025-07-06	CoT-lized Diffusion: Let’s Reinforce T2I Generation Step-by-step	Zheyuan Liu et.al.	2507.04451	null	Kimi
323	2025-07-06	MedGellan: LLM-Generated Medical Guidance to Support Physicians	Debodeep Banerjee et.al.	2507.04431	null	Kimi
324	2025-07-03	RefTok: Reference-Based Tokenization for Video Generation	Xiang Fan et.al.	2507.02862	null	Kimi
325	2025-07-03	Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching	Xin Zhou et.al.	2507.02860	null	Kimi
326	2025-07-03	Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation	Jiaer Xia et.al.	2507.02859	null	Kimi
327	2025-07-03	Requirements Elicitation Follow-Up Question Generation	Yuchen Shen et.al.	2507.02858	null	Kimi
328	2025-07-03	Answer Matching Outperforms Multiple Choice for Language Model Evaluation	Nikhil Chandak et.al.	2507.02856	null	Kimi
329	2025-07-03	MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs	Purbesh Mitra et.al.	2507.02851	null	Kimi
330	2025-07-03	LLM Hypnosis: Exploiting User Feedback for Unauthorized Knowledge Injection to All Users	Almog Hilel et.al.	2507.02850	null	Kimi
331	2025-07-03	Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection	Ziqi Miao et.al.	2507.02844	null	Kimi
332	2025-07-03	StepHint: Multi-level Stepwise Hints Enhance Reinforcement Learning to Reason	Kaiyi Zhang et.al.	2507.02841	null	Kimi
333	2025-07-03	ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning	Ruiyang Zhou et.al.	2507.02834	null	Kimi
334	2025-07-03	USAD: An Unsupervised Data Augmentation Spatio-Temporal Attention Diffusion Network	Ying Yu et.al.	2507.02827	null	Kimi
335	2025-07-03	Establishing Best Practices for Building Rigorous Agentic Benchmarks	Yuxuan Zhu et.al.	2507.02825	null	Kimi
336	2025-07-03	DNN-Based Precoding in RIS-Aided mmWave MIMO Systems With Practical Phase Shift	Po-Heng Chou et.al.	2507.02824	null	Kimi
337	2025-07-03	SynapseRoute: An Auto-Route Switching Framework on Dual-State Large Language Model	Wencheng Zhang et.al.	2507.02822	null	Kimi
338	2025-07-03	Multimodal Mathematical Reasoning with Diverse Solving Perspective	Wenhao Shi et.al.	2507.02804	null	Kimi
339	2025-07-03	Is Reasoning All You Need? Probing Bias in the Age of Reasoning Language Models	Riccardo Cantini et.al.	2507.02799	null	Kimi
340	2025-07-03	From Long Videos to Engaging Clips: A Human-Inspired Video Editing Framework with Multimodal Narrative Understanding	Xiangfeng Wang et.al.	2507.02790	null	Kimi
341	2025-07-03	Moral Responsibility or Obedience: What Do We Want from AI?	Joseph Boland et.al.	2507.02788	null	Kimi
342	2025-07-03	Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs	Ken Tsui et.al.	2507.02778	null	Kimi
343	2025-07-03	KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs	Yuzhang Xie et.al.	2507.02773	null	Kimi
344	2025-07-03	DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment	Ke-Han Lu et.al.	2507.02768	null	Kimi
345	2025-07-03	Knowledge Protocol Engineering: A New Paradigm for AI in Domain-Specific Knowledge Work	Guangwei Zhang et.al.	2507.02760	null	Kimi
346	2025-07-03	Multi-agent Auditory Scene Analysis	Caleb Rascon et.al.	2507.02755	null	Kimi
347	2025-07-03	Fast and Simplex: 2-Simplicial Attention in Triton	Aurko Roy et.al.	2507.02754	null	Kimi
348	2025-07-03	Synthesizable by Design: A Retrosynthesis-Guided Framework for Molecular Analog Generation	Shuan Chen et.al.	2507.02752	null	Kimi
349	2025-07-03	Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and Physics	Alex Colagrande et.al.	2507.02748	null	Kimi
350	2025-07-03	Early Signs of Steganographic Capabilities in Frontier LLMs	Artur Zolkowski et.al.	2507.02737	null	Kimi
351	2025-07-03	Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks	Sizhe Chen et.al.	2507.02735	null	Kimi
352	2025-07-03	Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving	Matthieu Zimmer et.al.	2507.02726	null	Kimi
353	2025-07-03	UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation	Qin Guo et.al.	2507.02713	null	Kimi
354	2025-07-03	AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models	Ziyin Zhou et.al.	2507.02664	null	Kimi
355	2025-07-03	OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding	Ramchalam Kinattinkara Ramakrishnan et.al.	2507.02659	null	Kimi
356	2025-07-03	FlowSpec: Continuous Pipelined Speculative Decoding for Efficient Distributed LLM Inference	Xing Liu et.al.	2507.02620	null	Kimi
357	2025-07-03	Strategic Intelligence in Large Language Models: Evidence from evolutionary Game Theory	Kenneth Payne et.al.	2507.02618	null	Kimi
358	2025-07-03	Are You Listening to Me? Fine-Tuning Chatbots for Empathetic Dialogue	Paulo Ricardo Knob et.al.	2507.02537	null	Kimi
359	2025-07-03	Red grape detection with accelerated artificial neural networks in the FPGA’s programmable logic	Sandro Costa Magalhães et.al.	2507.02443	null	Kimi
360	2025-07-03	Holistic Tokenizer for Autoregressive Image Generation	Anlin Zheng et.al.	2507.02358	null	Kimi
361	2025-07-03	DoMIX: An Efficient Framework for Exploiting Domain Knowledge in Fine-Tuning	Dohoon Kim et.al.	2507.02302	null	Kimi
362	2025-07-03	MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent	Hongli Yu et.al.	2507.02259	null	Kimi
363	2025-07-03	SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement	Zeyu Lei et.al.	2507.02252	null	Kimi
364	2025-07-02	ESTR-CoT: Towards Explainable and Accurate Event Stream based Scene Text Recognition with Chain-of-Thought Reasoning	Xiao Wang et.al.	2507.02200	null	Kimi
365	2025-07-02	Latent Chain-of-Thought? Decoding the Depth-Recurrent Transformer	Wenquan Lu et.al.	2507.02199	null	Kimi
366	2025-07-02	Reasoning or Not? A Comprehensive Evaluation of Reasoning LLMs for Dialogue Summarization	Keyan Jin et.al.	2507.02145	null	Kimi
367	2025-07-02	When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search	William A. Ingram et.al.	2507.02139	null	Kimi
368	2025-07-02	Dissecting the Impact of Mobile DVFS Governors on LLM Inference Performance and Energy Efficiency	Zongpu Zhang et.al.	2507.02135	null	Kimi
369	2025-07-02	Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs	Mohammad Ali Alomrani et.al.	2507.02076	null	Kimi
370	2025-07-02	Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges	Sanjeda Akter et.al.	2507.02074	null	Kimi
371	2025-07-02	Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation	Zhuoyang Zhang et.al.	2507.01957	null	Kimi
372	2025-07-02	SpecCLIP: Aligning and Translating Spectroscopic Measurements for Stars	Xiaosheng Zhao et.al.	2507.01939	null	Kimi
373	2025-07-02	Decision-oriented Text Evaluation	Yu-Shiang Huang et.al.	2507.01923	null	Kimi
374	2025-07-02	Gradient-Adaptive Policy Optimization: Towards Multi-Objective Alignment of Large Language Models	Chengao Li et.al.	2507.01915	null	Kimi
375	2025-07-02	AI4Research: A Survey of Artificial Intelligence for Scientific Research	Qiguang Chen et.al.	2507.01903	null	Kimi
376	2025-07-02	High-Layer Attention Pruning with Rescaling	Songtao Liu et.al.	2507.01900	null	Kimi
377	2025-07-02	MiCoTA: Bridging the Learnability Gap with Intermediate CoT and Teacher Assistants	Dongyi Ding et.al.	2507.01887	null	Kimi
378	2025-07-02	Bridging UI Design and chatbot Interactions: Applying Form-Based Principles to Conversational Agents	Sanjay Krishna Anbalagan et.al.	2507.01862	null	Kimi
379	2025-07-02	Eka-Eval : A Comprehensive Evaluation Framework for Large Language Models in Indian Languages	Samridhi Raj Sinha et.al.	2507.01853	null	Kimi
380	2025-07-02	LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs	Reza Arabpour et.al.	2507.01806	null	Kimi
381	2025-07-02	How Do Vision-Language Models Process Conflicting Information Across Modalities?	Tianze Hua et.al.	2507.01790	null	Kimi
382	2025-07-02	MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining	Zhixun Chen et.al.	2507.01785	null	Kimi
383	2025-07-02	ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving	Kai Chen et.al.	2507.01735	null	Kimi
384	2025-07-02	AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness	Zixin Chen et.al.	2507.01702	null	Kimi
385	2025-07-02	Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems	Zhaoyan Sun et.al.	2507.01599	null	Kimi
386	2025-07-02	Following the Clues: Experiments on Person Re-ID using Cross-Modal Intelligence	Robert Aufschläger et.al.	2507.01504	null	Kimi
387	2025-07-02	Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning	Yanfei Zhang et.al.	2507.01489	null	Kimi
388	2025-07-02	BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments	Yibo Qiu et.al.	2507.01485	null	Kimi
389	2025-07-02	Evaluating the Effectiveness of Direct Preference Optimization for Personalizing German Automatic Text Simplifications for Persons with Intellectual Disabilities	Yingqiang Gao et.al.	2507.01479	null	Kimi
390	2025-07-02	LogitSpec: Accelerating Retrieval-based Speculative Decoding via Next Next Token Speculation	Tianyu Liu et.al.	2507.01449	null	Kimi
391	2025-07-02	EdgeLoRA: An Efficient Multi-Tenant LLM Serving System on Edge Devices	Zheyu Shen et.al.	2507.01438	null	Kimi
392	2025-07-02	AI Agents and Agentic AI-Navigating a Plethora of Concepts for Future Manufacturing	Yinwang Ren et.al.	2507.01376	null	Kimi
393	2025-07-02	Long-Tailed Distribution-Aware Router For Mixture-of-Experts in Large Vision-Language Model	Chaoxiang Cai et.al.	2507.01351	null	Kimi
394	2025-07-02	Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs	Nifu Dan et.al.	2507.01334	null	Kimi
395	2025-07-02	VLAD: A VLM-Augmented Autonomous Driving Framework with Hierarchical Planning and Interpretable Decision Process	Cristian Gariboldi et.al.	2507.01284	null	Kimi
396	2025-07-02	GAIus: Combining Genai with Legal Clauses Retrieval for Knowledge-based Assistant	Michał Matak et.al.	2507.01259	null	Kimi
397	2025-07-01	Enhancing LLM Agent Safety via Causal Influence Prompting	Dongyoon Hahm et.al.	2507.00979	null	Kimi
398	2025-07-01	Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications	Jindong Han et.al.	2507.00914	null	Kimi
399	2025-07-01	ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models	Zifu Wan et.al.	2507.00898	null	Kimi
400	2025-07-01	TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation	Xi Xuan et.al.	2507.00875	null	Kimi
401	2025-07-01	Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives	Sixun Dong et.al.	2506.24124	null	Kimi
402	2025-06-30	Calligrapher: Freestyle Text Image Customization	Yue Ma et.al.	2506.24123	null	Kimi
403	2025-06-30	Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime	Yuqing Wang et.al.	2506.24120	null	Kimi
404	2025-07-01	SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning	Bo Liu et.al.	2506.24119	null	Kimi
405	2025-07-01	Intertextual Parallel Detection in Biblical Hebrew: A Transformer-Based Benchmark	David M. Smiley et.al.	2506.24117	null	Kimi
406	2025-06-30	On the Predictive Power of Representation Dispersion in Language Models	Yanhong Li et.al.	2506.24106	null	Kimi
407	2025-06-30	DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World	Xiangtai Li et.al.	2506.24102	null	Kimi
408	2025-06-30	MotionGPT3: Human Motion as a Second Modality	Bingfan Zhu et.al.	2506.24086	null	Kimi
409	2025-06-30	Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention	Wonwoong Cho et.al.	2506.24085	null	Kimi
410	2025-06-30	STACK: Adversarial Attacks on LLM Safeguard Pipelines	Ian R. McKenzie et.al.	2506.24068	null	Kimi
411	2025-06-30	Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios	Deng Li et.al.	2506.24063	null	Kimi
412	2025-06-30	Logit-Gap Steering: Efficient Short-Suffix Jailbreaks for Aligned Large Language Models	Tung-Ling Li et.al.	2506.24056	null	Kimi
413	2025-06-30	Agent.xpu: Efficient Scheduling of Agentic LLM Workloads on Heterogeneous SoC	Xinming Wei et.al.	2506.24045	null	Kimi
414	2025-06-30	A Survey on Vision-Language-Action Models for Autonomous Driving	Sicong Jiang et.al.	2506.24044	null	Kimi
415	2025-06-30	Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data	Shubhabrata Mukherjee et.al.	2506.24039	null	Kimi
416	2025-06-30	Ella: Embodied Social Agents with Lifelong Memory	Hongxin Zhang et.al.	2506.24019	null	Kimi
417	2025-06-30	EXPERT: An Explainable Image Captioning Evaluation Metric with Structured Explanations	Hyunjong Kim et.al.	2506.24016	null	Kimi
418	2025-06-30	Large Language Models Don’t Make Sense of Word Problems. A Scoping Review from a Mathematics Education Perspective	Anselm R. Strohmaier et.al.	2506.24006	null	Kimi
419	2025-06-30	ShapeKit	Junqi Liu et.al.	2506.24003	null	Kimi
420	2025-06-30	The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models	Lijun Sheng et.al.	2506.24000	null	Kimi
421	2025-06-30	Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning	Seungjun Yi et.al.	2506.23998	null	Kimi
422	2025-06-30	STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems	Mingfei Cheng et.al.	2506.23995	null	Kimi
423	2025-06-30	Harnessing AI Agents to Advance Research on Refugee Child Mental Health	Aditya Shrivastava et.al.	2506.23992	null	Kimi
424	2025-06-30	Machine Understanding of Scientific Language	Dustin Wright et.al.	2506.23990	null	Kimi
425	2025-06-30	TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation	Renren Jin et.al.	2506.23979	null	Kimi
426	2025-06-30	LLM Agents Are the Antidote to Walled Gardens	Samuele Marro et.al.	2506.23978	null	Kimi
427	2025-06-30	Evaluating the Impact of Khmer Font Types on Text Recognition	Vannkinh Nom et.al.	2506.23963	null	Kimi
428	2025-06-30	ADReFT: Adaptive Decision Repair for Safe Autonomous Driving via Reinforcement Fine-Tuning	Mingfei Cheng et.al.	2506.23960	null	Kimi
429	2025-06-30	Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice	Akshit Kumar et.al.	2506.23924	null	Kimi
430	2025-06-30	Advancing Multi-Step Mathematical Reasoning in Large Language Models through Multi-Layered Self-Reflection with Auto-Prompting	André de Souza Loureiro et.al.	2506.23888	null	Kimi
431	2025-06-30	Chain of Thought in Order: Discovering Learning-Friendly Orders for Arithmetic	Yuta Sato et.al.	2506.23875	null	Kimi
432	2025-06-30	A Survey on Autonomy-Induced Security Risks in Large Model-Based Agents	Hang Su et.al.	2506.23844	null	Kimi
433	2025-06-30	Do Thinking Tokens Help or Trap? Towards More Efficient Large Reasoning Model	Bowen Ding et.al.	2506.23840	null	Kimi
434	2025-06-30	Flash-VStream: Efficient Real-Time Understanding for Long Video Streams	Haoji Zhang et.al.	2506.23825	null	Kimi
435	2025-06-30	AutoEvoEval: An Automated Framework for Evolving Close-Ended LLM Evaluation Data	JiaRu Wu et.al.	2506.23735	null	Kimi
436	2025-06-30	Attestable Audits: Verifiable AI Safety Benchmarks Using Trusted Execution Environments	Christoph Schnabl et.al.	2506.23706	null	Kimi
437	2025-06-30	PokéAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red	Zihao Liu et.al.	2506.23689	null	Kimi
438	2025-06-30	Interactive Reasoning: Visualizing and Controlling Chain-of-Thought Reasoning in Large Language Models	Rock Yuren Pang et.al.	2506.23678	null	Kimi
439	2025-06-30	Unified Multimodal Understanding via Byte-Pair Visual Encoding	Wanpeng Zhang et.al.	2506.23639	null	Kimi
440	2025-06-30	Towards Building Private LLMs: Exploring Multi-Node Expert Parallelism on Apple Silicon for Mixture-of-Experts Large Language Model	Mu-Chi Chen et.al.	2506.23635	null	Kimi
441	2025-06-30	AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval	Suyash Maniyar et.al.	2506.23605	null	Kimi
442	2025-06-30	Semantic-guided Diverse Decoding for Large Language Model	Weijie Shi et.al.	2506.23601	null	Kimi
443	2025-06-30	MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI	Huanjin Yao et.al.	2506.23563	null	Kimi
444	2025-06-30	NEU-ESC: A Comprehensive Vietnamese dataset for Educational Sentiment analysis and topic Classification toward multitask learning	Phan Quoc Hung Mai et.al.	2506.23524	null	Kimi
445	2025-06-30	Thought-Augmented Planning for LLM-Powered Interactive Recommender Agent	Haocheng Yu et.al.	2506.23485	null	Kimi
446	2025-06-29	Pipelined Decoder for Efficient Context-Aware Text Generation	Zixian Huang et.al.	2506.23431	null	Kimi
447	2025-06-29	TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs	Felipe Nuti et.al.	2506.23423	null	Kimi
448	2025-06-29	SIEDD: Shared-Implicit Encoder with Discrete Decoders	Vikram Rangarajan et.al.	2506.23382	null	Kimi
449	2025-06-29	Perspective Dial: Measuring Perspective of Text and Guiding LLM Outputs	Taejin Kim et.al.	2506.23377	null	Kimi
450	2025-06-29	ATGen: A Framework for Active Text Generation	Akim Tsvigun et.al.	2506.23342	null	Kimi
451	2025-06-29	Information Loss in LLMs’ Multilingual Translation: The Role of Training Data, Language Proximity, and Language Family	Yumeng Lin et.al.	2506.23340	null	Kimi
452	2025-06-29	GATSim: Urban Mobility Simulation with Generative Agents	Qi Liu et.al.	2506.23306	null	Kimi
453	2025-06-29	Objective-Free Local Learning and Emergent Language Structure in Thinking Machines	P. Myles Eugenio et.al.	2506.23293	null	Kimi
454	2025-06-26	Whole-Body Conditioned Egocentric Video Prediction	Yutong Bai et.al.	2506.21552	null	Kimi
455	2025-06-26	mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale	Xiaona Zhou et.al.	2506.21550	null	Kimi
456	2025-06-26	SAM4D: Segment Anything in Camera and LiDAR Streams	Jianyun Xu et.al.	2506.21547	null	Kimi
457	2025-06-26	HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation	Xinzhuo Li et.al.	2506.21546	null	Kimi
458	2025-06-26	PsyLite Technical Report	Fangjun Ding et.al.	2506.21536	null	Kimi
459	2025-06-26	Exploring the Design Space of 3D MLLMs for CT Report Generation	Mohammed Baharoon et.al.	2506.21535	null	Kimi
460	2025-06-26	“What’s Up, Doc?”: Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets	Akshay Paruchuri et.al.	2506.21532	null	Kimi
461	2025-06-26	WAFT: Warping-Alone Field Transforms for Optical Flow	Yihan Wang et.al.	2506.21526	null	Kimi
462	2025-06-26	Potemkin Understanding in Large Language Models	Marina Mancoridis et.al.	2506.21521	null	Kimi
463	2025-06-26	Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits Calibration	Jiahe Chen et.al.	2506.21509	null	Kimi
464	2025-06-26	skLEP: A Slovak General Language Understanding Benchmark	Marek Šuppa et.al.	2506.21508	null	Kimi
465	2025-06-26	Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge	Boyu Gou et.al.	2506.21506	null	Kimi
466	2025-06-26	Enhancing User Engagement in Socially-Driven Dialogue through Interactive LLM Alignments	Jiashuo Wang et.al.	2506.21497	null	Kimi
467	2025-06-26	Bridging Offline and Online Reinforcement Learning for LLMs	Jack Lanchantin et.al.	2506.21495	null	Kimi
468	2025-06-26	Ad-Hoc Human-AI Coordination Challenge	Tin Dizdarević et.al.	2506.21490	null	Kimi
469	2025-06-26	TITAN: Query-Token based Domain Adaptive Adversarial Learning	Tajamul Ashraf et.al.	2506.21484	null	Kimi
470	2025-06-26	TopK Language Models	Ryosuke Takahashi et.al.	2506.21468	null	Kimi
471	2025-06-26	Efficient and Reuseable Cloud Configuration Search Using Discovery Spaces	Michael Johnston et.al.	2506.21467	null	Kimi
472	2025-06-26	Aligning Spoken Dialogue Models from User Interactions	Anne Wu et.al.	2506.21463	null	Kimi
473	2025-06-26	Spatial Mental Modeling from Limited Views	Baiqiao Yin et.al.	2506.21458	null	Kimi
474	2025-06-26	Rethinking Oversaturation in Classifier-Free Guidance via Low Frequency	Kaiyu Song et.al.	2506.21452	null	Kimi
475	2025-06-26	ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing	Huadai Liu et.al.	2506.21448	null	Kimi
476	2025-06-26	Text2Cypher Across Languages: Evaluating Foundational Models Beyond English	Makbule Gulcin Ozsoy et.al.	2506.21445	null	Kimi
477	2025-06-26	Domain Knowledge-Enhanced LLMs for Fraud and Concept Drift Detection	Ali Şenol et.al.	2506.21443	null	Kimi
478	2025-06-26	HyperSORT: Self-Organising Robust Training with hyper-networks	Samuel Joutard et.al.	2506.21430	null	Kimi
479	2025-06-26	XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation	Bowen Chen et.al.	2506.21416	null	Kimi
480	2025-06-26	Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference	Colin Samplawski et.al.	2506.21408	null	Kimi
481	2025-06-26	TableMoE: Neuro-Symbolic Routing for Structured Expert Reasoning in Multimodal Table Understanding	Junwen Zhang et.al.	2506.21393	null	Kimi
482	2025-06-26	Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation	Guanting Dong et.al.	2506.21384	null	Kimi
483	2025-06-26	GenFlow: Interactive Modular System for Image Generation	Duc-Hung Nguyen et.al.	2506.21369	null	Kimi
484	2025-06-26	Latent Prototype Routing: Achieving Near-Perfect Load Balancing in Mixture-of-Experts	Jiajie Yang et.al.	2506.21328	null	Kimi
485	2025-06-26	Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language Models	Bram Willemsen et.al.	2506.21294	null	Kimi
486	2025-06-26	Small Encoders Can Rival Large Decoders in Detecting Groundedness	Istabrak Abbes et.al.	2506.21288	null	Kimi
487	2025-06-26	Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning	Xin Xu et.al.	2506.21285	null	Kimi
488	2025-06-26	HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context	Qize Yang et.al.	2506.21277	null	Kimi
489	2025-06-26	DiLoCoX: A Low-Communication Large-Scale Training Framework for Decentralized Cluster	Ji Qi et.al.	2506.21263	null	Kimi
490	2025-06-26	Unveiling Causal Reasoning in Large Language Models: Reality or Mirage?	Haoang Chi et.al.	2506.21215	null	Kimi
491	2025-06-26	$T^3$ : Multi-level Tree-based Automatic Program Repair with Large Language Models	Quanming Liu et.al.	2506.21211	null	Kimi
492	2025-06-26	Task-Aware KV Compression For Cost-Effective Long Video Understanding	Minghao Qin et.al.	2506.21184	null	Kimi
493	2025-06-26	Uncover Treasures in DCT: Advancing JPEG Quality Enhancement by Exploiting Latent Correlations	Jing Yang et.al.	2506.21171	null	Kimi
494	2025-06-26	Large Language Models Acing Chartered Accountancy	Jatin Gupta et.al.	2506.21031	null	Kimi
495	2025-06-26	Evidence-based diagnostic reasoning with multi-agent copilot for human pathology	Chengkuan Chen et.al.	2506.20964	null	Kimi
496	2025-06-26	ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks	Joshua H. Davis et.al.	2506.20938	null	Kimi
497	2025-06-25	Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes	Quintin Myers et.al.	2506.20822	null	Kimi
498	2025-06-25	MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering	Chinmay Gondhalekar et.al.	2506.20821	null	Kimi
499	2025-06-25	The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research Ideas	Chenglei Si et.al.	2506.20803	null	Kimi
500	2025-06-25	The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind	Andrei Lupu et.al.	2506.20664	null	Kimi
501	2025-06-25	Memento: Note-Taking for Your Future Self	Chao Wan et.al.	2506.20642	null	Kimi
502	2025-06-26	DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation	Shansan Gong et.al.	2506.20639	null	Kimi
503	2025-06-25	Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization	Zhiwang Zhang et.al.	2506.20567	null	Kimi
504	2025-06-25	When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs	Ammar Khairi et.al.	2506.20544	null	Kimi
505	2025-06-25	WattsOnAI: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads	Hongzhen Huang et.al.	2506.20535	null	Kimi
506	2025-06-25	Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios	Wenbin Gan et.al.	2506.20531	null	Kimi
507	2025-06-25	OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling	Zengzhi Wang et.al.	2506.20512	null	Kimi
508	2025-06-25	Probing AI Safety with Source Code	Ujwal Narayan et.al.	2506.20471	null	Kimi
509	2025-06-25	An Agentic System for Rare Disease Diagnosis with Traceable Reasoning	Weike Zhao et.al.	2506.20430	null	Kimi
510	2025-06-25	SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models	Dipayan Saha et.al.	2506.20415	null	Kimi
511	2025-06-25	Enterprise Large Language Model Evaluation Benchmark	Liya Wang et.al.	2506.20274	null	Kimi
512	2025-06-25	A Transformer Based Handwriting Recognition System Jointly Using Online and Offline Features	Ayush Lodh et.al.	2506.20255	null	Kimi
513	2025-06-25	Enhancing Large Language Models through Structured Reasoning	Yubo Dong et.al.	2506.20241	null	Kimi
514	2025-06-25	How to Retrieve Examples in In-context Learning to Improve Conversational Emotion Recognition using Large Language Models?	Mengqi Wang et.al.	2506.20199	null	Kimi
515	2025-06-25	SEED: A Structural Encoder for Embedding-Driven Decoding in Time Series Prediction with LLMs	Fengze Li et.al.	2506.20167	null	Kimi
516	2025-06-25	EAR: Erasing Concepts from Unified Autoregressive Models	Haipeng Fan et.al.	2506.20151	null	Kimi
517	2025-06-25	MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations	Vardhan Dongre et.al.	2506.20100	null	Kimi
518	2025-06-25	A Modular Multitask Reasoning Framework Integrating Spatio-temporal Models and LLMs	Kethmi Hirushini Hettige et.al.	2506.20073	null	Kimi
519	2025-06-24	Persona-Assigned Large Language Models Exhibit Human-Like Motivated Reasoning	Saloni Dash et.al.	2506.20020	null	Kimi
520	2025-06-24	Accurate and Energy Efficient: Local Retrieval-Augmented Generation Models Outperform Commercial Large Language Models in Medical Tasks	Konstantinos Vrettos et.al.	2506.20009	null	Kimi
521	2025-06-24	Inference Scaled GraphRAG: Improving Multi Hop Question Answering on Knowledge Graphs	Travis Thompson et.al.	2506.19967	null	Kimi
522	2025-06-24	Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture	Shuchen Xue et.al.	2506.19935	null	Kimi
523	2025-06-24	Prover Agent: An Agent-based Framework for Formal Mathematical Proofs	Kaito Baba et.al.	2506.19923	null	Kimi
524	2025-06-24	Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation	Xingyang Li et.al.	2506.19852	null	Kimi
525	2025-06-24	Orthogonal Finetuning Made Scalable	Zeju Qiu et.al.	2506.19847	null	Kimi
526	2025-06-24	Scaling Speculative Decoding with Lookahead Reasoning	Yichao Fu et.al.	2506.19830	null	Kimi
527	2025-06-24	KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality	Baochang Ren et.al.	2506.19807	null	Kimi
528	2025-06-24	Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study	Yuqi Zhu et.al.	2506.19794	null	Kimi
529	2025-06-24	SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning	Yuqian Fu et.al.	2506.19767	null	Kimi
530	2025-06-24	Arabic Dialect Classification using RNNs, Transformers, and Large Language Models: A Comparative Analysis	Omar A. Essameldin et.al.	2506.19753	null	Kimi
531	2025-06-24	Recurrent Visual Feature Extraction and Stereo Attentions for CT Report Generation	Yuanhe Tian et.al.	2506.19665	null	Kimi
532	2025-06-24	PEVLM: Parallel Encoding for Vision-Language Models	Letian Kang et.al.	2506.19651	null	Kimi
533	2025-06-24	ECCoT: A Framework for Enhancing Effective Cognition via Chain of Thought in Large Language Model	Zhenke Duan et.al.	2506.19599	null	Kimi
534	2025-06-24	Fake or Real, Can Robots Tell? Evaluating Embodied Vision-Language Models on Real and 3D-Printed Objects	Federico Tavella et.al.	2506.19579	null	Kimi
535	2025-06-24	AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models	Zeyu Li et.al.	2506.19505	null	Kimi
536	2025-06-24	Dialogic Pedagogy for Large Language Models: Aligning Conversational AI with Proven Theories of Learning	Russell Beale et.al.	2506.19484	null	Kimi
537	2025-06-24	Can Large Language Models Capture Human Annotator Disagreements?	Jingwei Ni et.al.	2506.19467	null	Kimi
538	2025-06-24	Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System	Lixuan He et.al.	2506.19433	null	Kimi
539	2025-06-24	Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study	Yingji Zhang et.al.	2506.19418	null	Kimi
540	2025-06-24	Automated Detection of Pre-training Text in Black-box LLMs	Ruihan Hu et.al.	2506.19399	null	Kimi
541	2025-06-24	Personality Prediction from Life Stories using Language Models	Rasiq Hussain et.al.	2506.19258	null	Kimi
542	2025-06-24	RecLLM-R1: A Two-Stage Training Paradigm with Reinforcement Learning and Chain-of-Thought v1	Yu Xie et.al.	2506.19235	null	Kimi
543	2025-06-24	Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification	Minghao Qin et.al.	2506.19225	null	Kimi
544	2025-06-24	Augmenting Multi-Agent Communication with State Delta Trajectory	Yichen Tang et.al.	2506.19209	null	Kimi
545	2025-06-23	Thought Anchors: Which LLM Reasoning Steps Matter?	Paul C. Bogdan et.al.	2506.19143	null	Kimi
546	2025-06-23	HAWAII: Hierarchical Visual Knowledge Transfer for Efficient Vision-Language Models	Yimu Wang et.al.	2506.19072	null	Kimi
547	2025-06-23	Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective	Weijie Xu et.al.	2506.19028	null	Kimi
548	2025-06-23	Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations	Jiaming Han et.al.	2506.18898	null	Kimi
549	2025-06-23	OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization	Yiyou Sun et.al.	2506.18880	null	Kimi
550	2025-06-23	CommVQ: Commutative Vector Quantization for KV Cache Compression	Junyan Li et.al.	2506.18879	null	Kimi
551	2025-06-23	OmniGen2: Exploration to Advanced Multimodal Generation	Chenyuan Wu et.al.	2506.18871	null	Kimi
552	2025-06-23	LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning	Yuhao Wu et.al.	2506.18841	null	Kimi
553	2025-06-23	STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning	Aryasomayajula Ram Bharadwaj et.al.	2506.18831	null	Kimi
554	2025-06-23	Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories	Islem Bouzenia et.al.	2506.18824	null	Kimi
555	2025-06-24	ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation	Siao Tang et.al.	2506.18810	link	Kimi
556	2025-06-23	Existing LLMs Are Not Self-Consistent For Simple Tasks	Zhenru Lin et.al.	2506.18781	null	Kimi
557	2025-06-23	Is There a Case for Conversation Optimized Tokenizers in Large Language Models?	Raquel Ferrando et.al.	2506.18674	null	Kimi
558	2025-06-23	Historical Report Guided Bi-modal Concurrent Learning for Pathology Report Generation	Ling Zhang et.al.	2506.18658	null	Kimi
559	2025-06-23	ReDit: Reward Dithering for Improved LLM Policy Optimization	Chenxing Wei et.al.	2506.18631	null	Kimi
560	2025-06-23	AggTruth: Contextual Hallucination Detection using Aggregated Attention Scores in LLMs	Piotr Matys et.al.	2506.18628	null	Kimi
561	2025-06-23	Parallel Continuous Chain-of-Thought with Jacobi Iteration	Haoyi Wu et.al.	2506.18582	null	Kimi
562	2025-06-23	Security Assessment of DeepSeek and GPT Series Models against Jailbreak Attacks	Xiaodong Wu et.al.	2506.18543	null	Kimi
563	2025-06-23	Comparative Evaluation of ChatGPT and DeepSeek Across Key NLP Tasks: Strengths, Weaknesses, and Domain-Specific Performance	Wael Etaiwi et.al.	2506.18501	null	Kimi
564	2025-06-23	MeRF: Motivation-enhanced Reinforcement Finetuning for Large Reasoning Models	Junjie Zhang et.al.	2506.18485	null	Kimi
565	2025-06-23	TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models	Ce Li et.al.	2506.18421	null	Kimi
566	2025-06-23	SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation	Zichong Li et.al.	2506.18349	null	Kimi
567	2025-06-23	Dynamic Knowledge Exchange and Dual-diversity Review: Concisely Unleashing the Potential of a Multi-Agent Research Team	Weilun Yu et.al.	2506.18348	null	Kimi
568	2025-06-23	Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs	Kang Chen et.al.	2506.18341	null	Kimi
569	2025-06-23	RLPR: Extrapolating RLVR to General Domains without Verifiers	Tianyu Yu et.al.	2506.18254	null	Kimi
570	2025-06-23	Make It Efficient: Dynamic Sparse Attention for Autoregressive Image Generation	Xunzhi Xiang et.al.	2506.18226	null	Kimi
571	2025-06-22	Understanding Reasoning in Thinking Language Models via Steering Vectors	Constantin Venhoff et.al.	2506.18167	null	Kimi
572	2025-06-22	Chain-of-Memory: Enhancing GUI Agents for Cross-Application Navigation	Xinzge Gao et.al.	2506.18158	null	Kimi
573	2025-06-22	QuranMorph: Morphologically Annotated Quranic Corpus	Diyam Akra et.al.	2506.18148	null	Kimi
574	2025-06-22	$φ^{\infty}$ : Clause Purification, Embedding Realignment, and the Total Suppression of the Em Dash in Autoregressive Language Models	Bugra Kilictas et.al.	2506.18129	null	Kimi
575	2025-06-22	Mental Health Equity in LLMs: Leveraging Multi-Hop Question Answering to Detect Amplified and Silenced Perspectives	Batool Haider et.al.	2506.18116	null	Kimi
576	2025-06-22	InspireDebate: Multi-Dimensional Subjective-Objective Evaluation-Guided Reasoning and Optimization for Debating	Fuyu Wang et.al.	2506.18102	null	Kimi
577	2025-06-22	RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation	Tianxing Chen et.al.	2506.18088	null	Kimi
578	2025-06-18	PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning	Yuhui Shi et.al.	2506.15683	null	Kimi
579	2025-06-18	Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence	Yining Hong et.al.	2506.15677	null	Kimi
580	2025-06-18	Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers	Tommaso Green et.al.	2506.15674	link	Kimi
581	2025-06-18	Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability	Yusuke Sakai et.al.	2506.15629	null	Kimi
582	2025-06-18	WikiMixQA: A Multimodal Benchmark for Question Answering over Tables and Charts	Negar Foroutan et.al.	2506.15594	link	Kimi
583	2025-06-18	Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents	Aline Dobrovsky et.al.	2506.15567	null	Kimi
584	2025-06-18	PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction	Shufan Li et.al.	2506.15556	null	Kimi
585	2025-06-18	Optimizing Web-Based AI Query Retrieval with GPT Integration in LangChain A CoT-Enhanced Prompt Engineering Approach	Wenqi Guan et.al.	2506.15512	null	Kimi
586	2025-06-18	SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling	Md Imbesat Hassan Rizvi et.al.	2506.15498	link	Kimi
587	2025-06-18	Context-Informed Grounding Supervision	Hyunji Lee et.al.	2506.15480	link	Kimi
588	2025-06-18	RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation	Xinnuo Xu et.al.	2506.15455	null	Kimi
589	2025-06-18	Uncovering Intention through LLM-Driven Code Snippet Description Generation	Yusuf Sulistyo Nugroho et.al.	2506.15453	null	Kimi
590	2025-06-18	Targeted Lexical Injection: Unlocking Latent Cross-Lingual Alignment in Lugha-Llama via Early-Layer LoRA Fine-Tuning	Stanley Ngugi et.al.	2506.15415	null	Kimi
591	2025-06-18	DeVisE: Behavioral Testing of Medical Large Language Models	Camila Zurdo Tagliabue et.al.	2506.15339	null	Kimi
592	2025-06-18	Cohort Discovery: A Survey on LLM-Assisted Clinical Trial Recruitment	Shrestha Ghosh et.al.	2506.15301	null	Kimi
593	2025-06-18	ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs	Feng He et.al.	2506.15211	null	Kimi
594	2025-06-18	A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals	Andrea Cadeddu et.al.	2506.15208	null	Kimi
595	2025-06-18	eLLM: Elastic Memory Management Framework for Efficient LLM Serving	Jiale Xu et.al.	2506.15155	null	Kimi
596	2025-06-18	Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs	Jing Yang Lee et.al.	2506.15131	null	Kimi
597	2025-06-18	Truncated Proximal Policy Optimization	Tiantian Fan et.al.	2506.15050	null	Kimi
598	2025-06-17	SFT-GO: Supervised Fine-Tuning with Group Optimization for Large Language Models	Gyuhak Kim et.al.	2506.15021	null	Kimi
599	2025-06-17	Scaling Intelligence: Designing Data Centers for Next-Gen Language Models	Jesmin Jahan Tithi et.al.	2506.15006	null	Kimi
600	2025-06-17	Memory Tokens: Large Language Models Can Generate Reversible Sentence Embeddings	Ignacio Sastre et.al.	2506.15001	link	Kimi
601	2025-06-17	A Variational Framework for Improving Naturalness in Generative Spoken Language Models	Li-Wei Chen et.al.	2506.14767	link	Kimi
602	2025-06-17	ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM	Yujun Wang et.al.	2506.14766	null	Kimi
603	2025-06-18	Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs	Ling Team et.al.	2506.14731	null	Kimi
604	2025-06-17	GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors	Hengyuan Zhang et.al.	2506.14646	link	Kimi
605	2025-06-17	Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot	Xiang Cheng et.al.	2506.14641	null	Kimi
606	2025-06-18	AIn’t Nothing But a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation	Leah von der Heyde et.al.	2506.14634	null	Kimi
607	2025-06-18	Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models	Chenchen Yuan et.al.	2506.14625	link	Kimi
608	2025-06-16	Steering LLM Thinking with Budget Guidance	Junyan Li et.al.	2506.13752	link	Kimi
609	2025-06-16	Evaluating Large Language Models for Phishing Detection, Self-Consistency, Faithfulness, and Explainability	Shova Kuikel et.al.	2506.13746	link	Kimi
610	2025-06-16	Instruction Following by Boosting Attention of Large Language Models	Vitoria Guardieiro et.al.	2506.13734	null	Kimi
611	2025-06-16	Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems	Shang-Chi Tsai et.al.	2506.13692	null	Kimi
612	2025-06-16	Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model	Shaolei Zhang et.al.	2506.13642	link	Kimi
613	2025-06-16	An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability	Yusuke Yamauchi et.al.	2506.13639	null	Kimi
614	2025-06-16	CAMS: A CityGPT-Powered Agentic Framework for Urban Human Mobility Simulation	Yuwei Du et.al.	2506.13599	null	Kimi
615	2025-06-16	Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems	Tuan Nguyen et.al.	2506.13596	null	Kimi
616	2025-06-16	MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention	MiniMax et.al.	2506.13585	link	Kimi
617	2025-06-16	Flexible-length Text Infilling for Discrete Diffusion Models	Andrew Zhang et.al.	2506.13579	null	Kimi
618	2025-06-16	Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization	Guanghui Song et.al.	2506.13541	null	Kimi
619	2025-06-16	ROSAQ: Rotation-based Saliency-Aware Weight Quantization for Efficiently Compressing Large Language Models	Junho Yoon et.al.	2506.13472	null	Kimi
620	2025-06-16	Unveiling the Learning Mind of Language Models: A Cognitive Framework and Empirical Study	Zhengyu Hu et.al.	2506.13464	null	Kimi
621	2025-06-16	StoryBench: A Dynamic Benchmark for Evaluating Long-Term Memory with Multi Turns	Luanbo Wan et.al.	2506.13356	null	Kimi
622	2025-06-16	Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks	Yifei Xu et.al.	2506.13351	null	Kimi
623	2025-06-16	SeqPE: Transformer with Sequential Position Encoding	Huyang Li et.al.	2506.13277	link	Kimi
624	2025-06-16	IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation	Zijie Lin et.al.	2506.13229	link	Kimi
625	2025-06-16	Thought Crime: Backdoors and Emergent Misalignment in Reasoning Models	James Chua et.al.	2506.13206	null	Kimi
626	2025-06-16	Breaking Thought Patterns: A Multi-Dimensional Reasoning Framework for LLMs	Xintong Tang et.al.	2506.13192	null	Kimi
627	2025-06-16	Adapting LLMs for Minimal-edit Grammatical Error Correction	Ryszard Staruch et.al.	2506.13148	null	Kimi
628	2025-06-16	ZINA: Multimodal Fine-grained Hallucination Detection and Editing	Yuiga Wada et.al.	2506.13130	null	Kimi
629	2025-06-16	Rethinking Test-Time Scaling for Medical AI: Model and Task-Aware Strategies for LLMs and VLMs	Gyutaek Oh et.al.	2506.13102	null	Kimi
630	2025-06-16	Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs	Daniel Kilov et.al.	2506.13082	null	Kimi
631	2025-06-16	MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models?	Xixian Yong et.al.	2506.13065	null	Kimi
632	2025-06-16	Multipole Attention for Efficient Long Context Reasoning	Coleman Hooper et.al.	2506.13059	null	Kimi
633	2025-06-16	Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning	Haibo Qiu et.al.	2506.13056	null	Kimi
634	2025-06-16	Just Go Parallel: Improving the Multilingual Capabilities of Large Language Models	Muhammad Reza Qorib et.al.	2506.13044	null	Kimi
635	2025-06-15	Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning Skills	Changsheng Wang et.al.	2506.12963	null	Kimi
636	2025-06-15	HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance	Rosni Vasu et.al.	2506.12937	null	Kimi
637	2025-06-15	Scaling Test-time Compute for LLM Agents	King Zhu et.al.	2506.12928	null	Kimi
638	2025-06-12	Fine-Grained Perturbation Guidance via Attention Head Selection	Donghoon Ahn et.al.	2506.10978	null	Kimi
639	2025-06-12	AutoMind: Adaptive Knowledgeable Agent for Automated Data Science	Yixin Ou et.al.	2506.10974	link	Kimi
640	2025-06-12	Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs	Qizhe Zhang et.al.	2506.10967	link	Kimi
641	2025-06-12	MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning	Yuxuan Luo et.al.	2506.10963	null	Kimi
642	2025-06-12	SpectralAR: Spectral Autoregressive Visual Generation	Yuanhui Huang et.al.	2506.10962	null	Kimi
643	2025-06-12	ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark	Kangwei Liu et.al.	2506.10960	link	Kimi
644	2025-06-12	ReGuidance: A Simple Diffusion Wrapper for Boosting Sample Quality on Hard Inverse Problems	Aayush Karan et.al.	2506.10955	null	Kimi
645	2025-06-12	Build the web for agents, not agents for the web	Xing Han Lù et.al.	2506.10953	null	Kimi
646	2025-06-12	Spurious Rewards: Rethinking Training Signals in RLVR	Rulin Shao et.al.	2506.10947	link	Kimi
647	2025-06-12	GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models	Evelyn Ma et.al.	2506.10946	null	Kimi
648	2025-06-12	VINCIE: Unlocking In-context Image Editing from Video	Leigang Qu et.al.	2506.10941	null	Kimi
649	2025-06-12	Dynamic Epistemic Friction in Dialogue	Timothy Obiso et.al.	2506.10934	null	Kimi
650	2025-06-12	The Role of Generative AI in Facilitating Social Interactions: A Scoping Review	T. T. J. E. Arets et.al.	2506.10927	null	Kimi
651	2025-06-12	Robustly Improving LLM Fairness in Realistic Settings via Interpretability	Adam Karvonen et.al.	2506.10922	link	Kimi
652	2025-06-12	Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization	Or Shafran et.al.	2506.10920	link	Kimi
653	2025-06-12	M4V: Multi-Modal Mamba for Text-to-Video Generation	Jiancheng Huang et.al.	2506.10915	null	Kimi
654	2025-06-12	Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification?	Fei Lin et.al.	2506.10912	null	Kimi
655	2025-06-12	Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning	Lan Zhang et.al.	2506.10903	null	Kimi
656	2025-06-12	BioClinical ModernBERT: A State-of-the-Art Long-Context Encoder for Biomedical and Clinical NLP	Thomas Sounack et.al.	2506.10896	link	Kimi
657	2025-06-12	AIR: Zero-shot Generative Model Adaptation with Iterative Refinement	Guimeng Liu et.al.	2506.10895	link	Kimi
658	2025-06-12	Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers	Yixiao Huang et.al.	2506.10887	null	Kimi
659	2025-06-12	Slimming Down LLMs Without Losing Their Minds	Qingda et.al.	2506.10885	null	Kimi
660	2025-06-12	VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos	Jiashuo Yu et.al.	2506.10857	null	Kimi
661	2025-06-12	A Study on Individual Spatiotemporal Activity Generation Method Using MCP-Enhanced Chain-of-Thought Large Language Models	Yu Zhang et.al.	2506.10853	link	Kimi
662	2025-06-12	Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles	Qingyan Wei et.al.	2506.10848	link	Kimi
663	2025-06-12	CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training	Alireza Salemi et.al.	2506.10844	link	Kimi
664	2025-06-12	Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches	Andrea Moglia et.al.	2506.10825	null	Kimi
665	2025-06-12	ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization	Zhensheng Jin et.al.	2506.10822	link	Kimi
666	2025-06-12	VideoDeepResearch: Long Video Understanding With Agentic Tool Using	Huaying Yuan et.al.	2506.10821	link	Kimi
667	2025-06-12	Prompts to Summaries: Zero-Shot Language-Guided Video Summarization	Mario Barbara et.al.	2506.10807	null	Kimi
668	2025-06-12	PREMISE: Scalable and Strategic Prompt Optimization for Efficient Mathematical Reasoning in Large Models	Ye Yu et.al.	2506.10716	null	Kimi
669	2025-06-12	Large Language Models for Detection of Life-Threatening Texts	Thanh Thi Nguyen et.al.	2506.10687	null	Kimi
670	2025-06-12	TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving	Vincenzo Colle et.al.	2506.10674	null	Kimi
671	2025-06-12	Data Shifts Hurt CoT: A Theoretical Study	Lang Yin et.al.	2506.10647	null	Kimi
672	2025-06-12	Spelling-out is not Straightforward: LLMs’ Capability of Tokenization from Token to Characters	Tatsuya Hiraoka et.al.	2506.10641	null	Kimi
673	2025-06-12	NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors	Numaan Naeem et.al.	2506.10627	link	Kimi
674	2025-06-12	Primender Sequence: A Novel Mathematical Construct for Testing Symbolic Inference and AI Reasoning	Mohd Anwar Jamal Faiz et.al.	2506.10585	null	Kimi
675	2025-06-12	LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs	Yanan Cai et.al.	2506.10527	null	Kimi
676	2025-06-12	Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs	Yilin Xiao et.al.	2506.10508	null	Kimi
677	2025-06-12	Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models	Sangmin Song et.al.	2506.10504	null	Kimi
678	2025-06-12	TD-Pipe: Temporally-Disaggregated Pipeline Parallelism Architecture for High-Throughput LLM Inference	Hongbin Zhang et.al.	2506.10470	null	Kimi
679	2025-06-12	Specification and Evaluation of Multi-Agent LLM Systems – Prototype and Cybersecurity Applications	Felix Härer et.al.	2506.10467	link	Kimi
680	2025-06-12	MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models	Yu Huang et.al.	2506.10465	null	Kimi
681	2025-06-12	Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts	Zaijing Li et.al.	2506.10357	null	Kimi
682	2025-06-12	Code Execution as Grounded Supervision for LLM Reasoning	Dongwon Jung et.al.	2506.10343	link	Kimi
683	2025-06-12	Discrete Audio Tokens: More Than a Survey!	Pooneh Mousavi et.al.	2506.10274	null	Kimi
684	2025-06-11	Disclosure Audits for LLM Agents	Saswat Das et.al.	2506.10171	null	Kimi
685	2025-06-11	Can LLMs Generate Good Stories? Insights and Challenges from a Narrative Planning Perspective	Yi Wang et.al.	2506.10161	null	Kimi
686	2025-06-11	When Meaning Stays the Same, but Models Drift: Evaluating Quality of Service under Token-Level Behavioral Instability in LLMs	Xiao Li et.al.	2506.10095	link	Kimi
687	2025-06-11	From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring	Yang Li et.al.	2506.09996	null	Kimi
688	2025-06-11	PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants	Zheng Zhao et.al.	2506.09902	link	Kimi
689	2025-06-11	Attention Head Embeddings with Trainable Deep Kernels for Hallucination Detection in LLMs	Rodion Oblovatny et.al.	2506.09886	null	Kimi
690	2025-06-11	Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning	Xiangning Yu et.al.	2506.09853	null	Kimi
691	2025-06-11	Dataset of News Articles with Provenance Metadata for Media Relevance Assessment	Tomas Peterka et.al.	2506.09847	null	Kimi
692	2025-06-11	OctoNav: Towards Generalist Embodied Navigation	Chen Gao et.al.	2506.09839	null	Kimi
693	2025-06-11	CoRT: Code-integrated Reasoning within Thinking	Chengpeng Li et.al.	2506.09820	link	Kimi
694	2025-06-11	Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era	Shuo Jiang et.al.	2506.09755	null	Kimi
695	2025-06-11	Intent Factored Generation: Unleashing the Diversity in Your Language Model	Eltayeb Ahmed et.al.	2506.09659	null	Kimi
696	2025-06-11	DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning	Dongxu Liu et.al.	2506.09644	null	Kimi
697	2025-06-11	From Symbolic to Neural and Back: Exploring Knowledge Graph-Large Language Model Synergies	Blaž Škrlj et.al.	2506.09566	null	Kimi
698	2025-06-11	Understanding the Performance and Power of LLM Inferencing on Edge Accelerators	Mayank Arya et.al.	2506.09554	null	Kimi
699	2025-06-11	Athena: Enhancing Multimodal Reasoning with Data-efficient Process Reward Models	Shuai Wang et.al.	2506.09532	null	Kimi
700	2025-06-11	Revisit What You See: Disclose Language Prior in Vision Tokens for Efficient Guided Decoding of LVLMs	Beomsik Cho et.al.	2506.09522	link	Kimi
701	2025-06-11	ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning	Yu Sun et.al.	2506.09513	link	Kimi
702	2025-06-11	Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning	Jiayi Yuan et.al.	2506.09501	null	Kimi
703	2025-06-11	Token Constraint Decoding Improves Robustness on Question Answering for Large Language Models	Jui-Ming Yao et.al.	2506.09408	null	Kimi
704	2025-06-11	SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving	Xiangchen Li et.al.	2506.09397	null	Kimi
705	2025-06-11	DIVE into MoE: Diversity-Enhanced Reconstruction of Large Language Models from Dense into Mixture-of-Experts	Yuchen Feng et.al.	2506.09351	null	Kimi
706	2025-06-11	Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation	Shanchuan Lin et.al.	2506.09350	null	Kimi
707	2025-06-11	Ming-Omni: A Unified Multimodal Model for Perception and Generation	Inclusion AI et.al.	2506.09344	link	Kimi
708	2025-06-11	Latent Multi-Head Attention for Small Language Models	Sushant Mehta et.al.	2506.09342	null	Kimi
709	2025-06-11	Multi-Agent Language Models: Advancing Cooperation, Coordination, and Adaptation	Arjun Vaithilingam Sudhakar et.al.	2506.09331	null	Kimi
710	2025-06-10	Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search	Samuel Holt et.al.	2506.09171	null	Kimi
711	2025-06-10	VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning	Li Kang et.al.	2506.09049	null	Kimi
712	2025-06-10	Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better	Dianyi Wang et.al.	2506.09040	link	Kimi
713	2025-06-10	Can A Gamer Train A Mathematical Reasoning Model?	Andrew Shin et.al.	2506.08935	link	Kimi
714	2025-06-10	Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions	David Acuna et.al.	2506.08927	null	Kimi
715	2025-06-10	PropMEND: Hypernetworks for Knowledge Propagation in LLMs	Zeyu Leo Liu et.al.	2506.08920	link	Kimi
716	2025-06-10	From Legal Texts to Defeasible Deontic Logic via LLMs: A Study in Automated Semantic Analysis	Elias Horner et.al.	2506.08899	null	Kimi
717	2025-06-10	The impact of fine tuning in LLaMA on hallucinations for named entity extraction in legal documentation	Francisco Vargas et.al.	2506.08827	null	Kimi
718	2025-06-10	Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents	Irene Testini et.al.	2506.08800	null	Kimi
719	2025-06-10	AraReasoner: Evaluating Reasoning-Based LLMs for Arabic NLP	Ahmed Hasanaath et.al.	2506.08768	null	Kimi
720	2025-06-10	Improved LLM Agents for Financial Document Question Answering	Nelvin Tan et.al.	2506.08726	null	Kimi
721	2025-06-10	ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Large Language Model Preference Optimization	Hee Suk Yoon et.al.	2506.08712	null	Kimi
722	2025-06-10	Efficient Post-Training Refinement of Latent Reasoning in Large Language Models	Xinyuan Wang et.al.	2506.08552	null	Kimi
723	2025-06-10	DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs	Arie Cattan et.al.	2506.08500	link	Kimi
724	2025-06-10	Fairness is Not Silence: Unmasking Vacuous Neutrality in Small Language Models	Sumanth Manduru et.al.	2506.08487	null	Kimi
725	2025-06-10	Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive- $k$	Chihiro Taguchi et.al.	2506.08479	null	Kimi
726	2025-06-10	A Survey on Large Language Models for Mathematical Reasoning	Peng-Yuan Wang et.al.	2506.08446	null	Kimi
727	2025-06-10	Low-resource domain adaptation while minimizing energy and hardware resource consumption	Hernán Maina et.al.	2506.08433	null	Kimi
728	2025-06-10	TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration	Weiya Li et.al.	2506.08403	link	Kimi
729	2025-06-10	Reinforce LLM Reasoning through Multi-Agent Reflection	Yurun Yuan et.al.	2506.08379	null	Kimi
730	2025-06-10	Draft-based Approximate Inference for LLMs	Kevin Galim et.al.	2506.08373	link	Kimi
731	2025-06-10	Mitigating Posterior Salience Attenuation in Long-Context LLMs with Positional Contrastive Decoding	Zikai Xiao et.al.	2506.08371	null	Kimi
732	2025-06-10	DEAL: Disentangling Transformer Head Activations for LLM Steering	Li-Ming Zhan et.al.	2506.08359	null	Kimi
733	2025-06-10	Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving	Yuxuan Zhou et.al.	2506.08349	link	Kimi
734	2025-06-09	A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation	Andrew Z. Wang et.al.	2506.08210	null	Kimi
735	2025-06-09	LLM-BT: Back-Translation as a Framework for Terminology Standardization and Dynamic Semantic Embedding	Li Weigang et.al.	2506.08174	null	Kimi
736	2025-06-09	Multilingual Hate Speech Detection in Social Media Using Translation-Based Approaches with Large Language Models	Muhammad Usman et.al.	2506.08147	null	Kimi
737	2025-06-09	HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization	Hongzheng Chen et.al.	2506.07972	link	Kimi
738	2025-06-09	Reinforcing Multimodal Understanding and Generation with Dual Self-rewards	Jixiang Hong et.al.	2506.07963	null	Kimi
739	2025-06-09	Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations	Yizhen Li et.al.	2506.07943	null	Kimi
740	2025-06-09	Mimicking or Reasoning: Rethinking Multi-Modal In-Context Learning in Vision-Language Models	Chengyue Huang et.al.	2506.07936	null	Kimi
741	2025-06-09	Solving Inequality Proofs with Large Language Models	Jiayi Sheng et.al.	2506.07927	link	Kimi
742	2025-06-09	LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement	Dimitris Panagopoulos et.al.	2506.07915	null	Kimi
743	2025-06-09	MiniCPM4: Ultra-Efficient LLMs on End Devices	MiniCPM Team et.al.	2506.07900	link	Kimi
744	2025-06-09	Evaluating Large Language Models on the Frame and Symbol Grounding Problems: A Zero-shot Benchmark	Shoko Oka et.al.	2506.07896	link	Kimi
745	2025-06-09	Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning	Yiju Guo et.al.	2506.07851	null	Kimi
746	2025-06-09	Improving large language models with concept-aware fine-tuning	Michael K. Chen et.al.	2506.07833	link	Kimi
747	2025-06-09	Addition in Four Movements: Mapping Layer-wise Information Trajectories in LLMs	Yao Yan et.al.	2506.07824	null	Kimi
748	2025-06-09	Augmenting LLMs’ Reasoning by Reinforcing Abstract Thinking	Silin Gao et.al.	2506.07751	null	Kimi
749	2025-06-09	Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models	Ramakrishna Appicharla et.al.	2506.07583	null	Kimi
750	2025-06-09	SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems	Peiran Li et.al.	2506.07564	null	Kimi
751	2025-06-09	SELT: Self-Evaluation Tree Search for LLMs with Task Decomposition	Mengsong Wu et.al.	2506.07557	null	Kimi
752	2025-06-09	MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts	Wei Tao et.al.	2506.07533	null	Kimi
753	2025-06-09	LeVo: High-Quality Song Generation with Multi-Preference Alignment	Shun Lei et.al.	2506.07520	link	Kimi
754	2025-06-09	Graph-of-Causal Evolution: Challenging Chain-of-Model for Reasoning	Libo Wang et.al.	2506.07501	null	Kimi
755	2025-06-09	CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models	Guang Liu et.al.	2506.07463	null	Kimi
756	2025-06-09	Prompt to Protection: A Comparative Study of Multimodal LLMs in Construction Hazard Recognition	Nishi Chaudhary et.al.	2506.07436	null	Kimi
757	2025-06-09	Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding	Feifan Song et.al.	2506.07434	link	Kimi
758	2025-06-09	Evaluating Visual Mathematics in Multimodal LLMs: A Multilingual Benchmark Based on the Kangaroo Tests	Arnau Igualde Sáez et.al.	2506.07418	null	Kimi
759	2025-06-09	MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models	Philip Liu et.al.	2506.07400	link	Kimi
760	2025-06-09	Improving LLM Reasoning through Interpretable Role-Playing Steering	Anyi Wang et.al.	2506.07335	null	Kimi
761	2025-06-09	JavelinGuard: Low-Cost Transformer Architectures for LLM Security	Yash Datta et.al.	2506.07330	null	Kimi
762	2025-06-08	Reward Model Interpretability via Optimal and Pessimal Tokens	Brian Christian et.al.	2506.07326	null	Kimi
763	2025-06-08	Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference	Thomas Joshi et.al.	2506.07311	null	Kimi
764	2025-06-08	Tokenized Bandit for LLM Decoding and Alignment	Suho Shin et.al.	2506.07276	null	Kimi
765	2025-06-08	Learn as Individuals, Evolve as a Team: Multi-agent LLMs Adaptation in Embodied Environments	Xinran Li et.al.	2506.07232	null	Kimi
766	2025-06-08	Advancing Multimodal Reasoning Capabilities of Multimodal Large Language Models via Visual Perception Reward	Tong Xiao et.al.	2506.07218	null	Kimi
767	2025-06-05	Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets	Lei Hsiung et.al.	2506.05346	null	Kimi
768	2025-06-05	Inference-Time Hyper-Scaling with KV Cache Compression	Adrian Łańcucki et.al.	2506.05345	null	Kimi
769	2025-06-05	SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs	Jiahui Wang et.al.	2506.05344	link	Kimi
770	2025-06-05	Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning	Xingjian Ran et.al.	2506.05341	null	Kimi
771	2025-06-05	VideoMolmo: Spatio-Temporal Grounding Meets Pointing	Ghazi Shazan Ahmad et.al.	2506.05336	link	Kimi
772	2025-06-05	Kinetics: Rethinking Test-Time Scaling Laws	Ranajoy Sadhukhan et.al.	2506.05333	link	Kimi
773	2025-06-05	Unleashing Hour-Scale Video Training for Long Video-Language Understanding	Jingyang Lin et.al.	2506.05332	null	Kimi
774	2025-06-05	MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning	Xinyan Chen et.al.	2506.05331	link	Kimi
775	2025-06-05	Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay	Yifan Sun et.al.	2506.05316	null	Kimi
776	2025-06-05	Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models	Taha Entesari et.al.	2506.05314	null	Kimi
777	2025-06-05	Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games	Niv Eckhaus et.al.	2506.05309	link	Kimi
778	2025-06-05	ProRefine: Inference-time Prompt Refinement with Textual Feedback	Deepak Pandita et.al.	2506.05305	null	Kimi
779	2025-06-05	Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos	Weifeng Lin et.al.	2506.05302	null	Kimi
780	2025-06-05	Sample Complexity and Representation Ability of Test-time Scaling Paradigms	Baihe Huang et.al.	2506.05295	null	Kimi
781	2025-06-05	AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model	Pingyu Wu et.al.	2506.05289	link	Kimi
782	2025-06-05	Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning	Nan Huo et.al.	2506.05278	null	Kimi
783	2025-06-05	Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams	Mohammed Almutairi et.al.	2506.05265	null	Kimi
784	2025-06-05	CLATTER: Comprehensive Entailment Reasoning for Hallucination Detection	Ron Eliav et.al.	2506.05243	null	Kimi
785	2025-06-05	MesaNet: Sequence Modeling by Locally Optimal Test-Time Training	Johannes von Oswald et.al.	2506.05233	null	Kimi
786	2025-06-05	Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts	Danil Sivtsov et.al.	2506.05229	link	Kimi
787	2025-06-05	LLM-First Search: Self-Guided Exploration of the Solution Space	Nathan Herr et.al.	2506.05213	link	Kimi
788	2025-06-05	The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text	Nikhil Kandpal et.al.	2506.05209	null	Kimi
789	2025-06-05	RELIC: Evaluating Compositional Instruction Following via Language Recognition	Jackson Petty et.al.	2506.05205	null	Kimi
790	2025-06-05	Counterfactual reasoning: an analysis of in-context emergence	Moritz Miller et.al.	2506.05188	link	Kimi
791	2025-06-05	TreeRPO: Tree Relative Policy Optimization	Zhicheng Yang et.al.	2506.05183	null	Kimi
792	2025-06-05	ECoRAG: Evidentiality-guided Compression for Long Context RAG	Yeonseok Jeong et.al.	2506.05167	link	Kimi
793	2025-06-05	Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective	Bhavik Chandna et.al.	2506.05166	null	Kimi
794	2025-06-05	Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation	Chenyu Lin et.al.	2506.05154	null	Kimi
795	2025-06-05	Do Large Language Models Judge Error Severity Like Humans?	Diege Sun et.al.	2506.05142	null	Kimi
796	2025-06-05	AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models	Chih-Kai Yang et.al.	2506.05140	null	Kimi
797	2025-06-05	DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning	Tanmay Parekh et.al.	2506.05128	null	Kimi
798	2025-06-05	TALL – A Trainable Architecture for Enhancing LLM Performance in Low-Resource Languages	Moshe Ofer et.al.	2506.05057	null	Kimi
799	2025-06-05	Controlling Summarization Length Through EOS Token Weighting	Zeno Belligoli et.al.	2506.05017	null	Kimi
800	2025-06-05	When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models	Kai Wang et.al.	2506.04909	null	Kimi
801	2025-06-05	Verbose ListOps (VLO): Beyond Long Context – Unmasking LLM’s Reasoning Blind Spots	Alex Pan et.al.	2506.04907	null	Kimi
802	2025-06-05	Multiple-Choice Question Generation Using Large Language Models: Methodology and Educator Insights	Giorgio Biancini et.al.	2506.04851	null	Kimi
803	2025-06-05	Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study	Yujun Zhou et.al.	2506.04810	link	Kimi
804	2025-06-05	Accelerated Test-Time Scaling with Model-Free Speculative Sampling	Woomin Song et.al.	2506.04708	null	Kimi
805	2025-06-05	MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models	Gio Paik et.al.	2506.04688	null	Kimi
806	2025-06-05	TaDA: Training-free recipe for Decoding with Adaptive KV Cache Compression and Mean-centering	Vinay Joshi et.al.	2506.04642	null	Kimi
807	2025-06-05	Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning	Zhiyuan Ma et.al.	2506.04625	null	Kimi
808	2025-06-05	Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification	Chengwu Liu et.al.	2506.04592	null	Kimi
809	2025-06-05	Reasoning or Overthinking: Evaluating Large Language Models on Financial Sentiment Analysis	Dimitris Vamvourellis et.al.	2506.04574	null	Kimi
810	2025-06-04	Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model	Haibin Wu et.al.	2506.04518	null	Kimi
811	2025-06-04	MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale	Ran Xu et.al.	2506.04405	null	Kimi
812	2025-06-04	ReXVQA: A Large-scale Visual Question Answering Benchmark for Generalist Chest X-ray Understanding	Ankit Pal et.al.	2506.04353	null	Kimi
813	2025-06-04	GEM: Empowering LLM for both Embedding Generation and Language Understanding	Caojin Zhang et.al.	2506.04344	null	Kimi
814	2025-06-04	Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning	Shuang Chen et.al.	2506.04207	null	Kimi
815	2025-06-04	Cascadia: A Cascade Serving System for Large Language Models	Youhe Jiang et.al.	2506.04203	null	Kimi
816	2025-06-04	TracLLM: A Generic Framework for Attributing Long Context LLMs	Yanting Wang et.al.	2506.04202	link	Kimi
817	2025-06-05	Rectified Sparse Attention	Yutao Sun et.al.	2506.04108	null	Kimi
818	2025-06-04	Multimodal Tabular Reasoning with Privileged Structured Information	Jun-Peng Jiang et.al.	2506.04088	null	Kimi
819	2025-06-04	LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation	Ming Zhang et.al.	2506.04078	link	Kimi
820	2025-06-04	Explainability-Based Token Replacement on LLM-Generated Text	Hadi Mohammadi et.al.	2506.04050	null	Kimi
821	2025-06-04	Mitigating Hallucinations in Large Vision-Language Models via Entity-Centric Multimodal Preference Optimization	Jiulong Wu et.al.	2506.04039	null	Kimi
822	2025-06-04	AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in LLM-Based Agents	Akshat Naik et.al.	2506.04018	null	Kimi
823	2025-06-04	Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning	Junqi Gao et.al.	2506.03939	link	Kimi
824	2025-06-04	Vision Remember: Alleviating Visual Forgetting in Efficient MLLM with Vision Feature Resample	Ze Feng et.al.	2506.03928	null	Kimi
825	2025-06-04	RadialRouter: Structured Representation for Efficient and Robust Large Language Models Routing	Ruihan Jin et.al.	2506.03880	null	Kimi
826	2025-06-04	Knockout LLM Assessment: Using Large Language Models for Evaluations through Iterative Pairwise Comparisons	Isik Baran Sandan et.al.	2506.03785	null	Kimi
827	2025-06-04	ClozeMath: Improving Mathematical Reasoning in Language Models by Learning to Fill Equations	Quang Hieu Pham et.al.	2506.03763	null	Kimi
828	2025-06-04	AhaKV: Adaptive Holistic Attention-Driven KV Cache Eviction for Efficient Inference of Large Language Models	Yifeng Gu et.al.	2506.03762	null	Kimi
829	2025-06-04	Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision	Chaeyun Jang et.al.	2506.03723	null	Kimi
830	2025-06-04	AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism	Zhepei Wei et.al.	2506.03700	link	Kimi
831	2025-06-04	Learning to Insert [PAUSE] Tokens for Better Reasoning	Eunki Kim et.al.	2506.03616	null	Kimi
832	2025-06-04	POSS: Position Specialist Generates Better Draft for Speculative Decoding	Langlin Huang et.al.	2506.03566	link	Kimi
833	2025-06-04	Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning	Daeun Lee et.al.	2506.03525	null	Kimi
834	2025-06-04	EpiCoDe: Boosting Model Performance Beyond Training with Extrapolation and Contrastive Decoding	Mingxu Tao et.al.	2506.03489	null	Kimi
835	2025-06-03	Parallel CPU-GPU Execution for LLM Inference on Constrained GPUs	Jiakun Fan et.al.	2506.03296	null	Kimi
836	2025-06-03	Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem	Yubo Wang et.al.	2506.03295	null	Kimi
837	2025-06-03	FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes	Christodoulos Constantinides et.al.	2506.03278	link	Kimi
838	2025-06-04	UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation	Bin Lin et.al.	2506.03147	null	Kimi
839	2025-06-03	GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents	Qianhui Wu et.al.	2506.03143	null	Kimi
840	2025-06-03	Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning	Yinjie Wang et.al.	2506.03136	link	Kimi
841	2025-06-03	OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models	Mengdi Jia et.al.	2506.03135	null	Kimi
842	2025-06-03	EgoVLM: Policy Optimization for Egocentric Video Understanding	Ashwin Vinod et.al.	2506.03097	link	Kimi
843	2025-06-03	Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective	Jintian Shao et.al.	2506.03038	null	Kimi
844	2025-06-03	Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech	Florian Ludwig et.al.	2506.03009	null	Kimi
845	2025-06-03	Adaptive Graph Pruning for Multi-Agent Communication	Boyi Li et.al.	2506.02951	null	Kimi
846	2025-06-03	Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning	Yin Fang et.al.	2506.02911	link	Kimi
847	2025-06-03	Scaling Fine-Grained MoE Beyond 50B Parameters: Empirical Evaluation and Practical Insights	Jakub Krajewski et.al.	2506.02890	null	Kimi
848	2025-06-03	CoT is Not True Reasoning, It Is Just a Tight Constraint to Imitate: A Theory Perspective	Jintian Shao et.al.	2506.02878	null	Kimi
849	2025-06-03	BNPO: Beta Normalization Policy Optimization	Changyi Xiao et.al.	2506.02864	null	Kimi
850	2025-06-03	METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding	Mengyue Wang et.al.	2506.02850	link	Kimi
851	2025-06-03	RACE-Align: Retrieval-Augmented and Chain-of-Thought Enhanced Preference Alignment for Large Language Models	Qihang Yan et.al.	2506.02726	null	Kimi
852	2025-06-03	TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression	Zhong-Zhi Li et.al.	2506.02678	link	Kimi
853	2025-06-03	Truly Assessing Fluid Intelligence of Large Language Models through Dynamic Reasoning Evaluation	Yue Yang et.al.	2506.02648	null	Kimi
854	2025-06-03	KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider	Jiahao Wang et.al.	2506.02634	link	Kimi
855	2025-06-03	Pruning General Large Language Models into Customized Expert Models	Yirao Zhao et.al.	2506.02561	null	Kimi
856	2025-06-03	Answer Convergence as a Signal for Early Stopping in Reasoning	Xin Liu et.al.	2506.02536	null	Kimi
857	2025-06-03	Minos: A Multimodal Evaluation Model for Bidirectional Generation Between Image and Text	Junzhe Zhang et.al.	2506.02494	null	Kimi
858	2025-06-03	MidPO: Dual Preference Optimization for Safety and Helpfulness in Large Language Models via a Mixture of Experts Framework	Yupeng Qi et.al.	2506.02460	null	Kimi
859	2025-06-03	Comparative Analysis of AI Agent Architectures for Entity Relationship Classification	Maryam Berijanian et.al.	2506.02426	link	Kimi
860	2025-06-03	Consultant Decoding: Yet Another Synergistic Mechanism	Chuanghao Ding et.al.	2506.02391	null	Kimi
861	2025-06-03	Univariate to Multivariate: LLMs as Zero-Shot Predictors for Time-Series Forecasting	Chamara Madarasingha et.al.	2506.02389	null	Kimi
862	2025-06-03	DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization	Jeonghun Kang et.al.	2506.02351	null	Kimi
863	2025-06-02	The Unified Cognitive Consciousness Theory for Language Models: Anchoring Semantics, Thresholds of Activation, and Emergent Reasoning	Edward Y. Chang et.al.	2506.02139	null	Kimi
864	2025-06-02	Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains	Juncheng Wu et.al.	2506.02126	null	Kimi
865	2025-06-02	Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning	Shenzhi Wang et.al.	2506.01939	null	Kimi
866	2025-06-02	Large language models can learn and generalize steganographic chain-of-thought under process supervision	Joey Skaf et.al.	2506.01926	null	Kimi
867	2025-06-02	MoDA: Modulation Adapter for Fine-Grained Visual Grounding in Instructional MLLMs	Wayner Barrios et.al.	2506.01850	null	Kimi
868	2025-06-02	Analysis of LLM Bias (Chinese Propaganda & Anti-US Sentiment) in DeepSeek-R1 vs. ChatGPT o3-mini-high	PeiHsuan Huang et.al.	2506.01814	null	Kimi
869	2025-05-29	From Chat Logs to Collective Insights: Aggregative Question Answering	Wentao Zhang et.al.	2505.23765	null	Kimi
870	2025-05-29	MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence	Sihan Yang et.al.	2505.23764	null	Kimi
871	2025-05-29	ZeroGUI: Automating Online GUI Learning at Zero Human Cost	Chenyu Yang et.al.	2505.23762	link	Kimi
872	2025-05-29	Puzzled by Puzzles: When Vision-Language Models Can’t Take a Hint	Heekyung Lee et.al.	2505.23759	link	Kimi
873	2025-05-29	DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning	Ziyin Zhang et.al.	2505.23754	link	Kimi
874	2025-05-29	ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks	Akashah Shabbir et.al.	2505.23752	link	Kimi
875	2025-05-29	Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence	Diankun Wu et.al.	2505.23747	null	Kimi
876	2025-05-29	ATLAS: Learning to Optimally Memorize the Context at Test Time	Ali Behrouz et.al.	2505.23735	null	Kimi
877	2025-05-29	Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time	Mohamad Chehade et.al.	2505.23729	null	Kimi
878	2025-05-29	ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering	Zexi Liu et.al.	2505.23723	link	Kimi
879	2025-05-29	Label-Guided In-Context Learning for Named Entity Recognition	Fan Bai et.al.	2505.23722	link	Kimi
880	2025-05-29	From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems	Zeinab Nezami et.al.	2505.23710	null	Kimi
881	2025-05-29	Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation	Ziling Cheng et.al.	2505.23701	null	Kimi
882	2025-05-29	Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics	Ran Zhang et.al.	2505.23695	link	Kimi
883	2025-05-29	VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos	Tingyu Song et.al.	2505.23693	link	Kimi
884	2025-05-29	LoLA: Low-Rank Linear Attention With Sparse Caching	Luke McDermott et.al.	2505.23666	null	Kimi
885	2025-05-29	D-AR: Diffusion via Autoregressive Models	Ziteng Gao et.al.	2505.23660	link	Kimi
886	2025-05-29	Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation	Hongxiang Zhang et.al.	2505.23657	null	Kimi
887	2025-05-29	Are Reasoning Models More Prone to Hallucination?	Zijun Yao et.al.	2505.23646	null	Kimi
888	2025-05-29	AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora	Jiaxin Bai et.al.	2505.23628	link	Kimi
889	2025-05-29	Table-R1: Inference-Time Scaling for Table Reasoning	Zheyuan Yang et.al.	2505.23621	link	Kimi
890	2025-05-29	One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory	Chenhao Zheng et.al.	2505.23617	null	Kimi
891	2025-05-29	MAPLE: A Mobile Assistant with Persistent Finite State Machines for Recovery Reasoning	Linqiang Guo et.al.	2505.23596	null	Kimi
892	2025-05-29	Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with Jigsaw Puzzles	Zifu Wang et.al.	2505.23590	link	Kimi
893	2025-05-29	CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring	Benjamin Arnav et.al.	2505.23575	null	Kimi
894	2025-05-29	Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models	Yiran Guo et.al.	2505.23564	link	Kimi
895	2025-05-29	Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information	Xu Chu et.al.	2505.23558	link	Kimi
896	2025-05-29	Sustainable Carbon-Aware and Water-Efficient LLM Scheduling in Geo-Distributed Cloud Datacenters	Hayden Moore et.al.	2505.23554	null	Kimi
897	2025-05-29	Probability-Consistent Preference Optimization for Enhanced LLM Reasoning	Yunqiao Yang et.al.	2505.23540	link	Kimi
898	2025-05-29	CLaC at SemEval-2025 Task 6: A Multi-Architecture Approach for Corporate Environmental Promise Verification	Nawar Turk et.al.	2505.23538	null	Kimi
899	2025-05-29	Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation	Beiduo Chen et.al.	2505.23368	link	Kimi
900	2025-05-29	VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?	Yuanxin Liu et.al.	2505.23359	link	Kimi
901	2025-05-29	How Does Response Length Affect Long-Form Factuality	James Xu Zhao et.al.	2505.23295	link	Kimi
902	2025-05-29	Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective	Yong Zhang et.al.	2505.23277	link	Kimi
903	2025-05-29	Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models	Zeyu Liu et.al.	2505.23091	null	Kimi
904	2025-05-29	From Token to Action: State Machine Reasoning to Mitigate Overthinking in Information Retrieval	Dohyeon Lee et.al.	2505.23059	link	Kimi
905	2025-05-28	NegVQA: Can Vision Language Models Understand Negation?	Yuhui Zhang et.al.	2505.22946	null	Kimi
906	2025-05-28	Can LLMs Deceive CLIP? Benchmarking Adversarial Compositionality of Pre-trained Multimodal Representation via Text Updates	Jaewoo Ahn et.al.	2505.22943	null	Kimi
907	2025-05-28	WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning	Yuchen Zhuang et.al.	2505.22942	null	Kimi
908	2025-05-28	Can Large Language Models Match the Conclusions of Systematic Reviews?	Christopher Polzak et.al.	2505.22787	link	Kimi
909	2025-05-28	Pre-Training Curriculum for Multi-Token Prediction in Language Models	Ansar Aynetdinov et.al.	2505.22757	link	Kimi
910	2025-05-28	Zero-Shot Vision Encoder Grafting via LLM Surrogates	Kaiyu Yue et.al.	2505.22664	link	Kimi
911	2025-05-28	AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models	Feng Luo et.al.	2505.22662	null	Kimi
912	2025-05-28	3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model	Wenbo Hu et.al.	2505.22657	null	Kimi
913	2025-05-28	VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models	Ce Zhang et.al.	2505.22654	null	Kimi
914	2025-05-28	Learning Composable Chains-of-Thought	Fangcong Yin et.al.	2505.22635	null	Kimi
915	2025-05-28	Spatial Knowledge Graph-Guided Multimodal Synthesis	Yida Xue et.al.	2505.22633	null	Kimi
916	2025-05-28	Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding	Chengyue Wu et.al.	2505.22618	null	Kimi
917	2025-05-28	RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction	Yuchi Wang et.al.	2505.22613	null	Kimi
918	2025-05-28	Less, but Better: Efficient Multilingual Expansion for LLMs via Layer-wise Mixture-of-Experts	Xue Zhang et.al.	2505.22582	null	Kimi
919	2025-05-29	Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems	Hoang Pham et.al.	2505.22571	null	Kimi
920	2025-05-28	Thinking with Generated Images	Ethan Chern et.al.	2505.22525	null	Kimi
921	2025-05-28	Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO	Lai Wei et.al.	2505.22453	link	Kimi
922	2025-05-28	Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start	Lai Wei et.al.	2505.22334	link	Kimi
923	2025-05-28	Advancing Expert Specialization for Better MoE	Hongcan Guo et.al.	2505.22323	null	Kimi
924	2025-05-28	Skywork Open Reasoner 1 Technical Report	Jujie He et.al.	2505.22312	link	Kimi
925	2025-05-28	Let’s Predict Sentence by Sentence	Hyeonbin Hwang et.al.	2505.22202	null	Kimi
926	2025-05-28	Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design	Yudi Zhang et.al.	2505.22179	link	Kimi
927	2025-05-28	InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing	Shuaiyi Li et.al.	2505.22156	null	Kimi
928	2025-05-28	What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning	Gangwei Jiang et.al.	2505.22148	null	Kimi
929	2025-05-28	Flexible Tool Selection through Low-dimensional Attribute Alignment of Vision and Language	Guangfu Hao et.al.	2505.22146	null	Kimi
930	2025-05-28	Curse of High Dimensionality Issue in Transformer for Long-context Modeling	Shuhai Zhang et.al.	2505.22107	link	Kimi
931	2025-05-28	CoThink: Token-Efficient Reasoning via Instruct Models Guiding Reasoning Models	Siqi Fan et.al.	2505.22017	null	Kimi
932	2025-05-28	Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference	Yue Zhu et.al.	2505.21919	null	Kimi
933	2025-05-28	Co-Saving: Resource Aware Multi-Agent Collaboration for Software Development	Rennai Qiu et.al.	2505.21898	null	Kimi
934	2025-05-28	EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse	Tianyu Guo et.al.	2505.21889	link	Kimi
935	2025-05-27	Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation	Tharindu Kumarage et.al.	2505.21784	null	Kimi
936	2025-05-27	R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing	Tianyu Fu et.al.	2505.21600	link	Kimi
937	2025-05-27	Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making	Yihan Wang et.al.	2505.21503	null	Kimi
938	2025-05-27	Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers	Wei Pang et.al.	2505.21497	link	Kimi
939	2025-05-27	Hardware-Efficient Attention for Fast Decoding	Ted Zadouri et.al.	2505.21487	null	Kimi
940	2025-05-27	Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion	Zhanqiu Hu et.al.	2505.21467	null	Kimi
941	2025-05-28	Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity	Yehui Tang et.al.	2505.21411	null	Kimi
942	2025-05-27	Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History	Qishuai Zhong et.al.	2505.21362	link	Kimi
943	2025-05-27	Leveraging Large Language Models for Bengali Math Word Problem Solving with Chain of Thought Reasoning	Bidyarthi Paul et.al.	2505.21354	null	Kimi
944	2025-05-27	PEDANTIC: A Dataset for the Automatic Examination of Definiteness in Patent Claims	Valentin Knappich et.al.	2505.21342	null	Kimi
945	2025-05-28	HoliTom: Holistic Token Merging for Fast Video Large Language Models	Kele Shao et.al.	2505.21334	link	Kimi
946	2025-05-27	Beyond Chemical QA: Evaluating LLM’s Chemical Reasoning with Modular Chemical Operations	Hao Li et.al.	2505.21318	null	Kimi
947	2025-05-27	Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework	Saman Marandi et.al.	2505.21291	null	Kimi
948	2025-05-27	Exploring the Latent Capacity of LLMs for One-Step Text Generation	Gleb Mezentsev et.al.	2505.21189	null	Kimi
949	2025-05-27	Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning	Mingyang Song et.al.	2505.21178	null	Kimi
950	2025-05-27	Thinker: Learning to Think Fast and Slow	Stephen Chung et.al.	2505.21097	null	Kimi
951	2025-05-27	Uni3D-MoE: Scalable Multimodal 3D Scene Understanding via Mixture of Experts	Yue Zhang et.al.	2505.21079	null	Kimi
952	2025-05-27	Efficient Large Language Model Inference with Neural Block Linearization	Mete Erdogan et.al.	2505.21077	null	Kimi
953	2025-05-27	Who Reasons in the Large Language Models?	Jie Shao et.al.	2505.20993	null	Kimi
954	2025-05-27	Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation	Pingrui Zhang et.al.	2505.20897	link	Kimi
955	2025-05-27	Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties	Jiyoung Lee et.al.	2505.20875	null	Kimi
956	2025-05-27	Fork-Merge Decoding: Enhancing Multimodal Understanding in Audio-Visual Large Language Models	Chaeyoung Jung et.al.	2505.20873	null	Kimi
957	2025-05-27	AVCD: Mitigating Hallucinations in Audio-Visual Large Language Models through Contrastive Decoding	Chaeyoung Jung et.al.	2505.20862	null	Kimi
958	2025-05-27	SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences	Jungyoub Cha et.al.	2505.20776	link	Kimi
959	2025-05-27	Dissecting Physics Reasoning in Small Language Models: A Multi-Dimensional Analysis from an Educational Perspective	Nicy Scaria et.al.	2505.20707	null	Kimi
960	2025-05-27	Self-Route: Automatic Mode Switching via Capability Estimation for Efficient Reasoning	Yang He et.al.	2505.20664	null	Kimi
961	2025-05-26	Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review	Matthew Lisondra et.al.	2505.20503	null	Kimi
962	2025-05-26	HAMburger: Accelerating LLM Inference via Token Smashing	Jingyu Liu et.al.	2505.20438	null	Kimi
963	2025-05-26	What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models	Lorenzo Baraldi et.al.	2505.20405	null	Kimi
964	2025-05-27	Does quantization affect models’ performance on long-context tasks?	Anmol Mekala et.al.	2505.20276	link	Kimi
965	2025-05-26	FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models	Hao Kang et.al.	2505.20225	link	Kimi
966	2025-05-26	THiNK: Can Large Language Models Think-aloud?	Yongan Yu et.al.	2505.20184	link	Kimi
967	2025-05-26	Adaptive Deep Reasoning: Triggering Deep Thinking When Needed	Yunhao Wang et.al.	2505.20101	null	Kimi
968	2025-05-26	AdaTP: Attention-Debiased Token Pruning for Video Large Language Models	Fengyuan Sun et.al.	2505.20100	null	Kimi
969	2025-05-26	Incentivizing Reasoning from Weak Supervision	Yige Yuan et.al.	2505.20072	link	Kimi
970	2025-05-26	Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion	Zheqi Lv et.al.	2505.20053	link	Kimi
971	2025-05-26	Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks	Debargha Ganguly et.al.	2505.20047	null	Kimi
972	2025-05-26	Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs	Artem Vazhentsev et.al.	2505.20045	null	Kimi
973	2025-05-26	Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles	Jiangjie Chen et.al.	2505.19914	null	Kimi
974	2025-05-26	ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows	Qiushi Sun et.al.	2505.19897	null	Kimi
975	2025-05-26	HS-STAR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation	Feng Xiong et.al.	2505.19866	null	Kimi
976	2025-05-26	Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition	Zihao Zeng et.al.	2505.19788	null	Kimi
977	2025-05-26	Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models	Yi Liu et.al.	2505.19700	null	Kimi
978	2025-05-26	Large Language Models for Planning: A Comprehensive and Systematic Survey	Pengfei Cao et.al.	2505.19683	link	Kimi
979	2025-05-26	MoESD: Unveil Speculative Decoding’s Potential for Accelerating Sparse MoE	Zongle Huang et.al.	2505.19645	null	Kimi
980	2025-05-26	SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond	Junteng Liu et.al.	2505.19641	link	Kimi
981	2025-05-26	Interleaved Reasoning for Large Language Models via Reinforcement Learning	Roy Xie et.al.	2505.19640	null	Kimi
982	2025-05-26	Faster and Better LLMs via Latency-Aware Test-Time Scaling	Zili Wang et.al.	2505.19634	null	Kimi
983	2025-05-26	Multi-Agent Collaboration via Evolving Orchestration	Yufan Dang et.al.	2505.19591	null	Kimi
984	2025-05-26	TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization	Dingyu Yao et.al.	2505.19586	link	Kimi
985	2025-05-26	Accelerating Prefilling for Long-Context LLMs via Sparse Pattern Sharing	Dan Peng et.al.	2505.19578	null	Kimi
986	2025-05-26	FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models	Jintao Tong et.al.	2505.19536	link	Kimi
987	2025-05-26	Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs	Hao Kang et.al.	2505.19481	link	Kimi
988	2025-05-26	BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs	Guilong Lu et.al.	2505.19457	link	Kimi
989	2025-05-26	Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents	Ye Ye et.al.	2505.19436	link	Kimi
990	2025-05-26	CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems	Yan Wen et.al.	2505.19405	null	Kimi
991	2025-05-25	100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?	Wang Yang et.al.	2505.19293	link	Kimi
992	2025-05-25	To CoT or To Loop? A Formal Comparison Between Chain-of-Thought and Looped Transformers	Kevin Xu et.al.	2505.19245	null	Kimi
993	2025-05-25	LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models	Aida Kostikova et.al.	2505.19240	null	Kimi
994	2025-05-25	GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling	Jialong Zhou et.al.	2505.19234	null	Kimi
995	2025-05-25	SpeakStream: Streaming Text-to-Speech with Interleaved Data	Richard He Bai et.al.	2505.19206	null	Kimi
996	2025-05-25	DREAM: Drafting with Refined Target Features and Entropy-Adaptive Cross-Attention Fusion for Multimodal Speculative Decoding	Yunhai Hu et.al.	2505.19201	link	Kimi
997	2025-05-22	GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning	Chengqi Duan et.al.	2505.17022	link	Kimi
998	2025-05-22	CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms	Shilin Yan et.al.	2505.17020	link	Kimi
999	2025-05-22	Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO	Chengzhuo Tong et.al.	2505.17017	link	Kimi
1000	2025-05-22	Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models	Runsen Xu et.al.	2505.17015	null	Kimi
1001	2025-05-22	SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding	Haoning Wu et.al.	2505.17012	link	Kimi
1002	2025-05-22	R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning	Huatong Song et.al.	2505.17005	link	Kimi
1003	2025-05-22	Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?	Jin Jiang et.al.	2505.16998	link	Kimi
1004	2025-05-22	X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs	Rui Ye et.al.	2505.16997	link	Kimi
1005	2025-05-22	$\text{R}^2\text{ec}$ : Towards Large Recommender Models with Reasoning	Runyang You et.al.	2505.16994	link	Kimi
1006	2025-05-22	Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding	Runpeng Yu et.al.	2505.16990	link	Kimi
1007	2025-05-22	T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning	Amartya Chakraborty et.al.	2505.16986	null	Kimi
1008	2025-05-22	Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine	Adib Bazgir et.al.	2505.16982	null	Kimi
1009	2025-05-22	Bottlenecked Transformers: Periodic KV Cache Abstraction for Generalised Reasoning	Adnan Oomerjee et.al.	2505.16950	null	Kimi
1010	2025-05-22	MixAT: Combining Continuous and Discrete Adversarial Training for LLMs	Csaba Dékány et.al.	2505.16947	link	Kimi
1011	2025-05-22	AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios	Yunjia Qi et.al.	2505.16944	link	Kimi
1012	2025-05-22	NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification	NovelSeek Team et.al.	2505.16938	link	Kimi
1013	2025-05-22	In-Context Watermarks for Large Language Models	Yepeng Liu et.al.	2505.16934	null	Kimi
1014	2025-05-22	Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning	Bosung Kim et.al.	2505.16928	null	Kimi
1015	2025-05-22	Don’t “Overthink” Passage Reranking: Is Reasoning Truly Necessary?	Nour Jedidi et.al.	2505.16886	null	Kimi
1016	2025-05-22	CASTILLO: Characterizing Response Length Distributions of Large Language Models	Daniel F. Perez-Ramirez et.al.	2505.16881	link	Kimi
1017	2025-05-22	LaViDa: A Large Diffusion Language Model for Multimodal Understanding	Shufan Li et.al.	2505.16839	link	Kimi
1018	2025-05-22	R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search	Yibo Wang et.al.	2505.16838	link	Kimi
1019	2025-05-22	Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning	Fanrui Zhang et.al.	2505.16836	link	Kimi
1020	2025-05-22	SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis	Shuang Sun et.al.	2505.16834	link	Kimi
1021	2025-05-22	From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization	Haonian Ji et.al.	2505.16832	link	Kimi
1022	2025-05-22	Unlearning Isn’t Deletion: Investigating Reversibility of Machine Unlearning in LLMs	Xiaoyu Xu et.al.	2505.16831	link	Kimi
1023	2025-05-22	KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning	Wei Sun et.al.	2505.16826	link	Kimi
1024	2025-05-22	REPA Works Until It Doesn’t: Early-Stopped, Holistic Alignment Supercharges Diffusion Training	Ziqiao Wang et.al.	2505.16792	link	Kimi
1025	2025-05-22	CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models	Zhenzhen Ren et.al.	2505.16785	null	Kimi
1026	2025-05-22	Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning	Xinghao Chen et.al.	2505.16782	link	Kimi
1027	2025-05-22	R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO	Huanjin Yao et.al.	2505.16673	link	Kimi
1028	2025-05-22	Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding	Feilong Tang et.al.	2505.16652	null	Kimi
1029	2025-05-22	Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains	Wenhui Tan et.al.	2505.16552	null	Kimi
1030	2025-05-22	LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing	Dario Di Palma et.al.	2505.16491	null	Kimi
1031	2025-05-22	WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning	Zhepei Wei et.al.	2505.16421	link	Kimi
1032	2025-05-22	DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving	Zhenjie Yang et.al.	2505.16278	null	Kimi
1033	2025-05-22	LIFEBench: Evaluating Length Instruction Following in Large Language Models	Wei Zhang et.al.	2505.16234	link	Kimi
1034	2025-05-22	NQKV: A KV Cache Quantization Scheme Based on Normal Distribution Characteristics	Zhihang Cai et.al.	2505.16210	null	Kimi
1035	2025-05-22	QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design	Benjamin Schneider et.al.	2505.16175	link	Kimi
1036	2025-05-22	KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization	Mingbo Song et.al.	2505.16162	null	Kimi
1037	2025-05-22	Training-Free Reasoning and Reflection in MLLMs	Hongchen Wei et.al.	2505.16151	null	Kimi
1038	2025-05-22	Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation	Zhenglin Hua et.al.	2505.16146	null	Kimi
1039	2025-05-22	Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning	Gagan Bhatia et.al.	2505.16088	null	Kimi
1040	2025-05-22	Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development	Ming Shen et.al.	2505.16086	null	Kimi
1041	2025-05-21	Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models	Jingcong Liang et.al.	2505.16056	link	Kimi
1042	2025-05-21	Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning	Alex Su et.al.	2505.15966	null	Kimi
1043	2025-05-21	Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization	Aliakbar Nafar et.al.	2505.15918	null	Kimi
1044	2025-05-21	dKV-Cache: The Cache for Diffusion Language Models	Xinyin Ma et.al.	2505.15781	link	Kimi
1045	2025-05-21	Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space	Zhen Zhang et.al.	2505.15778	link	Kimi
1046	2025-05-21	Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention	Huanxuan Liao et.al.	2505.15774	link	Kimi
1047	2025-05-21	ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy	Gengyang Li et.al.	2505.15684	null	Kimi
1048	2025-05-21	A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability	Zishuai Zhang et.al.	2505.15683	link	Kimi
1049	2025-05-21	Feature Extraction and Steering for Enhanced Chain-of-Thought Reasoning in Language Models	Zihao Li et.al.	2505.15634	null	Kimi
1050	2025-05-21	Learn to Reason Efficiently with Adaptive Length-based Reward Shaping	Wei Liu et.al.	2505.15612	link	Kimi
1051	2025-05-21	Multilingual Test-Time Scaling via Initial Thought Transfer	Prasoon Bajpai et.al.	2505.15508	null	Kimi
1052	2025-05-21	Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought	Ao Liu et.al.	2505.15431	null	Kimi
1053	2025-05-21	FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management	Xiang Liu et.al.	2505.15347	null	Kimi
1054	2025-05-21	Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack	Silvia Cappelletti et.al.	2505.15323	null	Kimi
1055	2025-05-21	Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization	Joonho Yang et.al.	2505.15291	null	Kimi
1056	2025-05-21	LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval	Zhenyu Ning et.al.	2505.15269	null	Kimi
1057	2025-05-21	Towards Explainable Temporal Reasoning in Large Language Models: A Structure-Aware Generative Framework	Zihao Jiang et.al.	2505.15245	link	Kimi
1058	2025-05-21	Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning	Jinghui Lu et.al.	2505.15154	null	Kimi
1059	2025-05-21	BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms	Yunlong Hou et.al.	2505.15141	null	Kimi
1060	2025-05-21	The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning	Shivam Agarwal et.al.	2505.15134	link	Kimi
1061	2025-05-21	An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents	Bowen Jin et.al.	2505.15117	link	Kimi
1062	2025-05-21	RoT: Enhancing Table Reasoning with Iterative Row-Wise Traversals	Xuanliang Zhang et.al.	2505.15110	null	Kimi
1063	2025-05-21	Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs	Hao Wang et.al.	2505.15075	link	Kimi
1064	2025-05-21	Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision	Eric Hanchen Jiang et.al.	2505.14999	null	Kimi
1065	2025-05-20	STree: Speculative Tree Decoding for Hybrid State-Space Models	Yangchao Wu et.al.	2505.14969	null	Kimi
1066	2025-05-20	Too Long, Didn’t Model: Decomposing LLM Long-Context Understanding With Novels	Sil Hamilton et.al.	2505.14925	link	Kimi
1067	2025-05-20	Scaling Laws for State Dynamics in Large Language Models	Jacob X Li et.al.	2505.14892	null	Kimi
1068	2025-05-20	Balanced and Elastic End-to-end Training of Dynamic LLMs	Mohamed Wahib et.al.	2505.14864	null	Kimi
1069	2025-05-20	Text Generation Beyond Discrete Token Sampling	Yufan Zhuang et.al.	2505.14827	null	Kimi
1070	2025-05-21	Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning	Haolei Xu et.al.	2505.14684	null	Kimi
1071	2025-05-20	Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training	Mengru Wang et.al.	2505.14681	null	Kimi
1072	2025-05-20	Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning	Jiaer Xia et.al.	2505.14677	null	Kimi
1073	2025-05-20	SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment	Wonje Jeung et.al.	2505.14667	null	Kimi
1074	2025-05-20	Beyond Words: Multimodal LLM Knows When to Speak	Zikai Liao et.al.	2505.14654	null	Kimi
1075	2025-05-20	KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models	Fnu Mohbat et.al.	2505.14629	link	Kimi
1076	2025-05-20	Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs	Morgan Lindsay Heisler et.al.	2505.14620	null	Kimi
1077	2025-05-20	Can Pruning Improve Reasoning? Revisiting Long-CoT Compression with Capability in Mind for Better Reasoning	Shangziqi Zhao et.al.	2505.14582	null	Kimi
1078	2025-05-20	Reasoning Models Better Express Their Confidence	Dongkeun Yoon et.al.	2505.14489	link	Kimi
1079	2025-05-20	Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation	Peter Baile Chen et.al.	2505.14398	null	Kimi
1080	2025-05-20	Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach	Umberto Cappellazzo et.al.	2505.14336	null	Kimi
1081	2025-05-20	Speculative Decoding Reimagined for Multimodal Large Language Models	Luxi Lin et.al.	2505.14260	link	Kimi
1082	2025-05-20	FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation	Shaolin Zhu et.al.	2505.14256	null	Kimi
1083	2025-05-20	Tokenization Constraints in LLMs: A Study of Symbolic and Arithmetic Reasoning Limits	Xiang Zhang et.al.	2505.14178	null	Kimi
1084	2025-05-20	RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning	Qianyue Hao et.al.	2505.14140	null	Kimi
1085	2025-05-20	DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models	Yakun Zhu et.al.	2505.14107	link	Kimi
1086	2025-05-20	Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models	Wenhui Zhu et.al.	2505.13973	null	Kimi
1087	2025-05-20	FlashThink: An Early Exit Method For Efficient Reasoning	Guochao Jiang et.al.	2505.13949	null	Kimi
1088	2025-05-20	EEG-to-Text Translation: A Model for Deciphering Human Brain Activity	Saydul Akbar Murad et.al.	2505.13936	link	Kimi
1089	2025-05-20	Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning	Jiwon Song et.al.	2505.13866	link	Kimi
1090	2025-05-20	EfficientLLM: Efficiency in Large Language Models	Zhengqing Yuan et.al.	2505.13840	null	Kimi
1091	2025-05-20	Structured Agent Distillation for Large Language Model	Jun Liu et.al.	2505.13820	null	Kimi
1092	2025-05-19	Ice Cream Doesn’t Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference	Jin Du et.al.	2505.13770	null	Kimi
1093	2025-05-19	Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers	Andrew Nam et.al.	2505.13737	null	Kimi
1094	2025-05-19	RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs	Soumya Rani Samineni et.al.	2505.13697	null	Kimi
1095	2025-05-19	Optimizing Anytime Reasoning via Budget Relative Policy Optimization	Penghui Qi et.al.	2505.13438	link	Kimi
1096	2025-05-19	CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process	Jinhe Bi et.al.	2505.13408	null	Kimi
1097	2025-05-19	Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference	Shuqing Luo et.al.	2505.13345	link	Kimi
1098	2025-05-19	Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space	Hengli Li et.al.	2505.13308	link	Kimi
1099	2025-05-19	RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning	Qiguang Chen et.al.	2505.13307	link	Kimi
1100	2025-05-19	Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability	Jingyi Ren et.al.	2505.13258	link	Kimi
1101	2025-05-19	HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding	Siran Liu et.al.	2505.13254	null	Kimi
1102	2025-05-19	Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification	Jikai Wang et.al.	2505.13204	null	Kimi
1103	2025-05-19	Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities	Lili Zhang et.al.	2505.13195	null	Kimi
1104	2025-05-19	ModernGBERT: German-only 1B Encoder Model Trained from Scratch	Anton Ehrmanntraut et.al.	2505.13136	null	Kimi
1105	2025-05-19	Benchmarking and Confidence Evaluation of LALMs For Temporal Reasoning	Debarpan Bhattacharya et.al.	2505.13115	link	Kimi
1106	2025-05-19	FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference	Guangda Liu et.al.	2505.13109	null	Kimi
1107	2025-05-19	Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning	Xiaoyu Yang et.al.	2505.13081	null	Kimi
1108	2025-05-19	MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO	Yicheng Xiao et.al.	2505.13031	link	Kimi
1109	2025-05-19	Fractured Chain-of-Thought Reasoning	Baohao Liao et.al.	2505.12992	null	Kimi
1110	2025-05-19	A3 : an Analytical Low-Rank Approximation Framework for Attention	Jeffrey T. H. Wong et.al.	2505.12942	null	Kimi
1111	2025-05-19	Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs	Zhihe Yang et.al.	2505.12929	link	Kimi
1112	2025-05-19	The Traitors: Deception and Trust in Multi-Agent Language Model Simulations	Pedro M. P. Curvo et.al.	2505.12923	link	Kimi
1113	2025-05-19	LEXam: Benchmarking Legal Reasoning on 340 Law Exams	Yu Fan et.al.	2505.12864	null	Kimi
1114	2025-05-19	Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs	Zhuo Yang et.al.	2505.12833	null	Kimi
1115	2025-05-19	SynDec: A Synthesize-then-Decode Approach for Arbitrary Textual Style Transfer via Large Language Models	Han Sun et.al.	2505.12821	null	Kimi
1116	2025-05-19	Accelerating Adaptive Retrieval Augmented Generation via Instruction-Driven Representation Reduction of Retrieval Overlaps	Jie Ou et.al.	2505.12731	null	Kimi
1117	2025-05-19	FLASH: Latent-Aware Semi-Autoregressive Speculative Decoding for Multimodal Tasks	Zihua Wang et.al.	2505.12728	link	Kimi
1118	2025-05-19	ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving	Haoyuan Wu et.al.	2505.12717	null	Kimi
1119	2025-05-19	Shadow-FT: Tuning Instruct via Base	Taiqiang Wu et.al.	2505.12716	link	Kimi
1120	2025-05-19	Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities	Haoyu Zhao et.al.	2505.12680	link	Kimi
1121	2025-05-19	HydraInfer: Hybrid Disaggregated Scheduling for Multimodal Large Language Model Serving	Xianzhe Dong et.al.	2505.12658	null	Kimi
1122	2025-05-19	Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents	Yunseok Jang et.al.	2505.12632	null	Kimi
1123	2025-05-19	Enhancing Latent Computation in Transformers with Latent Tokens	Yuchang Sun et.al.	2505.12629	null	Kimi
1124	2025-05-18	A Survey of Attacks on Large Language Models	Wenrui Xu et.al.	2505.12567	null	Kimi
1125	2025-05-15	3D-Fixup: Advancing Photo Editing with 3D Priors	Yen-Chi Cheng et.al.	2505.10566	null	Kimi
1126	2025-05-15	End-to-End Vision Tokenizer Tuning	Wenxuan Wang et.al.	2505.10562	null	Kimi
1127	2025-05-15	Neural Thermodynamic Laws for Large Language Model Training	Ziming Liu et.al.	2505.10559	null	Kimi
1128	2025-05-15	MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning	Ke Wang et.al.	2505.10557	link	Kimi
1129	2025-05-15	Beyond ‘Aha!’: Toward Systematic Meta-Abilities Alignment in Large Reasoning Models	Zhiyuan Hu et.al.	2505.10554	link	Kimi
1130	2025-05-15	Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data	Yiwen Liu et.al.	2505.10551	link	Kimi
1131	2025-05-15	Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning	Milan Ganai et.al.	2505.10547	null	Kimi
1132	2025-05-15	Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models	Annie Wong et.al.	2505.10543	link	Kimi
1133	2025-05-15	Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis	Pengfei Wang et.al.	2505.10541	link	Kimi
1134	2025-05-15	Enhancing Multi-Image Question Answering via Submodular Subset Selection	Aaryan Sharma et.al.	2505.10533	null	Kimi
1135	2025-05-15	MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models	Mugilan Ganesan et.al.	2505.10526	null	Kimi
1136	2025-05-15	Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation	Xinrui Wang et.al.	2505.10522	null	Kimi
1137	2025-05-15	Multi-Token Prediction Needs Registers	Anastasios Gerontopoulos et.al.	2505.10518	link	Kimi
1138	2025-05-15	The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks	Benedikt Ebing et.al.	2505.10507	null	Kimi
1139	2025-05-15	RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs	Vibha Belavadi et.al.	2505.10495	null	Kimi
1140	2025-05-15	Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective	Yutao Mou et.al.	2505.10494	link	Kimi
1141	2025-05-15	CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning	Shaohan Wang et.al.	2505.10493	null	Kimi
1142	2025-05-15	UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation	Yi Li et.al.	2505.10483	null	Kimi
1143	2025-05-15	Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps	Ningyuan Yang et.al.	2505.10482	null	Kimi
1144	2025-05-15	Parallel Scaling Law for Language Models	Mouxiang Chen et.al.	2505.10475	link	Kimi
1145	2025-05-15	AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge	Ranjan Sapkota et.al.	2505.10468	null	Kimi
1146	2025-05-15	Superposition Yields Robust Neural Scaling	Yizhou liu et.al.	2505.10465	link	Kimi
1147	2025-05-15	Vision language models have difficulty recognizing virtual objects	Tyler Tran et.al.	2505.10453	null	Kimi
1148	2025-05-15	Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models	Zemin Huang et.al.	2505.10446	null	Kimi
1149	2025-05-15	Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations?	Pedro Orvalho et.al.	2505.10443	null	Kimi
1150	2025-05-15	Hierarchical Document Refinement for Long-context Retrieval-augmented Generation	Jiajie Jin et.al.	2505.10413	link	Kimi
1151	2025-05-15	Are LLM-generated plain language summaries truly understandable? A large-scale crowdsourced evaluation	Yue Guo et.al.	2505.10409	null	Kimi
1152	2025-05-15	Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding	Jianhao Huang et.al.	2505.10405	null	Kimi
1153	2025-05-15	Rethinking Repetition Problems of LLMs in Code Generation	Yihong Dong et.al.	2505.10402	link	Kimi
1154	2025-05-15	Evaluating Model Explanations without Ground Truth	Kaivalya Rawal et.al.	2505.10399	link	Kimi
1155	2025-05-15	J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning	Chenxi Whitehouse et.al.	2505.10320	null	Kimi
1156	2025-05-15	StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation	Daniel A. P. Oliveira et.al.	2505.10292	link	Kimi
1157	2025-05-15	The Evolving Landscape of Generative Large Language Models and Traditional Natural Language Processing in Medicine	Rui Yang et.al.	2505.10261	null	Kimi
1158	2025-05-15	Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data	Poli Apollinaire Nemkova et.al.	2505.10260	link	Kimi
1159	2025-05-15	On the Interplay of Human-AI Alignment,Fairness, and Performance Trade-offs in Medical Imaging	Haozhe Luo et.al.	2505.10231	link	Kimi
1160	2025-05-15	ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention	Jintian Shao et.al.	2505.10222	null	Kimi
1161	2025-05-15	The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think	Seongyun Lee et.al.	2505.10185	null	Kimi
1162	2025-05-15	GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs	Longchao Da et.al.	2505.10143	null	Kimi
1163	2025-05-15	From Text to Network: Constructing a Knowledge Graph of Taiwan-Based China Studies Using Generative AI	Hsuan-Lei Shao et.al.	2505.10093	null	Kimi
1164	2025-05-15	CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability	Han Peng et.al.	2505.10063	null	Kimi
1165	2025-05-15	PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language	Ijazul Haq et.al.	2505.10055	link	Kimi
1166	2025-05-15	ServeGen: Workload Characterization and Generation of Large Language Model Serving in Production	Yuxing Xiang et.al.	2505.09999	link	Kimi
1167	2025-05-15	Analysing Safety Risks in LLMs Fine-Tuned with Pseudo-Malicious Cyber Security Data	Adel ElZemity et.al.	2505.09974	null	Kimi
1168	2025-05-15	Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents	Mrinal Rawat et.al.	2505.09970	null	Kimi
1169	2025-05-15	Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph	Deeksha Prahlad et.al.	2505.09945	link	Kimi
1170	2025-05-15	Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit Tasks	Ziyuan Zhang et.al.	2505.09901	link	Kimi
1171	2025-05-14	Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting	Apollinaire Poli Nemkova et.al.	2505.09852	null	Kimi
1172	2025-05-14	Contextual Phenotyping of Pediatric Sepsis Cohort Using Large Language Models	Aditya Nagori et.al.	2505.09805	null	Kimi
1173	2025-05-14	Trustless Autonomy: Understanding Motivations, Benefits and Governance Dilemma in Self-Sovereign Decentralized AI Agents	Botao Amber Hu et.al.	2505.09757	null	Kimi
1174	2025-05-14	System Prompt Optimization with Meta-Learning	Yumin Choi et.al.	2505.09666	null	Kimi
1175	2025-05-14	Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?	Anthony GX-Chen et.al.	2505.09614	null	Kimi
1176	2025-05-14	Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors	Nicolas Dupuis et.al.	2505.09610	null	Kimi
1177	2025-05-14	WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models	Abdullah Mushtaq et.al.	2505.09595	null	Kimi
1178	2025-05-14	PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning	Zongqian Li et.al.	2505.09519	link	Kimi
1179	2025-05-14	CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios	Raghav Garg et.al.	2505.09436	link	Kimi
1180	2025-05-14	Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records	Yili He et.al.	2505.09435	null	Kimi
1181	2025-05-14	Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits	Subrit Dikshit et.al.	2505.09407	null	Kimi
1182	2025-05-14	The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners	Vince Trencsenyi et.al.	2505.09396	null	Kimi
1183	2025-05-14	Qwen3 Technical Report	An Yang et.al.	2505.09388	link	Kimi
1184	2025-05-14	Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures	Chenggang Zhao et.al.	2505.09343	null	Kimi
1185	2025-05-14	Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs	Jingcheng Niu et.al.	2505.09338	link	Kimi
1186	2025-05-14	Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging	Hongjin Qian et.al.	2505.09316	null	Kimi
1187	2025-05-14	Reproducibility Study of “Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents”	Pedro M. P. Curvo et.al.	2505.09289	link	Kimi
1188	2025-05-14	Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt	Bin-Bin Gao et.al.	2505.09264	link	Kimi
1189	2025-05-14	ELIS: Efficient LLM Iterative Scheduling System with Response Length Predictor	Seungbeom Choi et.al.	2505.09142	null	Kimi
1190	2025-05-14	CEC-Zero: Chinese Error Correction Solution Based on LLM	Sophie Zhang et.al.	2505.09082	null	Kimi
1191	2025-05-14	A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias	Brandon Smith et.al.	2505.09056	null	Kimi
1192	2025-05-13	Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification	Adarsh Kumar et.al.	2505.09031	null	Kimi
1193	2025-05-13	Automated Meta Prompt Engineering for Alignment with the Theory of Mind	Aaron Baughman et.al.	2505.09024	null	Kimi
1194	2025-05-13	Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training	Yangyi Chen et.al.	2505.08971	link	Kimi
1195	2025-05-13	Toward Cost-Efficient Serving of Mixture-of-Experts with Asynchrony	Shaoyu Wang et.al.	2505.08944	null	Kimi
1196	2025-05-13	Performance Gains of LLMs With Humans in a World of LLMs Versus Humans	Lucas McCullum et.al.	2505.08902	null	Kimi
1197	2025-05-13	Generative AI for Autonomous Driving: Frontiers and Opportunities	Yuping Wang et.al.	2505.08854	link	Kimi
1198	2025-05-13	CodePDE: An Inference Framework for LLM-driven PDE Solver Generation	Shanda Li et.al.	2505.08783	link	Kimi
1199	2025-05-14	Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology	Yatai Ji et.al.	2505.08765	null	Kimi
1200	2025-05-13	DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models	Xiaoyang Chen et.al.	2505.08744	link	Kimi
1201	2025-05-13	Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies	Xiaoliang Luo et.al.	2505.08739	link	Kimi
1202	2025-05-13	NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context	Ben Yao et.al.	2505.08734	null	Kimi
1203	2025-05-13	PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts	Yang Su et.al.	2505.08719	null	Kimi
1204	2025-05-13	LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs	K M Sajjadul Islam et.al.	2505.08704	null	Kimi
1205	2025-05-13	TRAIL: Trace Reasoning and Agentic Issue Localization	Darshan Deshpande et.al.	2505.08638	null	Kimi
1206	2025-05-13	Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models	Donghoon Kim et.al.	2505.08622	null	Kimi
1207	2025-05-13	Automatic Task Detection and Heterogeneous LLM Speculative Decoding	Danying Ge et.al.	2505.08600	null	Kimi
1208	2025-05-13	Small but Significant: On the Promise of Small Language Models for Accessible AIED	Yumou Wei et.al.	2505.08588	null	Kimi
1209	2025-05-13	The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News	Yuhan Liu et.al.	2505.08532	null	Kimi
1210	2025-05-13	LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models	Takumi Shibata et.al.	2505.08498	null	Kimi
1211	2025-05-13	RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models	Fujun Zhang et.al.	2505.08463	null	Kimi
1212	2025-05-13	Accelerating Chain-of-Thought Reasoning: When Goal-Gradient Importance Meets Dynamic Skipping	Ren Zhuang et.al.	2505.08392	null	Kimi
1213	2025-05-13	Benchmarking AI scientists in omics data-driven biological research	Erpai Luo et.al.	2505.08341	link	Kimi
1214	2025-05-13	AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale	Yunjie Ji et.al.	2505.08311	null	Kimi
1215	2025-05-13	Evaluating the Effectiveness of Black-Box Prompt Optimization as the Scale of LLMs Continues to Grow	Ziyu Zhou et.al.	2505.08303	null	Kimi
1216	2025-05-13	Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration	Rishabh Agrawal et.al.	2505.08261	null	Kimi
1217	2025-05-13	Evaluating LLM Metrics Through Real-World Capabilities	Justin K Miller et.al.	2505.08253	null	Kimi
1218	2025-05-13	Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement	Haoran Ye et.al.	2505.08245	link	Kimi
1219	2025-05-13	A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs	Artem Shelmanov et.al.	2505.08200	null	Kimi
1220	2025-05-13	Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage	Ruilin Liu et.al.	2505.08167	null	Kimi
1221	2025-05-13	Decoding Neighborhood Environments with Large Language Models	Andrew Cart et.al.	2505.08163	null	Kimi
1222	2025-05-13	Lost in Transmission: When and Why LLMs Fail to Reason Globally	Tobias Schnabel et.al.	2505.08140	null	Kimi
1223	2025-05-13	ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval	Mingxu Tao et.al.	2505.08130	null	Kimi
1224	2025-05-12	Are LLMs complicated ethical dilemma analyzers?	Jiashen et.al.	2505.08106	link	Kimi
1225	2025-05-12	Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders	Dong Shu et.al.	2505.08080	null	Kimi
1226	2025-05-12	FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning	Zhehao Zhang et.al.	2505.08054	null	Kimi
1227	2025-05-12	Learning from Peers in Reasoning Models	Tongxu Luo et.al.	2505.07787	null	Kimi
1228	2025-05-12	S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models	Muzhi Dai et.al.	2505.07686	null	Kimi
1229	2025-05-12	SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models	Hang Wu et.al.	2505.07680	null	Kimi
1230	2025-05-13	OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit	Arun S. Maiya et.al.	2505.07672	link	Kimi
1231	2025-05-12	Benchmarking Retrieval-Augmented Generation for Chemistry	Xianrui Zhong et.al.	2505.07671	null	Kimi
1232	2025-05-12	Concept-Level Explainability for Auditing & Steering LLM Responses	Kenza Amara et.al.	2505.07610	link	Kimi
1233	2025-05-12	MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining	Xiaomi LLM-Core Team et.al.	2505.07608	link	Kimi
1234	2025-05-12	Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent	Ziyang Huang et.al.	2505.07596	null	Kimi
1235	2025-05-12	A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models	Junjie Ye et.al.	2505.07591	link	Kimi
1236	2025-05-12	ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution	Xu Huang et.al.	2505.07512	null	Kimi
1237	2025-05-12	A Survey on Collaborative Mechanisms Between Large and Small Language Models	Yi Chen et.al.	2505.07460	null	Kimi
1238	2025-05-12	How well do LLMs reason over tabular data, really?	Cornelius Wolff et.al.	2505.07453	null	Kimi
1239	2025-05-12	Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data	David de-Fitero-Dominguez et.al.	2505.07372	null	Kimi
1240	2025-05-12	QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines	Ohjoon Kwon et.al.	2505.07345	null	Kimi
1241	2025-05-12	Generative Pre-trained Autoregressive Diffusion Transformer	Yuan Zhang et.al.	2505.07344	null	Kimi
1242	2025-05-12	Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study	Baixuan Xu et.al.	2505.07313	null	Kimi
1243	2025-05-12	Semantic Retention and Extreme Compression in LLMs: Can We Have Both?	Stanislas Laborde et.al.	2505.07289	null	Kimi
1244	2025-05-12	UMoE: Unifying Attention and FFN with Shared Experts	Yuanhang Yang et.al.	2505.07260	null	Kimi
1245	2025-05-12	SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models	Peichao Lai et.al.	2505.07247	link	Kimi
1246	2025-05-12	Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity	Guang Yan et.al.	2505.07239	null	Kimi
1247	2025-05-12	DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation	Jiashuo Sun et.al.	2505.07233	link	Kimi
1248	2025-05-12	Measuring General Intelligence with Generated Games	Vivek Verma et.al.	2505.07215	link	Kimi
1249	2025-05-12	Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 2030	Mouxiao Bian et.al.	2505.07205	null	Kimi
1250	2025-05-12	PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications	Kuntai Du et.al.	2505.07203	null	Kimi
1251	2025-05-12	One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models	Haoran Gu et.al.	2505.07167	null	Kimi
1252	2025-05-12	Pre-training vs. Fine-tuning: A Reproducibility Study on Dense Retrieval Knowledge Acquisition	Zheng Yao et.al.	2505.07166	link	Kimi
1253	2025-05-11	RefPentester: A Knowledge-Informed Self-Reflective Penetration Testing Framework Based on Large Language Models	Hanzheng Dai et.al.	2505.07089	null	Kimi
1254	2025-05-11	Architectural Precedents for General Agents using Large Language Models	Robert E. Wray et.al.	2505.07087	null	Kimi
1255	2025-05-11	DialogueReason: Rule-Based RL Sparks Dialogue Reasoning in LLMs	Yubo Shu et.al.	2505.07049	null	Kimi
1256	2025-05-11	LLM-Augmented Chemical Synthesis and Design Decision Programs	Haorui Wang et.al.	2505.07027	null	Kimi
1257	2025-05-08	DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion	Qitao Zhao et.al.	2505.05473	null	Kimi
1258	2025-05-08	Flow-GRPO: Training Flow Matching Models via Online RL	Jie Liu et.al.	2505.05470	link	Kimi
1259	2025-05-08	Generating Physically Stable and Buildable LEGO Designs from Text	Ava Pun et.al.	2505.05469	link	Kimi
1260	2025-05-08	StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant	Haibo Wang et.al.	2505.05467	null	Kimi
1261	2025-05-08	ComPO: Preference Alignment via Comparison Oracles	Peter Chen et.al.	2505.05465	null	Kimi
1262	2025-05-08	Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging	Shiqi Chen et.al.	2505.05464	link	Kimi
1263	2025-05-08	UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections	Fatima Haouari et.al.	2505.05459	null	Kimi
1264	2025-05-08	SITE: towards Spatial Intelligence Thorough Evaluation	Wenqi Wang et.al.	2505.05456	null	Kimi
1265	2025-05-08	Conversational Process Model Redesign	Nataliia Klievtsova et.al.	2505.05453	null	Kimi
1266	2025-05-08	Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding	Han Xiao et.al.	2505.05446	link	Kimi
1267	2025-05-08	clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations	Chalamalasetti Kranti et.al.	2505.05445	null	Kimi
1268	2025-05-08	EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation	Biao Yi et.al.	2505.05440	null	Kimi
1269	2025-05-08	Empowering Scientific Workflows with Federated Agents	J. Gregory Pauloski et.al.	2505.05428	link	Kimi
1270	2025-05-08	Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data	Yudong Wang et.al.	2505.05427	null	Kimi
1271	2025-05-08	TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering	Ran Zhang et.al.	2505.05423	link	Kimi
1272	2025-05-08	TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation	Haokun Lin et.al.	2505.05422	link	Kimi
1273	2025-05-08	Reasoning Models Don’t Always Say What They Think	Yanda Chen et.al.	2505.05410	null	Kimi
1274	2025-05-08	Crosslingual Reasoning through Test-Time Scaling	Zheng-Xin Yong et.al.	2505.05408	link	Kimi
1275	2025-05-08	Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans?	Valeria Pastorino et.al.	2505.05406	null	Kimi
1276	2025-05-08	CART-ELC: Oblique Decision Tree Induction via Exhaustive Search	Andrew D. Laack et.al.	2505.05402	link	Kimi
1277	2025-05-08	PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model	Zhang Zhang et.al.	2505.05397	null	Kimi
1278	2025-05-08	EDmamba: A Simple yet Effective Event Denoising Method with State Space Model	Ciyu Ruan et.al.	2505.05391	null	Kimi
1279	2025-05-08	Walrus: An Efficient Decentralized Storage Network	George Danezis et.al.	2505.05370	null	Kimi
1280	2025-05-08	High-fidelity Grain Growth Modeling: Leveraging Deep Learning for Fast Computations	Pungponhavoan Tep et.al.	2505.05354	null	Kimi
1281	2025-05-08	Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization	Sooyoung Park et.al.	2505.05343	link	Kimi
1282	2025-05-08	Progressive Inertial Poser: Progressive Real-Time Kinematic Chain Estimation for 3D Full-Body Pose from Three IMU Sensors	Zunjie Zhu et.al.	2505.05336	null	Kimi
1283	2025-05-08	ICon: In-Context Contribution for Automatic Data Selection	Yixin Yang et.al.	2505.05327	null	Kimi
1284	2025-05-08	Scalable Chain of Thoughts via Elastic Reasoning	Yuhui Xu et.al.	2505.05315	link	Kimi
1285	2025-05-08	T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction	Kun Peng et.al.	2505.05271	null	Kimi
1286	2025-05-08	Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks	Yixin Cheng et.al.	2505.05190	link	Kimi
1287	2025-05-08	Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models	Wei Peng et.al.	2505.05189	link	Kimi
1288	2025-05-08	MARK: Memory Augmented Refinement of Knowledge	Anish Ganguli et.al.	2505.05177	null	Kimi
1289	2025-05-08	X-Driver: Explainable Autonomous Driving with Vision-Language Models	Wei Liu et.al.	2505.05098	null	Kimi
1290	2025-05-08	Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes	Zhuocheng Gong et.al.	2505.04993	null	Kimi
1291	2025-05-08	Chain-of-Thought Tokens are Computer Program Variables	Fangwei Zhu et.al.	2505.04955	link	Kimi
1292	2025-05-08	Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models	Yunxin Li et.al.	2505.04921	link	Kimi
1293	2025-05-08	An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education	Ramteja Sajja et.al.	2505.04916	null	Kimi
1294	2025-05-08	Enigme: Generative Text Puzzles for Evaluating Reasoning in Language Models	John Hawkins et.al.	2505.04914	link	Kimi
1295	2025-05-08	SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models	Shun Taguchi et.al.	2505.04911	null	Kimi
1296	2025-05-08	ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning	Ziqing Qiao et.al.	2505.04881	null	Kimi
1297	2025-05-08	GroverGPT-2: Simulating Grover’s Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization	Min Chen et.al.	2505.04880	null	Kimi
1298	2025-05-07	CRAFT: Cultural Russian-Oriented Dataset Adaptation for Focused Text-to-Image Generation	Viacheslav Vasilev et.al.	2505.04851	null	Kimi
1299	2025-05-07	Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards	Manveer Singh Tamber et.al.	2505.04847	link	Kimi
1300	2025-05-07	Large Language Models are Autonomous Cyber Defenders	Sebastián R. Castro et.al.	2505.04843	link	Kimi
1301	2025-05-07	ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling	Xiao Wang et.al.	2505.04802	null	Kimi
1302	2025-05-07	The Promise and Limits of LLMs in Constructing Proofs and Hints for Logic Problems in Intelligent Tutoring Systems	Sutapa Dey Tithi et.al.	2505.04736	null	Kimi
1303	2025-05-07	SOAEsV2-7B/72B: Full-Pipeline Optimization for State-Owned Enterprise LLMs via Continual Pre-Training, Domain-Progressive SFT and Distillation-Enhanced Speculative Decoding	Jingyang Deng et.al.	2505.04723	null	Kimi
1304	2025-05-07	EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning	Zhenghao Xing et.al.	2505.04623	link	Kimi
1305	2025-05-07	ZeroSearch: Incentivize the Search Capability of LLMs without Searching	Hao Sun et.al.	2505.04588	link	Kimi
1306	2025-05-07	Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review	Josh McGiff et.al.	2505.04531	null	Kimi
1307	2025-05-07	Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs	Yehui Tang et.al.	2505.04519	null	Kimi
1308	2025-05-07	CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation	Jiahao Li et.al.	2505.04481	null	Kimi
1309	2025-05-07	OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models	Xiaoyu Xu et.al.	2505.04416	null	Kimi
1310	2025-05-07	YABLoCo: Yet Another Benchmark for Long Context Code Generation	Aidar Valeev et.al.	2505.04406	null	Kimi
1311	2025-05-07	The Aloe Family Recipe for Open and Specialized Healthcare LLMs	Dario Garcia-Gasulla et.al.	2505.04388	null	Kimi
1312	2025-05-07	Benchmarking LLMs’ Swarm intelligence	Kai Ruan et.al.	2505.04364	link	Kimi
1313	2025-05-07	GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance	Sofia Jamil et.al.	2505.04284	link	Kimi
1314	2025-05-07	SToLa: Self-Adaptive Touch-Language Framework with Tactile Commonsense Reasoning in Open-Ended Scenarios	Ning Cheng et.al.	2505.04201	null	Kimi
1315	2025-05-07	VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction Tuning	Trinh T. L. Vuong et.al.	2505.04192	link	Kimi
1316	2025-05-07	S3D: Sketch-Driven 3D Model Generation	Hail Song et.al.	2505.04185	link	Kimi
1317	2025-05-07	Large Language Models are often politically extreme, usually ideologically inconsistent, and persuasive even in informational contexts	Nouar Aldahoul et.al.	2505.04171	null	Kimi
1318	2025-05-07	Unmasking the Canvas: A Dynamic Benchmark for Image Generation Jailbreaking and LLM Content Safety	Variath Madhupal Gautham Nair et.al.	2505.04146	null	Kimi
1319	2025-05-07	Enhancing Granular Sentiment Classification with Chain-of-Thought Prompting in Large Language Models	Vihaan Miriyala et.al.	2505.04135	null	Kimi
1320	2025-05-07	LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress?	Teddy Foley et.al.	2505.04075	link	Kimi
1321	2025-05-07	Advancing and Benchmarking Personalized Tool Invocation for LLMs	Xu Huang et.al.	2505.04072	link	Kimi
1322	2025-05-06	Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving	Shan Yu et.al.	2505.04021	null	Kimi
1323	2025-05-06	SLOT: Structuring the Output of Large Language Models	Darren Yow-Bang Wang et.al.	2505.04016	null	Kimi
1324	2025-05-06	Can Large Language Models Predict Parallel Code Performance?	Gregory Bolet et.al.	2505.03988	null	Kimi
1325	2025-05-06	X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains	Qianchu Liu et.al.	2505.03981	null	Kimi
1326	2025-05-06	The Power of Stories: Narrative Priming Shapes How LLM Agents Collaborate and Compete	Gerrit Großmann et.al.	2505.03961	link	Kimi
1327	2025-05-06	Frog Soup: Zero-Shot, In-Context, and Sample-Efficient Frogger Agents	Xiang Li et.al.	2505.03947	link	Kimi
1328	2025-05-06	MARCO: A Multi-Agent System for Optimizing HPC Code Generation Using Large Language Models	Asif Rahman et.al.	2505.03906	null	Kimi
1329	2025-05-06	Novel Extraction of Discriminative Fine-Grained Feature to Improve Retinal Vessel Segmentation	Shuang Zeng et.al.	2505.03896	link	Kimi
1330	2025-05-06	VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model	Zuwei Long et.al.	2505.03739	link	Kimi
1331	2025-05-06	WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch	Zimu Lu et.al.	2505.03733	link	Kimi
1332	2025-05-06	Distribution-Conditional Generation: From Class Distribution to Creative Generation	Fu Feng et.al.	2505.03667	null	Kimi
1333	2025-05-06	ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision Assistant	Yifan Xiang et.al.	2505.03654	link	Kimi
1334	2025-05-06	A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning	Kolawole E. Ogunsina et.al.	2505.03553	null	Kimi
1335	2025-05-06	Faster MoE LLM Inference for Extremely Large Models	Haoqi Yang et.al.	2505.03531	null	Kimi
1336	2025-05-06	Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models	Bin Yu et.al.	2505.03469	link	Kimi
1337	2025-05-06	The Steganographic Potentials of Language Models	Artem Karpov et.al.	2505.03439	null	Kimi
1338	2025-05-06	Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents	Schaun Wheeler et.al.	2505.03434	null	Kimi
1339	2025-05-06	MedArabiQ: Benchmarking Large Language Models on Arabic Medical Tasks	Mouath Abu Daoud et.al.	2505.03427	link	Kimi
1340	2025-05-06	Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation	Mohammad Shoaib Ansari et.al.	2505.03406	null	Kimi
1341	2025-05-06	Absolute Zero: Reinforced Self-play Reasoning with Zero Data	Andrew Zhao et.al.	2505.03335	link	Kimi
1342	2025-05-06	AI-Driven Scholarly Peer Review via Persistent Workflow Prompting, Meta-Prompting, and Meta-Reasoning	Evgeny Markhasin et.al.	2505.03332	null	Kimi
1343	2025-05-06	Recall with Reasoning: Chain-of-Thought Distillation for Mamba’s Long-Context Memory and Extrapolation	Junyu Ma et.al.	2505.03320	null	Kimi
1344	2025-05-06	SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation	Zhaoxi Mu et.al.	2505.03273	null	Kimi
1345	2025-05-06	RAVU: Retrieval Augmented Video Understanding with Compositional Reasoning over Graph	Sameer Malik et.al.	2505.03173	null	Kimi
1346	2025-05-06	Assessing and Enhancing the Robustness of LLM-based Multi-Agent Systems Through Chaos Engineering	Joshua Owotogbe et.al.	2505.03096	null	Kimi
1347	2025-05-05	Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text	Jennifer Healey et.al.	2505.03053	null	Kimi
1348	2025-05-05	A Typology of Synthetic Datasets for Dialogue Processing in Clinical Contexts	Steven Bedrick et.al.	2505.03025	null	Kimi
1349	2025-05-05	Memorization or Interpolation ? Detecting LLM Memorization through Input Perturbation Analysis	Albérick Euraste Djiré et.al.	2505.03019	null	Kimi
1350	2025-05-05	RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale	Daniel Goldstein et.al.	2505.03005	link	Kimi
1351	2025-05-05	Generating Narrated Lecture Videos from Slides with Synchronized Highlights	Alexander Holmberg et.al.	2505.02966	null	Kimi
1352	2025-05-05	When Your Own Output Becomes Your Training Data: Noise-to-Meaning Loops and a Formal RSI Trigger	Rintaro Ando et.al.	2505.02888	link	Kimi
1353	2025-05-05	AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation	Qingqiu Li et.al.	2505.02830	null	Kimi
1354	2025-05-05	AutoLibra: Agent Metric Induction from Open-Ended Feedback	Hao Zhu et.al.	2505.02820	link	Kimi
1355	2025-05-05	Knowing You Don’t Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing	Diji Yang et.al.	2505.02811	link	Kimi
1356	2025-05-05	HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models	Zheng Lin et.al.	2505.02795	null	Kimi
1357	2025-05-05	Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models	Matthew Dahl et.al.	2505.02763	null	Kimi
1358	2025-05-05	Using Knowledge Graphs to harvest datasets for efficient CLIP model training	Simon Ging et.al.	2505.02746	link	Kimi
1359	2025-05-05	FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models	Zhouliang Yu et.al.	2505.02735	link	Kimi
1360	2025-05-05	Enhancing LLMs’ Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry	Junu Kim et.al.	2505.02722	link	Kimi
1361	2025-05-05	Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play	Yemin Shi et.al.	2505.02707	link	Kimi
1362	2025-05-05	Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models	Xiaobao Wu et.al.	2505.02686	link	Kimi
1363	2025-05-05	A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law	Qianjun Pan et.al.	2505.02665	null	Kimi
1364	2025-05-05	Enhancing Chemical Reaction and Retrosynthesis Prediction with Large Language Model and Dual-task Learning	Xuan Lin et.al.	2505.02639	null	Kimi
1365	2025-05-05	LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis	Qingkai Fang et.al.	2505.02625	link	Kimi
1366	2025-05-05	EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning	Lingxiao Kong et.al.	2505.02579	link	Kimi
1367	2025-05-05	Bielik v3 Small: Technical Report	Krzysztof Ociepa et.al.	2505.02550	null	Kimi
1368	2025-05-05	Large Language Model Partitioning for Low-Latency Inference at the Edge	Dimitrios Kafetzis et.al.	2505.02533	null	Kimi
1369	2025-05-05	Beyond the model: Key differentiators in large language models and multi-agent services	Muskaan Goyal et.al.	2505.02489	null	Kimi
1370	2025-05-05	Incentivizing Inclusive Contributions in Model Sharing Markets	Enpei Zhang et.al.	2505.02462	null	Kimi
1371	2025-05-05	Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs	Elisa Forcada Rodríguez et.al.	2505.02456	null	Kimi
1372	2025-05-05	Bielik 11B v2 Technical Report	Krzysztof Ociepa et.al.	2505.02410	null	Kimi
1373	2025-05-05	Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL	Jiarui Yao et.al.	2505.02391	link	Kimi
1374	2025-05-05	RM-R1: Reward Modeling as Reasoning	Xiusi Chen et.al.	2505.02387	link	Kimi
1375	2025-05-05	JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings	Tianyu Zong et.al.	2505.02366	link	Kimi
1376	2025-05-05	Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques	Sanjay Surendranath Girija et.al.	2505.02309	null	Kimi
1377	2025-05-05	Generative Sign-description Prompts with Multi-positive Contrastive Learning for Sign Language Recognition	Siyu Liang et.al.	2505.02304	null	Kimi
1378	2025-05-04	Parameter-Efficient Transformer Embeddings	Henry Ndubuaku et.al.	2505.02266	link	Kimi
1379	2025-05-04	SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation	Tanguy Herserant et.al.	2505.02235	null	Kimi
1380	2025-05-04	Interpretable Emergent Language Using Inter-Agent Transformers	Mannan Bhardwaj et.al.	2505.02215	link	Kimi
1381	2025-05-04	Leveraging LLMs to Automate Energy-Aware Refactoring of Parallel Scientific Codes	Matthew T. Dearing et.al.	2505.02184	null	Kimi
1382	2025-05-04	Measuring Hong Kong Massive Multi-Task Language Understanding	Chuxue Cao et.al.	2505.02177	null	Kimi
1383	2025-05-04	A New HOPE: Domain-agnostic Automatic Evaluation of Text Chunking	Henrik Brådland et.al.	2505.02171	null	Kimi
1384	2025-05-04	Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents	Minzheng Wang et.al.	2505.02156	link	Kimi
1385	2025-05-01	T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT	Dongzhi Jiang et.al.	2505.00703	link	Kimi
1386	2025-05-01	RayZer: A Self-supervised Large View Synthesis Model	Hanwen Jiang et.al.	2505.00702	null	Kimi
1387	2025-05-01	Robotic Visual Instruction	Yanbang Li et.al.	2505.00693	null	Kimi
1388	2025-05-01	Towards Autonomous Micromobility through Scalable Urban Simulation	Wayne Wu et.al.	2505.00690	null	Kimi
1389	2025-05-01	GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution	Aditya Arora et.al.	2505.00687	null	Kimi
1390	2025-05-01	Visual Test-time Scaling for GUI Agent Grounding	Tiange Luo et.al.	2505.00684	link	Kimi
1391	2025-05-01	MINERVA: Evaluating Complex Video Reasoning	Arsha Nagrani et.al.	2505.00681	link	Kimi
1392	2025-05-01	Steering Large Language Models with Register Analysis for Arbitrary Style Transfer	Xinchen Yang et.al.	2505.00679	null	Kimi
1393	2025-05-01	Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions	Yiming Du et.al.	2505.00675	link	Kimi
1394	2025-05-01	DeepCritic: Deliberate Critique with Large Language Models	Wenkai Yang et.al.	2505.00662	link	Kimi
1395	2025-05-01	On the generalization of language models from in-context learning and finetuning: a controlled study	Andrew K. Lampinen et.al.	2505.00661	null	Kimi
1396	2025-05-01	Large Language Models Understanding: an Inherent Ambiguity Barrier	Daniel N. Nissani et.al.	2505.00654	null	Kimi
1397	2025-05-01	Open-Source LLM-Driven Federated Transformer for Predictive IoV Management	Yazan Otoum et.al.	2505.00651	null	Kimi
1398	2025-05-01	OmicsCL: Unsupervised Contrastive Learning for Cancer Subtype Discovery and Survival Stratification	Atahan Karagoz et.al.	2505.00650	link	Kimi
1399	2025-05-01	Investigating Task Arithmetic for Zero-Shot Information Retrieval	Marco Braga et.al.	2505.00649	link	Kimi
1400	2025-05-01	Deep Learning Assisted Outer Volume Removal for Highly-Accelerated Real-Time Dynamic MRI	Merve Gülle et.al.	2505.00643	null	Kimi
1401	2025-05-01	Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook	Muyi Bao et.al.	2505.00630	link	Kimi
1402	2025-05-01	The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)	Zihao Wang et.al.	2505.00626	null	Kimi
1403	2025-05-01	FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation	Chaitali Bhattacharyya et.al.	2505.00624	null	Kimi
1404	2025-05-01	Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction	Simon Giebenhain et.al.	2505.00615	null	Kimi
1405	2025-05-01	Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation	D. Sculley et.al.	2505.00612	null	Kimi
1406	2025-05-01	Combining LLMs with Logic-Based Framework to Explain MCTS	Ziyan An et.al.	2505.00610	null	Kimi
1407	2025-05-01	Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4	Phanish Puranam et.al.	2505.00603	null	Kimi
1408	2025-05-01	Fast and Low-Cost Genomic Foundation Models via Outlier Removal	Haozheng Luo et.al.	2505.00598	link	Kimi
1409	2025-05-01	A Finite-State Controller Based Offline Solver for Deterministic POMDPs	Alex Schutz et.al.	2505.00596	link	Kimi
1410	2025-05-01	Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading	Shuo Tong et.al.	2505.00592	null	Kimi
1411	2025-05-01	FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension	Jushi Kai et.al.	2505.00570	null	Kimi
1412	2025-05-01	Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models	Makoto Sato et.al.	2505.00557	null	Kimi
1413	2025-05-01	100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models	Chong Zhang et.al.	2505.00551	null	Kimi
1414	2025-05-01	HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection	Deanna Emery et.al.	2505.00506	null	Kimi
1415	2025-05-01	UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces	Alaa Saleh et.al.	2505.00472	null	Kimi
1416	2025-05-01	Red Teaming Large Language Models for Healthcare	Vahid Balazadeh et.al.	2505.00467	null	Kimi
1417	2025-05-01	Data Therapist: Eliciting Domain Knowledge from Subject Matter Experts Using Large Language Models	Sungbok Shin et.al.	2505.00455	null	Kimi
1418	2025-05-01	KoACD: The First Korean Adolescent Dataset for Cognitive Distortion Analysis	JunSeo Kim et.al.	2505.00367	null	Kimi
1419	2025-05-01	Enhancing AI-Driven Education: Integrating Cognitive Frameworks, Linguistic Feedback Analysis, and Ethical Considerations for Improved Content Generation	Antoun Yaacoub et.al.	2505.00339	null	Kimi
1420	2025-05-01	Mixture of Sparse Attention: Content-Based Learnable Sparse Attention via Expert-Choice Routing	Piotr Piękos et.al.	2505.00315	link	Kimi
1421	2025-05-01	Fine-grained spatial-temporal perception for gas leak segmentation	Xinlong Zhao et.al.	2505.00295	link	Kimi
1422	2025-05-01	Empowering Agentic Video Analytics Systems with Video Language Models	Yuxuan Yan et.al.	2505.00254	null	Kimi
1423	2025-04-30	Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems	Shaokun Zhang et.al.	2505.00212	link	Kimi
1424	2025-04-30	Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models	Minh-Hao Van et.al.	2505.00150	null	Kimi
1425	2025-04-30	AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models	Yinghui He et.al.	2505.00147	null	Kimi
1426	2025-04-30	Between Underthinking and Overthinking: An Empirical Study of Reasoning Length and correctness in LLMs	Jinyan Su et.al.	2505.00127	null	Kimi
1427	2025-04-30	Fine-Tuning LLMs for Low-Resource Dialect Translation: The Case of Lebanese	Silvana Yakhni et.al.	2505.00114	link	Kimi
1428	2025-04-30	GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling	Siqi Li et.al.	2505.00063	null	Kimi
1429	2025-04-30	TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments	Sichang Tu et.al.	2504.21851	null	Kimi
1430	2025-04-30	Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization	Anas Anwarul Haq Khan et.al.	2504.21831	null	Kimi
1431	2025-04-30	DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition	Z. Z. Ren et.al.	2504.21801	link	Kimi
1432	2025-04-30	WebThinker: Empowering Large Reasoning Models with Deep Research Capability	Xiaoxi Li et.al.	2504.21776	link	Kimi
1433	2025-04-30	MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness	Junsheng Huang et.al.	2504.21773	null	Kimi
1434	2025-04-30	AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization	Haotian Luo et.al.	2504.21659	link	Kimi
1435	2025-04-30	Sadeed: Advancing Arabic Diacritization Through Small Language Model	Zeina Aldallal et.al.	2504.21635	null	Kimi
1436	2025-04-30	Meeseeks: An Iterative Benchmark Evaluating LLMs Multi-Turn Instruction-Following Ability	Jiaming Wang et.al.	2504.21625	null	Kimi
1437	2025-04-30	RDF-Based Structured Quality Assessment Representation of Multilingual LLM Evaluations	Jonas Gwozdz et.al.	2504.21605	null	Kimi
1438	2025-04-30	DNB-AI-Project at SemEval-2025 Task 5: An LLM-Ensemble Approach for Automated Subject Indexing	Lisa Kluge et.al.	2504.21589	link	Kimi
1439	2025-04-30	Precision Where It Matters: A Novel Spike Aware Mixed-Precision Quantization Strategy for LLaMA-based Language Models	Lucas Maisonnave et.al.	2504.21553	null	Kimi
1440	2025-04-30	RWKV-X: A Linear Complexity Hybrid Language Model	Haowen Hou et.al.	2504.21463	link	Kimi
1441	2025-04-30	SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding	Chenkai Zhang et.al.	2504.21435	link	Kimi
1442	2025-04-30	Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction	Máté Gedeon et.al.	2504.21372	null	Kimi
1443	2025-04-30	ShorterBetter: Guiding Reasoning Models to Find Optimal Inference Length for Efficient Reasoning	Jingyang Yi et.al.	2504.21370	null	Kimi
1444	2025-04-30	Revisiting Diffusion Autoencoder Training for Image Reconstruction Quality	Pramook Khungurn et.al.	2504.21368	null	Kimi
1445	2025-04-30	Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing	Hong Zhang et.al.	2504.21356	link	Kimi
1446	2025-04-30	Phi-4-reasoning Technical Report	Marah Abdin et.al.	2504.21318	null	Kimi
1447	2025-04-30	BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models	Zhiting Fan et.al.	2504.21299	null	Kimi
1448	2025-04-30	Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models	Guanghao Zhou et.al.	2504.21277	null	Kimi
1449	2025-04-30	Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA	Xuanzhao Dong et.al.	2504.21252	link	Kimi
1450	2025-04-30	Memorization and Knowledge Injection in Gated LLMs	Xu Pan et.al.	2504.21239	null	Kimi
1451	2025-04-30	Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math	Haoran Xu et.al.	2504.21233	null	Kimi
1452	2025-04-29	CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks	Rui Wang et.al.	2504.21228	null	Kimi
1453	2025-04-29	Automatic Legal Writing Evaluation of LLMs	Ramon Pires et.al.	2504.21202	link	Kimi
1454	2025-04-29	Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare	Lovedeep Gondara et.al.	2504.21191	null	Kimi
1455	2025-04-29	OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification	Shangyu Li et.al.	2504.20964	link	Kimi
1456	2025-04-29	Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models	Maryna Vyshnyvetska et.al.	2504.20951	null	Kimi
1457	2025-04-29	Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models	Tyler McDonald et.al.	2504.20946	null	Kimi
1458	2025-04-29	ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification	Ziqing Fan et.al.	2504.20930	link	Kimi
1459	2025-04-29	DYNAMAX: Dynamic computing for Transformers and Mamba based architectures	Miguel Nogales et.al.	2504.20922	null	Kimi
1460	2025-04-29	Using LLMs in Generating Design Rationale for Software Architecture Decisions	Xiyu Zhou et.al.	2504.20781	link	Kimi
1461	2025-04-29	JTreeformer: Graph-Transformer via Latent-Diffusion Model for Molecular Generation	Ji Shi et.al.	2504.20770	null	Kimi
1462	2025-04-29	Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption	Wenxiao Wang et.al.	2504.20769	null	Kimi
1463	2025-04-29	Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think	Hasan Abed Al Kader Hammoud et.al.	2504.20708	null	Kimi
1464	2025-04-29	Cooking Up Creativity: A Cognitively-Inspired Approach for Enhancing LLM Creativity through Structured Representations	Moran Mizrahi et.al.	2504.20643	link	Kimi
1465	2025-04-29	The Hidden Risks of LLM-Generated Web Application Code: A Security-Centric Evaluation of Code Generation Capabilities in Large Language Models	Swaroop Dora et.al.	2504.20612	null	Kimi
1466	2025-04-29	Reinforcement Learning for Reasoning in Large Language Models with One Training Example	Yiping Wang et.al.	2504.20571	link	Kimi
1467	2025-04-29	UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation	Huimin Lu et.al.	2504.20500	link	Kimi
1468	2025-04-29	Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression	Yu Cui et.al.	2504.20493	null	Kimi
1469	2025-04-29	A Summary on GUI Agents with Foundation Models Enhanced by Reinforcement Learning	Jiahao Li et.al.	2504.20464	null	Kimi
1470	2025-04-29	Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding	Gabe Guo et.al.	2504.20456	link	Kimi
1471	2025-04-29	GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection	DiJia Su et.al.	2504.20437	null	Kimi
1472	2025-04-29	FiLA-Video: Spatio-Temporal Compression for Fine-Grained Long Video Understanding	Yanan Guo et.al.	2504.20384	null	Kimi
1473	2025-04-29	Local Prompt Optimization	Yash Jain et.al.	2504.20355	null	Kimi
1474	2025-04-29	MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation	Amaan Izhar et.al.	2504.20343	link	Kimi
1475	2025-04-28	Enhancing Systematic Reviews with Large Language Models: Using GPT-4 and Kimi	Dandan Chen Kaptur et.al.	2504.20276	null	Kimi
1476	2025-04-28	Can Large Language Models Learn Formal Logic? A Data-Driven Training and Evaluation Framework	Yuan Xia et.al.	2504.20213	null	Kimi
1477	2025-04-28	Weaving Context Across Images: Improving Vision-Language Models through Focus-Centric Visual Chains	Juntian Zhang et.al.	2504.20199	null	Kimi
1478	2025-04-28	MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools	Nishant Subramani et.al.	2504.20168	link	Kimi
1479	2025-04-28	AutoJudge: Judge Decoding Without Manual Annotation	Roman Garipov et.al.	2504.20039	null	Kimi
1480	2025-04-28	Towards Automated Scoping of AI for Social Good Projects	Jacob Emmerson et.al.	2504.20010	null	Kimi
1481	2025-04-28	TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons	Emre Can Acikgoz et.al.	2504.19982	null	Kimi
1482	2025-04-28	Accelerating Mixture-of-Experts Training with Adaptive Expert Replication	Athinagoras Skiadopoulos et.al.	2504.19925	null	Kimi
1483	2025-04-28	Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI	Hugo Georgenthum et.al.	2504.19918	null	Kimi
1484	2025-04-28	Can AI Agents Design and Implement Drug Discovery Pipelines?	Khachik Smbatyan et.al.	2504.19912	null	Kimi
1485	2025-04-28	GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets	Mingqian He et.al.	2504.19898	null	Kimi
1486	2025-04-28	semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage	Ke Hong et.al.	2504.19867	null	Kimi
1487	2025-04-28	Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance	Takuya Tamura et.al.	2504.19811	null	Kimi
1488	2025-04-28	Moral Reasoning Across Languages: The Critical Role of Low-Resource Languages in LLMs	Huichi Zhou et.al.	2504.19759	null	Kimi
1489	2025-04-28	Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented Generation	Carlo Merola et.al.	2504.19754	link	Kimi
1490	2025-04-28	LLM-Assisted Automated Deductive Coding of Dialogue Data: Leveraging Dialogue-Specific Characteristics to Enhance Contextual Understanding	Ying Na et.al.	2504.19734	null	Kimi
1491	2025-04-28	Taming the Titans: A Survey of Efficient LLM Inference Serving	Ranran Zhen et.al.	2504.19720	link	Kimi
1492	2025-04-28	From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review	Mohamed Amine Ferrag et.al.	2504.19678	null	Kimi
1493	2025-04-28	Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMs	Osma Suominen et.al.	2504.19675	link	Kimi
1494	2025-04-28	VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning	Run Luo et.al.	2504.19627	null	Kimi
1495	2025-04-28	m-KAILIN: Knowledge-Driven Agentic Scientific Corpus Distillation Framework for Biomedical Large Language Models Training	Meng Xiao et.al.	2504.19565	null	Kimi
1496	2025-04-28	DEEMO: De-identity Multimodal Emotion Recognition and Reasoning	Deng Li et.al.	2504.19549	null	Kimi
1497	2025-04-28	Bullet: Boosting GPU Utilization for LLM Serving via Dynamic Spatial-Temporal Orchestration	Zejia Lin et.al.	2504.19516	null	Kimi
1498	2025-04-28	Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding	Yan Wang et.al.	2504.19500	null	Kimi
1499	2025-04-28	Improving Reasoning Performance in Large Language Models via Representation Engineering	Bertram Højer et.al.	2504.19483	null	Kimi
1500	2025-04-28	BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text	Jiageng Wu et.al.	2504.19467	link	Kimi
1501	2025-04-28	Towards Long Context Hallucination Detection	Siyi Liu et.al.	2504.19457	null	Kimi
1502	2025-04-28	Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory	Prateek Chhikara et.al.	2504.19413	null	Kimi
1503	2025-04-28	ICL CIPHERS: Quantifying “Learning’’ in In-Context Learning via Substitution Ciphers	Zhouxiang Fang et.al.	2504.19395	null	Kimi
1504	2025-04-27	LLMs for Engineering: Teaching Models to Design High Powered Rockets	Toby Simonds et.al.	2504.19394	null	Kimi
1505	2025-04-27	Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing	James O’ Neill et.al.	2504.19333	null	Kimi
1506	2025-04-27	Platonic Grounding for Efficient Multimodal Language Models	Moulik Choraria et.al.	2504.19327	null	Kimi
1507	2025-04-27	BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese	Peilin Zhou et.al.	2504.19314	link	Kimi
1508	2025-04-27	AndroidGen: Building an Android Language Agent under Data Scarcity	Hanyu Lai et.al.	2504.19298	link	Kimi
1509	2025-04-24	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789	null	Kimi
1510	2025-04-24	The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs	Piotr Nawrot et.al.	2504.17768	null	Kimi
1511	2025-04-24	Step1X-Edit: A Practical Framework for General Image Editing	Shiyu Liu et.al.	2504.17761	link	Kimi
1512	2025-04-24	Conversational Assistants to support Heart Failure Patients: comparing a Neurosymbolic Architecture with ChatGPT	Anuja Tayal et.al.	2504.17753	null	Kimi
1513	2025-04-24	CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos	Shucheng Gong et.al.	2504.17728	link	Kimi
1514	2025-04-24	Multilingual Performance Biases of Large Language Models in Education	Vansh Gupta et.al.	2504.17720	null	Kimi
1515	2025-04-24	Early Detection of Multidrug Resistance Using Multivariate Time Series Analysis and Interpretable Patient-Similarity Representations	Óscar Escudero-Arnanz et.al.	2504.17717	null	Kimi
1516	2025-04-24	Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields	Zhuo He et.al.	2504.17712	null	Kimi
1517	2025-04-24	Plasma State Monitoring and Disruption Characterization using Multimodal VAEs	Yoeri Poels et.al.	2504.17710	null	Kimi
1518	2025-04-24	Safety in Large Reasoning Models: A Survey	Cheng Wang et.al.	2504.17704	null	Kimi
1519	2025-04-24	Federated Learning: A Survey on Privacy-Preserving Collaborative Intelligence	Edward Collins et.al.	2504.17703	null	Kimi
1520	2025-04-24	Hierarchical and Multimodal Data for Daily Activity Understanding	Ghazal Kaviani et.al.	2504.17696	link	Kimi
1521	2025-04-24	BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring	Asier Bikandi et.al.	2504.17693	null	Kimi
1522	2025-04-24	Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks	Haru-Tada Sato et.al.	2504.17685	null	Kimi
1523	2025-04-24	INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models	Jarne Thys et.al.	2504.17677	null	Kimi
1524	2025-04-24	Energy Considerations of Large Language Model Inference and Efficiency Optimizations	Jared Fernandez et.al.	2504.17674	null	Kimi
1525	2025-04-24	Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation	Ying Zhu et.al.	2504.17672	null	Kimi
1526	2025-04-24	Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction	Yuanchang Ye et.al.	2504.17671	null	Kimi
1527	2025-04-24	DiMeR: Disentangled Mesh Reconstruction Model	Lutao Jiang et.al.	2504.17670	link	Kimi
1528	2025-04-24	Towards a HIPAA Compliant Agentic AI System in Healthcare	Subash Neupane et.al.	2504.17669	null	Kimi
1529	2025-04-24	Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics	Zena Al-Khalili et.al.	2504.17665	null	Kimi
1530	2025-04-24	Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction	Farhad Pourkamali-Anaraki et.al.	2504.17655	null	Kimi
1531	2025-04-24	DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training	Xiaoyu Tian et.al.	2504.17565	null	Kimi
1532	2025-04-24	HalluLens: LLM Hallucination Benchmark	Yejin Bang et.al.	2504.17550	null	Kimi
1533	2025-04-24	A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task	Jiaqi Deng et.al.	2504.17547	null	Kimi
1534	2025-04-24	Auditing the Ethical Logic of Generative AI Models	W. Russell Neuman et.al.	2504.17544	null	Kimi
1535	2025-04-24	Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation	Xin Yi et.al.	2504.17480	null	Kimi
1536	2025-04-24	FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding	De-An Huang et.al.	2504.17447	link	Kimi
1537	2025-04-24	Assessing the Capability of Large Language Models for Domain-Specific Ontology Generation	Anna Sofia Lippolis et.al.	2504.17402	null	Kimi
1538	2025-04-24	LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams	Yongxuan Wu et.al.	2504.17366	link	Kimi
1539	2025-04-24	TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation	Ling You et.al.	2504.17365	null	Kimi
1540	2025-04-24	FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation	Yulia Otmakhova et.al.	2504.17311	null	Kimi
1541	2025-04-24	JurisCTC: Enhancing Legal Judgment Prediction via Cross-Domain Transfer and Contrastive Learning	Zhaolu Kang et.al.	2504.17264	null	Kimi
1542	2025-04-24	MCAF: Efficient Agent-based Video Understanding Framework through Multimodal Coarse-to-Fine Attention Focusing	Shiwen Cao et.al.	2504.17213	null	Kimi
1543	2025-04-24	A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation	Yangxinyu Xie et.al.	2504.17200	null	Kimi
1544	2025-04-24	Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning	Minju Seo et.al.	2504.17192	link	Kimi
1545	2025-04-23	MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation	Chanhee Park et.al.	2504.17137	null	Kimi
1546	2025-04-23	Steering the CensorShip: Uncovering Representation Vectors for LLM “Thought” Control	Hannah Cyberey et.al.	2504.17130	link	Kimi
1547	2025-04-23	The Rise of Small Language Models in Healthcare: A Comprehensive Survey	Muskan Garg et.al.	2504.17119	null	Kimi
1548	2025-04-23	Leveraging LLMs as Meta-Judges: A Multi-Agent Framework for Evaluating LLM Judgments	Yuran Li et.al.	2504.17087	null	Kimi
1549	2025-04-23	DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs	Zhenhailong Wang et.al.	2504.17040	null	Kimi
1550	2025-04-23	(Im)possibility of Automated Hallucination Detection in Large Language Models	Amin Karbasi et.al.	2504.17004	null	Kimi
1551	2025-04-23	Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text	Shifali Agrahari et.al.	2504.16913	null	Kimi
1552	2025-04-23	Do Large Language Models know who did what to whom?	Joseph M. Denning et.al.	2504.16884	null	Kimi
1553	2025-04-23	Monte Carlo Planning with Large Language Model for Text-Based Game Agents	Zijing Shi et.al.	2504.16855	null	Kimi
1554	2025-04-23	GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning	Luu Quy Tung et.al.	2504.16832	null	Kimi
1555	2025-04-23	Process Reward Models That Think	Muhammad Khalifa et.al.	2504.16828	link	Kimi
1556	2025-04-23	Random Long-Context Access for Mamba via Hardware-aligned Hierarchical Sparse Attention	Xiang Hu et.al.	2504.16795	null	Kimi
1557	2025-04-23	Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation	Lakshita Agarwal et.al.	2504.16788	null	Kimi
1558	2025-04-23	MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores	Fengwei Zhou et.al.	2504.16786	null	Kimi
1559	2025-04-23	How Effective are Generative Large Language Models in Performing Requirements Classification?	Waad Alhoshan et.al.	2504.16768	null	Kimi
1560	2025-04-23	Lightweight Latent Verifiers for Efficient Meta-Generation Strategies	Bartosz Piotrowski et.al.	2504.16760	null	Kimi
1561	2025-04-23	HEMA : A Hippocampus-Inspired Extended Memory Architecture for Long-Context AI Conversations	Kwangseob Ahn et.al.	2504.16754	null	Kimi
1562	2025-04-23	IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery	Aniketh Garikaparthi et.al.	2504.16728	link	Kimi
1563	2025-04-23	Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories	Mareike Lisker et.al.	2504.16604	null	Kimi
1564	2025-04-23	Comparing Large Language Models and Traditional Machine Translation Tools for Translating Medical Consultation Summaries: A Pilot Study	Andy Li et.al.	2504.16601	null	Kimi
1565	2025-04-23	PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression	Lizhe Chen et.al.	2504.16574	null	Kimi
1566	2025-04-23	Amplified Vulnerabilities: Structured Jailbreak Attacks on LLM-based Multi-Agent Debate	Senmao Qi et.al.	2504.16489	null	Kimi
1567	2025-04-23	Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark	Hanlei Zhang et.al.	2504.16427	link	Kimi
1568	2025-04-23	Evaluating Multi-Hop Reasoning in Large Language Models: A Chemistry-Centric Case Study	Mohammad Khodadad et.al.	2504.16414	null	Kimi
1569	2025-04-23	ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs	Fahmida Liza Piya et.al.	2504.16394	link	Kimi
1570	2025-04-23	SplitReason: Learning To Offload Reasoning	Yash Akhauri et.al.	2504.16379	null	Kimi
1571	2025-04-23	Text-to-TrajVis: Enabling Trajectory Data Visualizations from Natural Language Questions	Tian Bai et.al.	2504.16358	null	Kimi
1572	2025-04-23	DP2FL: Dual Prompt Personalized Federated Learning in Foundation Models	Ying Chang et.al.	2504.16357	null	Kimi
1573	2025-04-22	The Paradox of Poetic Intent in Back-Translation: Evaluating the Quality of Large Language Models in Chinese Translation	Li Weigang et.al.	2504.16286	null	Kimi
1574	2025-04-22	FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking	Jabez Magomere et.al.	2504.16188	null	Kimi
1575	2025-04-22	MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention	Yucheng Li et.al.	2504.16083	null	Kimi
1576	2025-04-22	MR. Video: “MapReduce” is the Principle for Long Video Understanding	Ziqi Pang et.al.	2504.16082	null	Kimi
1577	2025-04-22	LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities	Thomas Schmied et.al.	2504.16078	null	Kimi
1578	2025-04-22	LongMamba: Enhancing Mamba’s Long Context Capabilities via Training-Free Receptive Field Enlargement	Zhifan Ye et.al.	2504.16053	link	Kimi
1579	2025-04-22	Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3	Ahmed R. Sadik et.al.	2504.16027	null	Kimi
1580	2025-04-23	CAPO: Cost-Aware Prompt Optimization	Tom Zehle et.al.	2504.16005	link	Kimi
1581	2025-04-22	FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity	Fanny Jourdan et.al.	2504.15941	link	Kimi
1582	2025-04-22	Impact of Noise on LLM-Models Performance in Abstraction and Reasoning Corpus (ARC) Tasks with Model Temperature Considerations	Nikhil Khandalkar et.al.	2504.15903	null	Kimi
1583	2025-04-22	SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning	Cheng Wen et.al.	2504.15900	null	Kimi
1584	2025-04-22	Dynamic Early Exit in Reasoning Models	Chenxu Yang et.al.	2504.15895	link	Kimi
1585	2025-04-22	What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns	Michael A. Hedderich et.al.	2504.15815	link	Kimi
1586	2025-04-22	A closer look at how large language models trust humans: patterns and biases	Valeria Lerman et.al.	2504.15801	null	Kimi
1587	2025-04-22	Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach	Ruizhe Li et.al.	2504.15784	null	Kimi
1588	2025-04-22	TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving	Daocheng Fu et.al.	2504.15780	null	Kimi
1589	2025-04-22	DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models	Jie Zhu et.al.	2504.15716	link	Kimi
1590	2025-04-22	Cost-Effective Text Clustering with Large Language Models	Hongtao Wang et.al.	2504.15640	null	Kimi
1591	2025-04-22	DR.FIX: Automatically Fixing Data Races at Industry Scale	Farnaz Behrang et.al.	2504.15637	link	Kimi
1592	2025-04-22	Exploiting Contextual Knowledge in LLMs through V-usable Information based Layer Enhancement	Xiaowei Yuan et.al.	2504.15630	null	Kimi
1593	2025-04-22	A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models	Gengxian Cao et.al.	2504.15552	null	Kimi
1594	2025-04-22	llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length	Issa Sugiura et.al.	2504.15544	null	Kimi
1595	2025-04-22	Compass-V2 Technical Report	Sophia Maria et.al.	2504.15527	null	Kimi
1596	2025-04-21	CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting	Atin Pothiraj et.al.	2504.15485	null	Kimi
1597	2025-04-21	Speculative Sampling via Exponential Races	Szymon Kobus et.al.	2504.15475	null	Kimi
1598	2025-04-21	Trillion 7B Technical Report	Sungjun Han et.al.	2504.15431	null	Kimi
1599	2025-04-21	LLM-Assisted Translation of Legacy FORTRAN Codes to C++: A Cross-Platform Study	Nishath Rajiv Ranasinghe et.al.	2504.15424	null	Kimi
1600	2025-04-21	IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs	David Ma et.al.	2504.15415	link	Kimi
1601	2025-04-21	Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection	Myrthe Reuver et.al.	2504.15392	link	Kimi
1602	2025-04-21	Towards Understanding Camera Motions in Any Video	Zhiqiu Lin et.al.	2504.15376	null	Kimi
1603	2025-04-21	KeDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments	Junyoung Park et.al.	2504.15364	null	Kimi
1604	2025-04-21	Exploring Compositional Generalization (in ReCOGS_pos) by Transformers using Restricted Access Sequence Processing (RASP)	William Bruns et.al.	2504.15349	null	Kimi
1605	2025-04-21	Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs	Chun-Hsiao Yeh et.al.	2504.15280	link	Kimi
1606	2025-04-21	FlowReasoner: Reinforcing Query-Level Meta-Agents	Hongcheng Gao et.al.	2504.15257	link	Kimi
1607	2025-04-21	Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges	Nandan Thakur et.al.	2504.15205	null	Kimi
1608	2025-04-21	The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks	Joan C. Timoneda et.al.	2504.15160	null	Kimi
1609	2025-04-21	EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models	Ziwen Xu et.al.	2504.15133	link	Kimi
1610	2025-04-21	Kuwain 1.5B: An Arabic SLM via Language Injection	Khalil Hennara et.al.	2504.15120	null	Kimi
1611	2025-04-21	A triple-branch network for latent fingerprint enhancement guided by orientation fields and minutiae	Yurun Wang et.al.	2504.15105	null	Kimi
1612	2025-04-21	Rethinking the Potential of Multimodality in Collaborative Problem Solving Diagnosis with Large Language Models	K. Wong et.al.	2504.15093	null	Kimi
1613	2025-04-21	DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation	Weijie He et.al.	2504.15032	null	Kimi
1614	2025-04-21	Efficient Pretraining Length Scaling	Bohong Wu et.al.	2504.14992	null	Kimi
1615	2025-04-21	Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues	Rui Ribeiro et.al.	2504.14963	null	Kimi
1616	2025-04-21	MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core	Dennis Liu et.al.	2504.14960	null	Kimi
1617	2025-04-21	EducationQ: Evaluating LLMs’ Teaching Capabilities Through Multi-Agent Dialogue Framework	Yao Shi et.al.	2504.14928	null	Kimi
1618	2025-04-21	CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs	Yingming Zheng et.al.	2504.14905	link	Kimi
1619	2025-04-21	Latent Bayesian Optimization via Autoregressive Normalizing Flows	Seunghun Lee et.al.	2504.14889	null	Kimi
1620	2025-04-21	Natural Fingerprints of Large Language Models	Teppei Suzuki et.al.	2504.14871	null	Kimi
1621	2025-04-21	OTC: Optimal Tool Calls via Reinforcement Learning	Hongru Wang et.al.	2504.14870	null	Kimi
1622	2025-04-21	ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages	Zhoujie Qian et.al.	2504.14825	null	Kimi
1623	2025-04-21	On Self-improving Token Embeddings	Mario M. Kubek et.al.	2504.14808	null	Kimi
1624	2025-04-21	Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends	Jiaxin GUO et.al.	2504.14804	null	Kimi
1625	2025-04-21	gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling	Tianyu Guo et.al.	2504.14775	link	Kimi
1626	2025-04-21	PLANET: A Collection of Benchmarks for Evaluating LLMs’ Planning Capabilities	Haoming Li et.al.	2504.14773	null	Kimi
1627	2025-04-20	Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions	Luyang Fang et.al.	2504.14772	null	Kimi
1628	2025-04-20	SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs	Minh V. T. Pham et.al.	2504.14757	null	Kimi
1629	2025-04-20	PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines	Reya Vir et.al.	2504.14738	null	Kimi
1630	2025-04-20	AI with Emotions: Exploring Emotional Expressions in Large Language Models	Shin-nosuke Ishikawa et.al.	2504.14706	null	Kimi
1631	2025-04-20	Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark	Enxin Song et.al.	2504.14693	link	Kimi
1632	2025-04-20	FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models	Mehrnoush Shamsfard et.al.	2504.14690	null	Kimi
1633	2025-04-20	Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens	Kaihang Pan et.al.	2504.14666	null	Kimi
1634	2025-04-20	A Case Study Exploring the Current Landscape of Synthetic Medical Record Generation with Commercial LLMs	Yihan Lin et.al.	2504.14657	null	Kimi
1635	2025-04-17	PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding	Jang Hyun Cho et.al.	2504.13180	link	Kimi
1636	2025-04-17	Single-Shot Shape and Reflectance with Spatial Polarization Multiplexing	Tomoki Ichikawa et.al.	2504.13177	null	Kimi
1637	2025-04-17	It’s All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization	Ali Behrouz et.al.	2504.13173	null	Kimi
1638	2025-04-17	Sleep-time Compute: Beyond Inference Scaling at Test-time	Kevin Lin et.al.	2504.13171	link	Kimi
1639	2025-04-17	Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling	Tsung-Han Wu et.al.	2504.13169	link	Kimi
1640	2025-04-17	CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training	Shizhe Diao et.al.	2504.13161	null	Kimi
1641	2025-04-17	MIB: A Mechanistic Interpretability Benchmark	Aaron Mueller et.al.	2504.13151	link	Kimi
1642	2025-04-17	Readable Twins of Unreadable Models	Krzysztof Pancerz et.al.	2504.13150	link	Kimi
1643	2025-04-17	Antidistillation Sampling	Yash Savani et.al.	2504.13146	null	Kimi
1644	2025-04-17	Exploring Expert Failures Improves LLM Agent Tuning	Li-Cheng Lan et.al.	2504.13145	null	Kimi
1645	2025-04-17	$\texttt{Complex-Edit}$ : CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark	Siwei Yang et.al.	2504.13143	null	Kimi
1646	2025-04-17	Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo	João Loula et.al.	2504.13139	null	Kimi
1647	2025-04-17	Energy-Based Reward Models for Robust Language Model Alignment	Anamika Lochab et.al.	2504.13134	link	Kimi
1648	2025-04-17	Science-T2I: Addressing Scientific Illusions in Image Synthesis	Jialuo Li et.al.	2504.13129	null	Kimi
1649	2025-04-17	LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard	Varun Rao et.al.	2504.13125	null	Kimi
1650	2025-04-17	Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training	Xinsong Zhang et.al.	2504.13123	null	Kimi
1651	2025-04-17	VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models	Haojian Huang et.al.	2504.13122	link	Kimi
1652	2025-04-17	Probing and Inducing Combinational Creativity in Vision-Language Models	Yongqian Peng et.al.	2504.13120	null	Kimi
1653	2025-04-17	EventVAD: Training-Free Event-Aware Video Anomaly Detection	Yihua Shao et.al.	2504.13092	null	Kimi
1654	2025-04-17	Retrieval-Augmented Generation with Conflicting Evidence	Han Wang et.al.	2504.13079	link	Kimi
1655	2025-04-17	Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off	Riza Velioglu et.al.	2504.13078	link	Kimi
1656	2025-04-17	SkyReels-V2: Infinite-length Film Generative Model	Guibin Chen et.al.	2504.13074	link	Kimi
1657	2025-04-17	Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models	Sudesh Ramesh Bhagat et.al.	2504.13068	null	Kimi
1658	2025-04-17	RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins	Yao Mu et.al.	2504.13059	null	Kimi
1659	2025-04-17	Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation	Yichao Feng et.al.	2504.13054	null	Kimi
1660	2025-04-17	How Large Language Models Are Changing MOOC Essay Answers: A Comparison of Pre- and Post-LLM Responses	Leo Leppänen et.al.	2504.13038	null	Kimi
1661	2025-04-17	Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Assessment and Beyond	Yundi Zhang et.al.	2504.13037	link	Kimi
1662	2025-04-17	InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning	Zheng Wang et.al.	2504.13032	null	Kimi
1663	2025-04-17	ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images	Sangwook Kim et.al.	2504.13023	null	Kimi
1664	2025-04-17	Pose and Facial Expression Transfer by using StyleGAN	Petr Jahoda et.al.	2504.13021	null	Kimi
1665	2025-04-17	SHA256 at SemEval-2025 Task 4: Selective Amnesia – Constrained Unlearning for Large Language Models via Knowledge Isolation	Saransh Agrawal et.al.	2504.12996	link	Kimi
1666	2025-04-17	Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback	Nearchos Potamitis et.al.	2504.12951	null	Kimi
1667	2025-04-17	Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models	Zhouhao Sun et.al.	2504.12898	null	Kimi
1668	2025-04-17	EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting	Guanrou Yang et.al.	2504.12867	null	Kimi
1669	2025-04-17	Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks	Amey Hengle et.al.	2504.12845	link	Kimi
1670	2025-04-17	Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration	Yicheng Pan et.al.	2504.12773	link	Kimi
1671	2025-04-17	Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge	Yongrui Chen et.al.	2504.12734	null	Kimi
1672	2025-04-17	Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations	Yiyou Sun et.al.	2504.12691	link	Kimi
1673	2025-04-17	Data-efficient LLM Fine-tuning for Code Generation	Weijie Lv et.al.	2504.12687	link	Kimi
1674	2025-04-17	Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation	Linda He et.al.	2504.12637	null	Kimi
1675	2025-04-17	Identifying and Mitigating the Influence of the Prior Distribution in Large Language Models	Liyi Zhang et.al.	2504.12585	link	Kimi
1676	2025-04-17	MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation	Haris Riaz et.al.	2504.12563	null	Kimi
1677	2025-04-17	ZeroSumEval: Scaling LLM Evaluation with Inter-Model Competition	Haidar Khan et.al.	2504.12562	link	Kimi
1678	2025-04-17	Memorization: A Close Look at Books	Iris Ma et.al.	2504.12549	null	Kimi
1679	2025-04-16	MOM: Memory-Efficient Offloaded Mini-Sequence Inference for Long Context Language Models	Junyang Zhang et.al.	2504.12526	null	Kimi
1680	2025-04-16	Memorization vs. Reasoning: Updating LLMs with New Knowledge	Aochong Oliver Li et.al.	2504.12523	null	Kimi
1681	2025-04-16	Towards Conversational AI for Human-Machine Collaborative MLOps	George Fatouros et.al.	2504.12477	null	Kimi
1682	2025-04-16	Integrating Structural and Semantic Signals in Text-Attributed Graphs with BiGTex	Azadeh Beiranvand et.al.	2504.12474	link	Kimi
1683	2025-04-16	Dense Backpropagation Improves Training for Sparse Mixture-of-Experts	Ashwinee Panda et.al.	2504.12463	link	Kimi
1684	2025-04-16	Activated LoRA: Fine-tuned LLMs for Intrinsics	Kristjan Greenewald et.al.	2504.12397	link	Kimi
1685	2025-04-16	BitNet b1.58 2B4T Technical Report	Shuming Ma et.al.	2504.12285	null	Kimi
1686	2025-04-16	How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions	Aditya Prakash et.al.	2504.12284	null	Kimi
1687	2025-04-16	FLIP Reasoning Challenge	Andreas Plesner et.al.	2504.12256	link	Kimi
1688	2025-04-16	What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure	Céline Budding et.al.	2504.12187	null	Kimi
1689	2025-04-16	SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data	Suyoung Bae et.al.	2504.12185	null	Kimi
1690	2025-04-16	Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models -	Laura Fieback et.al.	2504.12137	null	Kimi
1691	2025-04-16	Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework	Jack Preuveneers et.al.	2504.12090	null	Kimi
1692	2025-04-16	Purposefully Induced Psychosis (PIP): Embracing Hallucination as Imagination in Large Language Models	Kris Pilcher et.al.	2504.12012	null	Kimi
1693	2025-04-16	Generative Recommendation with Continuous-Token Diffusion	Haohao Qu et.al.	2504.12007	null	Kimi
1694	2025-04-16	Language Models as Quasi-Crystalline Thought: Structure, Constraint, and Emergence in Generative Systems	Jose Manuel Guevara-Vela et.al.	2504.11986	null	Kimi
1695	2025-04-16	ADAT: Time-Series-Aware Adaptive Transformer Architecture for Sign Language Translation	Nada Shahin et.al.	2504.11942	null	Kimi
1696	2025-04-16	Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading	Qianjin Yu et.al.	2504.11919	null	Kimi
1697	2025-04-16	Evaluating the Goal-Directedness of Large Language Models	Tom Everitt et.al.	2504.11844	link	Kimi
1698	2025-04-16	FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations	Yue Zhao et.al.	2504.11837	null	Kimi
1699	2025-04-16	Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation	Julia Kreutzer et.al.	2504.11829	null	Kimi
1700	2025-04-16	Cost-Efficient LLM Serving in the Cloud: VM Selection with KV Cache Offloading	Kihyun Kim et.al.	2504.11816	link	Kimi
1701	2025-04-16	Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification	Yue Li et.al.	2504.11793	null	Kimi
1702	2025-04-16	Enhancing Web Agents with Explicit Rollback Mechanisms	Zhisong Zhang et.al.	2504.11788	null	Kimi
1703	2025-04-16	Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs	Hyungwoo Lee et.al.	2504.11765	null	Kimi
1704	2025-04-16	Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures	Prabhu Vellaisamy et.al.	2504.11750	null	Kimi
1705	2025-04-16	Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics	Yiran He et.al.	2504.11686	null	Kimi
1706	2025-04-16	Steering Prosocial AI Agents: Computational Basis of LLM’s Decision Making in Social Simulation	Ji Ma et.al.	2504.11671	null	Kimi
1707	2025-04-15	GraphicBench: A Planning Benchmark for Graphic Design with Language Agents	Dayeon Ki et.al.	2504.11571	null	Kimi
1708	2025-04-15	ReTool: Reinforcement Learning for Strategic Tool Use in LLMs	Jiazhan Feng et.al.	2504.11536	link	Kimi
1709	2025-04-15	HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation	Haokun Liu et.al.	2504.11524	null	Kimi
1710	2025-04-15	TextArena	Leon Guertler et.al.	2504.11442	link	Kimi
1711	2025-04-15	A Dual-Space Framework for General Knowledge Distillation of Large Language Models	Xue Zhang et.al.	2504.11426	null	Kimi
1712	2025-04-15	A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce	Wei Xiong et.al.	2504.11343	link	Kimi
1713	2025-04-15	Transformer-Based Model for Cold Start Mitigation in FaaS Architecture	Alexandre Savi Fayam Mbala Mouen et.al.	2504.11338	null	Kimi
1714	2025-04-15	Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints	Ruicheng Ao et.al.	2504.11320	link	Kimi
1715	2025-04-15	Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs	Chang Yang et.al.	2504.11239	link	Kimi
1716	2025-04-15	Video Summarization with Large Language Models	Min Jung Lee et.al.	2504.11199	null	Kimi
1717	2025-04-15	Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items	Minjie Zou et.al.	2504.11186	null	Kimi
1718	2025-04-15	DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis	Efthymios Georgiou et.al.	2504.11082	null	Kimi
1719	2025-04-15	Dynamic Compressing Prompts for Efficient Inference of Large Language Models	Jinwu Hu et.al.	2504.11004	null	Kimi
1720	2025-04-15	Efficient Reasoning Models: A Survey	Sicheng Feng et.al.	2504.10903	link	Kimi
1721	2025-04-15	ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search	Yize Zhang et.al.	2504.10893	null	Kimi
1722	2025-04-15	Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content	Yilang Peng et.al.	2504.10878	null	Kimi
1723	2025-04-15	Moving Beyond Next-Token Prediction: Transformers are Context-Sensitive Language Generators	Phill Kyu Rhee et.al.	2504.10845	null	Kimi
1724	2025-04-15	LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation	Hengyu Shi et.al.	2504.10829	null	Kimi
1725	2025-04-15	CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives	Ayoung Lee et.al.	2504.10823	null	Kimi
1726	2025-04-14	How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients	Ming Li et.al.	2504.10766	link	Kimi
1727	2025-04-14	ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models	Amirhosein Chahe et.al.	2504.10757	link	Kimi
1728	2025-04-14	CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD Map Updates	Ankit Kumar Shaw et.al.	2504.10738	null	Kimi
1729	2025-04-14	HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving	Avinash Kumar et.al.	2504.10724	null	Kimi
1730	2025-04-14	Weight-of-Thought Reasoning: Exploring Neural Network Weights for Enhanced LLM Reasoning	Saif Punjwani et.al.	2504.10646	link	Kimi
1731	2025-04-14	Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models	Thilo Hagendorff et.al.	2504.10615	null	Kimi
1732	2025-04-15	GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents	Xiaobo Xia et.al.	2504.10458	null	Kimi
1733	2025-04-14	RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users	Suyu Ye et.al.	2504.10445	link	Kimi
1734	2025-04-14	Multimodal Long Video Modeling Based on Temporal Dynamic Context	Haoran Hao et.al.	2504.10443	link	Kimi
1735	2025-04-14	LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models	Minqian Liu et.al.	2504.10430	null	Kimi
1736	2025-04-14	LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models	Parshin Shojaee et.al.	2504.10415	link	Kimi
1737	2025-04-14	Performance of Large Language Models in Supporting Medical Diagnosis and Treatment	Diogo Sousa et.al.	2504.10405	null	Kimi
1738	2025-04-14	Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families	Shahriar Noroozizadeh et.al.	2504.10340	null	Kimi
1739	2025-04-14	Heimdall: test-time scaling on the generative verification	Wenlei Shi et.al.	2504.10337	null	Kimi
1740	2025-04-14	AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference	Yangshen Deng et.al.	2504.10326	null	Kimi
1741	2025-04-14	Deep Reasoning Translation via Reinforcement Learning	Jiaan Wang et.al.	2504.10187	link	Kimi
1742	2025-04-14	HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection	Mohamed A. Abdallah et.al.	2504.10168	null	Kimi
1743	2025-04-14	Breaking the Data Barrier – Building GUI Agents Through Task Generalization	Junlei Zhang et.al.	2504.10127	link	Kimi
1744	2025-04-14	CameraBench: Benchmarking Visual Reasoning in MLLMs via Photography	I-Sheng Fang et.al.	2504.10090	null	Kimi
1745	2025-04-14	RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability	Yichi Zhang et.al.	2504.10081	null	Kimi
1746	2025-04-14	Mavors: Multi-granularity Video Representation for Multimodal Large Language Model	Yang Shi et.al.	2504.10068	null	Kimi
1747	2025-04-14	Hallucination Detection in LLMs via Topological Divergence on Attention Graphs	Alexandra Bazarova et.al.	2504.10063	null	Kimi
1748	2025-04-14	DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify	Zhengxuan Zhang et.al.	2504.10036	null	Kimi
1749	2025-04-14	The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination	Hao Yin et.al.	2504.10020	null	Kimi
1750	2025-04-14	Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models?	Yanbo Wang et.al.	2504.10000	null	Kimi
1751	2025-04-14	KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference	Yuxuan Tian et.al.	2504.09936	null	Kimi
1752	2025-04-14	FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding	Zheng Liu et.al.	2504.09925	link	Kimi
1753	2025-04-14	Reasoning Models Can Be Effective Without Thinking	Wenjie Ma et.al.	2504.09858	null	Kimi
1754	2025-04-14	A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science	Jie Feng et.al.	2504.09848	null	Kimi
1755	2025-04-14	OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training	Juntao Zhao et.al.	2504.09844	null	Kimi
1756	2025-04-14	Training Small Reasoning LLMs with Cognitive Preference Alignment	Wenrui Cai et.al.	2504.09802	null	Kimi
1757	2025-04-14	VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents	Ryota Tanaka et.al.	2504.09795	null	Kimi
1758	2025-04-14	Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning	Jingtian Wu et.al.	2504.09781	null	Kimi
1759	2025-04-14	Understanding and Optimizing Multi-Stage AI Inference Pipelines	Abhimanyu Rajeshkumar Bambhaniya et.al.	2504.09775	null	Kimi
1760	2025-04-14	Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning	Can Jin et.al.	2504.09772	link	Kimi
1761	2025-04-13	Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability	Haotian Wang et.al.	2504.09639	link	Kimi
1762	2025-04-13	Metropolis-Hastings Captioning Game: Knowledge Fusion of Vision Language Models via Decentralized Bayesian Inference	Yuta Matsui et.al.	2504.09620	null	Kimi
1763	2025-04-10	Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments	Lorenz Linhardt et.al.	2504.07965	null	Kimi
1764	2025-04-10	PixelFlow: Pixel-Space Generative Models with Flow	Shoufa Chen et.al.	2504.07963	link	Kimi
1765	2025-04-10	GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation	Lang Lin et.al.	2504.07962	null	Kimi
1766	2025-04-10	Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction	Zeren Jiang et.al.	2504.07961	link	Kimi
1767	2025-04-10	CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy	Dongyoung Kim et.al.	2504.07959	null	Kimi
1768	2025-04-10	MM-IFEngine: Towards Multimodal Instruction Following	Shengyuan Ding et.al.	2504.07957	link	Kimi
1769	2025-04-10	VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning	Yukun Qi et.al.	2504.07956	null	Kimi
1770	2025-04-10	Perception-R1: Pioneering Perception Policy with Reinforcement Learning	En Yu et.al.	2504.07954	link	Kimi
1771	2025-04-10	Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models	Mustafa Shukor et.al.	2504.07951	null	Kimi
1772	2025-04-10	InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians	Kefan Chen et.al.	2504.07949	null	Kimi
1773	2025-04-10	GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces	Hao Yu et.al.	2504.07945	null	Kimi
1774	2025-04-10	HoloPart: Generative 3D Part Amodal Segmentation	Yunhan Yang et.al.	2504.07943	null	Kimi
1775	2025-04-10	SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement	Xiyao Wang et.al.	2504.07934	link	Kimi
1776	2025-04-10	The Urban Impact of AI: Modeling Feedback Loops in Next-Venue Recommendation	Giovanni Mauro et.al.	2504.07911	link	Kimi
1777	2025-04-10	The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound	Blake VanBerlo et.al.	2504.07904	null	Kimi
1778	2025-04-10	Redefining Machine Translation on Social Network Services with Large Language Models	Hongcheng Guo et.al.	2504.07901	link	Kimi
1779	2025-04-10	How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective	Qi Liu et.al.	2504.07898	link	Kimi
1780	2025-04-10	Fast Adaptation with Behavioral Foundation Models	Harshit Sikchi et.al.	2504.07896	null	Kimi
1781	2025-04-10	SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning	Rui Pan et.al.	2504.07891	link	Kimi
1782	2025-04-10	Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge	Riccardo Cantini et.al.	2504.07887	link	Kimi
1783	2025-04-10	Token Level Routing Inference System for Edge Devices	Jianshu She et.al.	2504.07878	null	Kimi
1784	2025-04-10	Dual Engines of Thoughts: A Depth-Breadth Integration Framework for Open-Ended Analysis	Fei-Hsuan Yu et.al.	2504.07872	null	Kimi
1785	2025-04-10	SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos	Joshua Li et.al.	2504.07867	null	Kimi
1786	2025-04-10	Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs	Yichun Yin et.al.	2504.07866	null	Kimi
1787	2025-04-10	2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization	Mengyang Li et.al.	2504.07856	null	Kimi
1788	2025-04-10	The KL3M Data Project: Copyright-Clean Training Resources for Large Language Models	Michael J Bommarito II et.al.	2504.07854	link	Kimi
1789	2025-04-10	V2V3D: View-to-View Denoised 3D Reconstruction for Light-Field Microscopy	Jiayin Zhao et.al.	2504.07853	null	Kimi
1790	2025-04-10	Anytime Single-Step MAPF Planning with Anytime PIBT	Nayesha Gandotra et.al.	2504.07841	null	Kimi
1791	2025-04-10	Understanding Learner-LLM Chatbot Interactions and the Impact of Prompting Guidelines	Cansu Koyuturk et.al.	2504.07840	null	Kimi
1792	2025-04-10	Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems	Simon Lermen et.al.	2504.07831	null	Kimi
1793	2025-04-10	MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations	Genglin Liu et.al.	2504.07830	link	Kimi
1794	2025-04-10	Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models	Hongcheng Guo et.al.	2504.07807	link	Kimi
1795	2025-04-10	On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data	Alfredo Garrachón Ruiz et.al.	2504.07646	null	Kimi
1796	2025-04-10	ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models	Joel Barmettler et.al.	2504.07624	null	Kimi
1797	2025-04-10	VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model	Haozhan Shen et.al.	2504.07615	link	Kimi
1798	2025-04-10	Boosting Universal LLM Reward Design through the Heuristic Reward Observation Space Evolution	Zen Kit Heng et.al.	2504.07596	null	Kimi
1799	2025-04-10	AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation	Tuhin Chakrabarty et.al.	2504.07532	link	Kimi
1800	2025-04-10	Supervised Optimism Correction: Be Confident When LLMs Are Sure	Junjie Zhang et.al.	2504.07527	null	Kimi
1801	2025-04-10	VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding	Henghao Zhao et.al.	2504.07519	null	Kimi
1802	2025-04-10	GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable	Jianqiao Wangni et.al.	2504.07513	null	Kimi
1803	2025-04-10	Kimi-VL Technical Report	Kimi Team et.al.	2504.07491	link	Kimi
1804	2025-04-10	Beyond LLMs: A Linguistic Approach to Causal Graph Generation from Narrative Texts	Zehan Li et.al.	2504.07459	null	Kimi
1805	2025-04-10	From Token to Line: Enhancing Code Generation with a Long-Term Perspective	Tingwei Lu et.al.	2504.07433	null	Kimi
1806	2025-04-10	TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models	Sher Badshah et.al.	2504.07385	null	Kimi
1807	2025-04-10	Enhancing Time Series Forecasting via Multi-Level Text Alignment with LLMs	Taibiao Zhao et.al.	2504.07360	link	Kimi
1808	2025-04-10	Revisiting Prompt Optimization with Large Reasoning Models-A Case Study on Event Extraction	Saurabh Srivastava et.al.	2504.07357	null	Kimi
1809	2025-04-09	Modeling Response Consistency in Multi-Agent LLM Systems: A Comparative Analysis of Shared and Separate Context Approaches	Tooraj Helmi et.al.	2504.07303	null	Kimi
1810	2025-04-09	SemEval-2025 Task 5: LLMs4Subjects – LLM-based Automated Subject Tagging for a National Technical Library’s Open-Access Catalog	Jennifer D’Souza et.al.	2504.07199	link	Kimi
1811	2025-04-09	HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation	Mingxuan Li et.al.	2504.07174	link	Kimi
1812	2025-04-09	Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning	Nikhil Shivakumar Nayak et.al.	2504.07097	link	Kimi
1813	2025-04-09	OmniCaptioner: One Captioner to Rule Them All	Yiting Lu et.al.	2504.07089	link	Kimi
1814	2025-04-09	KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs	Elan Markowitz et.al.	2504.07087	null	Kimi
1815	2025-04-09	DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning	Atharva Pandey et.al.	2504.07080	null	Kimi
1816	2025-04-09	SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills	Boyuan Zheng et.al.	2504.07079	null	Kimi
1817	2025-04-09	HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification	Bibek Paudel et.al.	2504.07069	null	Kimi
1818	2025-04-09	Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration	Kostas Hatalis et.al.	2504.06943	null	Kimi
1819	2025-04-09	Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Image Recognition	Sergio Romero-Tapiador et.al.	2504.06925	null	Kimi
1820	2025-04-09	Integrating Cognitive Processing Signals into Language Models: A Review of Advances, Applications and Future Directions	Angela Lopez-Cardona et.al.	2504.06843	null	Kimi
1821	2025-04-09	LVC: A Lightweight Compression Framework for Enhancing VLMs in Long Video Understanding	Ziyi Wang et.al.	2504.06835	null	Kimi
1822	2025-04-09	Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations	Zican Dong et.al.	2504.06792	null	Kimi
1823	2025-04-09	Zero-Shot Image-Based Large Language Model Approach to Road Pavement Monitoring	Shuoshuo Xu et.al.	2504.06785	null	Kimi
1824	2025-04-09	FamilyTool: A Multi-hop Personalized Tool Use Benchmark	Yuxin Wang et.al.	2504.06766	link	Kimi
1825	2025-04-09	EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture	Wenfeng Feng et.al.	2504.06738	null	Kimi
1826	2025-04-09	A Neuro-inspired Interpretation of Unlearning in Large Language Models through Sample-level Unlearning Difficulty	Xiaohua Feng et.al.	2504.06658	null	Kimi
1827	2025-04-09	Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program	Minghe Gao et.al.	2504.06606	link	Kimi
1828	2025-04-09	Automated Business Process Analysis: An LLM-Based Approach to Value Assessment	William De Michele et.al.	2504.06600	link	Kimi
1829	2025-04-09	Right Prediction, Wrong Reasoning: Uncovering LLM Misalignment in RA Disease Diagnosis	Umakanta Maharana et.al.	2504.06581	link	Kimi
1830	2025-04-09	NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables	Lanrui Wang et.al.	2504.06560	null	Kimi
1831	2025-04-09	Lugha-Llama: Adapting Large Language Models for African Languages	Happy Buzaaba et.al.	2504.06536	null	Kimi
1832	2025-04-08	Don’t Let It Hallucinate: Premise Verification via Retrieval-Augmented Logical Reasoning	Yuehan Qin et.al.	2504.06438	null	Kimi
1833	2025-04-08	S’MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning	Hanqing Zeng et.al.	2504.06426	null	Kimi
1834	2025-04-08	Understanding Machine Unlearning Through the Lens of Mode Connectivity	Jiali Cheng et.al.	2504.06407	null	Kimi
1835	2025-04-08	GOLLuM: Gaussian Process Optimized LLMs – Reframing LLM Finetuning through Bayesian Optimization	Bojana Ranković et.al.	2504.06265	link	Kimi
1836	2025-04-09	Hogwild! Inference: Parallel LLM Generation via Concurrent Attention	Gleb Rodionov et.al.	2504.06261	link	Kimi
1837	2025-04-08	FEABench: Evaluating Language Models on Multiphysics Reasoning Ability	Nayantara Mudur et.al.	2504.06260	link	Kimi
1838	2025-04-08	Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation	Biao Zhang et.al.	2504.06225	null	Kimi
1839	2025-04-08	From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models	Chejian Xu et.al.	2504.06214	null	Kimi
1840	2025-04-08	TxGemma: Efficient and Agentic LLMs for Therapeutics	Eric Wang et.al.	2504.06196	null	Kimi
1841	2025-04-08	Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups	Rijul Magu et.al.	2504.06160	null	Kimi
1842	2025-04-08	QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform	Movina Moses et.al.	2504.06136	null	Kimi
1843	2025-04-08	Multi-Sense Embeddings for Language Models and Knowledge Distillation	Qitong Wang et.al.	2504.06036	null	Kimi
1844	2025-04-08	NativQA Framework: Enabling LLMs with Native, Local, and Everyday Knowledge	Firoj Alam et.al.	2504.05995	null	Kimi
1845	2025-04-08	PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario	Sriram Mandalika et.al.	2504.05908	null	Kimi
1846	2025-04-08	HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference	Shuzhang Zhong et.al.	2504.05897	link	Kimi
1847	2025-04-08	Agent Guide: A Simple Agent Behavioral Watermarking Framework	Kaibo Huang et.al.	2504.05871	null	Kimi
1848	2025-04-08	Are Generative AI Agents Effective Personalized Financial Advisors?	Takehiro Takayanagi et.al.	2504.05862	link	Kimi
1849	2025-04-08	How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM	Jirong Zha et.al.	2504.05786	null	Kimi
1850	2025-04-08	DDT: Decoupled Diffusion Transformer	Shuai Wang et.al.	2504.05741	null	Kimi
1851	2025-04-08	Rank-Then-Score: Enhancing Large Language Models for Automated Essay Scoring	Yida Cai et.al.	2504.05736	null	Kimi
1852	2025-04-08	STRIVE: A Think & Improve Approach with Iterative Refinement for Enhancing Question Quality Estimation	Aniket Deroy et.al.	2504.05693	null	Kimi
1853	2025-04-08	Towards Smarter Hiring: Are Zero-Shot and Few-Shot Pre-trained LLMs Ready for HR Spoken Interview Transcript Analysis?	Subhankar Maity et.al.	2504.05683	null	Kimi
1854	2025-04-08	Sugar-Coated Poison: Benign Generation Unlocks LLM Jailbreaking	Yu-Hang Wu et.al.	2504.05652	link	Kimi
1855	2025-04-08	TAGC: Optimizing Gradient Communication in Distributed Transformer Training	Igor Polyakov et.al.	2504.05638	link	Kimi
1856	2025-04-08	FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction	Qian-Wen Zhang et.al.	2504.05607	link	Kimi
1857	2025-04-08	ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs	Gejian Zhao et.al.	2504.05605	null	Kimi
1858	2025-04-08	Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought	Yi Peng et.al.	2504.05599	null	Kimi
1859	2025-04-08	DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding	Hossein Entezari Zarch et.al.	2504.05598	null	Kimi
1860	2025-04-08	Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions	Oded Ovadia et.al.	2504.05571	null	Kimi
1861	2025-04-07	Bridging Industrial Expertise and XR with LLM-Powered Conversational Agents	Despina Tomkou et.al.	2504.05527	null	Kimi
1862	2025-04-07	Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling	Benjamin Lipkin et.al.	2504.05410	null	Kimi
1863	2025-04-07	LiveVQA: Live Visual Knowledge Seeking	Mingyang Fu et.al.	2504.05288	null	Kimi
1864	2025-04-07	Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models	Adrián Bazaga et.al.	2504.05258	null	Kimi
1865	2025-04-07	Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	Hengran Zhang et.al.	2504.05216	null	Kimi
1866	2025-04-07	Post-Training Language Models for Continual Relation Extraction	Sefika Efeoglu et.al.	2504.05214	null	Kimi
1867	2025-04-07	Concise Reasoning via Reinforcement Learning	Mehdi Fatemi et.al.	2504.05185	link	Kimi
1868	2025-04-07	VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks	YuYue et.al.	2504.05118	null	Kimi
1869	2025-04-07	AI for Climate Finance: Agentic Retrieval and Multi-Step Reasoning for Early Warning System Investments	Saeid Ario Vaghefi et.al.	2504.05104	null	Kimi
1870	2025-04-07	The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning	Tianshi Zheng et.al.	2504.05081	null	Kimi
1871	2025-04-07	Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models	Jiawei Lian et.al.	2504.05050	null	Kimi
1872	2025-04-07	Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning	Sugyeong Eo et.al.	2504.05047	null	Kimi
1873	2025-04-07	Following the Whispers of Values: Unraveling Neural Mechanisms Behind Value-Oriented Behaviors in LLMs	Ling Hu et.al.	2504.04994	null	Kimi
1874	2025-04-07	Towards Visual Text Grounding of Multimodal Large Language Model	Ming Li et.al.	2504.04974	null	Kimi
1875	2025-04-07	M-Prometheus: A Suite of Open Multilingual LLM Judges	José Pombal et.al.	2504.04953	link	Kimi
1876	2025-04-07	A Llama walks into the ‘Bar’: Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam	Rean Fernandes et.al.	2504.04945	null	Kimi
1877	2025-04-07	Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration	Ran Xu et.al.	2504.04915	link	Kimi
1878	2025-04-07	Leveraging Large Language Models for Cost-Effective, Multilingual Depression Detection and Severity Assessment	Longdi Xian et.al.	2504.04891	null	Kimi
1879	2025-04-07	Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos	Zhi Zuo et.al.	2504.04837	null	Kimi
1880	2025-04-07	Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models	Ruikang Liu et.al.	2504.04823	link	Kimi
1881	2025-04-07	Can LLMs Interpret and Leverage Structured Linguistic Representations? A Case Study with AMRs	Ankush Raut et.al.	2504.04745	null	Kimi
1882	2025-04-07	TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context	Shubham Kumar Nigam et.al.	2504.04737	null	Kimi
1883	2025-04-07	Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use	Anna Goldie et.al.	2504.04736	null	Kimi
1884	2025-04-07	Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models	Yubo Li et.al.	2504.04717	link	Kimi
1885	2025-04-07	Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts	Yifei Yu et.al.	2504.04713	null	Kimi
1886	2025-04-07	LagKV: Lag-Relative Information of the KV Cache Tells Which Tokens Are Important	Manlai Liang et.al.	2504.04704	link	Kimi
1887	2025-04-07	R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation	Martin Weyssow et.al.	2504.04699	link	Kimi
1888	2025-04-07	LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts	Yimu Wang et.al.	2504.04653	null	Kimi
1889	2025-04-06	Splits! A Flexible Dataset for Evaluating a Model’s Demographic Social Inference	Eylon Caplan et.al.	2504.04640	link	Kimi
1890	2025-04-06	SECQUE: A Benchmark for Evaluating Real-World Financial Analysis Capabilities	Noga Ben Yoash et.al.	2504.04596	null	Kimi
1891	2025-04-06	The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?	Weichen Zhang et.al.	2504.04540	null	Kimi
1892	2025-04-06	An Empirical Comparison of Text Summarization: A Multi-Dimensional Evaluation of Large Language Models	Anantharaman Janakiraman et.al.	2504.04534	null	Kimi
1893	2025-04-03	Concept Lancet: Image Editing with Compositional Representation Transplant	Jinqi Luo et.al.	2504.02828	null	Kimi
1894	2025-04-03	On Vanishing Variance in Transformer Length Generalization	Ruining Li et.al.	2504.02827	null	Kimi
1895	2025-04-03	Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing	Xiangyu Zhao et.al.	2504.02826	link	Kimi
1896	2025-04-03	Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models	Mateusz Pach et.al.	2504.02821	link	Kimi
1897	2025-04-03	GMR-Conv: An Efficient Rotation and Reflection Equivariant Convolution Kernel Using Gaussian Mixture Rings	Yuexi Du et.al.	2504.02819	link	Kimi
1898	2025-04-03	Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization	Kangle Deng et.al.	2504.02817	null	Kimi
1899	2025-04-03	Generative Evaluation of Complex Reasoning in Large Language Models	Haowei Lin et.al.	2504.02810	link	Kimi
1900	2025-04-03	MegaMath: Pushing the Limits of Open Math Corpora	Fan Zhou et.al.	2504.02807	link	Kimi
1901	2025-04-03	A Survey of Large Language Models in Mental Health Disorder Detection on Social Media	Zhuohan Ge et.al.	2504.02800	null	Kimi
1902	2025-04-03	Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence	Anita Rau et.al.	2504.02799	null	Kimi
1903	2025-04-03	Spline-based Transformers	Prashanth Chandran et.al.	2504.02797	null	Kimi
1904	2025-04-03	A Framework for Situating Innovations, Opportunities, and Challenges in Advancing Vertical Systems with Large AI Models	Gaurav Verma et.al.	2504.02793	null	Kimi
1905	2025-04-03	Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets	Chuning Zhu et.al.	2504.02792	null	Kimi
1906	2025-04-03	A Framework for Robust Cognitive Evaluation of LLMs	Karin de Langis et.al.	2504.02789	null	Kimi
1907	2025-04-03	GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation	Zhiyuan Yan et.al.	2504.02782	link	Kimi
1908	2025-04-03	From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks	Joshua Holstein et.al.	2504.02780	null	Kimi
1909	2025-04-03	Multi-Head Adaptive Graph Convolution Network for Sparse Point Cloud-Based Human Activity Recognition	Vincent Gbouna Zakka et.al.	2504.02778	link	Kimi
1910	2025-04-03	MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs	Jaap Jumelet et.al.	2504.02768	link	Kimi
1911	2025-04-03	How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?	Andres Algaba et.al.	2504.02767	link	Kimi
1912	2025-04-03	Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model	Shengjun Zhang et.al.	2504.02764	null	Kimi
1913	2025-04-03	CanonNet: Canonical Ordering and Curvature Learning for Point Cloud Analysis	Benjy Friedmann et.al.	2504.02763	null	Kimi
1914	2025-04-03	RBR4DNN: Requirements-based Testing of Neural Networks	Nusrat Jahan Mozumder et.al.	2504.02737	link	Kimi
1915	2025-04-03	Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study	Aryan Agrawal et.al.	2504.02733	link	Kimi
1916	2025-04-03	Why do LLMs attend to the first token?	Federico Barbero et.al.	2504.02732	null	Kimi
1917	2025-04-03	HQViT: Hybrid Quantum Vision Transformer for Image Classification	Hui Zhang et.al.	2504.02730	null	Kimi
1918	2025-04-03	ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization	Kehua Feng et.al.	2504.02725	null	Kimi
1919	2025-04-03	Autonomous Human-Robot Interaction via Operator Imitation	Sammy Christen et.al.	2504.02724	null	Kimi
1920	2025-04-03	The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context	Nikhil Verma et.al.	2504.02708	null	Kimi
1921	2025-04-03	Responsible Development of Offensive AI	Ryan Marinelli et.al.	2504.02701	link	Kimi
1922	2025-04-03	Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation	Xingguang Zhang et.al.	2504.02697	link	Kimi
1923	2025-04-03	Affordable AI Assistants with Knowledge Graph of Thoughts	Maciej Besta et.al.	2504.02670	null	Kimi
1924	2025-04-03	Inference-Time Scaling for Generalist Reward Modeling	Zijun Liu et.al.	2504.02495	null	Kimi
1925	2025-04-03	Cognitive Memory in Large Language Models	Lianlei Shan et.al.	2504.02441	null	Kimi
1926	2025-04-03	Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation	Chuanqi Cheng et.al.	2504.02438	link	Kimi
1927	2025-04-03	AnesBench: Multi-Dimensional Evaluation of LLM Reasoning in Anesthesiology	Xiang Feng et.al.	2504.02404	link	Kimi
1928	2025-04-03	CoTAL: Human-in-the-Loop Prompt Engineering, Chain-of-Thought Reasoning, and Active Learning for Generalizable Formative Assessment Scoring	Clayton Cohn et.al.	2504.02323	null	Kimi
1929	2025-04-03	MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism	Ruidong Zhu et.al.	2504.02263	null	Kimi
1930	2025-04-03	LLMs as Deceptive Agents: How Role-Based Prompting Induces Semantic Ambiguity in Puzzle Tasks	Seunghyun Yoo et.al.	2504.02254	null	Kimi
1931	2025-04-03	FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant Attention	Huangliang Dai et.al.	2504.02211	null	Kimi
1932	2025-04-03	More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment	Yifan Wang et.al.	2504.02193	null	Kimi
1933	2025-04-02	A Survey of Scaling in Large Language Model Reasoning	Zihan Chen et.al.	2504.02181	null	Kimi
1934	2025-04-02	OmniCellTOSG: The First Cell Text-Omic Signaling Graphs Dataset for Joint LLM and GNN Modeling	Heming Zhang et.al.	2504.02148	link	Kimi
1935	2025-04-02	On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software	Ali Nouri et.al.	2504.02141	null	Kimi
1936	2025-04-02	Achieving Unanimous Consensus in Decision Making Using Multi-Agents	Apurba Pokharel et.al.	2504.02128	null	Kimi
1937	2025-04-02	Exploring LLM Reasoning Through Controlled Prompt Variations	Giannis Chatziveroglou et.al.	2504.02111	link	Kimi
1938	2025-04-02	The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data	Massimiliano Luca et.al.	2504.01951	null	Kimi
1939	2025-04-02	OpenCodeReasoning: Advancing Data Distillation for Competitive Coding	Wasi Uddin Ahmad et.al.	2504.01943	null	Kimi
1940	2025-04-02	Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length?	Celine Lee et.al.	2504.01935	link	Kimi
1941	2025-04-02	A thorough benchmark of automatic text classification: From traditional approaches to large language models	Washington Cunha et.al.	2504.01930	link	Kimi
1942	2025-04-03	Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation	Baban Gain et.al.	2504.01919	null	Kimi
1943	2025-04-02	FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs	Mothilal Asokan et.al.	2504.01916	null	Kimi
1944	2025-04-02	Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning	Yinggan Xu et.al.	2504.01911	null	Kimi
1945	2025-04-02	STAR-1: Safer Alignment of Reasoning LLMs with 1K Data	Zijun Wang et.al.	2504.01903	null	Kimi
1946	2025-04-02	TransientTables: Evaluating LLMs’ Reasoning on Temporally Evolving Semi-structured Tables	Abhilash Shankarampeta et.al.	2504.01879	null	Kimi
1947	2025-04-02	Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models	Zhiwei Yu et.al.	2504.01857	null	Kimi
1948	2025-04-02	InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation	Bowen Cao et.al.	2504.01707	null	Kimi
1949	2025-04-02	ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs	Yi-Long Lu et.al.	2504.01698	link	Kimi
1950	2025-04-02	Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish	Cedric Lothritz et.al.	2504.01667	null	Kimi
1951	2025-04-02	Enabling Systematic Generalization in Abstract Spatial Reasoning through Meta-Learning for Compositionality	Philipp Mondorf et.al.	2504.01445	link	Kimi
1952	2025-04-02	FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations	Athena Wen et.al.	2504.01420	link	Kimi
1953	2025-04-02	An Illusion of Progress? Assessing the Current State of Web Agents	Tianci Xue et.al.	2504.01382	link	Kimi
1954	2025-04-02	Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design	Mohan Zhang et.al.	2504.01337	null	Kimi
1955	2025-04-02	Slow-Fast Architecture for Video Multi-Modal Large Language Models	Min Shi et.al.	2504.01328	link	Kimi
1956	2025-04-02	On Data Synthesis and Post-training for Visual Abstract Reasoning	Ke Zhu et.al.	2504.01324	null	Kimi
1957	2025-04-02	Adaptive Rectification Sampling for Test-Time Compute Scaling	Zhendong Tan et.al.	2504.01317	link	Kimi
1958	2025-04-02	ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning	Bairu Hou et.al.	2504.01296	link	Kimi
1959	2025-04-02	Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding	Sakhinana Sagar Srinivas et.al.	2504.01281	null	Kimi
1960	2025-04-01	Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models	Rafael Giebisch et.al.	2504.01248	null	Kimi
1961	2025-04-01	Detecting PTSD in Clinical Interviews: A Comparative Analysis of NLP Methods and Large Language Models	Feng Chen et.al.	2504.01216	null	Kimi
1962	2025-04-01	$μ$ KE: Matryoshka Unstructured Knowledge Editing of Large Language Models	Zian Su et.al.	2504.01196	null	Kimi
1963	2025-04-01	When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning	Nishad Singhi et.al.	2504.01005	null	Kimi
1964	2025-04-01	Token embeddings violate the manifold hypothesis	Michael Robinson et.al.	2504.01002	null	Kimi
1965	2025-04-01	MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization	Siyuan Li et.al.	2504.00999	link	Kimi
1966	2025-04-01	MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs	Juncheng Wu et.al.	2504.00993	link	Kimi
1967	2025-04-01	SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching	Yuxuan Zhu et.al.	2504.00970	null	Kimi
1968	2025-04-01	Multi-Token Attention	Olga Golovneva et.al.	2504.00927	null	Kimi
1969	2025-04-01	Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents	Saaket Agashe et.al.	2504.00906	link	Kimi
1970	2025-03-31	Easi3R: Estimating Disentangled Motion from DUSt3R Without Training	Xingyu Chen et.al.	2503.24391	link	Kimi
1971	2025-03-31	RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy	Zhonghan Zhao et.al.	2503.24388	null	Kimi
1972	2025-03-31	Consistent Subject Generation via Contrastive Instantiated Concepts	Lee Hsin-Ying et.al.	2503.24387	null	Kimi
1973	2025-03-31	Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation	Shengqiong Wu et.al.	2503.24379	null	Kimi
1974	2025-03-31	ACPBench Hard: Unrestrained Reasoning about Action, Change, and Planning	Harsha Kokel et.al.	2503.24378	null	Kimi
1975	2025-03-31	Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models	Rui Wang et.al.	2503.24377	link	Kimi
1976	2025-03-31	Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1	Yi Chen et.al.	2503.24376	link	Kimi
1977	2025-03-31	ERUPT: Efficient Rendering with Unposed Patch Transformer	Maxim V. Shugaev et.al.	2503.24374	null	Kimi
1978	2025-03-31	Effectively Controlling Reasoning Models through Thinking Intervention	Tong Wu et.al.	2503.24370	null	Kimi
1979	2025-03-31	Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation	Xiaoran Zhang et.al.	2503.24368	null	Kimi
1980	2025-03-31	Query and Conquer: Execution-Guided SQL Generation	Łukasz Borchmann et.al.	2503.24364	null	Kimi
1981	2025-03-31	SQuat: Subspace-orthogonal KV Cache Quantization	Hao Wang et.al.	2503.24358	null	Kimi
1982	2025-03-31	ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion	Rana Muhammad Shahroz Khan et.al.	2503.24354	null	Kimi
1983	2025-03-31	Can Test-Time Scaling Improve World Foundation Model?	Wenyan Cong et.al.	2503.24320	link	Kimi
1984	2025-03-31	BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models	Alok Abhishek et.al.	2503.24310	null	Kimi
1985	2025-03-31	A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG	Arshia Kermani et.al.	2503.24307	null	Kimi
1986	2025-03-31	Order Matters: On Parameter-Efficient Image-to-Video Probing for Recognizing Nearly Symmetric Actions	Thinesh Thiyakesan Ponbagavathi et.al.	2503.24298	null	Kimi
1987	2025-03-31	Is analogy enough to draw novel adjective-noun inferences?	Hayley Ross et.al.	2503.24293	link	Kimi
1988	2025-03-31	Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model	Jingcheng Hu et.al.	2503.24290	null	Kimi
1989	2025-03-31	Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning	Jiacheng Lin et.al.	2503.24289	link	Kimi
1990	2025-03-31	Style Quantization for Data-Efficient GAN Training	Jian Wang et.al.	2503.24282	null	Kimi
1991	2025-03-31	Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality	Sewoong Lee et.al.	2503.24277	link	Kimi
1992	2025-03-31	FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics	Yixuan Li et.al.	2503.24267	null	Kimi
1993	2025-03-31	Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation	Dun Yuan et.al.	2503.24245	null	Kimi
1994	2025-03-31	Spatio-temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications in Ridesharing	Run Yang et.al.	2503.24237	null	Kimi
1995	2025-03-31	What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models	Qiyuan Zhang et.al.	2503.24235	link	Kimi
1996	2025-03-31	PAARS: Persona Aligned Agentic Retail Shoppers	Saab Mansour et.al.	2503.24228	null	Kimi
1997	2025-03-31	MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing	Karim Radouane et.al.	2503.24219	link	Kimi
1998	2025-03-31	All You Need is Sally-Anne: ToM in AI Strongly Supported After Surpassing Tests for 3-Year-Olds	Nitay Alon et.al.	2503.24215	null	Kimi
1999	2025-03-31	Synthetic News Generation for Fake News Classification	Abdul Sittar et.al.	2503.24206	null	Kimi
2000	2025-03-31	TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers’ Guidance	Jingxian Xu et.al.	2503.24198	null	Kimi
2001	2025-03-31	Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms	Shuoming Zhang et.al.	2503.24191	null	Kimi
2002	2025-03-31	Grounding Agent Reasoning in Image Schemas: A Neurosymbolic Approach to Embodied Cognition	François Olivier et.al.	2503.24110	null	Kimi
2003	2025-03-31	Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data	Fatemeh Mohammadi et.al.	2503.24062	null	Kimi
2004	2025-03-31	AirCache: Activating Inter-modal Relevancy KV Cache Compression for Efficient Large Vision-Language Model Inference	Kai Huang et.al.	2503.23956	null	Kimi
2005	2025-03-31	Model Hemorrhage and the Robustness Limits of Large Language Models	Ziyang Ma et.al.	2503.23924	null	Kimi
2006	2025-03-31	OrchMLLM: Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training	Yijie Zheng et.al.	2503.23830	null	Kimi
2007	2025-03-31	Expanding RL with Verifiable Rewards Across Diverse Domains	Yi Su et.al.	2503.23829	null	Kimi
2008	2025-03-31	Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute	Yingwei Ma et.al.	2503.23803	link	Kimi
2009	2025-03-31	Adaptive Layer-skipping in Pre-trained LLMs	Xuan Luo et.al.	2503.23798	null	Kimi
2010	2025-03-31	WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization	Ine Gevers et.al.	2503.23779	null	Kimi
2011	2025-03-31	Short-video Propagation Influence Rating: A New Real-world Dataset and A New Large Graph Model	Dizhan Xue et.al.	2503.23746	link	Kimi
2012	2025-03-31	LANID: LLM-assisted New Intent Discovery	Lu Fan et.al.	2503.23740	link	Kimi
2013	2025-03-31	AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization	Yiyang Du et.al.	2503.23733	link	Kimi
2014	2025-03-30	Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models	Haochen Liu et.al.	2503.23523	link	Kimi
2015	2025-03-30	If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs	Siqi Fan et.al.	2503.23514	null	Kimi
2016	2025-03-30	RARE: Retrieval-Augmented Reasoning Modeling	Zhengren Wang et.al.	2503.23513	link	Kimi
2017	2025-03-30	Benchmarking Systematic Relational Reasoning with Large Language and Reasoning Models	Irtaza Khalid et.al.	2503.23487	null	Kimi
2018	2025-03-30	Order Independence With Finetuning	Katrina Brown et.al.	2503.23483	null	Kimi
2019	2025-03-27	Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model	Abdelrahman Shaker et.al.	2503.21782	link	Kimi
2020	2025-03-27	X $^{2}$ -Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction	Weihao Yu et.al.	2503.21779	null	Kimi
2021	2025-03-27	Video-R1: Reinforcing Video Reasoning in MLLMs	Kaituo Feng et.al.	2503.21776	link	Kimi
2022	2025-03-27	StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion	Ziyu Guo et.al.	2503.21775	null	Kimi
2023	2025-03-27	MemInsight: Autonomous Memory Augmentation for LLM Agents	Rana Salama et.al.	2503.21760	null	Kimi
2024	2025-03-27	Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck	Adrian Bulat et.al.	2503.21757	null	Kimi
2025	2025-03-27	LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis	Shitian Zhao et.al.	2503.21749	null	Kimi
2026	2025-03-27	GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics	Arsham Gholamzadeh Khoee et.al.	2503.21735	null	Kimi
2027	2025-03-27	Effective Skill Unlearning through Intervention and Abstention	Yongce Li et.al.	2503.21730	link	Kimi
2028	2025-03-27	ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation	Zhicheng Lee et.al.	2503.21729	link	Kimi
2029	2025-03-27	OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation	Mallika Garg et.al.	2503.21723	null	Kimi
2030	2025-03-27	Collab: Controlled Decoding using Mixture of Agents for LLM Alignment	Souradip Chakraborty et.al.	2503.21720	null	Kimi
2031	2025-03-27	Outlier dimensions favor frequent tokens in language model	Iuri Macocco et.al.	2503.21718	null	Kimi
2032	2025-03-27	CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?	Jiefu Ou et.al.	2503.21717	link	Kimi
2033	2025-03-27	Elementwise Layer Normalization	Felix Stollenwerk et.al.	2503.21708	link	Kimi
2034	2025-03-27	MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX	Liuyue Xie et.al.	2503.21699	null	Kimi
2035	2025-03-27	Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks	Wenqi Zhang et.al.	2503.21696	link	Kimi
2036	2025-03-27	AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation	Jiahe Qian et.al.	2503.21695	null	Kimi
2037	2025-03-27	Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data	Zhiyuan Ma et.al.	2503.21694	link	Kimi
2038	2025-03-27	LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning	Hui Wang et.al.	2503.21683	null	Kimi
2039	2025-03-27	JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community	Yunze Xiao et.al.	2503.21679	null	Kimi
2040	2025-03-27	How do language models learn facts? Dynamics, curricula and hallucinations	Nicolas Zucchet et.al.	2503.21676	null	Kimi
2041	2025-03-27	COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing	Rajvee Sheth et.al.	2503.21670	null	Kimi
2042	2025-03-27	Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI	Danaja Rutar et.al.	2503.21668	null	Kimi
2043	2025-03-27	UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning	Zhengxi Lu et.al.	2503.21620	link	Kimi
2044	2025-03-27	A Measure Based Generalizable Approach to Understandability	Vikas Kushwaha et.al.	2503.21615	null	Kimi
2045	2025-03-27	A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond	Xiaoye Qu et.al.	2503.21614	link	Kimi
2046	2025-03-27	Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach	Javier Coronado-Blázquez et.al.	2503.21613	null	Kimi
2047	2025-03-27	GenEdit: Compounding Operators and Continuous Improvement to Tackle Text-to-SQL in the Enterprise	Karime Maamari et.al.	2503.21602	null	Kimi
2048	2025-03-27	Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing	Johan Wahréus et.al.	2503.21598	null	Kimi
2049	2025-03-27	debug-gym: A Text-Based Environment for Interactive Debugging	Xingdi Yuan et.al.	2503.21557	null	Kimi
2050	2025-03-27	SWI: Speaking with Intent in Large Language Models	Yuwei Yin et.al.	2503.21544	link	Kimi
2051	2025-03-27	Keyword-Oriented Multimodal Modeling for Euphemism Identification	Yuxue Hu et.al.	2503.21504	link	Kimi
2052	2025-03-27	Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection	Ryan Marinelli et.al.	2503.21464	link	Kimi
2053	2025-03-27	An evaluation of LLMs and Google Translate for translation of selected Indian languages via sentiment and semantic analyses	Rohitash Chandra et.al.	2503.21393	null	Kimi
2054	2025-03-27	Controlling Large Language Model with Latent Actions	Chengxing Jia et.al.	2503.21383	link	Kimi
2055	2025-03-27	Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models	Haoxiang Sun et.al.	2503.21380	link	Kimi
2056	2025-03-27	ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback	Taewon Yun et.al.	2503.21332	null	Kimi
2057	2025-03-27	InternVL-X: Advancing and Accelerating InternVL Series with Efficient Visual Token Compression	Dongchen Lu et.al.	2503.21307	link	Kimi
2058	2025-03-27	ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition	Yujie Liu et.al.	2503.21248	null	Kimi
2059	2025-03-27	Bias-Aware Agent: Enhancing Fairness in AI-Driven Knowledge Retrieval	Karanbir Singh et.al.	2503.21237	link	Kimi
2060	2025-03-27	LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models	Hengyuan Zhao et.al.	2503.21227	null	Kimi
2061	2025-03-27	ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging	Haoming Xu et.al.	2503.21088	link	Kimi
2062	2025-03-27	EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues	Yuhan Liu et.al.	2503.21080	null	Kimi
2063	2025-03-27	Rerouting Connection: Hybrid Computer Vision Analysis Reveals Visual Similarity Between Indus and Tibetan-Yi Corridor Writing Systems	Ooha Lakkadi Reddy et.al.	2503.21074	link	Kimi
2064	2025-03-26	Can Large Language Models Predict Associations Among Human Attitudes?	Ana Ma et.al.	2503.21011	null	Kimi
2065	2025-03-26	VinaBench: Benchmark for Faithful and Consistent Visual Narratives	Silin Gao et.al.	2503.20871	null	Kimi
2066	2025-03-26	Understanding R1-Zero-Like Training: A Critical Perspective	Zichen Liu et.al.	2503.20783	link	Kimi
2067	2025-03-27	Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning	Huajie Tan et.al.	2503.20752	null	Kimi
2068	2025-03-26	Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework	Soham Sane et.al.	2503.20750	null	Kimi
2069	2025-03-27	Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs	Yuxuan Lu et.al.	2503.20749	null	Kimi
2070	2025-03-26	Vision as LoRA	Han Wang et.al.	2503.20680	link	Kimi
2071	2025-03-26	TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews	Huimin Xu et.al.	2503.20666	null	Kimi
2072	2025-03-26	Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions	Alessandro Maisto et.al.	2503.20623	null	Kimi
2073	2025-03-26	Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation	Yunkai Liang et.al.	2503.20552	link	Kimi
2074	2025-03-26	Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence	Yijiong Yu et.al.	2503.20533	link	Kimi
2075	2025-03-26	StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs	Zhicheng Guo et.al.	2503.20527	link	Kimi
2076	2025-03-26	From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment	Yucheng Suo et.al.	2503.20472	null	Kimi
2077	2025-03-26	MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation	Rongyu Zhang et.al.	2503.20384	null	Kimi
2078	2025-03-26	VideoGEM: Training-free Action Grounding in Videos	Felix Vogel et.al.	2503.20348	null	Kimi
2079	2025-03-26	Iterative Prompting with Persuasion Skills in Jailbreaking Large Language Models	Shih-Wen Ke et.al.	2503.20320	null	Kimi
2080	2025-03-26	QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions	Siyin Wang et.al.	2503.20290	null	Kimi
2081	2025-03-26	sudo rm -rf agentic_security	Sejin Lee et.al.	2503.20279	link	Kimi
2082	2025-03-26	ViLBench: A Suite for Vision-Language Process Reward Modeling	Haoqin Tu et.al.	2503.20271	null	Kimi
2083	2025-03-26	Qwen2.5-Omni Technical Report	Jin Xu et.al.	2503.20215	null	Kimi
2084	2025-03-26	SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain	Nan Gao et.al.	2503.20202	null	Kimi
2085	2025-03-26	Open Deep Search: Democratizing Search with Open-source Reasoning Agents	Salaheddin Alzubi et.al.	2503.20201	link	Kimi
2086	2025-03-25	Can Multi-modal (reasoning) LLMs work as deepfake detectors?	Simiao Ren et.al.	2503.20084	null	Kimi
2087	2025-03-25	Cross-Tokenizer Distillation via Approximate Likelihood Matching	Benjamin Minixhofer et.al.	2503.20083	link	Kimi
2088	2025-03-25	OmniNova:A General Multimodal Agent Framework	Pengfei Du et.al.	2503.20028	null	Kimi
2089	2025-03-25	ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback	Bohan Zhai et.al.	2503.19988	link	Kimi
2090	2025-03-25	LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation	Han Chen et.al.	2503.19950	link	Kimi
2091	2025-03-25	CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning	Hao Yu et.al.	2503.19900	link	Kimi
2092	2025-03-25	Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking	Xiaoyu Tian et.al.	2503.19855	null	Kimi
2093	2025-03-25	FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs	Carlos Plou et.al.	2503.19850	null	Kimi
2094	2025-03-25	A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950	Zhao Fang et.al.	2503.19844	null	Kimi
2095	2025-03-25	PAVE: Patching and Adapting Video Large Language Models	Zhuoming Liu et.al.	2503.19794	link	Kimi
2096	2025-03-25	Gemma 3 Technical Report	Gemma Team et.al.	2503.19786	null	Kimi
2097	2025-03-25	AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation	Itay Nakash et.al.	2503.19693	link	Kimi
2098	2025-03-25	1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training	Han Zhao et.al.	2503.19633	null	Kimi
2099	2025-03-25	Innate Reasoning is Not Enough: In-Context Learning Enhances Reasoning Large Language Models with Less Overthinking	Yuyao Ge et.al.	2503.19602	null	Kimi
2100	2025-03-25	Scaling Laws of Synthetic Data for Language Models	Zeyu Qin et.al.	2503.19551	null	Kimi
2101	2025-03-25	FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models	Dahyun Jung et.al.	2503.19540	link	Kimi
2102	2025-03-25	ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning	Mingyang Chen et.al.	2503.19470	null	Kimi
2103	2025-03-25	DeCAP: Context-Adaptive Prompt Generation for Debiasing Zero-shot Question Answering in Large Language Models	Suyoung Bae et.al.	2503.19426	null	Kimi
2104	2025-03-25	Process or Result? Manipulated Ending Tokens Can Mislead Reasoning LLMs to Ignore the Correct Reasoning Steps	Yu Cui et.al.	2503.19326	null	Kimi
2105	2025-03-25	Long-Context Autoregressive Video Modeling with Next-Frame Prediction	Yuchao Gu et.al.	2503.19325	link	Kimi
2106	2025-03-25	Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications	Ben Rahman et.al.	2503.19276	null	Kimi
2107	2025-03-25	MARS: Memory-Enhanced Agents with Reflective Self-improvement	Xuechen Liang et.al.	2503.19271	null	Kimi
2108	2025-03-25	Linguistic Blind Spots of Large Language Models	Jiali Cheng et.al.	2503.19260	null	Kimi
2109	2025-03-25	SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings	Farhana Keya et.al.	2503.19257	null	Kimi
2110	2025-03-24	A Survey of Large Language Model Agents for Question Answering	Murong Yue et.al.	2503.19213	null	Kimi
2111	2025-03-24	Overtrained Language Models Are Harder to Fine-Tune	Jacob Mitchell Springer et.al.	2503.19206	null	Kimi
2112	2025-03-24	Language Model Uncertainty Quantification with Attention Chain	Yinghao Li et.al.	2503.19168	link	Kimi
2113	2025-03-24	LLM-Based Insight Extraction for Contact Center Analytics and Cost-Efficient Deployment	Varsha Embar et.al.	2503.19090	null	Kimi
2114	2025-03-24	Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization	Zhanda Zhu et.al.	2503.19050	link	Kimi
2115	2025-03-24	LookAhead Tuning: Safer Language Models via Partial Answer Previews	Kangwei Liu et.al.	2503.19041	link	Kimi
2116	2025-03-24	Exploring Training and Inference Scaling Laws in Generative Retrieval	Hongru Cai et.al.	2503.18941	link	Kimi
2117	2025-03-24	xKV: Cross-Layer SVD for KV-Cache Compression	Chi-Chih Chang et.al.	2503.18893	link	Kimi
2118	2025-03-24	SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild	Weihao Zeng et.al.	2503.18892	null	Kimi
2119	2025-03-24	AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration	Zhexuan Wang et.al.	2503.18891	link	Kimi
2120	2025-03-24	I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders	Andrey Galichin et.al.	2503.18878	link	Kimi
2121	2025-03-24	EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments	Sara Fish et.al.	2503.18825	null	Kimi
2122	2025-03-24	REALM: A Dataset of Real-World LLM Use Cases	Jingwen Cheng et.al.	2503.18792	null	Kimi
2123	2025-03-24	BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache	Dayou Du et.al.	2503.18773	link	Kimi
2124	2025-03-24	AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning	Alan Dao et.al.	2503.18769	null	Kimi
2125	2025-03-24	Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models	Yazhou Zhang et.al.	2503.18681	null	Kimi
2126	2025-03-24	Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures	Abdoul Majid O. Thiombiano et.al.	2503.18565	null	Kimi
2127	2025-03-24	Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models	Nariman Naderi et.al.	2503.18562	null	Kimi
2128	2025-03-24	Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models	Bin Li et.al.	2503.18556	null	Kimi
2129	2025-03-24	SciClaims: An End-to-End Generative System for Biomedical Claim Analysis	Raúl Ortega et.al.	2503.18526	null	Kimi
2130	2025-03-24	Verbal Process Supervision Elicits Better Coding Agents	Hao-Yuan Chen et.al.	2503.18494	null	Kimi
2131	2025-03-24	Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding	Xiangrui Liu et.al.	2503.18478	null	Kimi
2132	2025-03-24	A Simple yet Effective Layout Token in Large Language Models for Document Understanding	Zhaoqing Zhu et.al.	2503.18434	null	Kimi
2133	2025-03-24	Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning	Junsong Li et.al.	2503.18432	null	Kimi
2134	2025-03-24	Breaking the Encoder Barrier for Seamless Video-Language Understanding	Handong Li et.al.	2503.18422	null	Kimi
2135	2025-03-24	J&H: Evaluating the Robustness of Large Language Models Under Knowledge-Injection Attacks in Legal Domain	Yiran Hu et.al.	2503.18360	link	Kimi
2136	2025-03-24	Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions	Dong Jing et.al.	2503.18320	null	Kimi
2137	2025-03-24	Jenga: Effective Memory Management for Serving LLM with Heterogeneity	Chen Zhang et.al.	2503.18292	null	Kimi
2138	2025-03-24	Sun-Shine: A Large Language Model for Tibetan Culture	Cheng Huang et.al.	2503.18288	link	Kimi
2139	2025-03-24	TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model	Cheng Yang et.al.	2503.18278	null	Kimi
2140	2025-03-24	Bridging Emotions and Architecture: Sentiment Analysis in Modern Distributed Systems	Mahak Shah et.al.	2503.18260	null	Kimi
2141	2025-03-23	ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices	Aneesh Vathul et.al.	2503.18242	null	Kimi
2142	2025-03-23	Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering	Zixin Chen et.al.	2503.18172	null	Kimi
2143	2025-03-23	LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space	Zhangyu Wang et.al.	2503.18142	null	Kimi
2144	2025-03-23	AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs	Diwei Wang et.al.	2503.18141	null	Kimi
2145	2025-03-23	GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks	Varvara Krechetova et.al.	2503.18129	link	Kimi
2146	2025-03-20	Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation	Yuqing Wang et.al.	2503.16430	null	Kimi
2147	2025-03-20	XAttention: Block Sparse Attention with Antidiagonal Scoring	Ruyi Xu et.al.	2503.16428	link	Kimi
2148	2025-03-20	DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding	Keyan Chen et.al.	2503.16426	link	Kimi
2149	2025-03-20	Tokenize Image as a Set	Zigang Geng et.al.	2503.16425	link	Kimi
2150	2025-03-20	1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering	Yuheng Yuan et.al.	2503.16422	null	Kimi
2151	2025-03-20	Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models	Yang Sui et.al.	2503.16419	link	Kimi
2152	2025-03-20	Survey on Evaluation of LLM-based Agents	Asaf Yehudai et.al.	2503.16416	null	Kimi
2153	2025-03-20	M3: 3D-Spatial MultiModal Memory	Xueyan Zou et.al.	2503.16413	link	Kimi
2154	2025-03-20	RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints	Yiran Qin et.al.	2503.16408	null	Kimi
2155	2025-03-20	The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination	Yifan Sun et.al.	2503.16402	link	Kimi
2156	2025-03-20	SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation	Chun-Han Yao et.al.	2503.16396	null	Kimi
2157	2025-03-20	Do Visual Imaginations Improve Vision-and-Language Navigation Agents?	Akhil Perincherry et.al.	2503.16394	null	Kimi
2158	2025-03-20	Attentional Triple-Encoder Network in Spatiospectral Domains for Medical Image Segmentation	Kristin Qi et.al.	2503.16389	null	Kimi
2159	2025-03-20	Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation	Yijia Luo et.al.	2503.16385	link	Kimi
2160	2025-03-20	LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images	Leyang Wang et.al.	2503.16376	null	Kimi
2161	2025-03-20	NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes	Han-Hung Lee et.al.	2503.16375	link	Kimi
2162	2025-03-20	JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse	Muyao Li et.al.	2503.16365	null	Kimi
2163	2025-03-20	Neural Networks: According to the Principles of Grassmann Algebra	Z. Zarezadeh et.al.	2503.16364	null	Kimi
2164	2025-03-20	CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners	Yunzhi Yao et.al.	2503.16356	link	Kimi
2165	2025-03-20	Enhancing Software Quality Assurance with an Adaptive Differential Evolution based Quantum Variational Autoencoder-Transformer Model	Seshu Babu Barma et.al.	2503.16335	null	Kimi
2166	2025-03-20	LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates	Ying Shen et.al.	2503.16334	null	Kimi
2167	2025-03-20	OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence	Long Yuan et.al.	2503.16326	null	Kimi
2168	2025-03-20	Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1	Peiran Gu et.al.	2503.16304	null	Kimi
2169	2025-03-20	Unleashing Vecset Diffusion Model for Fast Shape Generation	Zeqiang Lai et.al.	2503.16302	link	Kimi
2170	2025-03-20	PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification	Sharon Peled et.al.	2503.16284	link	Kimi
2171	2025-03-20	Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data	Zijian Li et.al.	2503.16260	null	Kimi
2172	2025-03-20	Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models	Keda Tao et.al.	2503.16257	null	Kimi
2173	2025-03-20	M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation	Markus Karmann et.al.	2503.16254	null	Kimi
2174	2025-03-20	Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning	Zhaowei Liu et.al.	2503.16252	link	Kimi
2175	2025-03-20	AI Agents in Cryptoland: Practical Attacks and No Silver Bullet	Atharv Singh Patlan et.al.	2503.16248	null	Kimi
2176	2025-03-20	Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t	Quy-Anh Dang et.al.	2503.16219	link	Kimi
2177	2025-03-20	Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation	Andrea Maracani et.al.	2503.16184	null	Kimi
2178	2025-03-20	SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs	Shibo Jie et.al.	2503.16163	null	Kimi
2179	2025-03-20	Tuning LLMs by RAG Principles: Towards LLM-native Memory	Jiale Wei et.al.	2503.16071	link	Kimi
2180	2025-03-20	PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval	Qiang Zou et.al.	2503.16064	link	Kimi
2181	2025-03-20	Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts	Yike Yuan et.al.	2503.16057	null	Kimi
2182	2025-03-20	Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond	Yaoyao Yu et.al.	2503.16040	null	Kimi
2183	2025-03-20	Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models	Zhihang Liu et.al.	2503.16036	link	Kimi
2184	2025-03-20	The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement	Ruihan Yang et.al.	2503.16024	null	Kimi
2185	2025-03-20	Autonomous AI imitators increase diversity in homogeneous information ecosystems	Emil Bakkensen Johansen et.al.	2503.16021	null	Kimi
2186	2025-03-20	GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions	Xiaomeng Chu et.al.	2503.16013	null	Kimi
2187	2025-03-20	Adaptive Group Policy Optimization: Towards Stable Training and Token-Efficient Reasoning	Chen Li et.al.	2503.15952	null	Kimi
2188	2025-03-20	Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment	Gaole Dai et.al.	2503.15937	null	Kimi
2189	2025-03-20	SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models	Fahao Chen et.al.	2503.15921	null	Kimi
2190	2025-03-20	DeepPsy-Agent: A Stage-Aware and Deep-Thinking Emotional Support Agent System	Kai Chen et.al.	2503.15876	null	Kimi
2191	2025-03-20	MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations	Kyungho Bae et.al.	2503.15871	null	Kimi
2192	2025-03-20	Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey	Xiaoou Liu et.al.	2503.15850	null	Kimi
2193	2025-03-20	Entropy-based Exploration Conduction for Multi-step Reasoning	Jinghan Zhang et.al.	2503.15848	null	Kimi
2194	2025-03-20	Grammar and Gameplay-aligned RL for Game Description Generation with LLMs	Tsunehiko Tanaka et.al.	2503.15783	null	Kimi
2195	2025-03-19	UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction	Shravan Nayak et.al.	2503.15661	null	Kimi
2196	2025-03-19	LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning	Federico Cocchi et.al.	2503.15621	link	Kimi
2197	2025-03-19	Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification	ZhengLin Lai et.al.	2503.15469	link	Kimi
2198	2025-03-19	SemEval-2025 Task 1: AdMIRe – Advancing Multimodal Idiomaticity Representation	Thomas Pickard et.al.	2503.15358	null	Kimi
2199	2025-03-19	MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration	David Wan et.al.	2503.15272	null	Kimi
2200	2025-03-19	Do Chains-of-Thoughts of Large Language Models Suffer from Hallucinations, Cognitive Biases, or Phobias in Bayesian Reasoning?	Roberto Araya et.al.	2503.15268	null	Kimi
2201	2025-03-19	Efficient allocation of image recognition and LLM tasks on multi-GPU system	Marcin Lawenda et.al.	2503.15252	null	Kimi
2202	2025-03-19	Automated Non-Functional Requirements Generation in Software Engineering with Large Language Models: A Comparative Study	Jomar Thomas Almonte et.al.	2503.15248	null	Kimi
2203	2025-03-19	BigO(Bench) – Can LLMs Generate Code with Controlled Time and Space Complexity?	Pierre Chambon et.al.	2503.15242	link	Kimi
2204	2025-03-19	Exploring Large Language Models for Word Games:Who is the Spy?	Chentian Wei et.al.	2503.15235	link	Kimi
2205	2025-03-19	CoE: Chain-of-Explanation via Automatic Visual Concept Circuit Description and Polysemanticity Quantification	Wenlong Yu et.al.	2503.15234	link	Kimi
2206	2025-03-19	A Review on Large Language Models for Visual Analytics	Navya Sonal Agarwal et.al.	2503.15176	null	Kimi
2207	2025-03-19	Machine Unlearning in Hyperbolic vs. Euclidean Multimodal Contrastive Learning: Adapting Alignment Calibration to MERU	Àlex Pujol Vidal et.al.	2503.15166	null	Kimi
2208	2025-03-19	VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making	Mohamed Salim Aissi et.al.	2503.15108	null	Kimi
2209	2025-03-19	Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings	Zonghao Ying et.al.	2503.15092	link	Kimi
2210	2025-03-19	Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices	Ziyao Wang et.al.	2503.14932	null	Kimi
2211	2025-03-19	MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models	Jiazheng Li et.al.	2503.14917	null	Kimi
2212	2025-03-19	Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations	Shuo Li et.al.	2503.14895	null	Kimi
2213	2025-03-19	MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer	Honglin Lin et.al.	2503.14891	link	Kimi
2214	2025-03-19	Communication-Efficient Distributed On-Device LLM Inference Over Wireless Networks	Kai Zhang et.al.	2503.14882	null	Kimi
2215	2025-03-19	Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers	Bo Chen et.al.	2503.14881	null	Kimi
2216	2025-03-19	LogLLaMA: Transformer-based log anomaly detection with LLaMA	Zhuoyi Yang et.al.	2503.14849	null	Kimi
2217	2025-03-18	RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving	Wenqi Jiang et.al.	2503.14649	null	Kimi
2218	2025-03-18	Dynamic Accumulated Attention Map for Interpreting Evolution of Decision-Making in Vision Transformer	Yi Liao et.al.	2503.14640	link	Kimi
2219	2025-03-18	Assessing Large Language Models for Automated Feedback Generation in Learning Programming Problem Solving	Priscylla Silva et.al.	2503.14630	link	Kimi
2220	2025-03-18	Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives	Sara Sarto et.al.	2503.14604	link	Kimi
2221	2025-03-19	State Space Model Meets Transformer: A New Paradigm for 3D Object Detection	Chuxin Wang et.al.	2503.14493	null	Kimi
2222	2025-03-18	DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers	Minglei Shi et.al.	2503.14487	null	Kimi
2223	2025-03-18	Gricean Norms as a Basis for Effective Collaboration	Fardin Saad et.al.	2503.14484	link	Kimi
2224	2025-03-18	LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers	Nikhil Abhyankar et.al.	2503.14434	link	Kimi
2225	2025-03-18	PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play	Wei Fang et.al.	2503.14432	null	Kimi
2226	2025-03-18	VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation	Shoubin Yu et.al.	2503.14350	null	Kimi
2227	2025-03-18	DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies	Wei Song et.al.	2503.14324	link	Kimi
2228	2025-03-18	DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal	Vaibhav Aggarwal et.al.	2503.14269	link	Kimi
2229	2025-03-18	Speculative Decoding for Verilog: Speed and Quality, All in One	Changran Xu et.al.	2503.14153	null	Kimi
2230	2025-03-18	Inference-Time Intervention in Large Language Models for Reliable Requirement Verification	Paul Darm et.al.	2503.14130	null	Kimi
2231	2025-03-18	Growing a Twig to Accelerate Large Vision-Language Models	Zhenwei Shao et.al.	2503.14075	null	Kimi
2232	2025-03-18	Fast Autoregressive Video Generation with Diagonal Decoding	Yang Ye et.al.	2503.14070	null	Kimi
2233	2025-03-18	Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks	Mykyta Syromiatnikov et.al.	2503.13988	link	Kimi
2234	2025-03-18	Improving LLM Video Understanding with 16 Frames Per Second	Yixuan Li et.al.	2503.13956	null	Kimi
2235	2025-03-18	ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models	Alexey Karev et.al.	2503.13923	null	Kimi
2236	2025-03-18	Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models	Mingming Peng et.al.	2503.13813	null	Kimi
2237	2025-03-18	LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation	Yang Zhou et.al.	2503.13794	null	Kimi
2238	2025-03-17	Mitigating KV Cache Competition to Enhance User Experience in LLM Inference	Haiying Shen et.al.	2503.13773	null	Kimi
2239	2025-03-17	Do Large Language Models Understand Performance Optimization?	Bowen Cui et.al.	2503.13772	null	Kimi
2240	2025-03-17	MetaScale: Test-Time Scaling with Evolving Meta-Thoughts	Qin Liu et.al.	2503.13447	null	Kimi
2241	2025-03-17	VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning	Ye Liu et.al.	2503.13444	link	Kimi
2242	2025-03-17	xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference	Maximilian Beck et.al.	2503.13427	link	Kimi
2243	2025-03-17	MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research	James Burgess et.al.	2503.13399	link	Kimi
2244	2025-03-17	Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning	Mengyao Lyu et.al.	2503.13383	null	Kimi
2245	2025-03-17	TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM	Ye Wang et.al.	2503.13377	link	Kimi
2246	2025-03-17	Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning	Hai-Long Sun et.al.	2503.13360	null	Kimi
2247	2025-03-17	Computation Mechanism Behind LLM Position Generalization	Chi Han et.al.	2503.13305	null	Kimi
2248	2025-03-17	A Survey on Transformer Context Extension: Approaches and Evaluation	Yijun Liu et.al.	2503.13299	null	Kimi
2249	2025-03-17	$φ$ -Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation	Fangzhi Xu et.al.	2503.13288	link	Kimi
2250	2025-03-17	Knowledge-Aware Iterative Retrieval for Multi-Agent Systems	Seyoung Song et.al.	2503.13275	null	Kimi
2251	2025-03-17	Can Language Models Follow Multiple Turns of Entangled Instructions?	Chi Han et.al.	2503.13222	link	Kimi
2252	2025-03-17	Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach	Sinan Fan et.al.	2503.13208	null	Kimi
2253	2025-03-17	MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways	Zhen Chen et.al.	2503.13205	null	Kimi
2254	2025-03-17	Are LLMs (Really) Ideological? An IRT-based Analysis and Alignment Tool for Perceived Socio-Economic Bias in LLMs	Jasmin Wachter et.al.	2503.13149	null	Kimi
2255	2025-03-17	Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding	Weiyu Guo et.al.	2503.13139	null	Kimi
2256	2025-03-17	Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference	Hao Yin et.al.	2503.13108	link	Kimi
2257	2025-03-17	ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models	Hao Yin et.al.	2503.13107	link	Kimi
2258	2025-03-17	A Multi-Stage Framework with Taxonomy-Guided Reasoning for Occupation Classification Using Large Language Models	Palakorn Achananuparp et.al.	2503.12989	null	Kimi
2259	2025-03-17	ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM	Wenqiang Wang et.al.	2503.12988	null	Kimi
2260	2025-03-17	R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization	Jingyi Zhang et.al.	2503.12937	link	Kimi
2261	2025-03-17	HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models	Xinyan Jiang et.al.	2503.12908	link	Kimi
2262	2025-03-17	VITED: Video Temporal Evidence Distillation	Yujie Lu et.al.	2503.12855	null	Kimi
2263	2025-03-17	ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing	Aditi Tiwari et.al.	2503.12852	null	Kimi
2264	2025-03-17	Grounded Chain-of-Thought for Multimodal Large Language Models	Qiong Wu et.al.	2503.12799	link	Kimi
2265	2025-03-17	DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding	Xinyu Ma et.al.	2503.12797	link	Kimi
2266	2025-03-17	Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering	Kenneth J. K. Ong et.al.	2503.12722	null	Kimi
2267	2025-03-17	Can Reasoning Models Reason about Hardware? An Agentic HLS Perspective	Luca Collini et.al.	2503.12721	null	Kimi
2268	2025-03-16	Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility	Jacob Chmura et.al.	2503.12667	null	Kimi
2269	2025-03-16	VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures	Yoo Yeon Sung et.al.	2503.12651	null	Kimi
2270	2025-03-16	MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network	Vrushank Ahire et.al.	2503.12623	link	Kimi
2271	2025-03-16	MoECollab: Democratizing LLM Development Through Collaborative Mixture of Experts	Harshit et.al.	2503.12592	null	Kimi
2272	2025-03-16	AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding	Xiao Wang et.al.	2503.12559	link	Kimi
2273	2025-03-14	TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing	Stefan Lionar et.al.	2503.11629	link	Kimi
2274	2025-03-14	ASMA-Tune: Unlocking LLMs’ Assembly Code Comprehension via Structural-Semantic Instruction Tuning	Xinyi Wang et.al.	2503.11617	link	Kimi
2275	2025-03-14	Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space	Zhiliang Chen et.al.	2503.11586	link	Kimi
2276	2025-03-14	Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers	Weiming Ren et.al.	2503.11579	null	Kimi
2277	2025-03-14	Implicit Bias-Like Patterns in Reasoning Models	Messi H. J. Lee et.al.	2503.11572	null	Kimi
2278	2025-03-14	Similarity-Aware Token Pruning: Your VLM but Faster	Ahmadreza Jeddi et.al.	2503.11549	link	Kimi
2279	2025-03-14	HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models	Ziqin Zhou et.al.	2503.11513	null	Kimi
2280	2025-03-14	V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning	Zixu Cheng et.al.	2503.11495	null	Kimi
2281	2025-03-14	Integrating LLMs in Gamified Systems	Carlos J. Costa et.al.	2503.11458	null	Kimi
2282	2025-03-14	Cerebrum (AIOS SDK): A Platform for Agent Development, Deployment, Distribution, and Discovery	Balaji Rama et.al.	2503.11444	link	Kimi
2283	2025-03-14	Text Compression for Efficient Language Generation	David Gu et.al.	2503.11426	null	Kimi
2284	2025-03-14	Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages	Jiyeong Kim et.al.	2503.11384	null	Kimi
2285	2025-03-14	Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches	Panggih Kusuma Ningrum et.al.	2503.11376	null	Kimi
2286	2025-03-14	AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation	Fengyu Li et.al.	2503.11346	link	Kimi
2287	2025-03-14	Rule-Guided Feedback: Enhancing Reasoning by Enforcing Rule Adherence in Large Language Models	Aissatou Diallo et.al.	2503.11336	null	Kimi
2288	2025-03-14	Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking	Ziyi Wang et.al.	2503.11324	null	Kimi
2289	2025-03-14	MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens	Jeong Hun Yeo et.al.	2503.11315	link	Kimi
2290	2025-03-14	Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering	Xinyu Tang et.al.	2503.11314	link	Kimi
2291	2025-03-14	BriLLM: Brain-inspired Large Language Model	Hai Zhao et.al.	2503.11299	null	Kimi
2292	2025-03-14	Line of Duty: Evaluating LLM Self-Knowledge via Consistency in Feasibility Boundaries	Sahil Kale et.al.	2503.11256	link	Kimi
2293	2025-03-14	Reasoning-Grounded Natural Language Explanations for Language Models	Vojtech Cahlik et.al.	2503.11248	link	Kimi
2294	2025-03-14	Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty?	Giacomo Camposampiero et.al.	2503.11207	link	Kimi
2295	2025-03-14	LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs	Leqi Shen et.al.	2503.11205	null	Kimi
2296	2025-03-14	Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering	Gang Li et.al.	2503.11197	link	Kimi
2297	2025-03-14	FastVID: Dynamic Density Pruning for Fast Video Large Language Models	Leqi Shen et.al.	2503.11187	link	Kimi
2298	2025-03-14	Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity	Chi Xu et.al.	2503.11164	null	Kimi
2299	2025-03-14	Don’t Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models	Shaotian Yan et.al.	2503.11154	null	Kimi
2300	2025-03-14	MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling	Rachel S. Y. Teo et.al.	2503.11144	link	Kimi
2301	2025-03-14	X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression	Guihong Li et.al.	2503.11132	null	Kimi
2302	2025-03-14	Direction-Aware Diagonal Autoregressive Image Generation	Yijia Xu et.al.	2503.11129	null	Kimi
2303	2025-03-13	GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing	Rongyao Fang et.al.	2503.10639	link	Kimi
2304	2025-03-13	Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers?	Subhajit Maity et.al.	2503.10632	null	Kimi
2305	2025-03-13	SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems	Ziyu Guo et.al.	2503.10627	null	Kimi
2306	2025-03-13	Transformers without Normalization	Jiachen Zhu et.al.	2503.10622	null	Kimi
2307	2025-03-13	Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search	Andy Zhou et.al.	2503.10619	null	Kimi
2308	2025-03-13	Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models	Andy Zhou et.al.	2503.10617	null	Kimi
2309	2025-03-13	TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention	Jinhao Duan et.al.	2503.10602	link	Kimi
2310	2025-03-13	Long Context Tuning for Video Generation	Yuwei Guo et.al.	2503.10589	null	Kimi
2311	2025-03-13	Autoregressive Image Generation with Randomized Parallel Decoding	Haopeng Li et.al.	2503.10568	link	Kimi
2312	2025-03-13	AudioX: Diffusion Transformer for Anything-to-Audio Generation	Zeyue Tian et.al.	2503.10522	null	Kimi
2313	2025-03-13	TokenCarve: Information-Preserving Visual Token Compression in Multimodal Large Language Models	Xudong Tan et.al.	2503.10501	link	Kimi
2314	2025-03-13	MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation	Weihao Xuan et.al.	2503.10497	null	Kimi
2315	2025-03-13	Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents	Hanxu Hu et.al.	2503.10494	link	Kimi
2316	2025-03-13	LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions	Gaurav Kumar Gupta et.al.	2503.10486	null	Kimi
2317	2025-03-13	DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation	Wenhao Hu et.al.	2503.10452	null	Kimi
2318	2025-03-13	4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models	Wanhua Li et.al.	2503.10437	link	Kimi
2319	2025-03-13	BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models	Can Zheng et.al.	2503.10432	null	Kimi
2320	2025-03-13	Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning	Jonathan Shaki et.al.	2503.10408	null	Kimi
2321	2025-03-13	SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading	Qiaoling Chen et.al.	2503.10377	null	Kimi
2322	2025-03-13	G-Boost: Boosting Private SLMs with General LLMs	Yijiang Fan et.al.	2503.10367	null	Kimi
2323	2025-03-13	KV-Distill: Nearly Lossless Learnable Context Compression for LLMs	Vivek Chari et.al.	2503.10337	null	Kimi
2324	2025-03-13	Collaborative Speculative Inference for Efficient LLM Inference Serving	Luyao Gao et.al.	2503.10325	null	Kimi
2325	2025-03-13	VisualPRM: An Effective Process Reward Model for Multimodal Reasoning	Weiyun Wang et.al.	2503.10291	null	Kimi
2326	2025-03-13	Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout	Shilong Wang et.al.	2503.10217	null	Kimi
2327	2025-03-13	LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents	Boyu Chen et.al.	2503.10200	null	Kimi
2328	2025-03-13	Robustness Tokens: Towards Adversarial Robustness of Transformers	Brian Pulfer et.al.	2503.10191	link	Kimi
2329	2025-03-13	Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding	Shunqi Mao et.al.	2503.10183	null	Kimi
2330	2025-03-13	“Well, Keep Thinking”: Enhancing LLM Reasoning with Adaptive Injection Decoding	Hyunbin Jin et.al.	2503.10167	null	Kimi
2331	2025-03-13	ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning	Pengfei Luo et.al.	2503.10166	link	Kimi
2332	2025-03-13	Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding	Jinze Li et.al.	2503.10135	null	Kimi
2333	2025-03-11	QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension	Yongdong Luo et.al.	2503.08689	link	Kimi
2334	2025-03-11	CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving	Changxing Liu et.al.	2503.08683	link	Kimi
2335	2025-03-11	Chain-of-Thought Reasoning In The Wild Is Not Always Faithful	Iván Arcuschin et.al.	2503.08679	link	Kimi
2336	2025-03-11	REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder	Yitian Zhang et.al.	2503.08665	null	Kimi
2337	2025-03-11	MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention	Yuhan Wang et.al.	2503.08664	link	Kimi
2338	2025-03-11	Exploring the Word Sense Disambiguation Capabilities of Large Language Models	Pierpaolo Basile et.al.	2503.08662	null	Kimi
2339	2025-03-11	Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention	Emily Xiao et.al.	2503.08640	link	Kimi
2340	2025-03-11	HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder	Yingqi Tang et.al.	2503.08612	link	Kimi
2341	2025-03-11	Vision Transformer for Intracranial Hemorrhage Classification in CT Scans Using an Entropy-Aware Fuzzy Integral Strategy for Adaptive Scan-Level Decision Fusion	Mehdi Hosseini Chagahi et.al.	2503.08609	null	Kimi
2342	2025-03-11	Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling	Subin Kim et.al.	2503.08605	null	Kimi
2343	2025-03-11	RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding	Xichen Tan et.al.	2503.08576	null	Kimi
2344	2025-03-11	DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process	Minjun Zhu et.al.	2503.08569	null	Kimi
2345	2025-03-11	MoE-Loco: Mixture of Experts for Multitask Locomotion	Runhan Huang et.al.	2503.08564	null	Kimi
2346	2025-03-11	Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs	Wanyong Feng et.al.	2503.08551	null	Kimi
2347	2025-03-11	Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation	Xian Gao et.al.	2503.08549	null	Kimi
2348	2025-03-11	DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering	Sher Badshah et.al.	2503.08542	null	Kimi
2349	2025-03-11	Mellow: a small audio language model for reasoning	Soham Deshmukh et.al.	2503.08540	link	Kimi
2350	2025-03-11	Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation	Andres M Bran et.al.	2503.08537	link	Kimi
2351	2025-03-11	ChromaFormer: A Scalable and Accurate Transformer Architecture for Land Cover Classification	Mingshi Li et.al.	2503.08534	null	Kimi
2352	2025-03-11	Visual Attention Graph	Kai-Fu Yang et.al.	2503.08531	null	Kimi
2353	2025-03-11	Position-Aware Depth Decay Decoding ( $D^3$ ): Boosting Large Language Model Inference Efficiency	Siqi Fan et.al.	2503.08524	null	Kimi
2354	2025-03-11	Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models	Han Cao et.al.	2503.08495	null	Kimi
2355	2025-03-11	Accelerating MoE Model Inference with Expert Sharding	Oana Balmau et.al.	2503.08467	null	Kimi
2356	2025-03-11	FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework	Jianian Zhu et.al.	2503.08461	null	Kimi
2357	2025-03-11	Controlling Latent Diffusion Using Latent CLIP	Jason Becker et.al.	2503.08455	link	Kimi
2358	2025-03-11	TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference Systems	Feiyang Wu et.al.	2503.08415	link	Kimi
2359	2025-03-11	Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information	Elizaveta Kuznetsova et.al.	2503.08404	null	Kimi
2360	2025-03-11	Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens	Qingsong Xie et.al.	2503.08377	null	Kimi
2361	2025-03-11	Robust Latent Matters: Boosting Image Generation with Sampling Error	Kai Qiu et.al.	2503.08354	link	Kimi
2362	2025-03-11	Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs	Chongjun Tu et.al.	2503.08342	null	Kimi
2363	2025-03-10	Securing External Deeper-than-black-box GPAI Evaluations	Alejandro Tlaie et.al.	2503.07496	null	Kimi
2364	2025-03-10	V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation	Guiwei Zhang et.al.	2503.07493	link	Kimi
2365	2025-03-10	Destination Calculus: A Linear λ-Calculus for Purely Functional Memory Writes	Thomas Bagrel et.al.	2503.07489	link	Kimi
2366	2025-03-10	LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?	Bangyan Li et.al.	2503.07487	null	Kimi
2367	2025-03-10	Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction	Zongzheng Zhang et.al.	2503.07485	link	Kimi
2368	2025-03-10	VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models	Jiacheng Ruan et.al.	2503.07478	link	Kimi
2369	2025-03-10	Petri Net Modeling of Root Hair Response to Phosphate Starvation in Arabidopsis Thaliana	Amber H. B. Fijn et.al.	2503.07477	null	Kimi
2370	2025-03-10	MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning	Xiangru Tang et.al.	2503.07459	link	Kimi
2371	2025-03-10	Open-Set Gait Recognition from Sparse mmWave Radar Point Clouds	Riccardo Mazzieri et.al.	2503.07435	link	Kimi
2372	2025-03-10	DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks	Feiran You et.al.	2503.07433	link	Kimi
2373	2025-03-10	CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving	Ziliang Xiong et.al.	2503.07425	null	Kimi
2374	2025-03-10	Inorganic Catalyst Efficiency Prediction Based on EAPCR Model: A Deep Learning Solution for Multi-Source Heterogeneous Data	Zhangdi Liu et.al.	2503.07424	null	Kimi
2375	2025-03-10	AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion	Mingzhen Sun et.al.	2503.07418	null	Kimi
2376	2025-03-07	Task-oriented Uncertainty Collaborative Learning for Label-Efficient Brain Tumor Segmentation	Zhenxuan Zhang et.al.	2503.05682	link	Kimi
2377	2025-03-07	The latent variable proximal point algorithm for variational problems with inequality constraints	Jørgen S. Dokken et.al.	2503.05672	link	Kimi
2378	2025-03-07	Kinodynamic Model Predictive Control for Energy Efficient Locomotion of Legged Robots with Parallel Elasticity	Yulun Zhuang et.al.	2503.05666	null	Kimi
2379	2025-03-07	A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval	Yu Zhang et.al.	2503.05659	link	Kimi
2380	2025-03-07	Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning	Justin Chih-Yao Chen et.al.	2503.05641	null	Kimi
2381	2025-03-07	Exploring FMCW Radars and Feature Maps for Activity Recognition: A Benchmark Study	Ali Samimi Fard et.al.	2503.05629	null	Kimi
2382	2025-03-07	FMT:A Multimodal Pneumonia Detection Model Based on Stacking MOE Framework	Jingyu Xu et.al.	2503.05626	null	Kimi
2383	2025-03-07	A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models	Dong Shu et.al.	2503.05613	null	Kimi
2384	2025-03-07	D2GV: Deformable 2D Gaussian Splatting for Video Representation in 400FPS	Mufan Liu et.al.	2503.05600	link	Kimi
2385	2025-03-07	R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning	Huatong Song et.al.	2503.05592	null	Kimi
2386	2025-03-06	L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling	Zhuo Chen et.al.	2503.04725	link	Kimi
2387	2025-03-07	Shifting Long-Context LLMs Research from Input to Output	Yuhao Wu et.al.	2503.04723	null	Kimi
2388	2025-03-06	Enough Coin Flips Can Make LLMs Act Bayesian	Ritwik Gupta et.al.	2503.04722	null	Kimi
2389	2025-03-06	L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning	Pranjal Aggarwal et.al.	2503.04697	null	Kimi
2390	2025-03-06	UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets	Wenyu Wang et.al.	2503.04693	null	Kimi
2391	2025-03-06	The Influence of Prior Discourse on Conversational Agent-Driven Decision-Making	Stephen Pilli et.al.	2503.04692	null	Kimi
2392	2025-03-06	Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases	Pengcheng Qiu et.al.	2503.04691	null	Kimi
2393	2025-03-07	DIMSUM: Discourse in Mathematical Reasoning as a Supervision Module	Krish Sharma et.al.	2503.04685	null	Kimi
2394	2025-03-06	Matrix Factorization for Inferring Associations and Missing Links	Ryan Barron et.al.	2503.04680	null	Kimi
2395	2025-03-06	LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue	Sangyeop Kim et.al.	2503.04675	null	Kimi
2396	2025-03-05	PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning	Ryozo Masukawa et.al.	2503.03747	null	Kimi
2397	2025-03-05	Process-based Self-Rewarding Language Models	Shimao Zhang et.al.	2503.03746	link	Kimi
2398	2025-03-05	Rethinking Deep Clustering Paradigms: Self-Supervision Is All You Need	Amal Shaheena et.al.	2503.03733	null	Kimi
2399	2025-03-05	Towards Understanding Distilled Reasoning Models: A Representational Approach	David D. Baek et.al.	2503.03730	null	Kimi
2400	2025-03-05	When Radiation Meets Linux: Analyzing Soft Errors in Linux on COTS SoCs under Proton Irradiation	Saad Memon et.al.	2503.03722	null	Kimi
2401	2025-03-05	Improving LLM Safety Alignment with Dual-Objective Optimization	Xuandong Zhao et.al.	2503.03710	link	Kimi
2402	2025-03-05	Rethinking Video Tokenization: A Conditioned Diffusion-based Approach	Nianzu Yang et.al.	2503.03708	link	Kimi
2403	2025-03-05	A Practical Memory Injection Attack against LLM Agents	Shen Dong et.al.	2503.03704	null	Kimi
2404	2025-03-05	ILLC: Iterative Layer-by-Layer Compression for Enhancing Structural Faithfulness in SpArX	Ungsik Kim et.al.	2503.03693	null	Kimi
2405	2025-03-05	DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance	Zhao Yang et.al.	2503.03689	link	Kimi
2406	2025-03-04	Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation	Han Xue et.al.	2503.02881	link	Kimi
2407	2025-03-04	Language Models can Self-Improve at State-Value Estimation for Better Search	Ethan Mendes et.al.	2503.02878	link	Kimi
2408	2025-03-04	Weak-to-Strong Generalization Even in Random Feature Networks, Provably	Marko Medvedev et.al.	2503.02877	null	Kimi
2409	2025-03-04	SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models	Dmitry Nechaev et.al.	2503.02876	link	Kimi
2410	2025-03-04	The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models	Ke Ji et.al.	2503.02875	null	Kimi
2411	2025-03-04	Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework	Ziang Zhou et.al.	2503.02863	null	Kimi
2412	2025-03-04	PileUp Mitigation at the HL-LHC Using Attention for Event-Wide Context	Luke Vaughan et.al.	2503.02860	null	Kimi
2413	2025-03-04	Unsupervised Attributed Dynamic Network Embedding with Stability Guarantees	Emma Ceccherini et.al.	2503.02859	null	Kimi
2414	2025-03-04	Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs’ Decoding Layers	Zicong He et.al.	2503.02851	link	Kimi
2415	2025-03-04	Multimodal Deep Learning for Subtype Classification in Breast Cancer Using Histopathological Images and Gene Expression Data	Amin Honarmandi Shandiz et.al.	2503.02849	link	Kimi
2416	2025-02-28	LLM Post-Training: A Deep Dive into Reasoning Large Language Models	Komal Kumar et.al.	2502.21321	link	Kimi
2417	2025-02-28	*Doping dependence of 2-spinon excitations in the doped 1D cuprate Ba $2$CuO${3+δ}$*	Jiarui Li et.al.	2502.21316	null	Kimi
2418	2025-02-28	Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos	Zhiyu Tan et.al.	2502.21314	null	Kimi
2419	2025-02-28	FANformer: Improving Large Language Models Through Effective Periodicity Modeling	Yihong Dong et.al.	2502.21309	link	Kimi
2420	2025-02-28	Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With Faithfulness Based on Causal Theory of Mind	Dingyi Zhang et.al.	2502.21297	null	Kimi
2421	2025-02-28	Bilevel Optimized Implicit Neural Representation for Scan-Specific Accelerated MRI Reconstruction	Hongze Yu et.al.	2502.21292	null	Kimi
2422	2025-02-28	Contextualizing biological perturbation experiments through language	Menghua Wu et.al.	2502.21290	link	Kimi
2423	2025-02-28	Boosting Prediction with Data Missing Not at Random	Yuan Bian et.al.	2502.21276	null	Kimi
2424	2025-02-28	Adaptive Keyframe Sampling for Long Video Understanding	Xi Tang et.al.	2502.21271	null	Kimi
2425	2025-02-28	Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks	Andrea Montanari et.al.	2502.21269	null	Kimi
2426	2025-02-27	R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts	Zhongyang Li et.al.	2502.20395	link	Kimi
2427	2025-02-27	LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding	Ang Cao et.al.	2502.20389	null	Kimi
2428	2025-02-27	InsTaG: Learning Personalized 3D Talking Head from Few-Second Video	Jiahe Li et.al.	2502.20387	link	Kimi
2429	2025-02-27	ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting	Dexter Ong et.al.	2502.20386	null	Kimi
2430	2025-02-27	rSPDE: tools for statistical modeling using fractional SPDEs	David Bolin et.al.	2502.20385	null	Kimi
2431	2025-02-27	PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation	Albert Gong et.al.	2502.20377	link	Kimi
2432	2025-02-27	Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization	Ryan C. Barron et.al.	2502.20364	link	Kimi
2433	2025-02-27	Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs	Kuan Lok Zhou et.al.	2502.20356	null	Kimi
2434	2025-02-27	Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners	Daniele Paliotta et.al.	2502.20339	null	Kimi
2435	2025-02-27	KeBaB: $k$ -mer based breaking for finding super-maximal exact matches	Nathaniel K. Brown et.al.	2502.20338	null	Kimi
2436	2025-02-26	Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models	Lucy Xiaoyang Shi et.al.	2502.19417	null	Kimi
2437	2025-02-26	Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation	Shiven Sinha et.al.	2502.19414	link	Kimi
2438	2025-02-26	The Mighty ToRR: A Benchmark for Table Reasoning and Robustness	Shir Ashury-Tahan et.al.	2502.19412	link	Kimi
2439	2025-02-26	Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs	Dayu Yang et.al.	2502.19411	link	Kimi
2440	2025-02-26	ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models	Danae Sánchez Villegas et.al.	2502.19409	null	Kimi
2441	2025-02-26	Learning Code-Edit Embedding to Model Student Debugging Behavior	Hasnain Heickal et.al.	2502.19407	null	Kimi
2442	2025-02-26	Single-shot and two-shot decoding with generalized bicycle codes	Hsiang-Ku Lin et.al.	2502.19406	null	Kimi
2443	2025-02-26	General Reasoning Requires Learning to Reason from the Get-go	Seungwook Han et.al.	2502.19402	null	Kimi
2444	2025-02-26	TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding	Max Ku et.al.	2502.19400	null	Kimi
2445	2025-02-26	The End of Easy Phenomenology for CMB Experiments: A Case Study in the Dark Sector	Cynthia Trendafilova et.al.	2502.19383	null	Kimi
2446	2025-02-25	K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs	Ziheng Ouyang et.al.	2502.18461	null	Kimi
2447	2025-02-25	DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers	Xueguang Ma et.al.	2502.18460	link	Kimi
2448	2025-02-25	GHOST 2.0: generative high-fidelity one shot transfer of heads	Alexander Groshev et.al.	2502.18417	null	Kimi
2449	2025-02-25	Comparative Analysis of MDL-VAE vs. Standard VAE on 202 Years of Gynecological Data	Paula Santos et.al.	2502.18412	null	Kimi
2450	2025-02-25	The FFT Strikes Back: An Efficient Alternative to Self-Attention	Jacob Fein-Ashley et.al.	2502.18394	link	Kimi
2451	2025-02-25	ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation	Yifan Pu et.al.	2502.18364	null	Kimi
2452	2025-02-25	Graph Inference with Effective Resistance Queries	Huck Bennett et.al.	2502.18350	null	Kimi
2453	2025-02-25	Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology	Romy Beauté et.al.	2502.18318	null	Kimi
2454	2025-02-25	RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction	Jianhao Yan et.al.	2502.18308	null	Kimi
2455	2025-02-25	DeepCircuitX: A Comprehensive Repository-Level Dataset for RTL Code Understanding, Generation, and PPA Analysis	Zeju Li et.al.	2502.18297	null	Kimi
2456	2025-02-24	S4S: Solving for a Diffusion Model Solver	Eric Frankel et.al.	2502.17423	null	Kimi
2457	2025-02-24	MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs	Jiarui Zhang et.al.	2502.17422	link	Kimi
2458	2025-02-24	LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification	Penghui Yang et.al.	2502.17421	link	Kimi
2459	2025-02-24	Reasoning with Latent Thoughts: On the Power of Looped Transformers	Nikunj Saunshi et.al.	2502.17416	null	Kimi
2460	2025-02-24	X-Dancer: Expressive Music to Human Dance Video Generation	Zeyuan Chen et.al.	2502.17414	null	Kimi
2461	2025-02-24	Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning	Guijin Son et.al.	2502.17407	link	Kimi
2462	2025-02-24	Advances in multiparameter quantum sensing and metrology	Luca Pezzè et.al.	2502.17396	null	Kimi
2463	2025-02-24	The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE	Andrei Chernov et.al.	2502.17391	null	Kimi
2464	2025-02-24	A Concise Lyapunov Analysis of Nesterov’s Accelerated Gradient Method	Jun Liu et.al.	2502.17373	null	Kimi
2465	2025-02-24	KV-Edit: Training-Free Image Editing for Precise Background Preservation	Tianrui Zhu et.al.	2502.17363	link	Kimi
2466	2025-02-21	Sparks of cognitive flexibility: self-guided context inference for flexible stimulus-response mapping by attentional routing	Rowan Sommers et.al.	2502.15634	null	Kimi
2467	2025-02-21	LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models	Hugo Pitorro et.al.	2502.15612	null	Kimi
2468	2025-02-21	Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning	Wenhao Zhu et.al.	2502.15592	link	Kimi
2469	2025-02-21	LightThinker: Thinking Step-by-Step Compression	Jintian Zhang et.al.	2502.15589	null	Kimi
2470	2025-02-21	Adaptive Expansion for Hypergraph Learning	Tianyi Ma et.al.	2502.15564	null	Kimi
2471	2025-02-21	Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach	Sai Krishna Reddy Mareddy et.al.	2502.15545	null	Kimi
2472	2025-02-21	Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors	Milad Sefidgaran et.al.	2502.15540	link	Kimi
2473	2025-02-21	Towards Swift Serverless LLM Cold Starts with ParaServe	Chiheng Lou et.al.	2502.15524	null	Kimi
2474	2025-02-21	Solving Inverse Problems with Deep Linear Neural Networks: Global Convergence Guarantees for Gradient Descent with Weight Decay	Hannah Laus et.al.	2502.15522	null	Kimi
2475	2025-02-21	Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection	Yue Sun et.al.	2502.15516	null	Kimi
2476	2025-02-20	LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention	Shang Yang et.al.	2502.14866	link	Kimi
2477	2025-02-20	CLIPPER: Compression enables long-context synthetic data generation	Chau Minh Pham et.al.	2502.14854	link	Kimi
2478	2025-02-20	Revealing and Mitigating Over-Attention in Knowledge Editing	Pinzheng Wang et.al.	2502.14838	link	Kimi
2479	2025-02-20	Towards Economical Inference: Enabling DeepSeek’s Multi-Head Latent Attention in Any Transformer-based LLMs	Tao Ji et.al.	2502.14837	link	Kimi
2480	2025-02-20	Improving the Diffusability of Autoencoders	Ivan Skorokhodov et.al.	2502.14831	null	Kimi
2481	2025-02-20	Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps	Martin Tutek et.al.	2502.14829	link	Kimi
2482	2025-02-20	Turning on the Light: Polymorphism-Induced Photoluminescence in Cysteine Crystals	Debarshi Banerjee et.al.	2502.14826	null	Kimi
2483	2025-02-20	Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models	Vlad Sobal et.al.	2502.14819	null	Kimi
2484	2025-02-20	RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation	Henrique Piñeiro Monteagudo et.al.	2502.14792	null	Kimi
2485	2025-02-20	Ray-Tracing for Conditionally Activated Neural Networks	Claudio Gallicchio et.al.	2502.14788	null	Kimi
2486	2025-02-20	LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning	Yansheng Mao et.al.	2502.14644	null	Kimi
2487	2025-02-20	PEARL: Towards Permutation-Resilient LLMs	Liang Chen et.al.	2502.14628	link	Kimi
2488	2025-02-20	PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models	Yu Meng et.al.	2502.14504	null	Kimi
2489	2025-02-20	Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression	Haoyu Wang et.al.	2502.14477	null	Kimi
2490	2025-02-20	Early-Exit and Instant Confidence Translation Quality Estimation	Vilém Zouhar et.al.	2502.14429	link	Kimi
2491	2025-02-19	MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads	Weihao Liu et.al.	2502.13963	link	Kimi
2492	2025-02-19	A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models	Hao Huang et.al.	2502.13942	null	Kimi
2493	2025-02-19	Qwen2.5-VL Technical Report	Shuai Bai et.al.	2502.13923	null	Kimi
2494	2025-02-19	LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization	Guanzheng Chen et.al.	2502.13922	link	Kimi
2495	2025-02-19	A measurement-based approach to analyze the power consumption of the softwarized 5G core	Arturo Bellin et.al.	2502.13879	null	Kimi
2496	2025-02-19	SPEX: Scaling Feature Interaction Explanations for LLMs	Justin Singh Kang et.al.	2502.13870	link	Kimi
2497	2025-02-19	Enhancing LLM-Based Recommendations Through Personalized Reasoning	Jiahao Liu et.al.	2502.13845	link	Kimi
2498	2025-02-19	SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning	Renxi Wang et.al.	2502.13753	link	Kimi
2499	2025-02-19	MoM: Linear Sequence Modeling with Mixture-of-Memories	Jusen Du et.al.	2502.13685	link	Kimi
2500	2025-02-19	PeerQA: A Scientific Question Answering Dataset from Peer Reviews	Tim Baumgärtner et.al.	2502.13668	link	Kimi
2501	2025-02-18	Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning	Jingyang Lin et.al.	2502.13127	null	Kimi
2502	2025-02-18	Eager Updates For Overlapped Communication and Computation in DiLoCo	Satyen Kale et.al.	2502.12996	null	Kimi
2503	2025-02-18	Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing	Xiaoju Ye et.al.	2502.12962	null	Kimi
2504	2025-02-18	Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models	Gyeongman Kim et.al.	2502.12947	null	Kimi
2505	2025-02-18	S $^2$ R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning	Ruotian Ma et.al.	2502.12853	link	Kimi
2506	2025-02-18	A $^2$ ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization	Junhui He et.al.	2502.12665	null	Kimi
2507	2025-02-18	MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation	Sihyun Yu et.al.	2502.12632	null	Kimi
2508	2025-02-18	Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions	Leonardo Ranaldi et.al.	2502.12616	null	Kimi
2509	2025-02-18	LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data	Cehao Yang et.al.	2502.12583	link	Kimi
2510	2025-02-18	HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading	Cheng Luo et.al.	2502.12574	link	Kimi
2511	2025-02-17	Small Models Struggle to Learn from Strong Reasoners	Yuetai Li et.al.	2502.12143	null	Kimi
2512	2025-02-17	SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs	Yige Xu et.al.	2502.12134	link	Kimi
2513	2025-02-17	APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs	Yuxiang Huang et.al.	2502.12085	link	Kimi
2514	2025-02-17	AdaSplash: Adaptive Sparse Flash Attention	Nuno Gonçalves et.al.	2502.12082	link	Kimi
2515	2025-02-17	TokenSkip: Controllable Chain-of-Thought Compression in LLMs	Heming Xia et.al.	2502.12067	link	Kimi
2516	2025-02-17	SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities	Fengqing Jiang et.al.	2502.12025	null	Kimi
2517	2025-02-17	Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving	Xin Xu et.al.	2502.12022	null	Kimi
2518	2025-02-17	Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL	Hanbing Liu et.al.	2502.11656	link	Kimi
2519	2025-02-17	SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking	Zijian Wu et.al.	2502.11534	null	Kimi
2520	2025-02-17	AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification	Xiaoyu Tan et.al.	2502.11520	null	Kimi
2521	2025-02-14	Are Large Language Models the future crowd workers of Linguistics?	Iris Ferrazzo et.al.	2502.10266	null	Kimi
2522	2025-02-14	LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing	Kuan Li et.al.	2502.09977	null	Kimi
2523	2025-02-14	MIR-Bench: Benchmarking LLM’s Long-Context Intelligence via Many-Shot In-Context Inductive Reasoning	Kai Yan et.al.	2502.09933	null	Kimi
2524	2025-02-14	INF^2: High-Throughput Generative Inference of Large Language Models using Near-Storage Processing	Hongsun Jang et.al.	2502.09921	null	Kimi
2525	2025-02-13	ATM-Net: Adaptive Termination and Multi-Precision Neural Networks for Energy-Harvested Edge Intelligence	Neeraj Solanki et.al.	2502.09822	null	Kimi
2526	2025-02-13	NestQuant: Nested Lattice Quantization for Matrix Products and LLMs	Semyon Savkin et.al.	2502.09720	null	Kimi
2527	2025-02-13	MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency	Dongzhi Jiang et.al.	2502.09621	null	Kimi
2528	2025-02-13	CoT-Valve: Length-Compressible Chain-of-Thought Tuning	Xinyin Ma et.al.	2502.09601	link	Kimi
2529	2025-02-13	Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs	Siyan Zhao et.al.	2502.09597	link	Kimi
2530	2025-02-13	SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models	Daniel Fleischer et.al.	2502.09390	link	Kimi
2531	2025-02-13	Generalizability through Explainability: Countering Overfitting with Counterfactual Examples	Flavio Giorgi et.al.	2502.09193	null	Kimi
2532	2025-02-13	Bridging the Gap Between LLMs and Human Intentions: Progresses and Challenges in Instruction Understanding, Intention Reasoning, and Reliable Generation	Zongyu Chang et.al.	2502.09101	null	Kimi
2533	2025-02-13	Unleashing the Power of Large Language Model for Denoising Recommendation	Shuyao Wang et.al.	2502.09058	null	Kimi
2534	2025-02-13	Diversity Enhances an LLM’s Performance in RAG and Long-context Task	Zhchao Wang et.al.	2502.09017	null	Kimi
2535	2025-02-13	RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models	Quan Wei et.al.	2502.09003	null	Kimi
2536	2025-02-13	Task Generalization With AutoRegressive Compositional Structure: Can Learning From $\d$ Tasks Generalize to $\d^{T}$ Tasks?	Amirhesam Abedsoltan et.al.	2502.08991	null	Kimi
2537	2025-02-12	Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning	Qifan Yu et.al.	2502.08482	null	Kimi
2538	2025-02-12	The MoE-Empowered Edge LLMs Deployment: Architecture, Challenges, and Opportunities	Ning Li et.al.	2502.08381	null	Kimi
2539	2025-02-12	Inference-time sparse attention with asymmetric indexing	Pierre-Emmanuel Mazaré et.al.	2502.08246	null	Kimi
2540	2025-02-12	Learning Human Skill Generators at Key-Step Levels	Yilu Wu et.al.	2502.08234	null	Kimi
2541	2025-02-12	Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance	Lingfei Qian et.al.	2502.08127	link	Kimi
2542	2025-02-12	GCoT: Chain-of-Thought Prompt Learning for Graphs	Xingtong Yu et.al.	2502.08092	null	Kimi
2543	2025-02-12	Mixture of Decoupled Message Passing Experts with Entropy Constraint for General Node Classification	Xuanze Chen et.al.	2502.08083	null	Kimi
2544	2025-02-11	Training Sparse Mixture Of Experts Text Embedding Models	Zach Nussbaum et.al.	2502.07972	link	Kimi
2545	2025-02-11	HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment	Youhe Jiang et.al.	2502.07903	null	Kimi
2546	2025-02-11	TransMLA: Multi-head Latent Attention Is All You Need	Fanxu Meng et.al.	2502.07864	link	Kimi
2547	2025-02-11	Magic 1-For-1: Generating One Minute Video Clips within One Minute	Hongwei Yi et.al.	2502.07701	link	Kimi
2548	2025-02-11	LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid	Weigao Sun et.al.	2502.07563	link	Kimi
2549	2025-02-11	Early Stopping Against Label Noise Without Validation Data	Suqin Yuan et.al.	2502.07551	link	Kimi
2550	2025-02-11	Instance-dependent Early Stopping	Suqin Yuan et.al.	2502.07547	link	Kimi
2551	2025-02-11	Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More	Xialie Zhuang et.al.	2502.07490	link	Kimi
2552	2025-02-11	LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!	Dacheng Li et.al.	2502.07374	link	Kimi
2553	2025-02-11	LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation	Zican Dong et.al.	2502.07365	null	Kimi
2554	2025-02-11	BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models	Xu Huang et.al.	2502.07346	link	Kimi
2555	2025-02-11	CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction	Junlong Li et.al.	2502.07316	link	Kimi
2556	2025-02-11	OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms	Lumen AI et.al.	2502.07312	link	Kimi
2557	2025-02-10	On the Emergence of Thinking in LLMs I: Searching for the Right Intuition	Guanghao Ye et.al.	2502.06773	link	Kimi
2558	2025-02-10	ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates	Ling Yang et.al.	2502.06772	link	Kimi
2559	2025-02-10	Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs	Ryan Synk et.al.	2502.06766	link	Kimi
2560	2025-02-10	History-Guided Video Diffusion	Kiwhan Song et.al.	2502.06764	null	Kimi
2561	2025-02-10	Rationalization Models for Text-to-SQL	Gaetano Rossiello et.al.	2502.06759	null	Kimi
2562	2025-02-10	MoETuner: Optimized Mixture of Expert Serving with Balanced Expert Placement and Token Routing	Seokjin Go et.al.	2502.06643	null	Kimi
2563	2025-02-10	Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text Approaches	Adithya Pratapa et.al.	2502.06617	link	Kimi
2564	2025-02-10	Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation	Chengwen Qi et.al.	2502.06563	link	Kimi
2565	2025-02-10	CoS: Chain-of-Shot Prompting for Long Video Understanding	Jian Hu et.al.	2502.06428	null	Kimi
2566	2025-02-10	Expect the Unexpected: FailSafe Long Context QA for Finance	Kiran Kamble et.al.	2502.06329	null	Kimi
2567	2025-02-07	Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray	Yunhang Shen et.al.	2502.05177	link	Kimi
2568	2025-02-07	VideoRoPE: What Makes for Good Video Rotary Position Embedding?	Xilin Wei et.al.	2502.05173	link	Kimi
2569	2025-02-07	Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient	Jan Ludziejewski et.al.	2502.05172	null	Kimi
2570	2025-02-07	NoLiMa: Long-Context Evaluation Beyond Literal Matching	Ali Modarressi et.al.	2502.05167	link	Kimi
2571	2025-02-07	Data-Parallel Neural Network Training via Nonlinearly Preconditioned Trust-Region Method	Samuel A. Cruz Alegría et.al.	2502.05133	null	Kimi
2572	2025-02-07	Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures	Tushar Pandey et.al.	2502.05078	link	Kimi
2573	2025-02-07	S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency	Yuting Zeng et.al.	2502.04790	null	Kimi
2574	2025-02-07	Early Stopping for Regression Trees	Ratmir Miftachov et.al.	2502.04709	null	Kimi
2575	2025-02-07	ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning	Yuwei Yin et.al.	2502.04689	link	Kimi
2576	2025-02-07	Unveiling the Mechanisms of Explicit CoT Training: How Chain-of-Thought Enhances Reasoning Generalization	Xinhao Yao et.al.	2502.04667	link	Kimi
2577	2025-02-06	Exploring operation parallelism vs. ion movement in ion-trapped QCCD architectures	Anabel Ovide et.al.	2502.04181	null	Kimi
2578	2025-02-06	HD-EPIC: A Highly-Detailed Egocentric Video Dataset	Toby Perrett et.al.	2502.04144	null	Kimi
2579	2025-02-06	AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference	Qingyue Yang et.al.	2502.04077	link	Kimi
2580	2025-02-06	RWKV-UI: UI Understanding with Enhanced Perception and Reasoning	Jiaxi Yang et.al.	2502.03971	null	Kimi
2581	2025-02-06	InfinitePOD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers	Chenchen Shou et.al.	2502.03885	null	Kimi
2582	2025-02-06	Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning	Peizhuang Cong et.al.	2502.03884	null	Kimi
2583	2025-02-06	Identify Critical KV Cache in LLM Inference from an Output Perturbation Perspective	Yuan Feng et.al.	2502.03805	link	Kimi
2584	2025-02-05	(GG) MoE vs. MLP on Tabular Data	Andrei Chernov et.al.	2502.03608	null	Kimi
2585	2025-02-05	HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference	Zeyu Zhang et.al.	2502.03589	null	Kimi
2586	2025-02-05	Demystifying Long Chain-of-Thought Reasoning in LLMs	Edward Yeo et.al.	2502.03373	link	Kimi
2587	2025-02-05	ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model	Qiguang Chen et.al.	2502.03325	null	Kimi
2588	2025-02-05	Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning	DiJia Su et.al.	2502.03275	null	Kimi
2589	2025-02-05	MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding	Pengyi Li et.al.	2502.03183	null	Kimi
2590	2025-02-05	Structured Token Retention and Computational Memory Paths in Large Language Models	Jonathan Delena et.al.	2502.03102	null	Kimi
2591	2025-02-05	IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates	Aissatou Diallo et.al.	2502.03080	null	Kimi
2592	2025-02-05	Scaling Laws for Upcycling Mixture-of-Experts Language Models	Seng Pei Liew et.al.	2502.03009	null	Kimi
2593	2025-02-05	LLM-KT: Aligning Large Language Models with Knowledge Tracing using a Plug-and-Play Instruction	Ziwei Wang et.al.	2502.02945	null	Kimi
2594	2025-02-05	Early Stopping in Contextual Bandits and Inferences	Zihan Cui et.al.	2502.02793	null	Kimi
2595	2025-02-04	Twilight: Adaptive Attention Sparsity with Hierarchical Top- $p$ Pruning	Chaofan Lin et.al.	2502.02770	null	Kimi
2596	2025-02-04	Hecate: Unlocking Efficient Sparse Model Training via Fully Sharded Sparse Data Parallelism	Yuhao Qing et.al.	2502.02581	null	Kimi
2597	2025-02-04	Brief analysis of DeepSeek R1 and it’s implications for Generative AI	Sarah Mercer et.al.	2502.02523	null	Kimi
2598	2025-02-04	EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization	Yize Wu et.al.	2502.02493	null	Kimi
2599	2025-02-04	Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers	Alireza Amiri et.al.	2502.02393	null	Kimi
2600	2025-02-04	STAIR: Improving Safety Alignment with Introspective Reasoning	Yichi Zhang et.al.	2502.02384	link	Kimi
2601	2025-02-04	Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs	Sagnik Mukherjee et.al.	2502.02362	null	Kimi
2602	2025-02-04	VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation	Siyu Xu et.al.	2502.02175	null	Kimi
2603	2025-02-04	M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference	Nikhil Bhendawade et.al.	2502.02040	null	Kimi
2604	2025-02-04	Wavelet-based Positional Representation for Long Context	Yui Oka et.al.	2502.02004	null	Kimi
2605	2025-02-04	MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving	Shiju Zhao et.al.	2502.01960	null	Kimi
2606	2025-01-31	Scalable-Softmax Is Superior for Attention	Ken M. Nakanishi et.al.	2501.19399	null	Kimi
2607	2025-01-31	Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models	Alina Shutova et.al.	2501.19392	link	Kimi
2608	2025-01-31	Efficient Reasoning with Hidden Thinking	Xuan Shen et.al.	2501.19201	link	Kimi
2609	2025-01-31	Rethinking Early Stopping: Refine, Then Calibrate	Eugène Berta et.al.	2501.19195	link	Kimi
2610	2025-01-31	A theoretical framework for overfitting in energy-based modeling	Giovanni Catania et.al.	2501.19158	null	Kimi
2611	2025-01-31	$\infty$ -Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation	Saul Santos et.al.	2501.19098	link	Kimi
2612	2025-01-30	Rope to Nope and Back Again: A New Hybrid Attention Strategy	Bowen Yang et.al.	2501.18795	null	Kimi
2613	2025-01-30	Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning	Maya Kruse et.al.	2501.18724	null	Kimi
2614	2025-01-30	Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models	Yi Ding et.al.	2501.18533	null	Kimi
2615	2025-01-30	State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence	Thea Aviss et.al.	2501.18356	null	Kimi
2616	2025-01-30	Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge	Swarnadeep Saha et.al.	2501.18099	null	Kimi
2617	2025-01-29	Physics-Grounded Differentiable Simulation for Soft Growing Robots	Lucas Chen et.al.	2501.17963	link	Kimi
2618	2025-01-29	Free Agent in Agent-Based Mixture-of-Experts Generative AI Framework	Jung-Hua Liu et.al.	2501.17903	null	Kimi
2619	2025-01-29	Formally Verified Binary-level Pointer Analysis	Freek Verbeek et.al.	2501.17766	null	Kimi
2620	2025-01-29	CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs	Amey Hengle et.al.	2501.17581	null	Kimi
2621	2025-01-29	Heuristic-Informed Mixture of Experts for Link Prediction in Multilayer Networks	Lucio La Cava et.al.	2501.17557	null	Kimi
2622	2025-01-29	DINT Transformer	Yueyang Cang et.al.	2501.17486	null	Kimi
2623	2025-01-28	TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network	Yumingzhi Pan et.al.	2501.16784	null	Kimi
2624	2025-01-28	3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow	Yueen Ma et.al.	2501.16698	null	Kimi
2625	2025-01-28	MCTS-SQL: An Effective Framework for Text-to-SQL with Monte Carlo Tree Search	Shuozhi Yuan et.al.	2501.16607	null	Kimi
2626	2025-01-27	Searching for GEMS: Discovery and Characterization of Two Brown Dwarfs Around M Dwarfs	Alexander Larsen et.al.	2501.16554	null	Kimi
2627	2025-01-27	MoEVD: Enhancing Vulnerability Detection by Mixture-of-Experts (MoE)	Xu Yang et.al.	2501.16454	null	Kimi
2628	2025-01-27	The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model	Kaito Takanami et.al.	2501.16226	link	Kimi
2629	2025-01-27	Provence: efficient and robust context pruning for retrieval-augmented generation	Nadezhda Chirkova et.al.	2501.16214	null	Kimi
2630	2025-01-27	Options-Aware Dense Retrieval for Multiple-Choice query Answering	Manish Singh et.al.	2501.16111	null	Kimi
2631	2025-01-27	Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference	Yinghan Li et.al.	2501.16103	null	Kimi
2632	2025-01-27	Understanding Long Videos via LLM-Powered Entity Relation Graphs	Meng Chu et.al.	2501.15953	null	Kimi
2633	2025-01-27	Memorization and Regularization in Generative Diffusion Models	Ricardo Baptista et.al.	2501.15785	link	Kimi
2634	2025-01-27	Renewable Energy Prediction: A Comparative Study of Deep Learning Models for Complex Dataset Analysis	Haibo Wang et.al.	2501.15731	null	Kimi
2635	2025-01-26	A Benchmarking Platform for DDR4 Memory Performance in Data-Center-Class FPGAs	Andrea Galimberti et.al.	2501.15582	null	Kimi
2636	2025-01-26	Qwen2.5-1M Technical Report	An Yang et.al.	2501.15383	null	Kimi
2637	2025-01-25	ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning	Shangqian Gao et.al.	2501.15316	null	Kimi
2638	2025-01-24	Mean-field limit from general mixtures of experts to quantum neural networks	Anderson Melchor Hernandez et.al.	2501.14660	null	Kimi
2639	2025-01-24	Experimentally Evaluating the Resource Efficiency of Big Data Autoscaling	Jonathan Will et.al.	2501.14456	link	Kimi
2640	2025-01-24	Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains	Xu Chu et.al.	2501.14431	null	Kimi
2641	2025-01-24	GraphBC: Improving LLMs for Better Graph Data Processing	Xu Chu et.al.	2501.14427	null	Kimi
2642	2025-01-24	Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation	Shengzhe Zhang et.al.	2501.14269	link	Kimi
2643	2025-01-24	Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading	Minrui Xu et.al.	2501.14205	null	Kimi
2644	2025-01-23	Can We Generate Images with CoT? Let’s Verify and Reinforce Image Generation Step by Step	Ziyu Guo et.al.	2501.13926	link	Kimi
2645	2025-01-23	The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities	Chan-Jan Hsu et.al.	2501.13921	link	Kimi
2646	2025-01-23	PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection	Peiyuan Zhang et.al.	2501.13898	link	Kimi
2647	2025-01-23	Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models	Zhenghao Lin et.al.	2501.13629	null	Kimi
2648	2025-01-23	Coarse-to-Fine Process Reward Modeling for Enhanced Mathematical Reasoning	Yulan Hu et.al.	2501.13622	null	Kimi
2649	2025-01-23	Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge	Haomiao Xiong et.al.	2501.13468	link	Kimi
2650	2025-01-23	Contrast: A Hybrid Architecture of Transformers and State Space Models for Low-Level Vision	Aman Urumbekov et.al.	2501.13353	null	Kimi
2651	2025-01-23	Qrazor: Reliable and effortless 4-bit llm quantization by significant data razoring	Dongyoung Lee et.al.	2501.13331	null	Kimi
2652	2025-01-22	Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment	Melissa Kazemi Rad et.al.	2501.13080	null	Kimi
2653	2025-01-22	Autonomy-of-Experts Models	Ang Lv et.al.	2501.13074	null	Kimi
2654	2025-01-22	Ehrenfeucht-Haussler Rank and Chain of Thought	Pablo Barceló et.al.	2501.12997	null	Kimi
2655	2025-01-22	LLM4WM: Adapting LLM for Wireless Multi-Tasking	Xuanyu Liu et.al.	2501.12983	null	Kimi
2656	2025-01-22	Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference	Weizhi Fei et.al.	2501.12959	null	Kimi
2657	2025-01-22	Late Breaking Result: FPGA-Based Emulation and Fault Injection for CNN Inference Accelerators	Filip Masar et.al.	2501.12818	link	Kimi
2658	2025-01-22	NExtLong: Toward Effective Long-Context Training without Long Documents	Chaochen Gao et.al.	2501.12766	link	Kimi
2659	2025-01-22	BLR-MoE: Boosted Language-Routing Mixture of Experts for Domain-Robust Multilingual E2E ASR	Guodong Ma et.al.	2501.12602	null	Kimi
2660	2025-01-22	Kimi k1.5: Scaling Reinforcement Learning with LLMs	Kimi Team et.al.	2501.12599	null	Kimi
2661	2025-01-21	Slot-BERT: Self-supervised Object Discovery in Surgical Video	Guiqiu Liao et.al.	2501.12477	null	Kimi
2662	2025-01-21	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null	Kimi
2663	2025-01-21	Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL	Yeounoh Chung et.al.	2501.12372	link	Kimi
2664	2025-01-21	Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models	Samira Abnar et.al.	2501.12370	null	Kimi
2665	2025-01-21	CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning	Yuanheng Fang et.al.	2501.12226	null	Kimi
2666	2025-01-21	Muon-specific two-Higgs-doublet model for $(g-2)_μ$ anomaly, $W$ -boson mass-shift, and Zee model	I. A. Yafi et.al.	2501.12181	null	Kimi
2667	2025-01-21	Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models	Zihan Qiu et.al.	2501.11873	null	Kimi
2668	2025-01-20	Characterization of GPU TEE Overheads in Distributed Data Parallel ML Training	Jonghytun Lee et.al.	2501.11771	null	Kimi
2669	2025-01-20	Early Stopping Bayesian Optimization for Controller Tuning	David Stenger et.al.	2501.11532	link	Kimi
2670	2025-01-20	CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation	Zheng Chong et.al.	2501.11325	link	Kimi
2671	2025-01-20	RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?	Haotian Xu et.al.	2501.11284	null	Kimi
2672	2025-01-17	AraXL: A Physically Scalable, Ultra-Wide RISC-V Vector Processor Design for Fast and Efficient Computation on Long Vectors	Navaneeth Kunhi Purayil et.al.	2501.10301	null	Kimi
2673	2025-01-17	ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario	Lucen Zhong et.al.	2501.10132	link	Kimi
2674	2025-01-17	Multi-Dimensional Vector ISA Extension for Mobile In-Cache Computing	Alireza Khadem et.al.	2501.09902	link	Kimi
2675	2025-01-16	Coded Deep Learning: Framework and Algorithm	En-hui Yang et.al.	2501.09849	null	Kimi
2676	2025-01-15	LeMo: Enabling LEss Token Involvement for MOre Context Fine-tuning	Tuowei Wang et.al.	2501.09767	null	Kimi
2677	2025-01-16	AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation	Junjie He et.al.	2501.09503	link	Kimi
2678	2025-01-16	PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks	Huiyou Zhan et.al.	2501.09367	null	Kimi
2679	2025-01-15	Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation	Jiaxin Guo et.al.	2501.08523	null	Kimi
2680	2025-01-14	Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models	Yifu Qiu et.al.	2501.08248	null	Kimi
2681	2025-01-14	PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving	Ahmet Caner Yüzügüler et.al.	2501.08192	null	Kimi
2682	2025-01-13	A Survey of Early Exit Deep Neural Networks in NLP	Divya Jyoti Bajpai et.al.	2501.07670	null	Kimi
2683	2025-01-14	Monotone Curve Estimation via Convex Duality	Tongseok Lim et.al.	2501.06975	null	Kimi
2684	2025-01-12	MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference	Wenxuan Zeng et.al.	2501.06807	null	Kimi
2685	2025-01-12	Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management	Liu Qianli et.al.	2501.06709	null	Kimi
2686	2025-01-11	SafeSplit: A Novel Defense Against Client-Side Backdoor Attacks in Split Learning	Phillip Rieger et.al.	2501.06650	null	Kimi
2687	2025-01-11	Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks	Amr Almorsi et.al.	2501.06625	null	Kimi
2688	2025-01-11	Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping	Muru Zhang et.al.	2501.06589	link	Kimi
2689	2025-01-11	Tensor Product Attention Is All You Need	Yifan Zhang et.al.	2501.06425	link	Kimi
2690	2025-01-10	Scale-up Unlearnable Examples Learning with High-Performance Computing	Yanfan Zhu et.al.	2501.06080	link	Kimi
2691	2025-01-09	Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters	Ziyue Luo et.al.	2501.05563	null	Kimi
2692	2025-01-09	LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation	Xi Ye et.al.	2501.05414	null	Kimi
2693	2025-01-09	Euclid: Detecting Solar System objects in Euclid images and classifying them using Kohonen self-organising maps	A. A. Nucita et.al.	2501.05023	null	Kimi
2694	2025-01-09	SyNPar: Synthetic Null Data Parallelism for High-Power False Discovery Rate Control in High-Dimensional Variable Selection	Changhu Wang et.al.	2501.05012	null	Kimi
2695	2025-01-09	TreeKV: Smooth Key-Value Cache Compression with Tree Structures	Ziwei He et.al.	2501.04987	null	Kimi
2696	2025-01-08	Collaborative Inference Acceleration with Non-Penetrative Tensor Partitioning	Zhibang Liu et.al.	2501.04489	null	Kimi
2697	2025-01-06	The Power of Negative Zero: Datatype Customization for Quantized Large Language Models	Yuzong Chen et.al.	2501.04052	link	Kimi
2698	2025-01-07	CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering	Jialiang Chen et.al.	2501.03447	null	Kimi
2699	2025-01-05	PTEENet: Post-Trained Early-Exit Neural Networks Augmentation for Inference Cost Optimization	Assaf Lahiany et.al.	2501.02508	null	Kimi
2700	2025-01-07	ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling	Chaojie Mao et.al.	2501.02487	null	Kimi
2701	2025-01-04	AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference	Zhuomin He et.al.	2501.02336	link	Kimi
2702	2025-01-04	The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit	Huixue Zhou et.al.	2501.02173	null	Kimi
2703	2025-01-03	Efficient LLM Inference with Activation Checkpointing and Hybrid Caching	Sanghyeon Lee et.al.	2501.01792	null	Kimi
2704	2025-01-03	Data Parallel Visualization and Rendering on the RAMSES Supercomputer with ANARI	Stefan Zellmann et.al.	2501.01628	null	Kimi
2705	2025-01-02	TreeLUT: An Efficient Alternative to Deep Neural Networks for Inference Acceleration Using Gradient Boosted Decision Trees	Alireza Khataei et.al.	2501.01511	link	Kimi
2706	2025-01-02	FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving	Zihao Ye et.al.	2501.01005	link	Kimi
2707	2025-01-01	Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding	Jiajun Zhu et.al.	2501.00712	link	Kimi
2708	2025-01-01	Adjoint sharding for very long context training of state space models	Xingzi Xu et.al.	2501.00692	null	Kimi
2709	2024-12-31	Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing	Peihao Wang et.al.	2501.00658	link	Kimi
2710	2024-12-31	A Study on Context Length and Efficient Transformers for Biomedical Image Analysis	Sarah M. Hooper et.al.	2501.00619	null	Kimi
2711	2024-12-31	VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling	Xinhao Li et.al.	2501.00574	link	Kimi
2712	2024-12-30	CaseSumm: A Large-Scale Dataset for Long-Context Summarization from U.S. Supreme Court Opinions	Mourad Heddaya et.al.	2501.00097	null	Kimi
2713	2024-12-30	Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism	Tim Tsz-Kit Lau et.al.	2412.21124	null	Kimi
2714	2024-12-30	Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA	Qingyun Jin et.al.	2412.20677	null	Kimi
2715	2024-12-29	ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding	Xiao Wang et.al.	2412.20504	link	Kimi
2716	2024-12-29	TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication	Zongwu Wang et.al.	2412.20501	link	Kimi
2717	2024-12-29	NeutronTP: Load-Balanced Distributed Full-Graph GNN Training with Tensor Parallelism	Xin Ai et.al.	2412.20379	null	Kimi
2718	2024-12-28	LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System	Hyucksung Kwon et.al.	2412.20166	null	Kimi
2719	2024-12-28	ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming	Jiedong Zhuang et.al.	2412.20105	null	Kimi
2720	2024-12-27	Goal-oriented Communications based on Recursive Early Exit Neural Networks	Jary Pomponi et.al.	2412.19587	null	Kimi
2721	2024-12-27	StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture	Miaomiao Dai et.al.	2412.19535	null	Kimi
2722	2025-01-02	A Survey on Large Language Model Acceleration based on KV Cache Management	Haoyang Li et.al.	2412.19442	link	Kimi
2723	2024-12-26	Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones	Mehrnaz Mofakhami et.al.	2412.19325	null	Kimi
2724	2024-12-26	Multi-matrix Factorization Attention	Jingcheng Hu et.al.	2412.19255	null	Kimi
2725	2024-12-26	Repository Structure-Aware Training Makes SLMs Better Issue Resolver	Zexiong Ma et.al.	2412.19031	null	Kimi
2726	2024-12-25	Long-Range Tasks Using Short-Context LLMs: Incremental Reasoning With Structured Memories	Dulhan Jayalath et.al.	2412.18914	null	Kimi
2727	2024-12-25	Bootstrap Your Own Context Length	Liang Wang et.al.	2412.18860	null	Kimi
2728	2024-12-25	DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search	Lei Yang et.al.	2412.18811	link	Kimi
2729	2024-12-24	Efficient Long Context Language Model Retrieval with Compression	Minju Seo et.al.	2412.18232	null	Kimi
2730	2024-12-24	Adapter Merging with Centroid Prototype Mapping for Scalable Class-Incremental Learning	Takuma Fukuda et.al.	2412.18219	link	Kimi
2731	2024-12-24	KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management	Rongxin Cheng et.al.	2412.18169	null	Kimi
2732	2024-12-24	Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering	Francois Chaubard et.al.	2412.18052	link	Kimi
2733	2024-12-23	Theoretical Constraints on the Expressive Power of $\mathsf{RoPE}$ -based Tensor Attention Transformers	Xiaoyu Li et.al.	2412.18040	null	Kimi
2734	2024-12-23	Deliberation in Latent Space via Differentiable Cache Augmentation	Luyang Liu et.al.	2412.17747	null	Kimi
2735	2024-12-24	YuLan-Mini: An Open Data-efficient Language Model	Yiwen Hu et.al.	2412.17743	link	Kimi
2736	2024-12-23	Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework	Aswini Kumar Patra et.al.	2412.17587	null	Kimi
2737	2024-12-23	Optimal Convergence Rates for Neural Operators	Mike Nguyen et.al.	2412.17518	null	Kimi
2738	2024-12-23	A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression	Chenlong Deng et.al.	2412.17483	null	Kimi
2739	2024-12-23	MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models	Beibei Yu et.al.	2412.17339	null	Kimi
2740	2024-12-22	Revisiting In-Context Learning with Long Context Language Models	Jinheon Baek et.al.	2412.16926	null	Kimi
2741	2024-12-20	A survey on FPGA-based accelerator for ML models	Feng Yan et.al.	2412.15666	null	Kimi
2742	2024-12-20	Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks	Brian J Chan et.al.	2412.15605	link	Kimi
2743	2024-12-19	Systematic Evaluation of Long-Context LLMs on Financial Concepts	Lavanya Gupta et.al.	2412.15386	null	Kimi
2744	2024-12-19	LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks	Yushi Bai et.al.	2412.15204	link	Kimi
2745	2024-12-19	Minimizing speculation overhead in a parallel recognizer for regular texts	Angelo Borsotti et.al.	2412.14975	null	Kimi
2746	2024-12-19	DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs	Xiabin Zhou et.al.	2412.14838	null	Kimi
2747	2024-12-19	Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models	Wenhan Liu et.al.	2412.14574	link	Kimi
2748	2024-12-19	HashAttention: Semantic Sparsity for Faster Inference	Aditya Desai et.al.	2412.14468	null	Kimi
2749	2024-12-18	Scaling Deep Learning Training with MPMD Pipeline Parallelism	Anxhelo Xhebraj et.al.	2412.14374	null	Kimi
2750	2024-12-18	ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals	Utkarsh Saxena et.al.	2412.14363	link	Kimi
2751	2024-12-18	State Space Models are Strong Text Rerankers	Zhichao Xu et.al.	2412.14354	null	Kimi
2752	2024-12-19	Online MDP with Transition Prototypes: A Robust Adaptive Approach	Shuo Sun et.al.	2412.14075	null	Kimi
2753	2024-12-19	Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference	Benjamin Warner et.al.	2412.13663	link	Kimi
2754	2024-12-18	SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation	Jialong Wu et.al.	2412.13649	link	Kimi
2755	2024-12-18	LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning	Yansheng Mao et.al.	2412.13626	null	Kimi
2756	2024-12-18	Attention-aware convolutional neural networks for identification of magnetic islands in the tearing mode on EAST tokamak	Feifei Long et.al.	2412.13498	null	Kimi
2757	2024-12-18	Deploying Foundation Model Powered Agent Services: A Survey	Wenchao Xu et.al.	2412.13437	null	Kimi
2758	2024-12-17	COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism	Jianing He et.al.	2412.13236	link	Kimi
2759	2024-12-17	GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models	Mukai Li et.al.	2412.12735	link	Kimi
2760	2024-12-17	More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression	Jiebin Zhang et.al.	2412.12706	null	Kimi
2761	2024-12-17	LLMs are Also Effective Embedding Models: An In-depth Overview	Chongyang Tao et.al.	2412.12591	null	Kimi
2762	2024-12-17	PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization	Yun Luo et.al.	2412.12588	link	Kimi
2763	2024-12-17	ITP: Instance-Aware Test Pruning for Out-of-Distribution Detection	Haonan Xu et.al.	2412.12566	link	Kimi
2764	2024-12-17	A System for Microserving of LLMs	Hongyi Jin et.al.	2412.12488	null	Kimi
2765	2024-12-17	Boosting Long-Context Information Seeking via Query-Guided Activation Refilling	Hongjin Qian et.al.	2412.12486	link	Kimi
2766	2024-12-17	Core Context Aware Attention for Long Context Language Modeling	Yaofo Chen et.al.	2412.12465	null	Kimi
2767	2024-12-17	SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator	Guoxuan Chen et.al.	2412.12094	link	Kimi
2768	2024-12-16	SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval	Yueqian Lin et.al.	2412.12009	link	Kimi
2769	2024-12-16	EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents	Mengna Zhu et.al.	2412.11814	null	Kimi
2770	2024-12-16	CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation	Hongxuan Zhang et.al.	2412.11741	null	Kimi
2771	2024-12-16	Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning	Xingchi Chen et.al.	2412.11685	null	Kimi
2772	2024-12-16	On the SDP Relaxation of Direct Torque Finite Control Set Model Predictive Control	Luca M. Hartmann et.al.	2412.11666	null	Kimi
2773	2024-12-16	FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation	Dannong Wang et.al.	2412.11378	link	Kimi
2774	2024-12-15	Timing of Seven Isolated Pulsars in the Globular Cluster Terzan 1	Justine Singleton et.al.	2412.11271	null	Kimi
2775	2024-12-15	Wasserstein Bounds for generative diffusion models with Gaussian tail targets	Xixian Wang et.al.	2412.11251	null	Kimi
2776	2024-12-15	ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction	Yi Feng et.al.	2412.11210	link	Kimi
2777	2024-12-13	SCBench: A KV Cache-Centric Analysis of Long-Context Methods	Yucheng Li et.al.	2412.10319	null	Kimi
2778	2024-12-13	Lost in the Middle, and In-Between: Enhancing Language Models’ Ability to Reason Over Long Contexts in Multi-Hop QA	George Arthur Baker et.al.	2412.10079	link	Kimi
2779	2024-12-13	Benchmarking Table Comprehension In The Wild	Yikang Pan et.al.	2412.09884	null	Kimi
2780	2024-12-13	V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding	Junqi Ge et.al.	2412.09616	link	Kimi
2781	2024-12-12	InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions	Pan Zhang et.al.	2412.09596	link	Kimi
2782	2024-12-12	InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption	Tiehan Fan et.al.	2412.09283	null	Kimi
2783	2024-12-12	ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty	Meizhi Zhong et.al.	2412.09036	null	Kimi
2784	2024-12-12	RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios	Ruiwen Zhou et.al.	2412.08972	link	Kimi
2785	2024-12-12	Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries	Junhyuck Kim et.al.	2412.08890	link	Kimi
2786	2024-12-11	TURBOATTENTION: Efficient Attention Approximation For High Throughputs LLMs	Hao Kang et.al.	2412.08585	null	Kimi
2787	2024-12-11	EMS: Adaptive Evict-then-Merge Strategy for Head-wise KV Cache Compression Based on Global-Local Importance	Yingxin Li et.al.	2412.08521	null	Kimi
2788	2024-12-10	From Slow Bidirectional to Fast Causal Video Generators	Tianwei Yin et.al.	2412.07772	null	Kimi
2789	2024-12-10	ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer	Jinyi Hu et.al.	2412.07720	link	Kimi
2790	2024-12-09	FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization	Boyang Zhang et.al.	2412.06865	null	Kimi
2791	2024-12-09	Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models	Wei Suo et.al.	2412.06458	null	Kimi
2792	2024-12-08	BiDM: Pushing the Limit of Quantization for Diffusion Models	Xingyu Zheng et.al.	2412.05926	link	Kimi
2793	2024-12-08	XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference	Weizhuo Li et.al.	2412.05896	null	Kimi
2794	2024-12-07	Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression	Michael R. Metel et.al.	2412.05693	null	Kimi
2795	2024-12-11	Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference	Qingyuan Li et.al.	2412.04964	null	Kimi
2796	2024-12-06	GUIDE: A Global Unified Inference Engine for Deploying Large Language Models in Heterogeneous Environments	Yanyu Chen et.al.	2412.04788	null	Kimi
2797	2024-12-05	Cross-Self KV Cache Pruning for Efficient Vision-Language Inference	Xiaohuan Pei et.al.	2412.04652	link	Kimi
2798	2024-12-05	votess: A multi-target, GPU-capable, parallel Voronoi tessellator	C. Byrohl et.al.	2412.04514	link	Kimi
2799	2024-12-05	p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay	Jun Zhang et.al.	2412.04449	link	Kimi
2800	2024-12-07	PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation	Ao Wang et.al.	2412.03409	link	Kimi
2801	2024-12-04	ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression	Guangda Liu et.al.	2412.03213	link	Kimi
2802	2024-12-04	Unifying KV Cache Compression for Large Language Models with LeanKV	Yanqi Zhang et.al.	2412.03131	null	Kimi
2803	2024-12-04	Lightweight Multiplane Images Network for Real-Time Stereoscopic Conversion from Planar Video	Shanding Diao et.al.	2412.03102	null	Kimi
2804	2024-12-03	Resource-Adaptive Successive Doubling for Hyperparameter Optimization with Large Datasets on High-Performance Computing Systems	Marcel Aach et.al.	2412.02729	link	Kimi
2805	2024-12-03	Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity	Da Ma et.al.	2412.02252	null	Kimi
2806	2024-12-02	RandAR: Decoder-only Autoregressive Visual Generation in Random Orders	Ziqi Pang et.al.	2412.01827	null	Kimi
2807	2024-12-05	Yi-Lightning Technical Report	01. AI et.al.	2412.01253	null	Kimi
2808	2024-12-02	INTELLECT-1 Technical Report	Sami Jaghouar et.al.	2412.01152	link	Kimi
2809	2024-12-03	Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification	Wenxuan Huang et.al.	2412.00876	link	Kimi
2810	2024-12-01	MERLIN: Multi-stagE query performance prediction for dynamic paRallel oLap pIpeliNe	Kaixin Zhang et.al.	2412.00749	null	Kimi
2811	2024-11-29	DeMo: Decoupled Momentum Optimization	Bowen Peng et.al.	2411.19870	link	Kimi
2812	2024-11-27	FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving	Ao Shen et.al.	2411.18424	null	Kimi
2813	2024-11-28	MiniKV: Pushing the Limits of LLM Inference via 2-Bit Layer-Discriminative KV Cache	Akshat Sharma et.al.	2411.18077	null	Kimi
2814	2024-11-27	Addressing Architectural Obstacles for Overlay with Stream Network Abstraction	Chengyue Wang et.al.	2411.17966	null	Kimi
2815	2024-11-26	Attamba: Attending To Multi-Token States	Yash Akhauri et.al.	2411.17685	link	Kimi
2816	2024-11-26	Toward High-Performance LLM Serving: A Simulation-Based Approach for Identifying Optimal Parallelism	Yi-Chien Lin et.al.	2411.17651	link	Kimi
2817	2024-11-26	Efficient LLM Inference with I/O-Aware Partial KV Cache Recomputation	Chaoyi Jiang et.al.	2411.17089	link	Kimi
2818	2024-11-25	Lion Cub: Minimizing Communication Overhead in Distributed Lion	Satoki Ishikawa et.al.	2411.16462	null	Kimi
2819	2024-11-24	Hiding Communication Cost in Distributed LLM Training via Micro-batch Co-execution	Haiquan Wang et.al.	2411.15871	null	Kimi
2820	2024-11-27	A Method for Building Large Language Models with Predefined KV Cache Capacity	Zhonghua Yi et.al.	2411.15785	null	Kimi
2821	2024-11-22	DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models	Keda Tao et.al.	2411.15024	link	Kimi
2822	2024-11-21	Functional Array Programming in an Extended Pi-Calculus	Hans Hüttel et.al.	2411.14579	null	Kimi
2823	2024-11-22	Quantization without Tears	Minghao Fu et.al.	2411.13918	link	Kimi
2824	2024-11-19	Faster Multi-GPU Training with PPLL: A Pipeline Parallelism Framework Leveraging Local Learning	Xiuyuan Guo et.al.	2411.12780	null	Kimi
2825	2024-11-18	Parsing Millions of DNS Records per Second	Jeroen Koekkoek et.al.	2411.12035	link	Kimi
2826	2024-11-17	SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration	Jintao Zhang et.al.	2411.10958	link	Kimi
2827	2024-11-16	Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model	Ting Liu et.al.	2411.10803	link	Kimi
2828	2024-11-15	SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers	Joseph Liu et.al.	2411.10510	link	Kimi
2829	2024-11-14	Squeezed Attention: Accelerating Long Context Length LLM Inference	Coleman Hooper et.al.	2411.09688	link	Kimi
2830	2024-11-15	Communication Compression for Tensor Parallel LLM Inference	Jan Hansen-Palmus et.al.	2411.09510	null	Kimi
2831	2024-11-12	Towards Low-bit Communication for Tensor Parallel LLM Inference	Harry Dong et.al.	2411.07942	null	Kimi
2832	2024-11-11	Anchor Attention, Small Cache: Code Generation with Large Language Models	Xiangyu Zhang et.al.	2411.06680	link	Kimi
2833	2024-11-10	Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator	Kazuki Fujii et.al.	2411.06465	null	Kimi
2834	2024-11-08	Balancing Pipeline Parallelism with Vocabulary Parallelism	Man Tsung Yeung et.al.	2411.05288	link	Kimi
2835	2024-11-07	BitNet a4.8: 4-bit Activations for 1-bit LLMs	Hongyu Wang et.al.	2411.04965	null	Kimi
2836	2024-11-06	Stepping Forward on the Last Mile	Chen Feng et.al.	2411.04036	null	Kimi
2837	2024-11-05	TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection	Wei Wu et.al.	2411.02886	null	Kimi
2838	2024-11-05	DroidSpeak: Enhancing Cross-LLM Communication	Yuhan Liu et.al.	2411.02820	null	Kimi
2839	2024-11-04	“Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization	Eldar Kurtic et.al.	2411.02355	null	Kimi
2840	2024-11-04	Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism	Fan Wu et.al.	2411.02086	null	Kimi
2841	2024-11-04	xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism	Jiarui Fang et.al.	2411.01738	link	Kimi
2842	2024-11-02	NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference	Xuanlin Jiang et.al.	2411.01142	null	Kimi
2843	2024-11-01	MoNTA: Accelerating Mixture-of-Experts Training with Network-Traffc-Aware Parallel Optimization	Jingming Guo et.al.	2411.00662	link	Kimi
2844	2024-11-01	Constrained Diffusion Implicit Models	Vivek Jayaram et.al.	2411.00359	null	Kimi
2845	2024-11-05	SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile	Ruisi Zhang et.al.	2411.00284	null	Kimi
2846	2024-10-31	Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2	Weijie Ke et.al.	2410.23776	null	Kimi
2847	2024-10-31	ALISE: Accelerating Large Language Model Serving with Speculative Scheduling	Youpeng Zhao et.al.	2410.23537	null	Kimi
2848	2024-10-29	VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration	Dezhan Tu et.al.	2410.23317	null	Kimi
2849	2024-10-30	BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference	Junqi Zhao et.al.	2410.23079	link	Kimi
2850	2024-10-29	The Impact of Inference Acceleration Strategies on Bias of LLMs	Elisabeth Kirsten et.al.	2410.22118	link	Kimi
2851	2024-10-29	How Does Critical Batch Size Scale in Pre-training?	Hanlin Zhang et.al.	2410.21676	link	Kimi
2852	2024-10-28	ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference	Hanshi Sun et.al.	2410.21465	link	Kimi
2853	2024-10-28	Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments	Yuzhe Yang et.al.	2410.21340	null	Kimi
2854	2024-10-28	Beyond Autoregression: Fast LLMs via Self-Distillation Through Time	Justin Deschenaux et.al.	2410.21035	link	Kimi
2855	2024-10-26	DQRM: Deep Quantized Recommendation Models	Yang Zhou et.al.	2410.20046	link	Kimi
2856	2024-10-25	RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction	Tanqiu Jiang et.al.	2410.19937	null	Kimi
2857	2024-10-25	BitPipe: Bidirectional Interleaved Pipeline Parallelism for Accelerating Large Models Training	Houming Wu et.al.	2410.19367	link	Kimi
2858	2024-10-28	Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning	Yu Fu et.al.	2410.19258	link	Kimi
2859	2024-10-24	KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing	Yifei Yang et.al.	2410.18517	link	Kimi
2860	2024-10-24	The Nature of Mathematical Modeling and Probabilistic Optimization Engineering in Generative AI	Fulu Li et.al.	2410.18441	null	Kimi
2861	2024-10-25	Fast Inference for Augmented Large Language Models	Rana Shahout et.al.	2410.18248	null	Kimi
2862	2024-10-23	Value Residual Learning For Alleviating Attention Concentration In Transformers	Zhanchao Zhou et.al.	2410.17897	link	Kimi
2863	2024-10-23	Markov Chain of Thought for Efficient Mathematical Reasoning	Wen Yang et.al.	2410.17635	null	Kimi
2864	2024-10-22	PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction	Long Xing et.al.	2410.17247	link	Kimi
2865	2024-10-21	MagicPIG: LSH Sampling for Efficient LLM Generation	Zhuoming Chen et.al.	2410.16179	link	Kimi
2866	2024-10-21	Residual vector quantization for KV cache compression in large language model	Ankur Kumar et.al.	2410.15704	link	Kimi
2867	2024-10-20	SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training	Jinda Jia et.al.	2410.15526	link	Kimi
2868	2024-10-20	EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models	Junhao Hu et.al.	2410.15332	null	Kimi
2869	2024-10-20	Lossless KV Cache Compression to 2%	Zhen Yang et.al.	2410.15252	null	Kimi
2870	2024-10-19	Pipeline Gradient-based Model Training on Analog In-memory Accelerators	Zhaoxian Wu et.al.	2410.15155	link	Kimi
2871	2024-10-18	A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference	You Wu et.al.	2410.14442	link	Kimi
2872	2024-10-23	TiMePReSt: Time and Memory Efficient Pipeline Parallel DNN Training with Removed Staleness	Ankita Dutta et.al.	2410.14312	null	Kimi
2873	2024-10-17	SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction	Xuan Zhang et.al.	2410.13846	link	Kimi
2874	2024-10-17	AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations	Qian Tao et.al.	2410.13212	null	Kimi
2875	2024-10-19	In-context KV-Cache Eviction for LLMs via Attention-Gate	Zihao Zeng et.al.	2410.12876	null	Kimi
2876	2024-10-16	FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction	Akriti Jain et.al.	2410.12513	null	Kimi
2877	2024-10-16	COMET: Towards Partical W4A4KV4 LLMs Serving	Lian Liu et.al.	2410.12168	null	Kimi
2878	2024-10-15	From promise to practice: realizing high-performance decentralized training	Zesen Wang et.al.	2410.11998	null	Kimi
2879	2024-10-15	QSpec: Speculative Decoding with Complementary Quantization Schemes	Juntao Zhao et.al.	2410.11305	null	Kimi
2880	2024-10-14	DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads	Guangxuan Xiao et.al.	2410.10819	link	Kimi
2881	2024-10-14	When Attention Sink Emerges in Language Models: An Empirical View	Xiangming Gu et.al.	2410.10781	link	Kimi
2882	2024-10-14	Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling	Wenze Liu et.al.	2410.10511	link	Kimi
2883	2024-10-15	EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations	Zhangchi Feng et.al.	2410.10315	link	Kimi
2884	2024-10-11	ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression	Yefei He et.al.	2410.08584	null	Kimi
2885	2024-10-10	KV Prediction for Improved Time to First Token	Maxwell Horton et.al.	2410.08391	link	Kimi
2886	2024-10-10	TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text	Songshuo Lu et.al.	2410.07590	link	Kimi
2887	2024-10-09	SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration	Heming Xia et.al.	2410.06916	link	Kimi
2888	2024-10-07	PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs	Mengzhao Chen et.al.	2410.05265	link	Kimi
2889	2024-10-07	Presto! Distilling Steps and Layers for Accelerating Music Generation	Zachary Novack et.al.	2410.05167	null	Kimi
2890	2024-10-07	TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention	Lijie Yang et.al.	2410.05076	link	Kimi
2891	2024-10-07	Fast State Restoration in LLM Serving with HCache	Shiwei Gao et.al.	2410.05004	null	Kimi
2892	2024-10-06	Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective	Jinhao Li et.al.	2410.04466	link	Kimi
2893	2024-10-04	SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation	Aurick Qiao et.al.	2410.03960	null	Kimi
2894	2024-10-04	LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy	Rongzhi Zhang et.al.	2410.03111	null	Kimi
2895	2024-10-04	UNComp: Uncertainty-Aware Long-Context Compressor for Efficient Large Language Model Inference	Jing Xiong et.al.	2410.03090	null	Kimi
2896	2024-10-09	LEGO: QEC Decoding System Architecture for Dynamic Circuits	Yue Wu et.al.	2410.03073	null	Kimi
2897	2024-10-04	Compute Or Load KV Cache? Why Not Both?	Shuowei Jin et.al.	2410.03065	null	Kimi
2898	2024-10-03	EinDecomp: Decomposition of Declaratively-Specified Machine Learning and Numerical Computations for Parallel Execution	Daniel Bourgeois et.al.	2410.02682	null	Kimi
2899	2024-10-03	SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration	Jintao Zhang et.al.	2410.02367	link	Kimi
2900	2024-10-02	Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads	Yuxiang Huang et.al.	2410.01805	link	Kimi
2901	2024-10-02	InfiniPot: Infinite Context Processing on Memory-Constrained LLMs	Minsoo Kim et.al.	2410.01518	null	Kimi
2902	2024-10-02	A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts	Suyu Ge et.al.	2410.01485	null	Kimi
2903	2024-10-01	Developing a BLAS library for the AMD AI Engine	Tristan Laan et.al.	2410.00825	null	Kimi
2904	2024-10-01	TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices	Zonghang Li et.al.	2410.00531	link	Kimi
2905	2024-10-01	LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management	Yi Xiong et.al.	2410.00428	null	Kimi
2906	2024-09-30	KV-Compress: Paged KV-Cache Compression with Variable Compression Rates per Attention Head	Isaac Rehg et.al.	2410.00161	link	Kimi
2907	2024-09-30	The Early Bird Catches the Leak: Unveiling Timing Side Channels in LLM Serving Systems	Linke Song et.al.	2409.20002	null	Kimi
2908	2024-09-27	Toward Greener Matrix Operations by Lossless Compressed Formats	Francesco Tosoni et.al.	2409.18620	link	Kimi
2909	2024-09-26	Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores	Shaobo Ma et.al.	2409.17870	null	Kimi
2910	2024-09-25	Search for Efficient Large Language Models	Xuan Shen et.al.	2409.17372	link	Kimi
2911	2024-09-25	Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations	Amey Agrawal et.al.	2409.17264	null	Kimi
2912	2024-09-25	AlignedKV: Reducing Memory Access of KV-Cache with Precision-Aligned Quantization	Yifan Tan et.al.	2409.16546	link	Kimi
2913	2024-09-25	A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence	Xin Yuan et.al.	2409.16537	null	Kimi
2914	2024-09-23	CSPS: A Communication-Efficient Sequence-Parallelism based Serving System for Transformer based Models with Long Prompts	Zeyu Zhang et.al.	2409.15104	null	Kimi
2915	2024-09-23	Inference-Friendly Models With MixAttention	Shashank Rajput et.al.	2409.15012	null	Kimi
2916	2024-09-23	Mutation-Based Deep Learning Framework Testing Method in JavaScript Environment	Yinglong Zou et.al.	2409.14968	null	Kimi
2917	2024-09-16	Do Large Language Models Need a Content Delivery Network?	Yihua Cheng et.al.	2409.13761	link	Kimi
2918	2024-09-20	Time Distributed Deep Learning models for Purely Exogenous Forecasting. Application to Water Table Depth Prediction using Weather Image Time Series	Matteo Salis et.al.	2409.13284	null	Kimi
2919	2024-09-23	CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs	Junlin Lv et.al.	2409.12490	link	Kimi
2920	2024-09-04	ISO: Overlap of Computation and Communication within Seqenence For LLM Inference	Bin Xiao et.al.	2409.11155	null	Kimi
2921	2024-09-17	KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models	Bo Lv et.al.	2409.11057	null	Kimi
2922	2024-09-21	CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios	Luning Wang et.al.	2409.10593	link	Kimi
2923	2024-09-14	A Dynamic Weighting Strategy to Mitigate Worker Node Failure in Distributed Deep Learning	Yuesheng Xu et.al.	2409.09242	null	Kimi
2924	2024-09-11	Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU	Zhenyu Ning et.al.	2409.09086	null	Kimi
2925	2024-09-13	SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity	Qitian Wu et.al.	2409.09007	link	Kimi
2926	2024-09-11	Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering	Weixi Weng et.al.	2409.07331	null	Kimi
2927	2024-09-11	FreeRide: Harvesting Bubbles in Pipeline Parallelism	Jiashu Zhang et.al.	2409.06941	null	Kimi
2928	2024-09-09	DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects	Xu Zhang et.al.	2409.05404	null	Kimi
2929	2024-09-08	InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference	Xiurui Pan et.al.	2409.04992	null	Kimi
2930	2024-09-04	Accelerating Large Language Model Training with Hybrid GPU-based Compression	Lang Xu et.al.	2409.02423	null	Kimi
2931	2024-09-03	Contemporary Model Compression on Large Language Models Inference	Dong Liu et.al.	2409.01990	link	Kimi
2932	2024-09-03	On-chain Validation of Tracking Data Messages (TDM) Using Distributed Deep Learning on a Proof of Stake (PoS) Blockchain	Yasir Latif et.al.	2409.01614	null	Kimi
2933	2024-09-02	LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUs	Mo Sun et.al.	2409.00918	null	Kimi
2934	2024-08-26	Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition	Axel Klawonn et.al.	2408.14442	null	Kimi
2935	2024-08-23	Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI	Mikhail Khalilov et.al.	2408.13356	null	Kimi
2936	2024-08-22	LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation	Shihao Chen et.al.	2408.12354	null	Kimi
2937	2024-08-23	MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding	Jian Chen et.al.	2408.11049	link	Kimi
2938	2024-08-20	Security Assessment of Hierarchical Federated Deep Learning	D Alqattan et.al.	2408.10752	link	Kimi
2939	2024-08-20	Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning	Bei Ouyang et.al.	2408.10746	null	Kimi
2940	2024-08-21	LongVILA: Scaling Long-Context Visual Language Models for Long Videos	Fuzhao Xue et.al.	2408.10188	link	Kimi
2941	2024-08-17	RepControlNet: ControlNet Reparameterization	Zhaoli Deng et.al.	2408.09240	null	Kimi
2942	2024-08-17	Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version)	Mingkuan Xu et.al.	2408.09055	null	Kimi
2943	2024-08-23	ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models	Chao Zeng et.al.	2408.08554	link	Kimi
2944	2024-08-16	Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models	Jerry Huang et.al.	2408.08470	null	Kimi
2945	2024-08-15	Asteroid: Resource-Efficient Hybrid Pipeline Parallelism for Collaborative DNN Training on Heterogeneous Edge Devices	Shengyuan Ye et.al.	2408.08015	null	Kimi
2946	2024-08-17	Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference	Rohan Baskar Prabhakar et.al.	2408.07802	null	Kimi
2947	2024-08-18	Post-Training Sparse Attention with Double Sparsity	Shuo Yang et.al.	2408.07092	link	Kimi
2948	2024-08-12	LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration	Zhiwen Mo et.al.	2408.06003	null	Kimi
2949	2024-08-10	Eigen Attention: Attention in Low-Rank Space for KV Cache Compression	Utkarsh Saxena et.al.	2408.05646	link	Kimi
2950	2024-08-05	SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving	Andreas Kosmas Kakolyris et.al.	2408.05235	null	Kimi
2951	2024-08-08	Partial Experts Checkpoint: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training	Weilin Cai et.al.	2408.04307	null	Kimi
2952	2024-08-07	Zero-Delay QKV Compression for Mitigating KV Cache and Network Bottlenecks in LLM Inference	Zeyu Zhang et.al.	2408.04107	null	Kimi
2953	2024-08-08	NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time	Yilong Chen et.al.	2408.03675	link	Kimi
2954	2024-08-04	Cross-layer Attention Sharing for Large Language Models	Yongyu Mu et.al.	2408.01890	null	Kimi
2955	2024-08-01	Intermittent Semi-working Mask: A New Masking Paradigm for LLMs	Mingcong Lu et.al.	2408.00539	null	Kimi
2956	2024-08-13	Finch: Prompt-guided Key-Value Cache Compression	Giulio Corallo et.al.	2408.00167	null	Kimi
2957	2024-07-31	EdgeLLM: A Highly Efficient CPU-FPGA Heterogeneous Edge Accelerator for Large Language Models	Mingqiang Huang et.al.	2407.21325	null	Kimi
2958	2024-07-30	Palu: Compressing KV-Cache with Low-Rank Projection	Chi-Chih Chang et.al.	2407.21118	link	Kimi
2959	2024-07-30	ThinK: Thinner Key Cache by Query-Driven Pruning	Yuhui Xu et.al.	2407.21018	null	Kimi
2960	2024-07-31	A2SF: Accumulative Attention Scoring with Forgetting Factor for Token Pruning in Transformer Decoder	Hyun-rae Jo et.al.	2407.20485	null	Kimi
2961	2024-07-25	An Efficient Inference Framework for Early-exit Large Language Models	Ruijie Miao et.al.	2407.20272	null	Kimi
2962	2024-07-29	When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention	Lianghong Guo et.al.	2407.20042	link	Kimi
2963	2024-07-29	Inference acceleration for large language models using “stairs” assisted greedy generation	Domas Grigaliūnas et.al.	2407.19947	null	Kimi
2964	2024-07-29	Rina: Enhancing Ring-AllReduce with In-network Aggregation in Distributed Model Training	Zixuan Chen et.al.	2407.19721	null	Kimi
2965	2024-07-25	Efficient Inference of Vision Instruction-Following Models with Elastic Cache	Zuyan Liu et.al.	2407.18121	link	Kimi
2966	2024-07-28	Keep the Cost Down: A Review on Methods to Optimize LLM’ s KV-Cache Consumption	Luohe Shi et.al.	2407.18003	null	Kimi
2967	2024-07-25	Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads	Xihui Lin et.al.	2407.17678	null	Kimi
2968	2024-07-23	A deeper look at depth pruning of LLMs	Shoaib Ahmed Siddiqui et.al.	2407.16286	link	Kimi
2969	2024-07-22	RazorAttention: Efficient KV Cache Compression Through Retrieval Heads	Hanlin Tang et.al.	2407.15891	null	Kimi
2970	2024-07-22	AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description	Junyu Xie et.al.	2407.15850	link	Kimi
2971	2024-07-22	LLMmap: Fingerprinting For Large Language Models	Dario Pasquini et.al.	2407.15847	link	Kimi
2972	2024-07-22	CarFormer: Self-Driving with Learned Object-Centric Representations	Shadi Hamdan et.al.	2407.15843	null	Kimi
2973	2024-07-22	SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models	Mingze Xu et.al.	2407.15841	link	Kimi
2974	2024-07-22	MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity	Yangzhou Liu et.al.	2407.15838	link	Kimi
2975	2024-07-22	dMel: Speech Tokenization made Simple	He Bai et.al.	2407.15835	link	Kimi
2976	2024-07-22	Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight	Ziyuan Huang et.al.	2407.15819	null	Kimi
2977	2024-07-23	A simple and fast C++ thread pool implementation capable of running task graphs	Dmytro Puyda et.al.	2407.15805	link	Kimi
2978	2024-07-22	Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation	Guanyu Hu et.al.	2407.15798	null	Kimi
2979	2024-07-22	Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach	Rian Dolphin et.al.	2407.15788	null	Kimi
2980	2024-07-22	Parallel Split Learning with Global Sampling	Mohammad Kohankhaki et.al.	2407.15738	link	Kimi
2981	2024-07-22	vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving	Jiale Xu et.al.	2407.15309	link	Kimi
2982	2024-07-19	Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference	Joyjit Kundu et.al.	2407.14645	null	Kimi
2983	2024-07-19	Internal Consistency and Self-Feedback in Large Language Models: A Survey	Xun Liang et.al.	2407.14507	link	Kimi
2984	2024-07-19	On Pre-training of Multimodal Language Models Customized for Chart Understanding	Wan-Cyuan Fan et.al.	2407.14506	null	Kimi
2985	2024-07-19	PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding	Chenshu Hou et.al.	2407.14491	null	Kimi
2986	2024-07-19	Evaluating the Reliability of Self-Explanations in Large Language Models	Korbinian Randl et.al.	2407.14487	link	Kimi
2987	2024-07-19	Contrastive Learning with Counterfactual Explanations for Radiology Report Generation	Mingjie Li et.al.	2407.14474	null	Kimi
2988	2024-07-19	Check-Eval: A Checklist-based Approach for Evaluating Text Quality	Jayr Pereira et.al.	2407.14467	null	Kimi
2989	2024-07-19	AttentNet: Fully Convolutional 3D Attention for Lung Nodule Detection	Majedaldein Almahasneh et.al.	2407.14464	null	Kimi
2990	2024-07-19	PolyFormer: Scalable Node-wise Filters via Polynomial Graph Transformer	Jiahong Ma et.al.	2407.14459	link	Kimi
2991	2024-07-19	Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier	Zachary Wojtowicz et.al.	2407.14452	null	Kimi
2992	2024-07-19	From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards	Nicole Sultanum et.al.	2407.14451	null	Kimi
2993	2024-07-19	LoAS: Fully Temporal-Parallel Datatflow for Dual-Sparse Spiking Neural Networks	Ruokai Yin et.al.	2407.14073	link	Kimi
2994	2024-07-19	LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference	Qichen Fu et.al.	2407.14057	null	Kimi
2995	2024-07-18	SegPoint: Segment Any Point Cloud via Large Language Model	Shuting He et.al.	2407.13761	null	Kimi
2996	2024-07-18	Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models	Zhuo Chen et.al.	2407.13757	null	Kimi
2997	2024-07-18	CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications	Mirza Masfiqur Rahman et.al.	2407.13742	null	Kimi
2998	2024-07-18	Baba Is AI: Break the Rules to Beat the Benchmark	Nathan Cloos et.al.	2407.13729	null	Kimi
2999	2024-07-18	Compressing Structured Tensor Algebra	Mahdi Ghorbani et.al.	2407.13726	null	Kimi
3000	2024-07-18	CoDefeater: Using LLMs To Find Defeaters in Assurance Cases	Usman Gohar et.al.	2407.13717	link	Kimi
3001	2024-07-18	Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning	Ans Munir et.al.	2407.13715	link	Kimi
3002	2024-07-18	Understanding Reference Policies in Direct Preference Optimization	Yixin Liu et.al.	2407.13709	link	Kimi
3003	2024-07-18	ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection	Janek Herrlein et.al.	2407.13702	link	Kimi
3004	2024-07-18	Cross-Task Attack: A Self-Supervision Generative Framework Based on Attention Shift	Qingyuan Zeng et.al.	2407.13700	null	Kimi
3005	2024-07-17	Analysis of Crab X-ray Polarization using Deeper IXPE Observations	Josephine Wong et.al.	2407.12779	null	Kimi
3006	2024-07-17	The BRST quantisation of chiral BMS-like field theories	José Figueroa-O’Farrill et.al.	2407.12778	null	Kimi
3007	2024-07-17	Jigsaw Game: Federated Clustering	Jinxuan Xu et.al.	2407.12764	null	Kimi
3008	2024-07-17	LookupViT: Compressing visual information to a limited number of tokens	Rajat Koner et.al.	2407.12753	null	Kimi
3009	2024-07-17	CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference	Mohammad Erfan Sadeghi et.al.	2407.12736	null	Kimi
3010	2024-07-17	EchoSight: Advancing Visual-Language Models with Wiki Knowledge	Yibin Yan et.al.	2407.12735	null	Kimi
3011	2024-07-17	FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible Pruning in Uncertain Scenarios	Zekai Chen et.al.	2407.12729	null	Kimi
3012	2024-07-17	Exploring the interplay of individual traits and interaction dynamics in preschool social networks	Gülşah Akçakır et.al.	2407.12728	null	Kimi
3013	2024-07-17	NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model	Zhongqun Zhang et.al.	2407.12727	null	Kimi
3014	2024-07-17	Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models?	Ben Yao et.al.	2407.12725	null	Kimi
3015	2024-07-16	GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression	Daniel Goldstein et.al.	2407.12077	link	Kimi
3016	2024-07-16	Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale	Aymen Alsaadi et.al.	2407.11967	null	Kimi
3017	2024-07-16	UrbanWorld: An Urban World Model for 3D City Generation	Yu Shang et.al.	2407.11965	link	Kimi
3018	2024-07-16	NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?	Mo Li et.al.	2407.11963	link	Kimi
3019	2024-07-17	Hierarchical Separable Video Transformer for Snapshot Compressive Imaging	Ping Wang et.al.	2407.11946	link	Kimi
3020	2024-07-16	Min-max theory and existence of H-spheres with arbitrary codimensions	Rui Gao et.al.	2407.11945	null	Kimi
3021	2024-07-16	Beyond Spatial Explanations: Explainable Face Recognition in the Frequency Domain	Marco Huber et.al.	2407.11941	null	Kimi
3022	2024-07-16	Generalized Difference-in-Differences	Yiqing Xu et.al.	2407.11937	null	Kimi
3023	2024-07-16	Learning Multi-view Anomaly Detection	Haoyang He et.al.	2407.11935	null	Kimi
3024	2024-07-16	Code Documentation and Analysis to Secure Software Development	Paul Attie et.al.	2407.11934	null	Kimi
3025	2024-07-16	What’s Wrong? Refining Meeting Summaries with LLM Feedback	Frederic Kirstein et.al.	2407.11919	null	Kimi
3026	2024-07-16	PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation	Branden Butler et.al.	2407.11798	null	Kimi
3027	2024-07-21	Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference	Yuan Feng et.al.	2407.11550	link	Kimi
3028	2024-07-15	VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation	Bocheng Zou et.al.	2407.10972	link	Kimi
3029	2024-07-15	Q-Sparse: All Large Language Models can be Fully Sparsely-Activated	Hongyu Wang et.al.	2407.10969	null	Kimi
3030	2024-07-15	Induction of non-Fermi liquids by critical cavity photons at the onset of superradiance	Ipsita Mandal et.al.	2407.10963	null	Kimi
3031	2024-07-15	Fast Matrix Multiplications for Lookup Table-Quantized LLMs	Han Guo et.al.	2407.10960	link	Kimi
3032	2024-07-15	InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models	Nirat Saini et.al.	2407.10958	null	Kimi
3033	2024-07-15	MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models	Chengguang Gan et.al.	2407.10953	null	Kimi
3034	2024-07-15	The infamous 95 GeV $\rm b \bar b$ excess at LEP: Two b or not two b?	Patrick Janot et.al.	2407.10948	null	Kimi
3035	2024-07-15	Can Textual Semantics Mitigate Sounding Object Segmentation Preference?	Yaoting Wang et.al.	2407.10947	link	Kimi
3036	2024-07-15	GRUtopia: Dream General Robots in a City at Scale	Hanqing Wang et.al.	2407.10943	link	Kimi
3037	2024-07-15	IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation	Yuanhao Zhai et.al.	2407.10937	link	Kimi
3038	2024-07-12	FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3	Georgios Makridis et.al.	2407.09467	null	Kimi
3039	2024-07-12	Human-like Episodic Memory for Infinite Context LLMs	Zafeirios Fountas et.al.	2407.09450	link	Kimi
3040	2024-07-12	ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts	Amelia F. Hardy et.al.	2407.09447	link	Kimi
3041	2024-07-12	MUSCLE: A Model Update Strategy for Compatible LLM Evolution	Jessica Echterhoff et.al.	2407.09435	null	Kimi
3042	2024-07-12	Open (Clinical) LLMs are Sensitive to Instruction Phrasings	Alberto Mario Ceballos Arroyo et.al.	2407.09429	link	Kimi
3043	2024-07-12	TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models	Hang Zou et.al.	2407.09424	null	Kimi
3044	2024-07-12	Mitigating Entity-Level Hallucination in Large Language Models	Weihang Su et.al.	2407.09417	link	Kimi
3045	2024-07-12	SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers	Shraman Pramanick et.al.	2407.09413	link	Kimi
3046	2024-07-12	Thunderbolt: Causal Concurrent Consensus and Execution	Junchao Chen et.al.	2407.09409	null	Kimi
3047	2024-07-12	PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents	Saber Zerhoudi et.al.	2407.09394	link	Kimi
3048	2024-07-11	MAVIS: Mathematical Visual Instruction Tuning	Renrui Zhang et.al.	2407.08739	link	Kimi
3049	2024-07-11	Real-Time Anomaly Detection and Reactive Planning with Large Language Models	Rohan Sinha et.al.	2407.08735	null	Kimi
3050	2024-07-11	Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist	Zihao Zhou et.al.	2407.08733	null	Kimi
3051	2024-07-11	Planar decomposition of the HOMFLY polynomial for bipartite knots and links	A. Anokhina et.al.	2407.08724	null	Kimi
3052	2024-07-11	A Taxonomy for Data Contamination in Large Language Models	Medha Palavalli et.al.	2407.08716	null	Kimi
3053	2024-07-11	GTA: A Benchmark for General Tool Agents	Jize Wang et.al.	2407.08713	link	Kimi
3054	2024-07-11	Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models	Zhening Xing et.al.	2407.08701	null	Kimi
3055	2024-07-11	Flex-TPU: A Flexible TPU with Runtime Reconfigurable Dataflow Architecture	Mohammed Elbtity et.al.	2407.08700	null	Kimi
3056	2024-07-11	Mitigating Catastrophic Forgetting in Language Transfer via Model Merging	Anton Alexandrov et.al.	2407.08699	null	Kimi
3057	2024-07-11	Patterns of link reciprocity in directed, signed networks	Anna Gallo et.al.	2407.08697	null	Kimi
3058	2024-07-10	Training on the Test Task Confounds Evaluation and Emergence	Ricardo Dominguez-Olmedo et.al.	2407.07890	link	Kimi
3059	2024-07-10	Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization	Junkang Wu et.al.	2407.07880	link	Kimi
3060	2024-07-10	Bound States in Continuum via Singular Transfer Matrices	Ovidiu-Zeno Lipan et.al.	2407.07879	null	Kimi
3061	2024-07-10	FACTS About Building Retrieval Augmented Generation-based Chatbots	Rama Akkiraju et.al.	2407.07858	null	Kimi
3062	2024-07-10	OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training	Sami Jaghouar et.al.	2407.07852	link	Kimi
3063	2024-07-10	Harnessing Integrated CPU-GPU System Memory for HPC: a first look into Grace Hopper	Gabin Schieffer et.al.	2407.07850	null	Kimi
3064	2024-07-10	Natural Language Mechanisms via Self-Resolution with Foundation Models	Nicolas Della Penna et.al.	2407.07845	null	Kimi
3065	2024-07-10	Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification	Mei Qiu et.al.	2407.07842	null	Kimi
3066	2024-07-10	Transformer Alignment in Large Language Models	Murdock Aubry et.al.	2407.07810	null	Kimi
3067	2024-07-10	Attribute or Abstain: Large Language Models as Long Document Assistants	Jan Buchmann et.al.	2407.07799	link	Kimi
3068	2024-07-09	AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning	Jiaxi Cui et.al.	2407.07094	link	Kimi
3069	2024-07-09	FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation	Liqun Ma et.al.	2407.07093	link	Kimi
3070	2024-07-09	Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic	Ruochen Jin et.al.	2407.07089	link	Kimi
3071	2024-07-09	Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models	Logan Cross et.al.	2407.07086	link	Kimi
3072	2024-07-09	Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities	Shaltiel Shmidman et.al.	2407.07080	null	Kimi
3073	2024-07-09	ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction	Shaozhe Hao et.al.	2407.07077	link	Kimi
3074	2024-07-09	Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps	Yung-Sung Chuang et.al.	2407.07071	link	Kimi
3075	2024-07-09	Prompting Techniques for Secure Code Generation: A Systematic Investigation	Catherine Tony et.al.	2407.07064	null	Kimi
3076	2024-07-09	Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence	Weize Chen et.al.	2407.07061	link	Kimi
3077	2024-07-09	CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement	Wang Wei et.al.	2407.07056	null	Kimi
3078	2024-07-08	Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision	Orr Zohar et.al.	2407.06189	link	Kimi
3079	2024-07-08	CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation	Xinying Guo et.al.	2407.06188	null	Kimi
3080	2024-07-08	Left-Linear Rewriting in Adhesive Categories	Paolo Baldan et.al.	2407.06181	null	Kimi
3081	2024-07-08	The Tug-of-War Between Deepfake Generation and Detection	Hannah Lee et.al.	2407.06174	null	Kimi
3082	2024-07-08	On Speeding Up Language Model Evaluation	Jin Peng Zhou et.al.	2407.06172	null	Kimi
3083	2024-07-08	Inevitable Endgame of Comet Tsuchinshan-ATLAS (C/2023 A3)	Zdenek Sekanina et.al.	2407.06166	null	Kimi
3084	2024-07-08	What’s Wrong with Your Code Generated by Large Language Models? An Extensive Study	Shihan Dou et.al.	2407.06153	null	Kimi
3085	2024-07-08	WIBACong: An Argument-centric Framework for Understanding US Congressional Hearings	Arman Irani et.al.	2407.06149	null	Kimi
3086	2024-07-08	Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks	Lukas Netz et.al.	2407.06146	null	Kimi
3087	2024-07-08	ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation	Ethan Chern et.al.	2407.06135	link	Kimi
3088	2024-07-05	LaRa: Efficient Large-Baseline Radiance Fields	Anpei Chen et.al.	2407.04699	null	Kimi
3089	2024-07-05	Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs	Rudolf Laine et.al.	2407.04694	link	Kimi
3090	2024-07-05	ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models	Yuzhe Gu et.al.	2407.04693	link	Kimi
3091	2024-07-05	Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge	Yuanze Lin et.al.	2407.04681	null	Kimi
3092	2024-07-05	Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition	Ye Bai et.al.	2407.04675	null	Kimi
3093	2024-07-05	Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement	Yongji Wu et.al.	2407.04656	null	Kimi
3094	2024-07-05	Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework	Reza Averly et.al.	2407.04629	null	Kimi
3095	2024-07-05	On scalable oversight with weak LLMs judging strong LLMs	Zachary Kenton et.al.	2407.04622	null	Kimi
3096	2024-07-08	OneRestore: A Universal Restoration Framework for Composite Degradation	Yu Guo et.al.	2407.04621	link	Kimi
3097	2024-07-05	Learning to (Learn at Test Time): RNNs with Expressive Hidden States	Yu Sun et.al.	2407.04620	link	Kimi
3098	2024-07-03	Universal Length Generalization with Turing Programs	Kaiying Hou et.al.	2407.03310	null	Kimi
3099	2024-07-03	Eyes on the Game: Deciphering Implicit Human Signals to Infer Human Proficiency, Trust, and Intent	Nikhil Hulle et.al.	2407.03298	null	Kimi
3100	2024-07-03	Large Language Models for JSON Schema Discovery	Michael J. Mior et.al.	2407.03286	null	Kimi
3101	2024-07-03	LLM Internal States Reveal Hallucination Risk Faced With a Query	Ziwei Ji et.al.	2407.03282	link	Kimi
3102	2024-07-03	Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks	Mintae Kim et.al.	2407.03280	null	Kimi
3103	2024-07-03	Nesterov’s Accelerated Jacobi-Type Methods for Large-scale Symmetric Positive Semidefinite Linear Systems	Ling Liang et.al.	2407.03272	null	Kimi
3104	2024-07-03	STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data	Kheir Eddine Daouadi et.al.	2407.03253	null	Kimi
3105	2024-07-03	ACTRESS: Active Retraining for Semi-supervised Visual Grounding	Weitai Kang et.al.	2407.03251	null	Kimi
3106	2024-07-04	When big data actually are low-rank, or entrywise approximation of certain function-generated matrices	Stanislav Budzinskiy et.al.	2407.03250	link	Kimi
3107	2024-07-03	Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning	Jiaqi Wang et.al.	2407.03247	link	Kimi
3108	2024-07-02	MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention	Huiqiang Jiang et.al.	2407.02490	link	Kimi
3109	2024-07-02	Neurocache: Efficient Vector Retrieval for Long-range Language Modeling	Ali Safaya et.al.	2407.02486	link	Kimi
3110	2024-07-02	RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs	Yue Yu et.al.	2407.02485	null	Kimi
3111	2024-07-02	Characterizing the Interpretability of Attention Maps in Digital Pathology	Tomé Albuquerque et.al.	2407.02484	null	Kimi
3112	2024-07-02	MMedAgent: Learning to Use Medical Tools with Multi-modal Agent	Binxu Li et.al.	2407.02483	link	Kimi
3113	2024-07-02	Understanding Alignment in Multimodal LLMs: A Comprehensive Study	Elmira Amirloo et.al.	2407.02477	null	Kimi
3114	2024-07-02	Open Scene Graphs for Open World Object-Goal Navigation	Joel Loo et.al.	2407.02473	null	Kimi
3115	2024-07-02	Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I	Harrie Oosterhuis et.al.	2407.02464	null	Kimi
3116	2024-07-02	Decentralized Intelligence Network (DIN)	Abraham Nash et.al.	2407.02461	null	Kimi
3117	2024-07-02	Revisión de Métodos de Planificación de Camino de Cobertura para Entornos Agrícolas	Ismael Ait et.al.	2407.02449	null	Kimi

Early Stopping

ID	Publish Date	Title	Authors	PDF	Code	Kimi
1	2024-12-12	InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption	Tiehan Fan et.al.	2412.09283	null	Kimi
2	2024-12-11	GradStop: Exploring Training Dynamics in Unsupervised Outlier Detection through Gradient Cohesion	Yuang Zhang et.al.	2412.08501	link	Kimi
3	2024-12-11	Collaborative Inference for Large Models with Task Offloading and Early Exiting	Zuan Xie et.al.	2412.08284	null	Kimi
4	2024-12-11	Diff-GO $^\text{n}$ : Enhancing Diffusion Models for Goal-Oriented Communications	Suchinthaka Wanninayaka et.al.	2412.06980	null	Kimi
5	2024-12-06	Sparse autoencoders reveal selective remapping of visual concepts during adaptation	Hyesu Lim et.al.	2412.05276	link	Kimi
6	2024-12-06	BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits	Wazib Ansar et.al.	2412.05225	null	Kimi
7	2024-12-05	A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs	Wangbo Zhao et.al.	2412.03324	link	Kimi
8	2024-12-03	Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control	Sebastian Hirt et.al.	2412.02423	null	Kimi
9	2024-12-02	Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization	Weiqiao Shan et.al.	2412.01455	null	Kimi
10	2024-12-02	EdgeOAR: Real-time Online Action Recognition On Edge Devices	Wei Luo et.al.	2412.01267	null	Kimi
11	2024-12-02	Reliable and scalable variable importance estimation via warm-start and early stopping	Zexuan Sun et.al.	2412.01120	link	Kimi
12	2024-11-28	Deep Neural Network-Based Prediction of B-Cell Epitopes for SARS-CoV and SARS-CoV-2: Enhancing Vaccine Design through Machine Learning	Xinyu Shi et.al.	2412.00109	null	Kimi
13	2024-11-26	Selfish Evolution: Making Discoveries in Extreme Label Noise with the Help of Overfitting Dynamics	Nima Sedaghat et.al.	2412.00077	null	Kimi
14	2024-11-28	DIESEL – Dynamic Inference-Guidance via Evasion of Semantic Embeddings in LLMs	Ben Ganon et.al.	2411.19038	null	Kimi
15	2024-11-27	One-Step Early Stopping Strategy using Neural Tangent Kernel Theory and Rademacher Complexity	Daniel Martin Xavier et.al.	2411.18806	null	Kimi
16	2024-11-27	HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression	Lei Liu et.al.	2411.18473	null	Kimi
17	2024-11-26	Instance-Aware Graph Prompt Learning	Jiazheng Li et.al.	2411.17676	null	Kimi
18	2024-11-22	Instance-Aware Generalized Referring Expression Segmentation	E-Ro Nguyen et.al.	2411.15087	null	Kimi
19	2024-11-19	Deep Learning-Driven Heat Map Analysis for Evaluating thickness of Wounded Skin Layers	Devakumar GR et.al.	2411.12678	null	Kimi
20	2024-11-15	Exploiting Negative Curvature in Conjunction with Adaptive Sampling: Theoretical Results and a Practical Algorithm	Albert S. Berahas et.al.	2411.10378	null	Kimi
21	2024-11-13	Voxeland: Probabilistic Instance-Aware Semantic Mapping with Evidence-based Uncertainty Quantification	Jose-Luis Matez-Bandera et.al.	2411.08727	link	Kimi
22	2024-11-11	The Unreasonable Effectiveness of Monte Carlo Simulations in A/B Testing	Márton Trencséni et.al.	2411.06701	link	Kimi
23	2024-11-07	Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale	Flavio Di Palo et.al.	2411.05045	null	Kimi
24	2024-11-07	LoFi: Scalable Local Image Reconstruction with Implicit Neural Representation	AmirEhsan Khorashadizadeh et.al.	2411.04995	link	Kimi
25	2024-11-05	SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents	Dawei Li et.al.	2411.03284	link	Kimi
26	2024-11-06	Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression: A Distribution-Free Analysis	Yingzhen Yang et.al.	2411.02904	null	Kimi
27	2024-11-05	Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery	Bowei Du et.al.	2411.02861	null	Kimi
28	2024-11-05	CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration	Hongpeng Jin et.al.	2411.02829	null	Kimi
29	2024-11-06	Energy-Aware Dynamic Neural Inference	Marcello Bullo et.al.	2411.02471	null	Kimi
30	2024-11-04	DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution	Yang Yue et.al.	2411.02359	link	Kimi
31	2024-11-02	Bi-Level Graph Structure Learning for Next POI Recommendation	Liang Wang et.al.	2411.01169	null	Kimi
32	2024-10-30	Accelerated AI Inference via Dynamic Execution Methods	Haim Barad et.al.	2411.00853	null	Kimi
33	2024-11-01	Preventing Model Collapse in Deep Canonical Correlation Analysis by Noise Regularization	Junlin He et.al.	2411.00383	null	Kimi
34	2024-10-29	Power side-channel leakage localization through adversarial training of deep neural networks	Jimmy Gammell et.al.	2410.22425	link	Kimi
35	2024-10-27	Branch-and-bound algorithm for efficient reliability analysis of general coherent systems	Ji-Eun Byun et.al.	2410.22363	null	Kimi
36	2024-10-28	Agreement Tasks in Fault-Prone Synchronous Networks of Arbitrary Structure	Pierre Fraigniaud et.al.	2410.21538	null	Kimi
37	2024-10-28	Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA	Sangmin Bae et.al.	2410.20672	null	Kimi
38	2024-10-27	Sequential Large Language Model-Based Hyper-Parameter Optimization	Kanan Mahammadli et.al.	2410.20302	link	Kimi
39	2024-10-26	Looking Beyond The Top-1: Transformers Determine Top Tokens In Order	Daria Lioubashevski et.al.	2410.20210	link	Kimi
40	2024-10-26	Dynamic layer selection in decoder-only transformers	Theodore Glavas et.al.	2410.20022	link	Kimi
41	2024-10-25	COMSPLIT: A Communication-Aware Split Learning Design for Heterogeneous IoT Platforms	Vukan Ninkovic et.al.	2410.19375	null	Kimi
42	2024-10-30	Dynamic Vocabulary Pruning in Early-Exit LLMs	Jort Vincenti et.al.	2410.18952	link	Kimi
43	2024-10-24	AdaEDL: Early Draft Stopping for Speculative Decoding of Large Language Models via an Entropy-based Lower Bound on Token Acceptance Probability	Sudhanshu Agrawal et.al.	2410.18351	null	Kimi
44	2024-10-23	Inferring stability properties of chaotic systems on autoencoders’ latent spaces	Elise Özalp et.al.	2410.18003	link	Kimi
45	2024-10-23	Diffusion Priors for Variational Likelihood Estimation and Image Denoising	Jun Cheng et.al.	2410.17521	link	Kimi
46	2024-10-21	Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification	Nishant S. Gaikwad et.al.	2410.15681	null	Kimi
47	2024-10-24	BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping	Taolin Zhang et.al.	2410.15430	link	Kimi
48	2024-10-16	FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction	Akriti Jain et.al.	2410.12513	null	Kimi
49	2024-10-15	Juggernaut: Efficient Crypto-Agnostic Byzantine Agreement	Daniel Collins et.al.	2410.12121	null	Kimi
50	2024-10-14	Focused ReAct: Improving ReAct through Reiterate and Early Stop	Shuoqiu Li et.al.	2410.10779	null	Kimi
51	2024-10-14	big.LITTLE Vision Transformer for Efficient Visual Recognition	He Guo et.al.	2410.10267	null	Kimi
52	2024-10-12	DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach	Daniel Gallo Fernández et.al.	2410.09633	link	Kimi
53	2024-10-11	Scaling Gaussian Processes for Learning Curve Prediction via Latent Kronecker Structure	Jihao Andreas Lin et.al.	2410.09239	null	Kimi
54	2024-10-08	Benchmarking of a new data splitting method on volcanic eruption data	Simona Reale et.al.	2410.06306	null	Kimi
55	2024-10-08	MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More	Wei Huang et.al.	2410.06270	link	Kimi
56	2024-10-08	Mini-Batch Kernel $k$ -means	Ben Jourdan et.al.	2410.05902	null	Kimi
57	2024-10-06	Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach	Divya Jyoti Bajpai et.al.	2410.05338	null	Kimi
58	2024-10-07	L-C4: Language-Based Video Colorization for Creative and Consistent Color	Zheng Chang et.al.	2410.04972	null	Kimi
59	2024-10-06	CAPEEN: Image Captioning with Early Exits and Knowledge Distillation	Divya Jyoti Bajpai et.al.	2410.04433	link	Kimi
60	2024-10-06	DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs	Divya Jyoti Bajpai et.al.	2410.04424	link	Kimi
61	2024-10-03	Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis	Zikun Zhang et.al.	2410.02321	null	Kimi
62	2024-10-03	Global dynamical structures from infinitesimal data	Benjamin McInroe et.al.	2410.02111	null	Kimi
63	2024-10-02	CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL	Mohammadreza Pourreza et.al.	2410.01943	null	Kimi
64	2024-10-02	Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension	Zaiquan Yang et.al.	2410.01544	null	Kimi
65	2024-10-01	Timber! Poisoning Decision Trees	Stefano Calzavara et.al.	2410.00862	null	Kimi
66	2024-09-30	Inference of water waves surface elevation from horizontal velocity components using physics informed neural networks (PINN)	Omar Sallam et.al.	2409.19851	null	Kimi
67	2024-09-27	Improving Visual Object Tracking through Visual Prompting	Shih-Fang Chen et.al.	2409.18901	link	Kimi
68	2024-09-24	Reinforcement Leaning for Infinite-Dimensional Systems	Wei Zhang et.al.	2409.15737	null	Kimi
69	2024-10-03	Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer Prediction	Amrit Diggavi Seshadri et.al.	2409.14091	null	Kimi
70	2024-09-21	Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer	Zheng Liu et.al.	2409.13999	null	Kimi
71	2024-09-18	Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments	Gang Chen et.al.	2409.11975	link	Kimi
72	2024-09-17	UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning	Kathakoli Sengupta et.al.	2409.11403	null	Kimi
73	2024-09-16	Improving Multi-candidate Speculative Decoding	Xiaofan Lu et.al.	2409.10644	link	Kimi
74	2024-09-14	Group Sequential Testing of a Treatment Effect Using a Surrogate Marker	Layla Parast et.al.	2409.09440	link	Kimi
75	2024-09-13	Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection	Dixi Yao et.al.	2409.08858	null	Kimi
76	2024-09-11	AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge	Han Wang et.al.	2409.07394	link	Kimi
77	2024-09-11	From optimal score matching to optimal sampling	Zehao Dou et.al.	2409.07032	null	Kimi
78	2024-09-10	Noisy Early Stopping for Noisy Labels	William Toner et.al.	2409.06830	null	Kimi
79	2024-09-10	Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds	Mu Cai et.al.	2409.06827	link	Kimi
80	2024-08-26	Optimizing STAR Aligner for High Throughput Computing in the Cloud	Piotr Kica et.al.	2409.05886	null	Kimi
81	2024-09-09	Early-exit Convolutional Neural Networks	Edanur Demir et.al.	2409.05336	link	Kimi
82	2024-09-08	Attention-Based Efficient Breath Sound Removal in Studio Audio Recordings	Nidula Elgiriyewithana et.al.	2409.04949	null	Kimi
83	2024-09-16	RTop-K: Ultra-Fast Row-Wise Top-K Algorithm and GPU Implementation for Neural Networks	Xi Xie et.al.	2409.00822	null	Kimi
84	2024-08-30	Dynamic Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling	Guangya Wan et.al.	2408.17017	null	Kimi
85	2024-08-24	Inferring the shape of a solid inside a draining tank from its liquid level dynamics	Gbenga Fabusola et.al.	2408.14503	null	Kimi
86	2024-08-26	Re-Mix: Optimizing Data Mixtures for Large Scale Imitation Learning	Joey Hejna et.al.	2408.14037	link	Kimi
87	2024-08-24	Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning	Xinglin Wang et.al.	2408.13457	null	Kimi
88	2024-08-24	Face Clustering via Early Stopping and Edge Recall	Junjie Liu et.al.	2408.13431	link	Kimi
89	2024-08-21	Critique-out-Loud Reward Models	Zachary Ankner et.al.	2408.11791	link	Kimi
90	2024-08-21	EEG-Defender: Defending against Jailbreak through Early Exit Generation of Large Language Models	Chongwen Zhao et.al.	2408.11308	null	Kimi
91	2024-08-20	Inferring Underwater Topography with FINN	Coşku Can Horuz et.al.	2408.10649	null	Kimi
92	2024-08-15	An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation	Jun Wang et.al.	2408.08047	null	Kimi
93	2024-08-14	Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks	Liting Jiang et.al.	2408.07613	null	Kimi
94	2024-08-12	HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors	Hyungtae Lim et.al.	2408.06328	null	Kimi
95	2024-08-12	Transfer learning of state-based potential games for process optimization in decentralized manufacturing systems	Steve Yuwono et.al.	2408.05992	null	Kimi
96	2024-08-12	A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models	Taehong Moon et.al.	2408.05927	link	Kimi
97	2024-08-08	Early-Exit meets Model-Distributed Inference at Edge Networks	Marco Colocrese et.al.	2408.05247	null	Kimi
98	2024-08-09	PriPHiT: Privacy-Preserving Hierarchical Training of Deep Neural Networks	Yamin Sepehri et.al.	2408.05092	null	Kimi
99	2024-08-09	Early Exit Strategies for Approximate k-NN Search in Dense Retrieval	Francesco Busolin et.al.	2408.04981	null	Kimi
100	2024-08-07	Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling	Zilyu Ye et.al.	2408.03695	link	Kimi
101	2024-08-03	Advancing Green AI: Efficient and Accurate Lightweight CNNs for Rice Leaf Disease Identification	Khairun Saddami et.al.	2408.01752	null	Kimi
102	2024-08-01	Early Stopping Based on Repeated Significance	Eric Bax et.al.	2408.00908	null	Kimi
103	2024-07-31	Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation	Wenyuan Chen et.al.	2408.00112	null	Kimi
104	2024-07-30	Accelerating Large Language Model Inference with Self-Supervised Early Exits	Florian Valade et.al.	2407.21082	null	Kimi
105	2024-07-25	An Efficient Inference Framework for Early-exit Large Language Models	Ruijie Miao et.al.	2407.20272	null	Kimi
106	2024-07-26	Topology Optimization of Random Memristors for Input-Aware Dynamic SNN	Bo Wang et.al.	2407.18625	link	Kimi
107	2024-07-25	Superior Scoring Rules for Probabilistic Evaluation of Single-Label Multi-Class Classification Tasks	Rouhollah Ahmadian et.al.	2407.17697	null	Kimi
108	2024-07-23	Can Large Language Models Automatically Jailbreak GPT-4V?	Yuanwei Wu et.al.	2407.16686	null	Kimi
109	2024-07-22	WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding	Quan Kong et.al.	2407.15350	null	Kimi
110	2024-07-19	Joint or Disjoint: Mixing Training Regimes for Early-Exit Models	Bartłomiej Krzepkowski et.al.	2407.14320	link	Kimi
111	2024-07-19	BERTer: The Efficient One	Pradyumna Saligram et.al.	2407.14039	null	Kimi
112	2024-07-18	On the consistency of rotation curves and spatially integrated HI flux profiles	Tariq Yasin et.al.	2407.13754	null	Kimi
113	2024-07-19	Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View	Jianan Fan et.al.	2407.12870	link	Kimi
114	2024-07-17	Hallucination Index: An Image Quality Metric for Generative Reconstruction Models	Matthew Tivnan et.al.	2407.12780	null	Kimi
115	2024-07-16	Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning	Yanting Miao et.al.	2407.12164	link	Kimi
116	2024-07-16	Enhancing Split Computing and Early Exit Applications through Predefined Sparsity	Luigi Capogrosso et.al.	2407.11763	link	Kimi
117	2024-07-16	Preconditioned Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression	Yingzhen Yang et.al.	2407.11353	null	Kimi
118	2024-07-10	Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical	Adarsh Prasad Behera et.al.	2407.11061	null	Kimi
119	2024-07-15	Multilingual Contrastive Decoding via Language-Agnostic Layers Skipping	Wenhao Zhu et.al.	2407.10795	link	Kimi
120	2024-07-13	Towards understanding epoch-wise double descent in two-layer linear neural networks	Amanda Olmin et.al.	2407.09845	null	Kimi
121	2024-07-11	Sensor-Aware Classifiers for Energy-Efficient Time Series Applications on IoT Devices	Dina Hussein et.al.	2407.08715	null	Kimi
122	2024-07-07	Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking	You Wu et.al.	2407.05383	null	Kimi
123	2024-07-04	Unsupervised speech enhancement with spectral kurtosis and double deep priors	Hien Ohnaka et.al.	2407.03887	null	Kimi
124	2024-07-02	Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation	Efstathia Soufleri et.al.	2407.02713	link	Kimi
125	2024-07-02	Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model	Cong Cao et.al.	2407.01960	null	Kimi
126	2024-07-01	Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach	Stef Baas et.al.	2407.01055	null	Kimi
127	2024-07-01	SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection	Dingkang Liang et.al.	2407.01016	null	Kimi
128	2024-06-27	Adaptive Stochastic Weight Averaging	Caglar Demir et.al.	2406.19092	link	Kimi
129	2024-06-26	An Order Theory Framework of Recurrence Equations for Static Cost Analysis $-$ Dynamic Inference of Non-Linear Inequality Invariants	Louis Rustenholz et.al.	2406.18260	null	Kimi
130	2024-06-24	SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments	Neng Wang et.al.	2406.16279	link	Kimi
131	2024-06-21	Micro-power spoken keyword spotting on Xylo Audio 2	Hannah Bos et.al.	2406.15112	null	Kimi
132	2024-06-21	Early stopping for conjugate gradients in statistical inverse problems	Laura Hucker et.al.	2406.15001	null	Kimi
133	2024-06-21	Cost-Effective RF Fingerprinting Based on Hybrid CVNN-RF Classifier with Automated Multi-Dimensional Early-Exit Strategy	Jiayan Gan et.al.	2406.14869	null	Kimi
134	2024-06-20	On Layer-wise Representation Similarity: Application for Multi-Exit Models with a Single Classifier	Jiachen Jiang et.al.	2406.14479	null	Kimi

This site is open source. Improve this page.