CV Arxiv Daily

Contributors Forks Stargazers Issues

Updated on 2025.07.17

Usage instructions: here

Other links:

LLM

ID Publish Date Title Authors PDF Code Kimi
1 2025-07-16 Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models Yik Siu Chan et.al. 2507.12428 null Kimi
2 2025-07-16 Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data Chandana Cheerla et.al. 2507.12425 null Kimi
3 2025-07-16 Probing for Arithmetic Errors in Language Models Yucheng Sun et.al. 2507.12379 null Kimi
4 2025-07-16 Thought Purity: Defense Paradigm For Chain-of-Thought Attack Zihao Xue et.al. 2507.12314 null Kimi
5 2025-07-16 Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes Johann Frei et.al. 2507.12261 null Kimi
6 2025-07-16 Improving Contextual ASR via Multi-grained Fusion with Large Language Models Shilin Zhou et.al. 2507.12252 null Kimi
7 2025-07-16 Toward Efficient SpMV in Sparse LLMs via Block Extraction and Compressed Storage Junqing Lin et.al. 2507.12205 null Kimi
8 2025-07-16 Findings of MEGA: Maths Explanation with LLMs using the Socratic Method for Active Learning Tosin Adewumi et.al. 2507.12079 null Kimi
9 2025-07-16 Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited Anthony G Cohn et.al. 2507.12059 null Kimi
10 2025-07-16 Simplifications are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated Definitions Lukas Ellinger et.al. 2507.11981 null Kimi
11 2025-07-16 Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness Yuki Sakamoto et.al. 2507.11979 null Kimi
12 2025-07-16 Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation Ziyu Ge et.al. 2507.11966 null Kimi
13 2025-07-16 PoTPTQ: A Two-step Power-of-Two Post-training for LLMs Xinyu Wang et.al. 2507.11959 null Kimi
14 2025-07-16 The benefits of query-based KGQA systems for complex and temporal questions in LLM era Artem Alekseev et.al. 2507.11954 null Kimi
15 2025-07-16 IAM: Efficient Inference through Attention Mapping between Different-scale LLMs Yi Zhao et.al. 2507.11953 null Kimi
16 2025-07-16 DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression Yi Zhao et.al. 2507.11942 null Kimi
17 2025-07-16 BlockBPE: Parallel BPE Tokenization Amos You et.al. 2507.11941 null Kimi
18 2025-07-16 POLYCHARTQA: Benchmarking Large Vision-Language Models with Multilingual Chart Question Answering Yichen Xu et.al. 2507.11939 null Kimi
19 2025-07-16 A Survey of Deep Learning for Geometry Problem Solving Jianzhe Ma et.al. 2507.11936 null Kimi
20 2025-07-16 Tracing Facts or just Copies? A critical investigation of the Competitions of Mechanisms in Large Language Models Dante Campregher et.al. 2507.11809 null Kimi
21 2025-07-15 CRABS: A syntactic-semantic pincer strategy for bounding LLM interpretation of Python notebooks Meng Li et.al. 2507.11742 null Kimi
22 2025-07-15 Auto-Formulating Dynamic Programming Problems with Large Language Models Chenyu Zhou et.al. 2507.11737 null Kimi
23 2025-07-15 Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis Maciej Szankin et.al. 2507.11730 null Kimi
24 2025-07-15 PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed Training Seth Ockerman et.al. 2507.11683 null Kimi
25 2025-07-15 MapIQ: Benchmarking Multimodal Large Language Models for Map Question Answering Varun Srivastava et.al. 2507.11625 null Kimi
26 2025-07-15 Streaming 4D Visual Geometry Transformer Dong Zhuo et.al. 2507.11539 null Kimi
27 2025-07-15 DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering Yinsheng Li et.al. 2507.11527 null Kimi
28 2025-07-16 Reasoning Strategies in Large Language Models: Can They Follow, Prefer, and Optimize? Yanjian Zhang et.al. 2507.11423 null Kimi
29 2025-07-15 Seq vs Seq: An Open Suite of Paired Encoders and Decoders Orion Weller et.al. 2507.11412 null Kimi
30 2025-07-15 KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning? Soumadeep Saha et.al. 2507.11408 null Kimi
31 2025-07-15 Automated Novelty Evaluation of Academic Paper: A Collaborative Approach Integrating Human and Large Language Model Knowledge Wenqing Wu et.al. 2507.11330 null Kimi
32 2025-07-15 Internal Value Alignment in Large Language Models through Controlled Value Vector Activation Haoran Jin et.al. 2507.11316 null Kimi
33 2025-07-15 KV-Latent: Dimensional-level KV Cache Reduction with Frequency-aware Rotary Positional Embedding Luohe Shi et.al. 2507.11273 null Kimi
34 2025-07-15 An Agentic Flow for Finite State Machine Extraction using Prompt Chaining Fares Wael et.al. 2507.11222 null Kimi
35 2025-07-15 Role-Playing LLM-Based Multi-Agent Support Framework for Detecting and Addressing Family Communication Bias Rushia Harada et.al. 2507.11210 null Kimi
36 2025-07-15 Temperature and Persona Shape LLM Agent Consensus With Minimal Accuracy Gains in Qualitative Coding Conrad Borchers et.al. 2507.11198 null Kimi
37 2025-07-15 Mixture of Experts in Large Language Models Danyang Zhang et.al. 2507.11181 null Kimi
38 2025-07-15 SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks Pavel Adamenko et.al. 2507.11059 null Kimi
39 2025-07-15 LLM-Augmented Symptom Analysis for Cardiovascular Disease Risk Prediction: A Clinical NLP Haowei Yang et.al. 2507.11052 null Kimi
40 2025-07-15 First-Order Error Matters: Accurate Compensation for Quantized Large Language Models Xingyu Zheng et.al. 2507.11017 null Kimi
41 2025-07-15 Teach Me Sign: Stepwise Prompting LLM for Sign Language Production Zhaoyi An et.al. 2507.10972 null Kimi
42 2025-07-15 DS@GT at eRisk 2025: From prompts to predictions, benchmarking early depression detection with conversational agent based assessments and temporal attention models Anthony Miyaguchi et.al. 2507.10958 null Kimi
43 2025-07-15 Modeling Understanding of Story-Based Analogies Using Large Language Models Kalit Inani et.al. 2507.10957 null Kimi
44 2025-07-15 Artificial Finance: How AI Thinks About Money Orhan Erdem et.al. 2507.10933 null Kimi
45 2025-07-15 HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training Seungho Choi et.al. 2507.10920 null Kimi
46 2025-07-15 NavComposer: Composing Language Instructions for Navigation Trajectories through Action-Scene-Object Modularization Zongtao He et.al. 2507.10894 null Kimi
47 2025-07-14 Automated Thematic Analyses Using LLMs: Xylazine Wound Management Social Media Chatter Use Case JaMor Hairston et.al. 2507.10803 null Kimi
48 2025-07-14 Warehouse Spatial Question Answering with LLM Agent Hsiang-Wei Huang et.al. 2507.10778 null Kimi
49 2025-07-14 From Semantic Web and MAS to Agentic AI: A Unified Narrative of the Web of Agents Tatiana Petrova et.al. 2507.10644 null Kimi
50 2025-07-14 EmbRACE-3K: Embodied Reasoning and Action in Complex Environments Mingxian Lin et.al. 2507.10548 null Kimi
51 2025-07-14 CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks Hongchao Jiang et.al. 2507.10535 null Kimi
52 2025-07-14 Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Mingqi Wu et.al. 2507.10532 null Kimi
53 2025-07-14 Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Sangmin Bae et.al. 2507.10524 null Kimi
54 2025-07-14 DeepResearch $^{\text{Eco}}$ : A Recursive Agentic Workflow for Complex Scientific Question Answering in Ecology Jennifer D’Souza et.al. 2507.10522 null Kimi
55 2025-07-14 Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance Kyungtae Han et.al. 2507.10500 null Kimi
56 2025-07-14 Cameras as Relative Positional Encoding Ruilong Li et.al. 2507.10496 null Kimi
57 2025-07-14 Can You Detect the Difference? İsmail Tarım et.al. 2507.10475 null Kimi
58 2025-07-14 Referential ambiguity and clarification requests: comparing human and LLM behaviour Chris Madge et.al. 2507.10445 null Kimi
59 2025-07-14 Zorse: Optimizing LLM Training Efficiency on Heterogeneous GPU Clusters Runsheng Benson Guo et.al. 2507.10392 null Kimi
60 2025-07-14 FaceLLM: A Multimodal Large Language Model for Face Understanding Hatef Otroshi Shahreza et.al. 2507.10300 null Kimi
61 2025-07-14 Absher: A Benchmark for Evaluating Large Language Models Understanding of Saudi Dialects Renad Al-Monef et.al. 2507.10216 null Kimi
62 2025-07-14 Natural Language-based Assessment of L2 Oral Proficiency using LLMs Stefano Bannò et.al. 2507.10200 null Kimi
63 2025-07-14 Abusive text transformation using LLMs Rohitash Chandra et.al. 2507.10177 null Kimi
64 2025-07-14 Fusing Large Language Models with Temporal Transformers for Time Series Forecasting Chen Su et.al. 2507.10098 null Kimi
65 2025-07-14 Enhancing Chain-of-Thought Reasoning with Critical Representation Fine-tuning Chenxi Huang et.al. 2507.10085 null Kimi
66 2025-07-14 Cultural Bias in Large Language Models: Evaluating AI Agents through Moral Questionnaires Simon Münker et.al. 2507.10073 null Kimi
67 2025-07-14 Automating SPARQL Query Translations between DBpedia and Wikidata Malte Christian Bartels et.al. 2507.10045 null Kimi
68 2025-07-14 Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning Zijun Chen et.al. 2507.10007 null Kimi
69 2025-07-14 On The Role of Intentionality in Knowledge Representation: Analyzing Scene Context for Cognitive Agents with a Tiny Language Model Mark Burgess et.al. 2507.10000 null Kimi
70 2025-07-14 Tiny Reward Models Sarah Pan et.al. 2507.09973 null Kimi
71 2025-07-14 DeepSeek: Paradigm Shifts and Technical Evolution in Large AI Models Luolin Xiong et.al. 2507.09955 null Kimi
72 2025-07-14 Enhancing Retrieval Augmented Generation with Hierarchical Text Segmentation Chunking Hai Toan Nguyen et.al. 2507.09935 null Kimi
73 2025-07-14 ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models Yongheng Zhang et.al. 2507.09876 null Kimi
74 2025-07-14 Is Human-Written Data Enough? The Challenge of Teaching Reasoning to LLMs Without RL or Distillation Wei Du et.al. 2507.09850 null Kimi
75 2025-07-14 Generative Audio Language Modeling with Continuous-valued Tokens and Masked Next-Token Prediction Shu-wen Yang et.al. 2507.09834 null Kimi
76 2025-07-13 CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design Prashant Govindarajan et.al. 2507.09792 null Kimi
77 2025-07-13 TinyTroupe: An LLM-powered Multiagent Persona Simulation Toolkit Paulo Salem et.al. 2507.09788 null Kimi
78 2025-07-13 Sound and Complete Neuro-symbolic Reasoning with LLM-Grounded Interpretations Bradley P. Allen et.al. 2507.09751 null Kimi
79 2025-07-13 Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces Baturay Saglam et.al. 2507.09709 null Kimi
80 2025-07-10 Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology Haochen Wang et.al. 2507.07999 null Kimi
81 2025-07-10 PyVision: Agentic Vision with Dynamic Tooling Shitian Zhao et.al. 2507.07998 null Kimi
82 2025-07-10 MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization Mingkai Jia et.al. 2507.07997 null Kimi
83 2025-07-10 Single-pass Adaptive Image Tokenization for Minimum Program Search Shivam Duggal et.al. 2507.07995 null Kimi
84 2025-07-10 Multigranular Evaluation for Brain Visual Decoding Weihao Xia et.al. 2507.07993 null Kimi
85 2025-07-10 Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs Jeongseok Hyun et.al. 2507.07990 null Kimi
86 2025-07-10 Automating Expert-Level Medical Reasoning Evaluation of Large Language Models Shuang Zhou et.al. 2507.07988 null Kimi
87 2025-07-10 OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding JingLi Lin et.al. 2507.07984 null Kimi
88 2025-07-10 Performance and Practical Considerations of Large and Small Language Models in Clinical Decision Support in Rheumatology Sabine Felde et.al. 2507.07983 null Kimi
89 2025-07-10 Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling Haoyu Wu et.al. 2507.07982 null Kimi
90 2025-07-10 Why is Your Language Model a Poor Implicit Reward Model? Noam Razin et.al. 2507.07981 null Kimi
91 2025-07-10 Scaling RL to Long Videos Yukang Chen et.al. 2507.07966 null Kimi
92 2025-07-10 MIRIX: Multi-Agent Memory System for LLM-Based Agents Yu Wang et.al. 2507.07957 null Kimi
93 2025-07-10 Input Conditioned Layer Dropping in Speech Foundation Models Abdul Hannan et.al. 2507.07954 null Kimi
94 2025-07-10 SAGE: A Visual Language Model for Anomaly Detection via Fact Enhancement and Entropy-aware Alignment Guoxin Zang et.al. 2507.07939 null Kimi
95 2025-07-10 Working with AI: Measuring the Occupational Implications of Generative AI Kiran Tomlinson et.al. 2507.07935 null Kimi
96 2025-07-10 Meek Models Shall Inherit the Earth Hans Gundlach et.al. 2507.07931 null Kimi
97 2025-07-10 Probing Experts’ Perspectives on AI-Assisted Public Speaking Training Nesrine Fourati et.al. 2507.07930 null Kimi
98 2025-07-10 Towards Continuous Home Cage Monitoring: An Evaluation of Tracking and Identification Strategies for Laboratory Mice Juan Pablo Oberhauser et.al. 2507.07929 null Kimi
99 2025-07-10 DTECT: Dynamic Topic Explorer & Context Tracker Suman Adhya et.al. 2507.07910 null Kimi
100 2025-07-10 Not Only Consistency: Enhance Test-Time Adaptation with Spatio-temporal Inconsistency for Remote Physiological Measurement Xiao Yang et.al. 2507.07908 null Kimi
101 2025-07-10 Agentic Retrieval of Topics and Insights from Earnings Calls Anant Gupta et.al. 2507.07906 null Kimi
102 2025-07-10 MIRA: A Novel Framework for Fusing Modalities in Medical RAG Jinhong Wang et.al. 2507.07902 null Kimi
103 2025-07-10 An Integrated Framework of Prompt Engineering and Multidimensional Knowledge Graphs for Legal Dispute Analysis Mingda Zhang et.al. 2507.07893 null Kimi
104 2025-07-10 Automating MD simulations for Proteins using Large language Models: NAMD-Agent Achuth Chandrasekhar et.al. 2507.07887 null Kimi
105 2025-07-10 Single-Step Latent Diffusion for Underwater Image Restoration Jiayi Wu et.al. 2507.07878 null Kimi
106 2025-07-10 DocCHA: Towards LLM-Augmented Interactive Online diagnosis System Xinyi Liu et.al. 2507.07870 null Kimi
107 2025-07-10 Alpay Algebra V: Multi-Layered Semantic Games and Transfinite Fixed-Point Simulation Bugra Kilictas et.al. 2507.07868 null Kimi
108 2025-07-10 Searching for actual causes: Approximate algorithms with adjustable precision Samuel Reyd et.al. 2507.07857 null Kimi
109 2025-07-10 From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems Youngjoon Jang et.al. 2507.07847 null Kimi
110 2025-07-10 MoSE: Skill-by-Skill Mixture-of-Expert Learning for Autonomous Driving Lu Xu et.al. 2507.07818 null Kimi
111 2025-07-10 When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance Peizhang Shao et.al. 2507.07748 null Kimi
112 2025-07-10 Not All Preferences are What You Need for Post-Training: Selective Alignment Strategy for Preference Optimization Zhijin Dong et.al. 2507.07725 null Kimi
113 2025-07-10 Rationale-Enhanced Decoding for Multi-modal Chain-of-Thought Shin’ya Yamaguchi et.al. 2507.07685 null Kimi
114 2025-07-10 Single-to-mix Modality Alignment with Multimodal Large Language Model for Document Image Machine Translation Yupu Liang et.al. 2507.07572 null Kimi
115 2025-07-10 Toward Real-World Chinese Psychological Support Dialogues: CPsDD Dataset and a Co-Evolving Multi-Agent System Yuanchen Shi et.al. 2507.07509 null Kimi
116 2025-07-10 Hallucination Stations: On Some Basic Limitations of Transformer-Based Language Models Varin Sikka et.al. 2507.07505 null Kimi
117 2025-07-10 PLAN-TUNING: Post-Training Language Models to Learn Step-by-Step Planning for Complex Problem Solving Mihir Parmar et.al. 2507.07495 null Kimi
118 2025-07-10 Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models Kaiqu Liang et.al. 2507.07484 null Kimi
119 2025-07-10 SAND: Boosting LLM Agents with Self-Taught Action Deliberation Yu Xia et.al. 2507.07441 null Kimi
120 2025-07-10 DrugMCTS: a drug repurposing framework combining multi-agent, RAG and Monte Carlo Tree Search Zerui Yang et.al. 2507.07426 null Kimi
121 2025-07-10 May I have your Attention? Breaking Fine-Tuning based Prompt Injection Defenses using Architecture-Aware Attacks Nishit V. Pandya et.al. 2507.07417 null Kimi
122 2025-07-10 GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation Fardin Rastakhiz et.al. 2507.07414 null Kimi
123 2025-07-10 Phishing Detection in the Gen-AI Era: Quantized LLMs vs Classical Models Jikesh Thapa et.al. 2507.07406 null Kimi
124 2025-07-10 KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows Zaifeng Pan et.al. 2507.07400 null Kimi
125 2025-07-09 Application of LLMs to Multi-Robot Path Planning and Task Allocation Ashish Kumar et.al. 2507.07302 null Kimi
126 2025-07-09 Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery Licong Xu et.al. 2507.07257 null Kimi
127 2025-07-09 Attentions Under the Microscope: A Comparative Study of Resource Utilization for Variants of Self-Attention Zhengyu Tian et.al. 2507.07247 null Kimi
128 2025-07-09 Prompt Perturbations Reveal Human-Like Biases in LLM Survey Responses Jens Rupprecht et.al. 2507.07188 null Kimi
129 2025-07-09 Interpretable EEG-to-Image Generation with Semantic Prompts Arshak Rezvani et.al. 2507.07157 null Kimi
130 2025-07-09 Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models Tiezheng Zhang et.al. 2507.07104 null Kimi
131 2025-07-09 Learning Deliberately, Acting Intuitively: Unlocking Test-Time Reasoning in Multimodal LLMs Yahan Yu et.al. 2507.06999 null Kimi
132 2025-07-09 Exploring LLMs for Predicting Tutor Strategy and Student Outcomes in Dialogues Fareya Ikram et.al. 2507.06910 null Kimi
133 2025-07-09 MultiJustice: A Chinese Dataset for Multi-Party, Multi-Charge Legal Prediction Xiao Wang et.al. 2507.06909 null Kimi
134 2025-07-09 Developing and Maintaining an Open-Source Repository of AI Evaluations: Challenges and Insights Alexandra Abbas et.al. 2507.06893 null Kimi
135 2025-07-09 Text to model via SysML: Automated generation of dynamical system computational models from unstructured natural language text via enhanced System Modeling Language diagrams Matthew Anderson Hendricks et.al. 2507.06803 null Kimi
136 2025-07-09 Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining: Method, Evaluation and Applications Seonwu Kim et.al. 2507.06795 null Kimi
137 2025-07-09 Expediting data extraction using a large language model (LLM) and scoping review protocol: a methodological study within a complex scoping review James Stewart-Evans et.al. 2507.06623 null Kimi
138 2025-07-09 Nexus: Taming Throughput-Latency Tradeoff in LLM Serving via Efficient GPU Sharing Xiaoxiang Shi et.al. 2507.06608 null Kimi
139 2025-07-09 Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation Liliang Ren et.al. 2507.06607 null Kimi
140 2025-07-09 From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization Xinjie Chen et.al. 2507.06573 null Kimi
141 2025-07-09 SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference Qian Chen et.al. 2507.06567 null Kimi
142 2025-07-09 InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes under Herd Behavior Huisheng Wang et.al. 2507.06528 null Kimi
143 2025-07-09 SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers Zicong Tang et.al. 2507.06517 null Kimi
144 2025-07-09 Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection Yupeng Hu et.al. 2507.06510 null Kimi
145 2025-07-09 Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings Russell Taylor et.al. 2507.06506 null Kimi
146 2025-07-09 MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting Models Yiwen Liu et.al. 2507.06502 null Kimi
147 2025-07-09 Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning Ziyang Wang et.al. 2507.06485 null Kimi
148 2025-07-08 Bridging AI and Software Security: A Comparative Vulnerability Assessment of LLM Agent Deployment Paradigms Tarek Gasmi et.al. 2507.06323 null Kimi
149 2025-07-08 ETT: Expanding the Long Context Understanding Capability of LLMs at Test-Time Kiarash Zahirnia et.al. 2507.06313 null Kimi
150 2025-07-08 Too Human to Model:The Uncanny Valley of LLMs in Social Simulation – When Generative Language Agents Misalign with Modelling Principles Yongchao Zeng et.al. 2507.06310 null Kimi
151 2025-07-08 Humans overrely on overconfident language models, across languages Neil Rathi et.al. 2507.06306 null Kimi
152 2025-07-08 Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers Zhiyuan Peng et.al. 2507.06223 null Kimi
153 2025-07-08 A Survey on Latent Reasoning Rui-Jie Zhu et.al. 2507.06203 null Kimi
154 2025-07-08 UQLM: A Python Package for Uncertainty Quantification in Large Language Models Dylan Bouchard et.al. 2507.06196 null Kimi
155 2025-07-09 Skywork-R1V3 Technical Report Wei Shen et.al. 2507.06167 null Kimi
156 2025-07-08 Evaluation of Habitat Robotics using Large Language Models William Li et.al. 2507.06157 null Kimi
157 2025-07-08 Coding Triangle: How Does Large Language Model Understand Code? Taolin Zhang et.al. 2507.06138 null Kimi
158 2025-07-08 NeoBabel: A Multilingual Open Tower for Visual Generation Mohammad Mahdi Derakhshani et.al. 2507.06137 null Kimi
159 2025-07-09 Omni-Video: Democratizing Unified Video Understanding and Generation Zhiyu Tan et.al. 2507.06119 null Kimi
160 2025-07-08 Few-shot text-based emotion detection Teodor-George Marchitan et.al. 2507.05918 null Kimi
161 2025-07-08 Affective-ROPTester: Capability and Bias Analysis of LLMs in Predicting Retinopathy of Prematurity Shuai Zhao et.al. 2507.05816 null Kimi
162 2025-07-08 Omni-Router: Sharing Routing Decisions in Sparse Mixture-of-Experts for Speech Recognition Zijin Gu et.al. 2507.05724 null Kimi
163 2025-07-08 Agentic-R1: Distilled Dual-Strategy Reasoning Weihua Du et.al. 2507.05707 null Kimi
164 2025-07-08 Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs SeungWon Ji et.al. 2507.05686 null Kimi
165 2025-07-08 LLMs are Introvert Litian Zhang et.al. 2507.05638 null Kimi
166 2025-07-08 SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression Yiqiao Jin et.al. 2507.05633 null Kimi
167 2025-07-08 Flipping Knowledge Distillation: Leveraging Small Models’ Expertise to Enhance LLMs in Text Matching Mingzhe Li et.al. 2507.05617 null Kimi
168 2025-07-08 Enhancing Test-Time Scaling of Large Language Models with Hierarchical Retrieval-Augmented MCTS Alex ZH Dou et.al. 2507.05557 null Kimi
169 2025-07-07 Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment Jiahuan Pei et.al. 2507.05528 null Kimi
170 2025-07-07 Fine-Grained Vision-Language Modeling for Multimodal Training Assistants in Augmented Reality Haochen Huang et.al. 2507.05515 null Kimi
171 2025-07-07 On the Semantics of Large Language Models Martin Schuele et.al. 2507.05448 null Kimi
172 2025-07-07 “Lost-in-the-Later”: Framework for Quantifying Contextual Grounding in Large Language Models Yufei Tao et.al. 2507.05424 null Kimi
173 2025-07-07 On the Bias of Next-Token Predictors Toward Systematically Inefficient Reasoning: A Shortest-Path Case Study Riccardo Alberghi et.al. 2507.05362 null Kimi
174 2025-07-07 LoRA-Augmented Generation (LAG) for Knowledge-Intensive Language Tasks William Fleshman et.al. 2507.05346 null Kimi
175 2025-07-07 MindFlow: Revolutionizing E-commerce Customer Support with Multimodal LLM Agents Ming Gong et.al. 2507.05330 null Kimi
176 2025-07-07 LCDS: A Logic-Controlled Discharge Summary Generation System Supporting Source Attribution and Expert Review Cheng Yuan et.al. 2507.05319 null Kimi
177 2025-07-07 Spatio-Temporal LLM: Reasoning about Environments and Actions Haozhen Zheng et.al. 2507.05258 null Kimi
178 2025-07-07 Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions Yuanzhe Hu et.al. 2507.05257 null Kimi
179 2025-07-07 Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning Yana Wei et.al. 2507.05255 null Kimi
180 2025-07-07 When Chain of Thought is Necessary, Language Models Struggle to Evade Monitors Scott Emmons et.al. 2507.05246 null Kimi
181 2025-07-07 StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling Meng Wei et.al. 2507.05240 null Kimi
182 2025-07-07 Critiques of World Models Eric Xing et.al. 2507.05169 null Kimi
183 2025-07-07 InfoSteer: Steering Information Utility in Language Model Post-Training Chunyuan Deng et.al. 2507.05158 null Kimi
184 2025-07-07 AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models Chinnappa Guggilla et.al. 2507.05157 null Kimi
185 2025-07-07 Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization Jaewook Lee et.al. 2507.05137 null Kimi
186 2025-07-07 MARBLE: A Multi-Agent Rule-Based LLM Reasoning Engine for Accident Severity Prediction Kaleem Ullah Qasim et.al. 2507.04893 null Kimi
187 2025-07-07 Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations A. Bochkov et.al. 2507.04886 null Kimi
188 2025-07-07 FurniMAS: Language-Guided Furniture Decoration using Multi-Agent System Toan Nguyen et.al. 2507.04770 null Kimi
189 2025-07-07 From Imitation to Innovation: The Emergence of AI Unique Artistic Styles and the Challenge of Copyright Protection Zexi Jia et.al. 2507.04769 null Kimi
190 2025-07-07 CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering Hang Lv et.al. 2507.04756 null Kimi
191 2025-07-07 LLM-based Question-Answer Framework for Sensor-driven HVAC System Interaction Sungmin Lee et.al. 2507.04748 null Kimi
192 2025-07-07 Activation Steering for Chain-of-Thought Compression Seyedarmin Azizi et.al. 2507.04742 null Kimi
193 2025-07-07 LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework Zecheng Tang et.al. 2507.04723 null Kimi
194 2025-07-07 SPATIA: Multimodal Model for Prediction and Generation of Spatial Cell Phenotypes Zhenglun Kong et.al. 2507.04704 null Kimi
195 2025-07-07 Performance Evaluation of General Purpose Large Language Models for Basic Linear Algebra Subprograms Code Generation Daichi Mukunoki et.al. 2507.04697 null Kimi
196 2025-07-07 Knowledge-Aware Self-Correction in Language Models via Structured Memory Graphs Swayamjit Saha et.al. 2507.04625 null Kimi
197 2025-07-07 Multimodal LLM Integrated Semantic Communications for 6G Immersive Experiences Yusong Zhang et.al. 2507.04621 null Kimi
198 2025-07-07 PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes Xinliang Frederick Zhang et.al. 2507.04607 null Kimi
199 2025-07-06 Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Guokan Shang et.al. 2507.04569 null Kimi
200 2025-07-06 Evaluating LLMs on Real-World Forecasting Against Human Superforecasters Janna Lu et.al. 2507.04562 null Kimi
201 2025-07-06 MambaVideo for Discrete Video Tokenization with Channel-Split Quantization Dawit Mureja Argaw et.al. 2507.04559 null Kimi
202 2025-07-06 DP-Fusion: Token-Level Differentially Private Inference for Large Language Models Rushil Thareja et.al. 2507.04531 null Kimi
203 2025-07-06 Model Inversion Attacks on Llama 3: Extracting PII from Large Language Models Sathesh P. Sivashanmugam et.al. 2507.04478 null Kimi
204 2025-07-06 The role of large language models in UI/UX design: A systematic literature review Ammar Ahmed et.al. 2507.04469 null Kimi
205 2025-07-06 CoT-lized Diffusion: Let’s Reinforce T2I Generation Step-by-step Zheyuan Liu et.al. 2507.04451 null Kimi
206 2025-07-06 MedGellan: LLM-Generated Medical Guidance to Support Physicians Debodeep Banerjee et.al. 2507.04431 null Kimi
207 2025-07-03 RefTok: Reference-Based Tokenization for Video Generation Xiang Fan et.al. 2507.02862 null Kimi
208 2025-07-03 Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching Xin Zhou et.al. 2507.02860 null Kimi
209 2025-07-03 Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation Jiaer Xia et.al. 2507.02859 null Kimi
210 2025-07-03 Requirements Elicitation Follow-Up Question Generation Yuchen Shen et.al. 2507.02858 null Kimi
211 2025-07-03 Answer Matching Outperforms Multiple Choice for Language Model Evaluation Nikhil Chandak et.al. 2507.02856 null Kimi
212 2025-07-03 MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs Purbesh Mitra et.al. 2507.02851 null Kimi
213 2025-07-03 LLM Hypnosis: Exploiting User Feedback for Unauthorized Knowledge Injection to All Users Almog Hilel et.al. 2507.02850 null Kimi
214 2025-07-03 Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection Ziqi Miao et.al. 2507.02844 null Kimi
215 2025-07-03 StepHint: Multi-level Stepwise Hints Enhance Reinforcement Learning to Reason Kaiyi Zhang et.al. 2507.02841 null Kimi
216 2025-07-03 ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning Ruiyang Zhou et.al. 2507.02834 null Kimi
217 2025-07-03 USAD: An Unsupervised Data Augmentation Spatio-Temporal Attention Diffusion Network Ying Yu et.al. 2507.02827 null Kimi
218 2025-07-03 Establishing Best Practices for Building Rigorous Agentic Benchmarks Yuxuan Zhu et.al. 2507.02825 null Kimi
219 2025-07-03 DNN-Based Precoding in RIS-Aided mmWave MIMO Systems With Practical Phase Shift Po-Heng Chou et.al. 2507.02824 null Kimi
220 2025-07-03 SynapseRoute: An Auto-Route Switching Framework on Dual-State Large Language Model Wencheng Zhang et.al. 2507.02822 null Kimi
221 2025-07-03 Multimodal Mathematical Reasoning with Diverse Solving Perspective Wenhao Shi et.al. 2507.02804 null Kimi
222 2025-07-03 Is Reasoning All You Need? Probing Bias in the Age of Reasoning Language Models Riccardo Cantini et.al. 2507.02799 null Kimi
223 2025-07-03 From Long Videos to Engaging Clips: A Human-Inspired Video Editing Framework with Multimodal Narrative Understanding Xiangfeng Wang et.al. 2507.02790 null Kimi
224 2025-07-03 Moral Responsibility or Obedience: What Do We Want from AI? Joseph Boland et.al. 2507.02788 null Kimi
225 2025-07-03 Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs Ken Tsui et.al. 2507.02778 null Kimi
226 2025-07-03 KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs Yuzhang Xie et.al. 2507.02773 null Kimi
227 2025-07-03 DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Ke-Han Lu et.al. 2507.02768 null Kimi
228 2025-07-03 Knowledge Protocol Engineering: A New Paradigm for AI in Domain-Specific Knowledge Work Guangwei Zhang et.al. 2507.02760 null Kimi
229 2025-07-03 Multi-agent Auditory Scene Analysis Caleb Rascon et.al. 2507.02755 null Kimi
230 2025-07-03 Fast and Simplex: 2-Simplicial Attention in Triton Aurko Roy et.al. 2507.02754 null Kimi
231 2025-07-03 Synthesizable by Design: A Retrosynthesis-Guided Framework for Molecular Analog Generation Shuan Chen et.al. 2507.02752 null Kimi
232 2025-07-03 Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and Physics Alex Colagrande et.al. 2507.02748 null Kimi
233 2025-07-03 Early Signs of Steganographic Capabilities in Frontier LLMs Artur Zolkowski et.al. 2507.02737 null Kimi
234 2025-07-03 Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks Sizhe Chen et.al. 2507.02735 null Kimi
235 2025-07-03 Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving Matthieu Zimmer et.al. 2507.02726 null Kimi
236 2025-07-03 UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation Qin Guo et.al. 2507.02713 null Kimi
237 2025-07-03 AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models Ziyin Zhou et.al. 2507.02664 null Kimi
238 2025-07-03 OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding Ramchalam Kinattinkara Ramakrishnan et.al. 2507.02659 null Kimi
239 2025-07-03 FlowSpec: Continuous Pipelined Speculative Decoding for Efficient Distributed LLM Inference Xing Liu et.al. 2507.02620 null Kimi
240 2025-07-03 Strategic Intelligence in Large Language Models: Evidence from evolutionary Game Theory Kenneth Payne et.al. 2507.02618 null Kimi
241 2025-07-03 Are You Listening to Me? Fine-Tuning Chatbots for Empathetic Dialogue Paulo Ricardo Knob et.al. 2507.02537 null Kimi
242 2025-07-03 Red grape detection with accelerated artificial neural networks in the FPGA’s programmable logic Sandro Costa Magalhães et.al. 2507.02443 null Kimi
243 2025-07-03 Holistic Tokenizer for Autoregressive Image Generation Anlin Zheng et.al. 2507.02358 null Kimi
244 2025-07-03 DoMIX: An Efficient Framework for Exploiting Domain Knowledge in Fine-Tuning Dohoon Kim et.al. 2507.02302 null Kimi
245 2025-07-03 MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent Hongli Yu et.al. 2507.02259 null Kimi
246 2025-07-03 SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement Zeyu Lei et.al. 2507.02252 null Kimi
247 2025-07-02 ESTR-CoT: Towards Explainable and Accurate Event Stream based Scene Text Recognition with Chain-of-Thought Reasoning Xiao Wang et.al. 2507.02200 null Kimi
248 2025-07-02 Latent Chain-of-Thought? Decoding the Depth-Recurrent Transformer Wenquan Lu et.al. 2507.02199 null Kimi
249 2025-07-02 Reasoning or Not? A Comprehensive Evaluation of Reasoning LLMs for Dialogue Summarization Keyan Jin et.al. 2507.02145 null Kimi
250 2025-07-02 When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search William A. Ingram et.al. 2507.02139 null Kimi
251 2025-07-02 Dissecting the Impact of Mobile DVFS Governors on LLM Inference Performance and Energy Efficiency Zongpu Zhang et.al. 2507.02135 null Kimi
252 2025-07-02 Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs Mohammad Ali Alomrani et.al. 2507.02076 null Kimi
253 2025-07-02 Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges Sanjeda Akter et.al. 2507.02074 null Kimi
254 2025-07-02 Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation Zhuoyang Zhang et.al. 2507.01957 null Kimi
255 2025-07-02 SpecCLIP: Aligning and Translating Spectroscopic Measurements for Stars Xiaosheng Zhao et.al. 2507.01939 null Kimi
256 2025-07-02 Decision-oriented Text Evaluation Yu-Shiang Huang et.al. 2507.01923 null Kimi
257 2025-07-02 Gradient-Adaptive Policy Optimization: Towards Multi-Objective Alignment of Large Language Models Chengao Li et.al. 2507.01915 null Kimi
258 2025-07-02 AI4Research: A Survey of Artificial Intelligence for Scientific Research Qiguang Chen et.al. 2507.01903 null Kimi
259 2025-07-02 High-Layer Attention Pruning with Rescaling Songtao Liu et.al. 2507.01900 null Kimi
260 2025-07-02 MiCoTA: Bridging the Learnability Gap with Intermediate CoT and Teacher Assistants Dongyi Ding et.al. 2507.01887 null Kimi
261 2025-07-02 Bridging UI Design and chatbot Interactions: Applying Form-Based Principles to Conversational Agents Sanjay Krishna Anbalagan et.al. 2507.01862 null Kimi
262 2025-07-02 Eka-Eval : A Comprehensive Evaluation Framework for Large Language Models in Indian Languages Samridhi Raj Sinha et.al. 2507.01853 null Kimi
263 2025-07-02 LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs Reza Arabpour et.al. 2507.01806 null Kimi
264 2025-07-02 How Do Vision-Language Models Process Conflicting Information Across Modalities? Tianze Hua et.al. 2507.01790 null Kimi
265 2025-07-02 MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining Zhixun Chen et.al. 2507.01785 null Kimi
266 2025-07-02 ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving Kai Chen et.al. 2507.01735 null Kimi
267 2025-07-02 AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness Zixin Chen et.al. 2507.01702 null Kimi
268 2025-07-02 Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems Zhaoyan Sun et.al. 2507.01599 null Kimi
269 2025-07-02 Following the Clues: Experiments on Person Re-ID using Cross-Modal Intelligence Robert Aufschläger et.al. 2507.01504 null Kimi
270 2025-07-02 Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning Yanfei Zhang et.al. 2507.01489 null Kimi
271 2025-07-02 BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments Yibo Qiu et.al. 2507.01485 null Kimi
272 2025-07-02 Evaluating the Effectiveness of Direct Preference Optimization for Personalizing German Automatic Text Simplifications for Persons with Intellectual Disabilities Yingqiang Gao et.al. 2507.01479 null Kimi
273 2025-07-02 LogitSpec: Accelerating Retrieval-based Speculative Decoding via Next Next Token Speculation Tianyu Liu et.al. 2507.01449 null Kimi
274 2025-07-02 EdgeLoRA: An Efficient Multi-Tenant LLM Serving System on Edge Devices Zheyu Shen et.al. 2507.01438 null Kimi
275 2025-07-02 AI Agents and Agentic AI-Navigating a Plethora of Concepts for Future Manufacturing Yinwang Ren et.al. 2507.01376 null Kimi
276 2025-07-02 Long-Tailed Distribution-Aware Router For Mixture-of-Experts in Large Vision-Language Model Chaoxiang Cai et.al. 2507.01351 null Kimi
277 2025-07-02 Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs Nifu Dan et.al. 2507.01334 null Kimi
278 2025-07-02 VLAD: A VLM-Augmented Autonomous Driving Framework with Hierarchical Planning and Interpretable Decision Process Cristian Gariboldi et.al. 2507.01284 null Kimi
279 2025-07-02 GAIus: Combining Genai with Legal Clauses Retrieval for Knowledge-based Assistant Michał Matak et.al. 2507.01259 null Kimi
280 2025-07-01 Enhancing LLM Agent Safety via Causal Influence Prompting Dongyoon Hahm et.al. 2507.00979 null Kimi
281 2025-07-01 Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications Jindong Han et.al. 2507.00914 null Kimi
282 2025-07-01 ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models Zifu Wan et.al. 2507.00898 null Kimi
283 2025-07-01 TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation Xi Xuan et.al. 2507.00875 null Kimi
284 2025-07-01 Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives Sixun Dong et.al. 2506.24124 null Kimi
285 2025-06-30 Calligrapher: Freestyle Text Image Customization Yue Ma et.al. 2506.24123 null Kimi
286 2025-06-30 Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime Yuqing Wang et.al. 2506.24120 null Kimi
287 2025-07-01 SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning Bo Liu et.al. 2506.24119 null Kimi
288 2025-07-01 Intertextual Parallel Detection in Biblical Hebrew: A Transformer-Based Benchmark David M. Smiley et.al. 2506.24117 null Kimi
289 2025-06-30 On the Predictive Power of Representation Dispersion in Language Models Yanhong Li et.al. 2506.24106 null Kimi
290 2025-06-30 DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World Xiangtai Li et.al. 2506.24102 null Kimi
291 2025-06-30 MotionGPT3: Human Motion as a Second Modality Bingfan Zhu et.al. 2506.24086 null Kimi
292 2025-06-30 Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention Wonwoong Cho et.al. 2506.24085 null Kimi
293 2025-06-30 STACK: Adversarial Attacks on LLM Safeguard Pipelines Ian R. McKenzie et.al. 2506.24068 null Kimi
294 2025-06-30 Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios Deng Li et.al. 2506.24063 null Kimi
295 2025-06-30 Logit-Gap Steering: Efficient Short-Suffix Jailbreaks for Aligned Large Language Models Tung-Ling Li et.al. 2506.24056 null Kimi
296 2025-06-30 Agent.xpu: Efficient Scheduling of Agentic LLM Workloads on Heterogeneous SoC Xinming Wei et.al. 2506.24045 null Kimi
297 2025-06-30 A Survey on Vision-Language-Action Models for Autonomous Driving Sicong Jiang et.al. 2506.24044 null Kimi
298 2025-06-30 Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data Shubhabrata Mukherjee et.al. 2506.24039 null Kimi
299 2025-06-30 Ella: Embodied Social Agents with Lifelong Memory Hongxin Zhang et.al. 2506.24019 null Kimi
300 2025-06-30 EXPERT: An Explainable Image Captioning Evaluation Metric with Structured Explanations Hyunjong Kim et.al. 2506.24016 null Kimi
301 2025-06-30 Large Language Models Don’t Make Sense of Word Problems. A Scoping Review from a Mathematics Education Perspective Anselm R. Strohmaier et.al. 2506.24006 null Kimi
302 2025-06-30 ShapeKit Junqi Liu et.al. 2506.24003 null Kimi
303 2025-06-30 The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models Lijun Sheng et.al. 2506.24000 null Kimi
304 2025-06-30 Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning Seungjun Yi et.al. 2506.23998 null Kimi
305 2025-06-30 STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems Mingfei Cheng et.al. 2506.23995 null Kimi
306 2025-06-30 Harnessing AI Agents to Advance Research on Refugee Child Mental Health Aditya Shrivastava et.al. 2506.23992 null Kimi
307 2025-06-30 Machine Understanding of Scientific Language Dustin Wright et.al. 2506.23990 null Kimi
308 2025-06-30 TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation Renren Jin et.al. 2506.23979 null Kimi
309 2025-06-30 LLM Agents Are the Antidote to Walled Gardens Samuele Marro et.al. 2506.23978 null Kimi
310 2025-06-30 Evaluating the Impact of Khmer Font Types on Text Recognition Vannkinh Nom et.al. 2506.23963 null Kimi
311 2025-06-30 ADReFT: Adaptive Decision Repair for Safe Autonomous Driving via Reinforcement Fine-Tuning Mingfei Cheng et.al. 2506.23960 null Kimi
312 2025-06-30 Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice Akshit Kumar et.al. 2506.23924 null Kimi
313 2025-06-30 Advancing Multi-Step Mathematical Reasoning in Large Language Models through Multi-Layered Self-Reflection with Auto-Prompting André de Souza Loureiro et.al. 2506.23888 null Kimi
314 2025-06-30 Chain of Thought in Order: Discovering Learning-Friendly Orders for Arithmetic Yuta Sato et.al. 2506.23875 null Kimi
315 2025-06-30 A Survey on Autonomy-Induced Security Risks in Large Model-Based Agents Hang Su et.al. 2506.23844 null Kimi
316 2025-06-30 Do Thinking Tokens Help or Trap? Towards More Efficient Large Reasoning Model Bowen Ding et.al. 2506.23840 null Kimi
317 2025-06-30 Flash-VStream: Efficient Real-Time Understanding for Long Video Streams Haoji Zhang et.al. 2506.23825 null Kimi
318 2025-06-30 AutoEvoEval: An Automated Framework for Evolving Close-Ended LLM Evaluation Data JiaRu Wu et.al. 2506.23735 null Kimi
319 2025-06-30 Attestable Audits: Verifiable AI Safety Benchmarks Using Trusted Execution Environments Christoph Schnabl et.al. 2506.23706 null Kimi
320 2025-06-30 PokéAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red Zihao Liu et.al. 2506.23689 null Kimi
321 2025-06-30 Interactive Reasoning: Visualizing and Controlling Chain-of-Thought Reasoning in Large Language Models Rock Yuren Pang et.al. 2506.23678 null Kimi
322 2025-06-30 Unified Multimodal Understanding via Byte-Pair Visual Encoding Wanpeng Zhang et.al. 2506.23639 null Kimi
323 2025-06-30 Towards Building Private LLMs: Exploring Multi-Node Expert Parallelism on Apple Silicon for Mixture-of-Experts Large Language Model Mu-Chi Chen et.al. 2506.23635 null Kimi
324 2025-06-30 AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval Suyash Maniyar et.al. 2506.23605 null Kimi
325 2025-06-30 Semantic-guided Diverse Decoding for Large Language Model Weijie Shi et.al. 2506.23601 null Kimi
326 2025-06-30 MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI Huanjin Yao et.al. 2506.23563 null Kimi
327 2025-06-30 NEU-ESC: A Comprehensive Vietnamese dataset for Educational Sentiment analysis and topic Classification toward multitask learning Phan Quoc Hung Mai et.al. 2506.23524 null Kimi
328 2025-06-30 Thought-Augmented Planning for LLM-Powered Interactive Recommender Agent Haocheng Yu et.al. 2506.23485 null Kimi
329 2025-06-29 Pipelined Decoder for Efficient Context-Aware Text Generation Zixian Huang et.al. 2506.23431 null Kimi
330 2025-06-29 TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs Felipe Nuti et.al. 2506.23423 null Kimi
331 2025-06-29 SIEDD: Shared-Implicit Encoder with Discrete Decoders Vikram Rangarajan et.al. 2506.23382 null Kimi
332 2025-06-29 Perspective Dial: Measuring Perspective of Text and Guiding LLM Outputs Taejin Kim et.al. 2506.23377 null Kimi
333 2025-06-29 ATGen: A Framework for Active Text Generation Akim Tsvigun et.al. 2506.23342 null Kimi
334 2025-06-29 Information Loss in LLMs’ Multilingual Translation: The Role of Training Data, Language Proximity, and Language Family Yumeng Lin et.al. 2506.23340 null Kimi
335 2025-06-29 GATSim: Urban Mobility Simulation with Generative Agents Qi Liu et.al. 2506.23306 null Kimi
336 2025-06-29 Objective-Free Local Learning and Emergent Language Structure in Thinking Machines P. Myles Eugenio et.al. 2506.23293 null Kimi
337 2025-06-26 Whole-Body Conditioned Egocentric Video Prediction Yutong Bai et.al. 2506.21552 null Kimi
338 2025-06-26 mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale Xiaona Zhou et.al. 2506.21550 null Kimi
339 2025-06-26 SAM4D: Segment Anything in Camera and LiDAR Streams Jianyun Xu et.al. 2506.21547 null Kimi
340 2025-06-26 HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation Xinzhuo Li et.al. 2506.21546 null Kimi
341 2025-06-26 PsyLite Technical Report Fangjun Ding et.al. 2506.21536 null Kimi
342 2025-06-26 Exploring the Design Space of 3D MLLMs for CT Report Generation Mohammed Baharoon et.al. 2506.21535 null Kimi
343 2025-06-26 “What’s Up, Doc?”: Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets Akshay Paruchuri et.al. 2506.21532 null Kimi
344 2025-06-26 WAFT: Warping-Alone Field Transforms for Optical Flow Yihan Wang et.al. 2506.21526 null Kimi
345 2025-06-26 Potemkin Understanding in Large Language Models Marina Mancoridis et.al. 2506.21521 null Kimi
346 2025-06-26 Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits Calibration Jiahe Chen et.al. 2506.21509 null Kimi
347 2025-06-26 skLEP: A Slovak General Language Understanding Benchmark Marek Šuppa et.al. 2506.21508 null Kimi
348 2025-06-26 Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Boyu Gou et.al. 2506.21506 null Kimi
349 2025-06-26 Enhancing User Engagement in Socially-Driven Dialogue through Interactive LLM Alignments Jiashuo Wang et.al. 2506.21497 null Kimi
350 2025-06-26 Bridging Offline and Online Reinforcement Learning for LLMs Jack Lanchantin et.al. 2506.21495 null Kimi
351 2025-06-26 Ad-Hoc Human-AI Coordination Challenge Tin Dizdarević et.al. 2506.21490 null Kimi
352 2025-06-26 TITAN: Query-Token based Domain Adaptive Adversarial Learning Tajamul Ashraf et.al. 2506.21484 null Kimi
353 2025-06-26 TopK Language Models Ryosuke Takahashi et.al. 2506.21468 null Kimi
354 2025-06-26 Efficient and Reuseable Cloud Configuration Search Using Discovery Spaces Michael Johnston et.al. 2506.21467 null Kimi
355 2025-06-26 Aligning Spoken Dialogue Models from User Interactions Anne Wu et.al. 2506.21463 null Kimi
356 2025-06-26 Spatial Mental Modeling from Limited Views Baiqiao Yin et.al. 2506.21458 null Kimi
357 2025-06-26 Rethinking Oversaturation in Classifier-Free Guidance via Low Frequency Kaiyu Song et.al. 2506.21452 null Kimi
358 2025-06-26 ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing Huadai Liu et.al. 2506.21448 null Kimi
359 2025-06-26 Text2Cypher Across Languages: Evaluating Foundational Models Beyond English Makbule Gulcin Ozsoy et.al. 2506.21445 null Kimi
360 2025-06-26 Domain Knowledge-Enhanced LLMs for Fraud and Concept Drift Detection Ali Şenol et.al. 2506.21443 null Kimi
361 2025-06-26 HyperSORT: Self-Organising Robust Training with hyper-networks Samuel Joutard et.al. 2506.21430 null Kimi
362 2025-06-26 XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation Bowen Chen et.al. 2506.21416 null Kimi
363 2025-06-26 Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference Colin Samplawski et.al. 2506.21408 null Kimi
364 2025-06-26 TableMoE: Neuro-Symbolic Routing for Structured Expert Reasoning in Multimodal Table Understanding Junwen Zhang et.al. 2506.21393 null Kimi
365 2025-06-26 Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation Guanting Dong et.al. 2506.21384 null Kimi
366 2025-06-26 GenFlow: Interactive Modular System for Image Generation Duc-Hung Nguyen et.al. 2506.21369 null Kimi
367 2025-06-26 Latent Prototype Routing: Achieving Near-Perfect Load Balancing in Mixture-of-Experts Jiajie Yang et.al. 2506.21328 null Kimi
368 2025-06-26 Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language Models Bram Willemsen et.al. 2506.21294 null Kimi
369 2025-06-26 Small Encoders Can Rival Large Decoders in Detecting Groundedness Istabrak Abbes et.al. 2506.21288 null Kimi
370 2025-06-26 Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning Xin Xu et.al. 2506.21285 null Kimi
371 2025-06-26 HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context Qize Yang et.al. 2506.21277 null Kimi
372 2025-06-26 DiLoCoX: A Low-Communication Large-Scale Training Framework for Decentralized Cluster Ji Qi et.al. 2506.21263 null Kimi
373 2025-06-26 Unveiling Causal Reasoning in Large Language Models: Reality or Mirage? Haoang Chi et.al. 2506.21215 null Kimi
374 2025-06-26 $T^3$ : Multi-level Tree-based Automatic Program Repair with Large Language Models Quanming Liu et.al. 2506.21211 null Kimi
375 2025-06-26 Task-Aware KV Compression For Cost-Effective Long Video Understanding Minghao Qin et.al. 2506.21184 null Kimi
376 2025-06-26 Uncover Treasures in DCT: Advancing JPEG Quality Enhancement by Exploiting Latent Correlations Jing Yang et.al. 2506.21171 null Kimi
377 2025-06-26 Large Language Models Acing Chartered Accountancy Jatin Gupta et.al. 2506.21031 null Kimi
378 2025-06-26 Evidence-based diagnostic reasoning with multi-agent copilot for human pathology Chengkuan Chen et.al. 2506.20964 null Kimi
379 2025-06-26 ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks Joshua H. Davis et.al. 2506.20938 null Kimi
380 2025-06-25 Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes Quintin Myers et.al. 2506.20822 null Kimi
381 2025-06-25 MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering Chinmay Gondhalekar et.al. 2506.20821 null Kimi
382 2025-06-25 The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research Ideas Chenglei Si et.al. 2506.20803 null Kimi
383 2025-06-25 The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind Andrei Lupu et.al. 2506.20664 null Kimi
384 2025-06-25 Memento: Note-Taking for Your Future Self Chao Wan et.al. 2506.20642 null Kimi
385 2025-06-26 DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Shansan Gong et.al. 2506.20639 null Kimi
386 2025-06-25 Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization Zhiwang Zhang et.al. 2506.20567 null Kimi
387 2025-06-25 When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs Ammar Khairi et.al. 2506.20544 null Kimi
388 2025-06-25 WattsOnAI: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads Hongzhen Huang et.al. 2506.20535 null Kimi
389 2025-06-25 Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios Wenbin Gan et.al. 2506.20531 null Kimi
390 2025-06-25 OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling Zengzhi Wang et.al. 2506.20512 null Kimi
391 2025-06-25 Probing AI Safety with Source Code Ujwal Narayan et.al. 2506.20471 null Kimi
392 2025-06-25 An Agentic System for Rare Disease Diagnosis with Traceable Reasoning Weike Zhao et.al. 2506.20430 null Kimi
393 2025-06-25 SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models Dipayan Saha et.al. 2506.20415 null Kimi
394 2025-06-25 Enterprise Large Language Model Evaluation Benchmark Liya Wang et.al. 2506.20274 null Kimi
395 2025-06-25 A Transformer Based Handwriting Recognition System Jointly Using Online and Offline Features Ayush Lodh et.al. 2506.20255 null Kimi
396 2025-06-25 Enhancing Large Language Models through Structured Reasoning Yubo Dong et.al. 2506.20241 null Kimi
397 2025-06-25 How to Retrieve Examples in In-context Learning to Improve Conversational Emotion Recognition using Large Language Models? Mengqi Wang et.al. 2506.20199 null Kimi
398 2025-06-25 SEED: A Structural Encoder for Embedding-Driven Decoding in Time Series Prediction with LLMs Fengze Li et.al. 2506.20167 null Kimi
399 2025-06-25 EAR: Erasing Concepts from Unified Autoregressive Models Haipeng Fan et.al. 2506.20151 null Kimi
400 2025-06-25 MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations Vardhan Dongre et.al. 2506.20100 null Kimi
401 2025-06-25 A Modular Multitask Reasoning Framework Integrating Spatio-temporal Models and LLMs Kethmi Hirushini Hettige et.al. 2506.20073 null Kimi
402 2025-06-24 Persona-Assigned Large Language Models Exhibit Human-Like Motivated Reasoning Saloni Dash et.al. 2506.20020 null Kimi
403 2025-06-24 Accurate and Energy Efficient: Local Retrieval-Augmented Generation Models Outperform Commercial Large Language Models in Medical Tasks Konstantinos Vrettos et.al. 2506.20009 null Kimi
404 2025-06-24 Inference Scaled GraphRAG: Improving Multi Hop Question Answering on Knowledge Graphs Travis Thompson et.al. 2506.19967 null Kimi
405 2025-06-24 Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture Shuchen Xue et.al. 2506.19935 null Kimi
406 2025-06-24 Prover Agent: An Agent-based Framework for Formal Mathematical Proofs Kaito Baba et.al. 2506.19923 null Kimi
407 2025-06-24 Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation Xingyang Li et.al. 2506.19852 null Kimi
408 2025-06-24 Orthogonal Finetuning Made Scalable Zeju Qiu et.al. 2506.19847 null Kimi
409 2025-06-24 Scaling Speculative Decoding with Lookahead Reasoning Yichao Fu et.al. 2506.19830 null Kimi
410 2025-06-24 KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality Baochang Ren et.al. 2506.19807 null Kimi
411 2025-06-24 Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study Yuqi Zhu et.al. 2506.19794 null Kimi
412 2025-06-24 SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning Yuqian Fu et.al. 2506.19767 null Kimi
413 2025-06-24 Arabic Dialect Classification using RNNs, Transformers, and Large Language Models: A Comparative Analysis Omar A. Essameldin et.al. 2506.19753 null Kimi
414 2025-06-24 Recurrent Visual Feature Extraction and Stereo Attentions for CT Report Generation Yuanhe Tian et.al. 2506.19665 null Kimi
415 2025-06-24 PEVLM: Parallel Encoding for Vision-Language Models Letian Kang et.al. 2506.19651 null Kimi
416 2025-06-24 ECCoT: A Framework for Enhancing Effective Cognition via Chain of Thought in Large Language Model Zhenke Duan et.al. 2506.19599 null Kimi
417 2025-06-24 Fake or Real, Can Robots Tell? Evaluating Embodied Vision-Language Models on Real and 3D-Printed Objects Federico Tavella et.al. 2506.19579 null Kimi
418 2025-06-24 AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models Zeyu Li et.al. 2506.19505 null Kimi
419 2025-06-24 Dialogic Pedagogy for Large Language Models: Aligning Conversational AI with Proven Theories of Learning Russell Beale et.al. 2506.19484 null Kimi
420 2025-06-24 Can Large Language Models Capture Human Annotator Disagreements? Jingwei Ni et.al. 2506.19467 null Kimi
421 2025-06-24 Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System Lixuan He et.al. 2506.19433 null Kimi
422 2025-06-24 Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study Yingji Zhang et.al. 2506.19418 null Kimi
423 2025-06-24 Automated Detection of Pre-training Text in Black-box LLMs Ruihan Hu et.al. 2506.19399 null Kimi
424 2025-06-24 Personality Prediction from Life Stories using Language Models Rasiq Hussain et.al. 2506.19258 null Kimi
425 2025-06-24 RecLLM-R1: A Two-Stage Training Paradigm with Reinforcement Learning and Chain-of-Thought v1 Yu Xie et.al. 2506.19235 null Kimi
426 2025-06-24 Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification Minghao Qin et.al. 2506.19225 null Kimi
427 2025-06-24 Augmenting Multi-Agent Communication with State Delta Trajectory Yichen Tang et.al. 2506.19209 null Kimi
428 2025-06-23 Thought Anchors: Which LLM Reasoning Steps Matter? Paul C. Bogdan et.al. 2506.19143 null Kimi
429 2025-06-23 HAWAII: Hierarchical Visual Knowledge Transfer for Efficient Vision-Language Models Yimu Wang et.al. 2506.19072 null Kimi
430 2025-06-23 Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective Weijie Xu et.al. 2506.19028 null Kimi
431 2025-06-23 Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations Jiaming Han et.al. 2506.18898 null Kimi
432 2025-06-23 OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization Yiyou Sun et.al. 2506.18880 null Kimi
433 2025-06-23 CommVQ: Commutative Vector Quantization for KV Cache Compression Junyan Li et.al. 2506.18879 null Kimi
434 2025-06-23 OmniGen2: Exploration to Advanced Multimodal Generation Chenyuan Wu et.al. 2506.18871 null Kimi
435 2025-06-23 LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning Yuhao Wu et.al. 2506.18841 null Kimi
436 2025-06-23 STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning Aryasomayajula Ram Bharadwaj et.al. 2506.18831 null Kimi
437 2025-06-23 Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories Islem Bouzenia et.al. 2506.18824 null Kimi
438 2025-06-24 ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation Siao Tang et.al. 2506.18810 link Kimi
439 2025-06-23 Existing LLMs Are Not Self-Consistent For Simple Tasks Zhenru Lin et.al. 2506.18781 null Kimi
440 2025-06-23 Is There a Case for Conversation Optimized Tokenizers in Large Language Models? Raquel Ferrando et.al. 2506.18674 null Kimi
441 2025-06-23 Historical Report Guided Bi-modal Concurrent Learning for Pathology Report Generation Ling Zhang et.al. 2506.18658 null Kimi
442 2025-06-23 ReDit: Reward Dithering for Improved LLM Policy Optimization Chenxing Wei et.al. 2506.18631 null Kimi
443 2025-06-23 AggTruth: Contextual Hallucination Detection using Aggregated Attention Scores in LLMs Piotr Matys et.al. 2506.18628 null Kimi
444 2025-06-23 Parallel Continuous Chain-of-Thought with Jacobi Iteration Haoyi Wu et.al. 2506.18582 null Kimi
445 2025-06-23 Security Assessment of DeepSeek and GPT Series Models against Jailbreak Attacks Xiaodong Wu et.al. 2506.18543 null Kimi
446 2025-06-23 Comparative Evaluation of ChatGPT and DeepSeek Across Key NLP Tasks: Strengths, Weaknesses, and Domain-Specific Performance Wael Etaiwi et.al. 2506.18501 null Kimi
447 2025-06-23 MeRF: Motivation-enhanced Reinforcement Finetuning for Large Reasoning Models Junjie Zhang et.al. 2506.18485 null Kimi
448 2025-06-23 TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models Ce Li et.al. 2506.18421 null Kimi
449 2025-06-23 SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Zichong Li et.al. 2506.18349 null Kimi
450 2025-06-23 Dynamic Knowledge Exchange and Dual-diversity Review: Concisely Unleashing the Potential of a Multi-Agent Research Team Weilun Yu et.al. 2506.18348 null Kimi
451 2025-06-23 Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs Kang Chen et.al. 2506.18341 null Kimi
452 2025-06-23 RLPR: Extrapolating RLVR to General Domains without Verifiers Tianyu Yu et.al. 2506.18254 null Kimi
453 2025-06-23 Make It Efficient: Dynamic Sparse Attention for Autoregressive Image Generation Xunzhi Xiang et.al. 2506.18226 null Kimi
454 2025-06-22 Understanding Reasoning in Thinking Language Models via Steering Vectors Constantin Venhoff et.al. 2506.18167 null Kimi
455 2025-06-22 Chain-of-Memory: Enhancing GUI Agents for Cross-Application Navigation Xinzge Gao et.al. 2506.18158 null Kimi
456 2025-06-22 QuranMorph: Morphologically Annotated Quranic Corpus Diyam Akra et.al. 2506.18148 null Kimi
457 2025-06-22 $φ^{\infty}$ : Clause Purification, Embedding Realignment, and the Total Suppression of the Em Dash in Autoregressive Language Models Bugra Kilictas et.al. 2506.18129 null Kimi
458 2025-06-22 Mental Health Equity in LLMs: Leveraging Multi-Hop Question Answering to Detect Amplified and Silenced Perspectives Batool Haider et.al. 2506.18116 null Kimi
459 2025-06-22 InspireDebate: Multi-Dimensional Subjective-Objective Evaluation-Guided Reasoning and Optimization for Debating Fuyu Wang et.al. 2506.18102 null Kimi
460 2025-06-22 RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation Tianxing Chen et.al. 2506.18088 null Kimi
461 2025-06-18 PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning Yuhui Shi et.al. 2506.15683 null Kimi
462 2025-06-18 Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence Yining Hong et.al. 2506.15677 null Kimi
463 2025-06-18 Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers Tommaso Green et.al. 2506.15674 link Kimi
464 2025-06-18 Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability Yusuke Sakai et.al. 2506.15629 null Kimi
465 2025-06-18 WikiMixQA: A Multimodal Benchmark for Question Answering over Tables and Charts Negar Foroutan et.al. 2506.15594 link Kimi
466 2025-06-18 Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents Aline Dobrovsky et.al. 2506.15567 null Kimi
467 2025-06-18 PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction Shufan Li et.al. 2506.15556 null Kimi
468 2025-06-18 Optimizing Web-Based AI Query Retrieval with GPT Integration in LangChain A CoT-Enhanced Prompt Engineering Approach Wenqi Guan et.al. 2506.15512 null Kimi
469 2025-06-18 SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling Md Imbesat Hassan Rizvi et.al. 2506.15498 link Kimi
470 2025-06-18 Context-Informed Grounding Supervision Hyunji Lee et.al. 2506.15480 link Kimi
471 2025-06-18 RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation Xinnuo Xu et.al. 2506.15455 null Kimi
472 2025-06-18 Uncovering Intention through LLM-Driven Code Snippet Description Generation Yusuf Sulistyo Nugroho et.al. 2506.15453 null Kimi
473 2025-06-18 Targeted Lexical Injection: Unlocking Latent Cross-Lingual Alignment in Lugha-Llama via Early-Layer LoRA Fine-Tuning Stanley Ngugi et.al. 2506.15415 null Kimi
474 2025-06-18 DeVisE: Behavioral Testing of Medical Large Language Models Camila Zurdo Tagliabue et.al. 2506.15339 null Kimi
475 2025-06-18 Cohort Discovery: A Survey on LLM-Assisted Clinical Trial Recruitment Shrestha Ghosh et.al. 2506.15301 null Kimi
476 2025-06-18 ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs Feng He et.al. 2506.15211 null Kimi
477 2025-06-18 A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals Andrea Cadeddu et.al. 2506.15208 null Kimi
478 2025-06-18 eLLM: Elastic Memory Management Framework for Efficient LLM Serving Jiale Xu et.al. 2506.15155 null Kimi
479 2025-06-18 Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs Jing Yang Lee et.al. 2506.15131 null Kimi
480 2025-06-18 Truncated Proximal Policy Optimization Tiantian Fan et.al. 2506.15050 null Kimi
481 2025-06-17 SFT-GO: Supervised Fine-Tuning with Group Optimization for Large Language Models Gyuhak Kim et.al. 2506.15021 null Kimi
482 2025-06-17 Scaling Intelligence: Designing Data Centers for Next-Gen Language Models Jesmin Jahan Tithi et.al. 2506.15006 null Kimi
483 2025-06-17 Memory Tokens: Large Language Models Can Generate Reversible Sentence Embeddings Ignacio Sastre et.al. 2506.15001 link Kimi
484 2025-06-17 A Variational Framework for Improving Naturalness in Generative Spoken Language Models Li-Wei Chen et.al. 2506.14767 link Kimi
485 2025-06-17 ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM Yujun Wang et.al. 2506.14766 null Kimi
486 2025-06-18 Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs Ling Team et.al. 2506.14731 null Kimi
487 2025-06-17 GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors Hengyuan Zhang et.al. 2506.14646 link Kimi
488 2025-06-17 Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot Xiang Cheng et.al. 2506.14641 null Kimi
489 2025-06-18 AIn’t Nothing But a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation Leah von der Heyde et.al. 2506.14634 null Kimi
490 2025-06-18 Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models Chenchen Yuan et.al. 2506.14625 link Kimi
491 2025-06-16 Steering LLM Thinking with Budget Guidance Junyan Li et.al. 2506.13752 link Kimi
492 2025-06-16 Evaluating Large Language Models for Phishing Detection, Self-Consistency, Faithfulness, and Explainability Shova Kuikel et.al. 2506.13746 link Kimi
493 2025-06-16 Instruction Following by Boosting Attention of Large Language Models Vitoria Guardieiro et.al. 2506.13734 null Kimi
494 2025-06-16 Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems Shang-Chi Tsai et.al. 2506.13692 null Kimi
495 2025-06-16 Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model Shaolei Zhang et.al. 2506.13642 link Kimi
496 2025-06-16 An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability Yusuke Yamauchi et.al. 2506.13639 null Kimi
497 2025-06-16 CAMS: A CityGPT-Powered Agentic Framework for Urban Human Mobility Simulation Yuwei Du et.al. 2506.13599 null Kimi
498 2025-06-16 Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems Tuan Nguyen et.al. 2506.13596 null Kimi
499 2025-06-16 MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention MiniMax et.al. 2506.13585 link Kimi
500 2025-06-16 Flexible-length Text Infilling for Discrete Diffusion Models Andrew Zhang et.al. 2506.13579 null Kimi
501 2025-06-16 Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization Guanghui Song et.al. 2506.13541 null Kimi
502 2025-06-16 ROSAQ: Rotation-based Saliency-Aware Weight Quantization for Efficiently Compressing Large Language Models Junho Yoon et.al. 2506.13472 null Kimi
503 2025-06-16 Unveiling the Learning Mind of Language Models: A Cognitive Framework and Empirical Study Zhengyu Hu et.al. 2506.13464 null Kimi
504 2025-06-16 StoryBench: A Dynamic Benchmark for Evaluating Long-Term Memory with Multi Turns Luanbo Wan et.al. 2506.13356 null Kimi
505 2025-06-16 Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks Yifei Xu et.al. 2506.13351 null Kimi
506 2025-06-16 SeqPE: Transformer with Sequential Position Encoding Huyang Li et.al. 2506.13277 link Kimi
507 2025-06-16 IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation Zijie Lin et.al. 2506.13229 link Kimi
508 2025-06-16 Thought Crime: Backdoors and Emergent Misalignment in Reasoning Models James Chua et.al. 2506.13206 null Kimi
509 2025-06-16 Breaking Thought Patterns: A Multi-Dimensional Reasoning Framework for LLMs Xintong Tang et.al. 2506.13192 null Kimi
510 2025-06-16 Adapting LLMs for Minimal-edit Grammatical Error Correction Ryszard Staruch et.al. 2506.13148 null Kimi
511 2025-06-16 ZINA: Multimodal Fine-grained Hallucination Detection and Editing Yuiga Wada et.al. 2506.13130 null Kimi
512 2025-06-16 Rethinking Test-Time Scaling for Medical AI: Model and Task-Aware Strategies for LLMs and VLMs Gyutaek Oh et.al. 2506.13102 null Kimi
513 2025-06-16 Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs Daniel Kilov et.al. 2506.13082 null Kimi
514 2025-06-16 MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models? Xixian Yong et.al. 2506.13065 null Kimi
515 2025-06-16 Multipole Attention for Efficient Long Context Reasoning Coleman Hooper et.al. 2506.13059 null Kimi
516 2025-06-16 Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning Haibo Qiu et.al. 2506.13056 null Kimi
517 2025-06-16 Just Go Parallel: Improving the Multilingual Capabilities of Large Language Models Muhammad Reza Qorib et.al. 2506.13044 null Kimi
518 2025-06-15 Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning Skills Changsheng Wang et.al. 2506.12963 null Kimi
519 2025-06-15 HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance Rosni Vasu et.al. 2506.12937 null Kimi
520 2025-06-15 Scaling Test-time Compute for LLM Agents King Zhu et.al. 2506.12928 null Kimi
521 2025-06-12 Fine-Grained Perturbation Guidance via Attention Head Selection Donghoon Ahn et.al. 2506.10978 null Kimi
522 2025-06-12 AutoMind: Adaptive Knowledgeable Agent for Automated Data Science Yixin Ou et.al. 2506.10974 link Kimi
523 2025-06-12 Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs Qizhe Zhang et.al. 2506.10967 link Kimi
524 2025-06-12 MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning Yuxuan Luo et.al. 2506.10963 null Kimi
525 2025-06-12 SpectralAR: Spectral Autoregressive Visual Generation Yuanhui Huang et.al. 2506.10962 null Kimi
526 2025-06-12 ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark Kangwei Liu et.al. 2506.10960 link Kimi
527 2025-06-12 ReGuidance: A Simple Diffusion Wrapper for Boosting Sample Quality on Hard Inverse Problems Aayush Karan et.al. 2506.10955 null Kimi
528 2025-06-12 Build the web for agents, not agents for the web Xing Han Lù et.al. 2506.10953 null Kimi
529 2025-06-12 Spurious Rewards: Rethinking Training Signals in RLVR Rulin Shao et.al. 2506.10947 link Kimi
530 2025-06-12 GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models Evelyn Ma et.al. 2506.10946 null Kimi
531 2025-06-12 VINCIE: Unlocking In-context Image Editing from Video Leigang Qu et.al. 2506.10941 null Kimi
532 2025-06-12 Dynamic Epistemic Friction in Dialogue Timothy Obiso et.al. 2506.10934 null Kimi
533 2025-06-12 The Role of Generative AI in Facilitating Social Interactions: A Scoping Review T. T. J. E. Arets et.al. 2506.10927 null Kimi
534 2025-06-12 Robustly Improving LLM Fairness in Realistic Settings via Interpretability Adam Karvonen et.al. 2506.10922 link Kimi
535 2025-06-12 Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization Or Shafran et.al. 2506.10920 link Kimi
536 2025-06-12 M4V: Multi-Modal Mamba for Text-to-Video Generation Jiancheng Huang et.al. 2506.10915 null Kimi
537 2025-06-12 Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification? Fei Lin et.al. 2506.10912 null Kimi
538 2025-06-12 Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning Lan Zhang et.al. 2506.10903 null Kimi
539 2025-06-12 BioClinical ModernBERT: A State-of-the-Art Long-Context Encoder for Biomedical and Clinical NLP Thomas Sounack et.al. 2506.10896 link Kimi
540 2025-06-12 AIR: Zero-shot Generative Model Adaptation with Iterative Refinement Guimeng Liu et.al. 2506.10895 link Kimi
541 2025-06-12 Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers Yixiao Huang et.al. 2506.10887 null Kimi
542 2025-06-12 Slimming Down LLMs Without Losing Their Minds Qingda et.al. 2506.10885 null Kimi
543 2025-06-12 VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos Jiashuo Yu et.al. 2506.10857 null Kimi
544 2025-06-12 A Study on Individual Spatiotemporal Activity Generation Method Using MCP-Enhanced Chain-of-Thought Large Language Models Yu Zhang et.al. 2506.10853 link Kimi
545 2025-06-12 Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles Qingyan Wei et.al. 2506.10848 link Kimi
546 2025-06-12 CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training Alireza Salemi et.al. 2506.10844 link Kimi
547 2025-06-12 Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches Andrea Moglia et.al. 2506.10825 null Kimi
548 2025-06-12 ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization Zhensheng Jin et.al. 2506.10822 link Kimi
549 2025-06-12 VideoDeepResearch: Long Video Understanding With Agentic Tool Using Huaying Yuan et.al. 2506.10821 link Kimi
550 2025-06-12 Prompts to Summaries: Zero-Shot Language-Guided Video Summarization Mario Barbara et.al. 2506.10807 null Kimi
551 2025-06-12 PREMISE: Scalable and Strategic Prompt Optimization for Efficient Mathematical Reasoning in Large Models Ye Yu et.al. 2506.10716 null Kimi
552 2025-06-12 Large Language Models for Detection of Life-Threatening Texts Thanh Thi Nguyen et.al. 2506.10687 null Kimi
553 2025-06-12 TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving Vincenzo Colle et.al. 2506.10674 null Kimi
554 2025-06-12 Data Shifts Hurt CoT: A Theoretical Study Lang Yin et.al. 2506.10647 null Kimi
555 2025-06-12 Spelling-out is not Straightforward: LLMs’ Capability of Tokenization from Token to Characters Tatsuya Hiraoka et.al. 2506.10641 null Kimi
556 2025-06-12 NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors Numaan Naeem et.al. 2506.10627 link Kimi
557 2025-06-12 Primender Sequence: A Novel Mathematical Construct for Testing Symbolic Inference and AI Reasoning Mohd Anwar Jamal Faiz et.al. 2506.10585 null Kimi
558 2025-06-12 LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs Yanan Cai et.al. 2506.10527 null Kimi
559 2025-06-12 Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs Yilin Xiao et.al. 2506.10508 null Kimi
560 2025-06-12 Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models Sangmin Song et.al. 2506.10504 null Kimi
561 2025-06-12 TD-Pipe: Temporally-Disaggregated Pipeline Parallelism Architecture for High-Throughput LLM Inference Hongbin Zhang et.al. 2506.10470 null Kimi
562 2025-06-12 Specification and Evaluation of Multi-Agent LLM Systems – Prototype and Cybersecurity Applications Felix Härer et.al. 2506.10467 link Kimi
563 2025-06-12 MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models Yu Huang et.al. 2506.10465 null Kimi
564 2025-06-12 Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts Zaijing Li et.al. 2506.10357 null Kimi
565 2025-06-12 Code Execution as Grounded Supervision for LLM Reasoning Dongwon Jung et.al. 2506.10343 link Kimi
566 2025-06-12 Discrete Audio Tokens: More Than a Survey! Pooneh Mousavi et.al. 2506.10274 null Kimi
567 2025-06-11 Disclosure Audits for LLM Agents Saswat Das et.al. 2506.10171 null Kimi
568 2025-06-11 Can LLMs Generate Good Stories? Insights and Challenges from a Narrative Planning Perspective Yi Wang et.al. 2506.10161 null Kimi
569 2025-06-11 When Meaning Stays the Same, but Models Drift: Evaluating Quality of Service under Token-Level Behavioral Instability in LLMs Xiao Li et.al. 2506.10095 link Kimi
570 2025-06-11 From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring Yang Li et.al. 2506.09996 null Kimi
571 2025-06-11 PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants Zheng Zhao et.al. 2506.09902 link Kimi
572 2025-06-11 Attention Head Embeddings with Trainable Deep Kernels for Hallucination Detection in LLMs Rodion Oblovatny et.al. 2506.09886 null Kimi
573 2025-06-11 Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning Xiangning Yu et.al. 2506.09853 null Kimi
574 2025-06-11 Dataset of News Articles with Provenance Metadata for Media Relevance Assessment Tomas Peterka et.al. 2506.09847 null Kimi
575 2025-06-11 OctoNav: Towards Generalist Embodied Navigation Chen Gao et.al. 2506.09839 null Kimi
576 2025-06-11 CoRT: Code-integrated Reasoning within Thinking Chengpeng Li et.al. 2506.09820 link Kimi
577 2025-06-11 Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era Shuo Jiang et.al. 2506.09755 null Kimi
578 2025-06-11 Intent Factored Generation: Unleashing the Diversity in Your Language Model Eltayeb Ahmed et.al. 2506.09659 null Kimi
579 2025-06-11 DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning Dongxu Liu et.al. 2506.09644 null Kimi
580 2025-06-11 From Symbolic to Neural and Back: Exploring Knowledge Graph-Large Language Model Synergies Blaž Škrlj et.al. 2506.09566 null Kimi
581 2025-06-11 Understanding the Performance and Power of LLM Inferencing on Edge Accelerators Mayank Arya et.al. 2506.09554 null Kimi
582 2025-06-11 Athena: Enhancing Multimodal Reasoning with Data-efficient Process Reward Models Shuai Wang et.al. 2506.09532 null Kimi
583 2025-06-11 Revisit What You See: Disclose Language Prior in Vision Tokens for Efficient Guided Decoding of LVLMs Beomsik Cho et.al. 2506.09522 link Kimi
584 2025-06-11 ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning Yu Sun et.al. 2506.09513 link Kimi
585 2025-06-11 Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning Jiayi Yuan et.al. 2506.09501 null Kimi
586 2025-06-11 Token Constraint Decoding Improves Robustness on Question Answering for Large Language Models Jui-Ming Yao et.al. 2506.09408 null Kimi
587 2025-06-11 SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving Xiangchen Li et.al. 2506.09397 null Kimi
588 2025-06-11 DIVE into MoE: Diversity-Enhanced Reconstruction of Large Language Models from Dense into Mixture-of-Experts Yuchen Feng et.al. 2506.09351 null Kimi
589 2025-06-11 Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation Shanchuan Lin et.al. 2506.09350 null Kimi
590 2025-06-11 Ming-Omni: A Unified Multimodal Model for Perception and Generation Inclusion AI et.al. 2506.09344 link Kimi
591 2025-06-11 Latent Multi-Head Attention for Small Language Models Sushant Mehta et.al. 2506.09342 null Kimi
592 2025-06-11 Multi-Agent Language Models: Advancing Cooperation, Coordination, and Adaptation Arjun Vaithilingam Sudhakar et.al. 2506.09331 null Kimi
593 2025-06-10 Improving LLM Agent Planning with In-Context Learning via Atomic Fact Augmentation and Lookahead Search Samuel Holt et.al. 2506.09171 null Kimi
594 2025-06-10 VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning Li Kang et.al. 2506.09049 null Kimi
595 2025-06-10 Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Dianyi Wang et.al. 2506.09040 link Kimi
596 2025-06-10 Can A Gamer Train A Mathematical Reasoning Model? Andrew Shin et.al. 2506.08935 link Kimi
597 2025-06-10 Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions David Acuna et.al. 2506.08927 null Kimi
598 2025-06-10 PropMEND: Hypernetworks for Knowledge Propagation in LLMs Zeyu Leo Liu et.al. 2506.08920 link Kimi
599 2025-06-10 From Legal Texts to Defeasible Deontic Logic via LLMs: A Study in Automated Semantic Analysis Elias Horner et.al. 2506.08899 null Kimi
600 2025-06-10 The impact of fine tuning in LLaMA on hallucinations for named entity extraction in legal documentation Francisco Vargas et.al. 2506.08827 null Kimi
601 2025-06-10 Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents Irene Testini et.al. 2506.08800 null Kimi
602 2025-06-10 AraReasoner: Evaluating Reasoning-Based LLMs for Arabic NLP Ahmed Hasanaath et.al. 2506.08768 null Kimi
603 2025-06-10 Improved LLM Agents for Financial Document Question Answering Nelvin Tan et.al. 2506.08726 null Kimi
604 2025-06-10 ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Large Language Model Preference Optimization Hee Suk Yoon et.al. 2506.08712 null Kimi
605 2025-06-10 Efficient Post-Training Refinement of Latent Reasoning in Large Language Models Xinyuan Wang et.al. 2506.08552 null Kimi
606 2025-06-10 DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs Arie Cattan et.al. 2506.08500 link Kimi
607 2025-06-10 Fairness is Not Silence: Unmasking Vacuous Neutrality in Small Language Models Sumanth Manduru et.al. 2506.08487 null Kimi
608 2025-06-10 Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive- $k$ Chihiro Taguchi et.al. 2506.08479 null Kimi
609 2025-06-10 A Survey on Large Language Models for Mathematical Reasoning Peng-Yuan Wang et.al. 2506.08446 null Kimi
610 2025-06-10 Low-resource domain adaptation while minimizing energy and hardware resource consumption Hernán Maina et.al. 2506.08433 null Kimi
611 2025-06-10 TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration Weiya Li et.al. 2506.08403 link Kimi
612 2025-06-10 Reinforce LLM Reasoning through Multi-Agent Reflection Yurun Yuan et.al. 2506.08379 null Kimi
613 2025-06-10 Draft-based Approximate Inference for LLMs Kevin Galim et.al. 2506.08373 link Kimi
614 2025-06-10 Mitigating Posterior Salience Attenuation in Long-Context LLMs with Positional Contrastive Decoding Zikai Xiao et.al. 2506.08371 null Kimi
615 2025-06-10 DEAL: Disentangling Transformer Head Activations for LLM Steering Li-Ming Zhan et.al. 2506.08359 null Kimi
616 2025-06-10 Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving Yuxuan Zhou et.al. 2506.08349 link Kimi
617 2025-06-09 A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation Andrew Z. Wang et.al. 2506.08210 null Kimi
618 2025-06-09 LLM-BT: Back-Translation as a Framework for Terminology Standardization and Dynamic Semantic Embedding Li Weigang et.al. 2506.08174 null Kimi
619 2025-06-09 Multilingual Hate Speech Detection in Social Media Using Translation-Based Approaches with Large Language Models Muhammad Usman et.al. 2506.08147 null Kimi
620 2025-06-09 HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization Hongzheng Chen et.al. 2506.07972 link Kimi
621 2025-06-09 Reinforcing Multimodal Understanding and Generation with Dual Self-rewards Jixiang Hong et.al. 2506.07963 null Kimi
622 2025-06-09 Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations Yizhen Li et.al. 2506.07943 null Kimi
623 2025-06-09 Mimicking or Reasoning: Rethinking Multi-Modal In-Context Learning in Vision-Language Models Chengyue Huang et.al. 2506.07936 null Kimi
624 2025-06-09 Solving Inequality Proofs with Large Language Models Jiayi Sheng et.al. 2506.07927 link Kimi
625 2025-06-09 LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement Dimitris Panagopoulos et.al. 2506.07915 null Kimi
626 2025-06-09 MiniCPM4: Ultra-Efficient LLMs on End Devices MiniCPM Team et.al. 2506.07900 link Kimi
627 2025-06-09 Evaluating Large Language Models on the Frame and Symbol Grounding Problems: A Zero-shot Benchmark Shoko Oka et.al. 2506.07896 link Kimi
628 2025-06-09 Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning Yiju Guo et.al. 2506.07851 null Kimi
629 2025-06-09 Improving large language models with concept-aware fine-tuning Michael K. Chen et.al. 2506.07833 link Kimi
630 2025-06-09 Addition in Four Movements: Mapping Layer-wise Information Trajectories in LLMs Yao Yan et.al. 2506.07824 null Kimi
631 2025-06-09 Augmenting LLMs’ Reasoning by Reinforcing Abstract Thinking Silin Gao et.al. 2506.07751 null Kimi
632 2025-06-09 Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models Ramakrishna Appicharla et.al. 2506.07583 null Kimi
633 2025-06-09 SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems Peiran Li et.al. 2506.07564 null Kimi
634 2025-06-09 SELT: Self-Evaluation Tree Search for LLMs with Task Decomposition Mengsong Wu et.al. 2506.07557 null Kimi
635 2025-06-09 MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts Wei Tao et.al. 2506.07533 null Kimi
636 2025-06-09 LeVo: High-Quality Song Generation with Multi-Preference Alignment Shun Lei et.al. 2506.07520 link Kimi
637 2025-06-09 Graph-of-Causal Evolution: Challenging Chain-of-Model for Reasoning Libo Wang et.al. 2506.07501 null Kimi
638 2025-06-09 CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models Guang Liu et.al. 2506.07463 null Kimi
639 2025-06-09 Prompt to Protection: A Comparative Study of Multimodal LLMs in Construction Hazard Recognition Nishi Chaudhary et.al. 2506.07436 null Kimi
640 2025-06-09 Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding Feifan Song et.al. 2506.07434 link Kimi
641 2025-06-09 Evaluating Visual Mathematics in Multimodal LLMs: A Multilingual Benchmark Based on the Kangaroo Tests Arnau Igualde Sáez et.al. 2506.07418 null Kimi
642 2025-06-09 MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models Philip Liu et.al. 2506.07400 link Kimi
643 2025-06-09 Improving LLM Reasoning through Interpretable Role-Playing Steering Anyi Wang et.al. 2506.07335 null Kimi
644 2025-06-09 JavelinGuard: Low-Cost Transformer Architectures for LLM Security Yash Datta et.al. 2506.07330 null Kimi
645 2025-06-08 Reward Model Interpretability via Optimal and Pessimal Tokens Brian Christian et.al. 2506.07326 null Kimi
646 2025-06-08 Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference Thomas Joshi et.al. 2506.07311 null Kimi
647 2025-06-08 Tokenized Bandit for LLM Decoding and Alignment Suho Shin et.al. 2506.07276 null Kimi
648 2025-06-08 Learn as Individuals, Evolve as a Team: Multi-agent LLMs Adaptation in Embodied Environments Xinran Li et.al. 2506.07232 null Kimi
649 2025-06-08 Advancing Multimodal Reasoning Capabilities of Multimodal Large Language Models via Visual Perception Reward Tong Xiao et.al. 2506.07218 null Kimi
650 2025-06-05 Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets Lei Hsiung et.al. 2506.05346 null Kimi
651 2025-06-05 Inference-Time Hyper-Scaling with KV Cache Compression Adrian Łańcucki et.al. 2506.05345 null Kimi
652 2025-06-05 SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs Jiahui Wang et.al. 2506.05344 link Kimi
653 2025-06-05 Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning Xingjian Ran et.al. 2506.05341 null Kimi
654 2025-06-05 VideoMolmo: Spatio-Temporal Grounding Meets Pointing Ghazi Shazan Ahmad et.al. 2506.05336 link Kimi
655 2025-06-05 Kinetics: Rethinking Test-Time Scaling Laws Ranajoy Sadhukhan et.al. 2506.05333 link Kimi
656 2025-06-05 Unleashing Hour-Scale Video Training for Long Video-Language Understanding Jingyang Lin et.al. 2506.05332 null Kimi
657 2025-06-05 MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning Xinyan Chen et.al. 2506.05331 link Kimi
658 2025-06-05 Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay Yifan Sun et.al. 2506.05316 null Kimi
659 2025-06-05 Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models Taha Entesari et.al. 2506.05314 null Kimi
660 2025-06-05 Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games Niv Eckhaus et.al. 2506.05309 link Kimi
661 2025-06-05 ProRefine: Inference-time Prompt Refinement with Textual Feedback Deepak Pandita et.al. 2506.05305 null Kimi
662 2025-06-05 Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos Weifeng Lin et.al. 2506.05302 null Kimi
663 2025-06-05 Sample Complexity and Representation Ability of Test-time Scaling Paradigms Baihe Huang et.al. 2506.05295 null Kimi
664 2025-06-05 AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model Pingyu Wu et.al. 2506.05289 link Kimi
665 2025-06-05 Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning Nan Huo et.al. 2506.05278 null Kimi
666 2025-06-05 Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams Mohammed Almutairi et.al. 2506.05265 null Kimi
667 2025-06-05 CLATTER: Comprehensive Entailment Reasoning for Hallucination Detection Ron Eliav et.al. 2506.05243 null Kimi
668 2025-06-05 MesaNet: Sequence Modeling by Locally Optimal Test-Time Training Johannes von Oswald et.al. 2506.05233 null Kimi
669 2025-06-05 Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts Danil Sivtsov et.al. 2506.05229 link Kimi
670 2025-06-05 LLM-First Search: Self-Guided Exploration of the Solution Space Nathan Herr et.al. 2506.05213 link Kimi
671 2025-06-05 The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Nikhil Kandpal et.al. 2506.05209 null Kimi
672 2025-06-05 RELIC: Evaluating Compositional Instruction Following via Language Recognition Jackson Petty et.al. 2506.05205 null Kimi
673 2025-06-05 Counterfactual reasoning: an analysis of in-context emergence Moritz Miller et.al. 2506.05188 link Kimi
674 2025-06-05 TreeRPO: Tree Relative Policy Optimization Zhicheng Yang et.al. 2506.05183 null Kimi
675 2025-06-05 ECoRAG: Evidentiality-guided Compression for Long Context RAG Yeonseok Jeong et.al. 2506.05167 link Kimi
676 2025-06-05 Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective Bhavik Chandna et.al. 2506.05166 null Kimi
677 2025-06-05 Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation Chenyu Lin et.al. 2506.05154 null Kimi
678 2025-06-05 Do Large Language Models Judge Error Severity Like Humans? Diege Sun et.al. 2506.05142 null Kimi
679 2025-06-05 AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models Chih-Kai Yang et.al. 2506.05140 null Kimi
680 2025-06-05 DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning Tanmay Parekh et.al. 2506.05128 null Kimi
681 2025-06-05 TALL – A Trainable Architecture for Enhancing LLM Performance in Low-Resource Languages Moshe Ofer et.al. 2506.05057 null Kimi
682 2025-06-05 Controlling Summarization Length Through EOS Token Weighting Zeno Belligoli et.al. 2506.05017 null Kimi
683 2025-06-05 When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models Kai Wang et.al. 2506.04909 null Kimi
684 2025-06-05 Verbose ListOps (VLO): Beyond Long Context – Unmasking LLM’s Reasoning Blind Spots Alex Pan et.al. 2506.04907 null Kimi
685 2025-06-05 Multiple-Choice Question Generation Using Large Language Models: Methodology and Educator Insights Giorgio Biancini et.al. 2506.04851 null Kimi
686 2025-06-05 Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study Yujun Zhou et.al. 2506.04810 link Kimi
687 2025-06-05 Accelerated Test-Time Scaling with Model-Free Speculative Sampling Woomin Song et.al. 2506.04708 null Kimi
688 2025-06-05 MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models Gio Paik et.al. 2506.04688 null Kimi
689 2025-06-05 TaDA: Training-free recipe for Decoding with Adaptive KV Cache Compression and Mean-centering Vinay Joshi et.al. 2506.04642 null Kimi
690 2025-06-05 Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning Zhiyuan Ma et.al. 2506.04625 null Kimi
691 2025-06-05 Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification Chengwu Liu et.al. 2506.04592 null Kimi
692 2025-06-05 Reasoning or Overthinking: Evaluating Large Language Models on Financial Sentiment Analysis Dimitris Vamvourellis et.al. 2506.04574 null Kimi
693 2025-06-04 Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model Haibin Wu et.al. 2506.04518 null Kimi
694 2025-06-04 MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale Ran Xu et.al. 2506.04405 null Kimi
695 2025-06-04 ReXVQA: A Large-scale Visual Question Answering Benchmark for Generalist Chest X-ray Understanding Ankit Pal et.al. 2506.04353 null Kimi
696 2025-06-04 GEM: Empowering LLM for both Embedding Generation and Language Understanding Caojin Zhang et.al. 2506.04344 null Kimi
697 2025-06-04 Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning Shuang Chen et.al. 2506.04207 null Kimi
698 2025-06-04 Cascadia: A Cascade Serving System for Large Language Models Youhe Jiang et.al. 2506.04203 null Kimi
699 2025-06-04 TracLLM: A Generic Framework for Attributing Long Context LLMs Yanting Wang et.al. 2506.04202 link Kimi
700 2025-06-05 Rectified Sparse Attention Yutao Sun et.al. 2506.04108 null Kimi
701 2025-06-04 Multimodal Tabular Reasoning with Privileged Structured Information Jun-Peng Jiang et.al. 2506.04088 null Kimi
702 2025-06-04 LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation Ming Zhang et.al. 2506.04078 link Kimi
703 2025-06-04 Explainability-Based Token Replacement on LLM-Generated Text Hadi Mohammadi et.al. 2506.04050 null Kimi
704 2025-06-04 Mitigating Hallucinations in Large Vision-Language Models via Entity-Centric Multimodal Preference Optimization Jiulong Wu et.al. 2506.04039 null Kimi
705 2025-06-04 AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in LLM-Based Agents Akshat Naik et.al. 2506.04018 null Kimi
706 2025-06-04 Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning Junqi Gao et.al. 2506.03939 link Kimi
707 2025-06-04 Vision Remember: Alleviating Visual Forgetting in Efficient MLLM with Vision Feature Resample Ze Feng et.al. 2506.03928 null Kimi
708 2025-06-04 RadialRouter: Structured Representation for Efficient and Robust Large Language Models Routing Ruihan Jin et.al. 2506.03880 null Kimi
709 2025-06-04 Knockout LLM Assessment: Using Large Language Models for Evaluations through Iterative Pairwise Comparisons Isik Baran Sandan et.al. 2506.03785 null Kimi
710 2025-06-04 ClozeMath: Improving Mathematical Reasoning in Language Models by Learning to Fill Equations Quang Hieu Pham et.al. 2506.03763 null Kimi
711 2025-06-04 AhaKV: Adaptive Holistic Attention-Driven KV Cache Eviction for Efficient Inference of Large Language Models Yifeng Gu et.al. 2506.03762 null Kimi
712 2025-06-04 Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision Chaeyun Jang et.al. 2506.03723 null Kimi
713 2025-06-04 AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism Zhepei Wei et.al. 2506.03700 link Kimi
714 2025-06-04 Learning to Insert [PAUSE] Tokens for Better Reasoning Eunki Kim et.al. 2506.03616 null Kimi
715 2025-06-04 POSS: Position Specialist Generates Better Draft for Speculative Decoding Langlin Huang et.al. 2506.03566 link Kimi
716 2025-06-04 Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning Daeun Lee et.al. 2506.03525 null Kimi
717 2025-06-04 EpiCoDe: Boosting Model Performance Beyond Training with Extrapolation and Contrastive Decoding Mingxu Tao et.al. 2506.03489 null Kimi
718 2025-06-03 Parallel CPU-GPU Execution for LLM Inference on Constrained GPUs Jiakun Fan et.al. 2506.03296 null Kimi
719 2025-06-03 Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem Yubo Wang et.al. 2506.03295 null Kimi
720 2025-06-03 FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes Christodoulos Constantinides et.al. 2506.03278 link Kimi
721 2025-06-04 UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation Bin Lin et.al. 2506.03147 null Kimi
722 2025-06-03 GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Qianhui Wu et.al. 2506.03143 null Kimi
723 2025-06-03 Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning Yinjie Wang et.al. 2506.03136 link Kimi
724 2025-06-03 OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models Mengdi Jia et.al. 2506.03135 null Kimi
725 2025-06-03 EgoVLM: Policy Optimization for Egocentric Video Understanding Ashwin Vinod et.al. 2506.03097 link Kimi
726 2025-06-03 Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective Jintian Shao et.al. 2506.03038 null Kimi
727 2025-06-03 Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech Florian Ludwig et.al. 2506.03009 null Kimi
728 2025-06-03 Adaptive Graph Pruning for Multi-Agent Communication Boyi Li et.al. 2506.02951 null Kimi
729 2025-06-03 Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning Yin Fang et.al. 2506.02911 link Kimi
730 2025-06-03 Scaling Fine-Grained MoE Beyond 50B Parameters: Empirical Evaluation and Practical Insights Jakub Krajewski et.al. 2506.02890 null Kimi
731 2025-06-03 CoT is Not True Reasoning, It Is Just a Tight Constraint to Imitate: A Theory Perspective Jintian Shao et.al. 2506.02878 null Kimi
732 2025-06-03 BNPO: Beta Normalization Policy Optimization Changyi Xiao et.al. 2506.02864 null Kimi
733 2025-06-03 METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding Mengyue Wang et.al. 2506.02850 link Kimi
734 2025-06-03 RACE-Align: Retrieval-Augmented and Chain-of-Thought Enhanced Preference Alignment for Large Language Models Qihang Yan et.al. 2506.02726 null Kimi
735 2025-06-03 TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression Zhong-Zhi Li et.al. 2506.02678 link Kimi
736 2025-06-03 Truly Assessing Fluid Intelligence of Large Language Models through Dynamic Reasoning Evaluation Yue Yang et.al. 2506.02648 null Kimi
737 2025-06-03 KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider Jiahao Wang et.al. 2506.02634 link Kimi
738 2025-06-03 Pruning General Large Language Models into Customized Expert Models Yirao Zhao et.al. 2506.02561 null Kimi
739 2025-06-03 Answer Convergence as a Signal for Early Stopping in Reasoning Xin Liu et.al. 2506.02536 null Kimi
740 2025-06-03 Minos: A Multimodal Evaluation Model for Bidirectional Generation Between Image and Text Junzhe Zhang et.al. 2506.02494 null Kimi
741 2025-06-03 MidPO: Dual Preference Optimization for Safety and Helpfulness in Large Language Models via a Mixture of Experts Framework Yupeng Qi et.al. 2506.02460 null Kimi
742 2025-06-03 Comparative Analysis of AI Agent Architectures for Entity Relationship Classification Maryam Berijanian et.al. 2506.02426 link Kimi
743 2025-06-03 Consultant Decoding: Yet Another Synergistic Mechanism Chuanghao Ding et.al. 2506.02391 null Kimi
744 2025-06-03 Univariate to Multivariate: LLMs as Zero-Shot Predictors for Time-Series Forecasting Chamara Madarasingha et.al. 2506.02389 null Kimi
745 2025-06-03 DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization Jeonghun Kang et.al. 2506.02351 null Kimi
746 2025-06-02 The Unified Cognitive Consciousness Theory for Language Models: Anchoring Semantics, Thresholds of Activation, and Emergent Reasoning Edward Y. Chang et.al. 2506.02139 null Kimi
747 2025-06-02 Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains Juncheng Wu et.al. 2506.02126 null Kimi
748 2025-06-02 Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Shenzhi Wang et.al. 2506.01939 null Kimi
749 2025-06-02 Large language models can learn and generalize steganographic chain-of-thought under process supervision Joey Skaf et.al. 2506.01926 null Kimi
750 2025-06-02 MoDA: Modulation Adapter for Fine-Grained Visual Grounding in Instructional MLLMs Wayner Barrios et.al. 2506.01850 null Kimi
751 2025-06-02 Analysis of LLM Bias (Chinese Propaganda & Anti-US Sentiment) in DeepSeek-R1 vs. ChatGPT o3-mini-high PeiHsuan Huang et.al. 2506.01814 null Kimi
752 2025-05-29 From Chat Logs to Collective Insights: Aggregative Question Answering Wentao Zhang et.al. 2505.23765 null Kimi
753 2025-05-29 MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence Sihan Yang et.al. 2505.23764 null Kimi
754 2025-05-29 ZeroGUI: Automating Online GUI Learning at Zero Human Cost Chenyu Yang et.al. 2505.23762 link Kimi
755 2025-05-29 Puzzled by Puzzles: When Vision-Language Models Can’t Take a Hint Heekyung Lee et.al. 2505.23759 link Kimi
756 2025-05-29 DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning Ziyin Zhang et.al. 2505.23754 link Kimi
757 2025-05-29 ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks Akashah Shabbir et.al. 2505.23752 link Kimi
758 2025-05-29 Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence Diankun Wu et.al. 2505.23747 null Kimi
759 2025-05-29 ATLAS: Learning to Optimally Memorize the Context at Test Time Ali Behrouz et.al. 2505.23735 null Kimi
760 2025-05-29 Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time Mohamad Chehade et.al. 2505.23729 null Kimi
761 2025-05-29 ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering Zexi Liu et.al. 2505.23723 link Kimi
762 2025-05-29 Label-Guided In-Context Learning for Named Entity Recognition Fan Bai et.al. 2505.23722 link Kimi
763 2025-05-29 From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems Zeinab Nezami et.al. 2505.23710 null Kimi
764 2025-05-29 Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation Ziling Cheng et.al. 2505.23701 null Kimi
765 2025-05-29 Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics Ran Zhang et.al. 2505.23695 link Kimi
766 2025-05-29 VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos Tingyu Song et.al. 2505.23693 link Kimi
767 2025-05-29 LoLA: Low-Rank Linear Attention With Sparse Caching Luke McDermott et.al. 2505.23666 null Kimi
768 2025-05-29 D-AR: Diffusion via Autoregressive Models Ziteng Gao et.al. 2505.23660 link Kimi
769 2025-05-29 Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation Hongxiang Zhang et.al. 2505.23657 null Kimi
770 2025-05-29 Are Reasoning Models More Prone to Hallucination? Zijun Yao et.al. 2505.23646 null Kimi
771 2025-05-29 AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora Jiaxin Bai et.al. 2505.23628 link Kimi
772 2025-05-29 Table-R1: Inference-Time Scaling for Table Reasoning Zheyuan Yang et.al. 2505.23621 link Kimi
773 2025-05-29 One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory Chenhao Zheng et.al. 2505.23617 null Kimi
774 2025-05-29 MAPLE: A Mobile Assistant with Persistent Finite State Machines for Recovery Reasoning Linqiang Guo et.al. 2505.23596 null Kimi
775 2025-05-29 Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with Jigsaw Puzzles Zifu Wang et.al. 2505.23590 link Kimi
776 2025-05-29 CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring Benjamin Arnav et.al. 2505.23575 null Kimi
777 2025-05-29 Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models Yiran Guo et.al. 2505.23564 link Kimi
778 2025-05-29 Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information Xu Chu et.al. 2505.23558 link Kimi
779 2025-05-29 Sustainable Carbon-Aware and Water-Efficient LLM Scheduling in Geo-Distributed Cloud Datacenters Hayden Moore et.al. 2505.23554 null Kimi
780 2025-05-29 Probability-Consistent Preference Optimization for Enhanced LLM Reasoning Yunqiao Yang et.al. 2505.23540 link Kimi
781 2025-05-29 CLaC at SemEval-2025 Task 6: A Multi-Architecture Approach for Corporate Environmental Promise Verification Nawar Turk et.al. 2505.23538 null Kimi
782 2025-05-29 Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation Beiduo Chen et.al. 2505.23368 link Kimi
783 2025-05-29 VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning? Yuanxin Liu et.al. 2505.23359 link Kimi
784 2025-05-29 How Does Response Length Affect Long-Form Factuality James Xu Zhao et.al. 2505.23295 link Kimi
785 2025-05-29 Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective Yong Zhang et.al. 2505.23277 link Kimi
786 2025-05-29 Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models Zeyu Liu et.al. 2505.23091 null Kimi
787 2025-05-29 From Token to Action: State Machine Reasoning to Mitigate Overthinking in Information Retrieval Dohyeon Lee et.al. 2505.23059 link Kimi
788 2025-05-28 NegVQA: Can Vision Language Models Understand Negation? Yuhui Zhang et.al. 2505.22946 null Kimi
789 2025-05-28 Can LLMs Deceive CLIP? Benchmarking Adversarial Compositionality of Pre-trained Multimodal Representation via Text Updates Jaewoo Ahn et.al. 2505.22943 null Kimi
790 2025-05-28 WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning Yuchen Zhuang et.al. 2505.22942 null Kimi
791 2025-05-28 Can Large Language Models Match the Conclusions of Systematic Reviews? Christopher Polzak et.al. 2505.22787 link Kimi
792 2025-05-28 Pre-Training Curriculum for Multi-Token Prediction in Language Models Ansar Aynetdinov et.al. 2505.22757 link Kimi
793 2025-05-28 Zero-Shot Vision Encoder Grafting via LLM Surrogates Kaiyu Yue et.al. 2505.22664 link Kimi
794 2025-05-28 AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models Feng Luo et.al. 2505.22662 null Kimi
795 2025-05-28 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model Wenbo Hu et.al. 2505.22657 null Kimi
796 2025-05-28 VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models Ce Zhang et.al. 2505.22654 null Kimi
797 2025-05-28 Learning Composable Chains-of-Thought Fangcong Yin et.al. 2505.22635 null Kimi
798 2025-05-28 Spatial Knowledge Graph-Guided Multimodal Synthesis Yida Xue et.al. 2505.22633 null Kimi
799 2025-05-28 Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding Chengyue Wu et.al. 2505.22618 null Kimi
800 2025-05-28 RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction Yuchi Wang et.al. 2505.22613 null Kimi
801 2025-05-28 Less, but Better: Efficient Multilingual Expansion for LLMs via Layer-wise Mixture-of-Experts Xue Zhang et.al. 2505.22582 null Kimi
802 2025-05-29 Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems Hoang Pham et.al. 2505.22571 null Kimi
803 2025-05-28 Thinking with Generated Images Ethan Chern et.al. 2505.22525 null Kimi
804 2025-05-28 Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO Lai Wei et.al. 2505.22453 link Kimi
805 2025-05-28 Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start Lai Wei et.al. 2505.22334 link Kimi
806 2025-05-28 Advancing Expert Specialization for Better MoE Hongcan Guo et.al. 2505.22323 null Kimi
807 2025-05-28 Skywork Open Reasoner 1 Technical Report Jujie He et.al. 2505.22312 link Kimi
808 2025-05-28 Let’s Predict Sentence by Sentence Hyeonbin Hwang et.al. 2505.22202 null Kimi
809 2025-05-28 Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design Yudi Zhang et.al. 2505.22179 link Kimi
810 2025-05-28 InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing Shuaiyi Li et.al. 2505.22156 null Kimi
811 2025-05-28 What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning Gangwei Jiang et.al. 2505.22148 null Kimi
812 2025-05-28 Flexible Tool Selection through Low-dimensional Attribute Alignment of Vision and Language Guangfu Hao et.al. 2505.22146 null Kimi
813 2025-05-28 Curse of High Dimensionality Issue in Transformer for Long-context Modeling Shuhai Zhang et.al. 2505.22107 link Kimi
814 2025-05-28 CoThink: Token-Efficient Reasoning via Instruct Models Guiding Reasoning Models Siqi Fan et.al. 2505.22017 null Kimi
815 2025-05-28 Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference Yue Zhu et.al. 2505.21919 null Kimi
816 2025-05-28 Co-Saving: Resource Aware Multi-Agent Collaboration for Software Development Rennai Qiu et.al. 2505.21898 null Kimi
817 2025-05-28 EFIM: Efficient Serving of LLMs for Infilling Tasks with Improved KV Cache Reuse Tianyu Guo et.al. 2505.21889 link Kimi
818 2025-05-27 Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation Tharindu Kumarage et.al. 2505.21784 null Kimi
819 2025-05-27 R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing Tianyu Fu et.al. 2505.21600 link Kimi
820 2025-05-27 Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making Yihan Wang et.al. 2505.21503 null Kimi
821 2025-05-27 Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Wei Pang et.al. 2505.21497 link Kimi
822 2025-05-27 Hardware-Efficient Attention for Fast Decoding Ted Zadouri et.al. 2505.21487 null Kimi
823 2025-05-27 Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion Zhanqiu Hu et.al. 2505.21467 null Kimi
824 2025-05-28 Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity Yehui Tang et.al. 2505.21411 null Kimi
825 2025-05-27 Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History Qishuai Zhong et.al. 2505.21362 link Kimi
826 2025-05-27 Leveraging Large Language Models for Bengali Math Word Problem Solving with Chain of Thought Reasoning Bidyarthi Paul et.al. 2505.21354 null Kimi
827 2025-05-27 PEDANTIC: A Dataset for the Automatic Examination of Definiteness in Patent Claims Valentin Knappich et.al. 2505.21342 null Kimi
828 2025-05-28 HoliTom: Holistic Token Merging for Fast Video Large Language Models Kele Shao et.al. 2505.21334 link Kimi
829 2025-05-27 Beyond Chemical QA: Evaluating LLM’s Chemical Reasoning with Modular Chemical Operations Hao Li et.al. 2505.21318 null Kimi
830 2025-05-27 Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework Saman Marandi et.al. 2505.21291 null Kimi
831 2025-05-27 Exploring the Latent Capacity of LLMs for One-Step Text Generation Gleb Mezentsev et.al. 2505.21189 null Kimi
832 2025-05-27 Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning Mingyang Song et.al. 2505.21178 null Kimi
833 2025-05-27 Thinker: Learning to Think Fast and Slow Stephen Chung et.al. 2505.21097 null Kimi
834 2025-05-27 Uni3D-MoE: Scalable Multimodal 3D Scene Understanding via Mixture of Experts Yue Zhang et.al. 2505.21079 null Kimi
835 2025-05-27 Efficient Large Language Model Inference with Neural Block Linearization Mete Erdogan et.al. 2505.21077 null Kimi
836 2025-05-27 Who Reasons in the Large Language Models? Jie Shao et.al. 2505.20993 null Kimi
837 2025-05-27 Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation Pingrui Zhang et.al. 2505.20897 link Kimi
838 2025-05-27 Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties Jiyoung Lee et.al. 2505.20875 null Kimi
839 2025-05-27 Fork-Merge Decoding: Enhancing Multimodal Understanding in Audio-Visual Large Language Models Chaeyoung Jung et.al. 2505.20873 null Kimi
840 2025-05-27 AVCD: Mitigating Hallucinations in Audio-Visual Large Language Models through Contrastive Decoding Chaeyoung Jung et.al. 2505.20862 null Kimi
841 2025-05-27 SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences Jungyoub Cha et.al. 2505.20776 link Kimi
842 2025-05-27 Dissecting Physics Reasoning in Small Language Models: A Multi-Dimensional Analysis from an Educational Perspective Nicy Scaria et.al. 2505.20707 null Kimi
843 2025-05-27 Self-Route: Automatic Mode Switching via Capability Estimation for Efficient Reasoning Yang He et.al. 2505.20664 null Kimi
844 2025-05-26 Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review Matthew Lisondra et.al. 2505.20503 null Kimi
845 2025-05-26 HAMburger: Accelerating LLM Inference via Token Smashing Jingyu Liu et.al. 2505.20438 null Kimi
846 2025-05-26 What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models Lorenzo Baraldi et.al. 2505.20405 null Kimi
847 2025-05-27 Does quantization affect models’ performance on long-context tasks? Anmol Mekala et.al. 2505.20276 link Kimi
848 2025-05-26 FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models Hao Kang et.al. 2505.20225 link Kimi
849 2025-05-26 THiNK: Can Large Language Models Think-aloud? Yongan Yu et.al. 2505.20184 link Kimi
850 2025-05-26 Adaptive Deep Reasoning: Triggering Deep Thinking When Needed Yunhao Wang et.al. 2505.20101 null Kimi
851 2025-05-26 AdaTP: Attention-Debiased Token Pruning for Video Large Language Models Fengyuan Sun et.al. 2505.20100 null Kimi
852 2025-05-26 Incentivizing Reasoning from Weak Supervision Yige Yuan et.al. 2505.20072 link Kimi
853 2025-05-26 Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion Zheqi Lv et.al. 2505.20053 link Kimi
854 2025-05-26 Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks Debargha Ganguly et.al. 2505.20047 null Kimi
855 2025-05-26 Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs Artem Vazhentsev et.al. 2505.20045 null Kimi
856 2025-05-26 Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles Jiangjie Chen et.al. 2505.19914 null Kimi
857 2025-05-26 ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows Qiushi Sun et.al. 2505.19897 null Kimi
858 2025-05-26 HS-STAR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation Feng Xiong et.al. 2505.19866 null Kimi
859 2025-05-26 Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition Zihao Zeng et.al. 2505.19788 null Kimi
860 2025-05-26 Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models Yi Liu et.al. 2505.19700 null Kimi
861 2025-05-26 Large Language Models for Planning: A Comprehensive and Systematic Survey Pengfei Cao et.al. 2505.19683 link Kimi
862 2025-05-26 MoESD: Unveil Speculative Decoding’s Potential for Accelerating Sparse MoE Zongle Huang et.al. 2505.19645 null Kimi
863 2025-05-26 SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Junteng Liu et.al. 2505.19641 link Kimi
864 2025-05-26 Interleaved Reasoning for Large Language Models via Reinforcement Learning Roy Xie et.al. 2505.19640 null Kimi
865 2025-05-26 Faster and Better LLMs via Latency-Aware Test-Time Scaling Zili Wang et.al. 2505.19634 null Kimi
866 2025-05-26 Multi-Agent Collaboration via Evolving Orchestration Yufan Dang et.al. 2505.19591 null Kimi
867 2025-05-26 TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization Dingyu Yao et.al. 2505.19586 link Kimi
868 2025-05-26 Accelerating Prefilling for Long-Context LLMs via Sparse Pattern Sharing Dan Peng et.al. 2505.19578 null Kimi
869 2025-05-26 FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models Jintao Tong et.al. 2505.19536 link Kimi
870 2025-05-26 Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs Hao Kang et.al. 2505.19481 link Kimi
871 2025-05-26 BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Guilong Lu et.al. 2505.19457 link Kimi
872 2025-05-26 Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents Ye Ye et.al. 2505.19436 link Kimi
873 2025-05-26 CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems Yan Wen et.al. 2505.19405 null Kimi
874 2025-05-25 100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability? Wang Yang et.al. 2505.19293 link Kimi
875 2025-05-25 To CoT or To Loop? A Formal Comparison Between Chain-of-Thought and Looped Transformers Kevin Xu et.al. 2505.19245 null Kimi
876 2025-05-25 LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models Aida Kostikova et.al. 2505.19240 null Kimi
877 2025-05-25 GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling Jialong Zhou et.al. 2505.19234 null Kimi
878 2025-05-25 SpeakStream: Streaming Text-to-Speech with Interleaved Data Richard He Bai et.al. 2505.19206 null Kimi
879 2025-05-25 DREAM: Drafting with Refined Target Features and Entropy-Adaptive Cross-Attention Fusion for Multimodal Speculative Decoding Yunhai Hu et.al. 2505.19201 link Kimi
880 2025-05-22 GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning Chengqi Duan et.al. 2505.17022 link Kimi
881 2025-05-22 CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms Shilin Yan et.al. 2505.17020 link Kimi
882 2025-05-22 Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO Chengzhuo Tong et.al. 2505.17017 link Kimi
883 2025-05-22 Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models Runsen Xu et.al. 2505.17015 null Kimi
884 2025-05-22 SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding Haoning Wu et.al. 2505.17012 link Kimi
885 2025-05-22 R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning Huatong Song et.al. 2505.17005 link Kimi
886 2025-05-22 Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? Jin Jiang et.al. 2505.16998 link Kimi
887 2025-05-22 X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs Rui Ye et.al. 2505.16997 link Kimi
888 2025-05-22 $\text{R}^2\text{ec}$ : Towards Large Recommender Models with Reasoning Runyang You et.al. 2505.16994 link Kimi
889 2025-05-22 Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding Runpeng Yu et.al. 2505.16990 link Kimi
890 2025-05-22 T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning Amartya Chakraborty et.al. 2505.16986 null Kimi
891 2025-05-22 Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine Adib Bazgir et.al. 2505.16982 null Kimi
892 2025-05-22 Bottlenecked Transformers: Periodic KV Cache Abstraction for Generalised Reasoning Adnan Oomerjee et.al. 2505.16950 null Kimi
893 2025-05-22 MixAT: Combining Continuous and Discrete Adversarial Training for LLMs Csaba Dékány et.al. 2505.16947 link Kimi
894 2025-05-22 AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios Yunjia Qi et.al. 2505.16944 link Kimi
895 2025-05-22 NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification NovelSeek Team et.al. 2505.16938 link Kimi
896 2025-05-22 In-Context Watermarks for Large Language Models Yepeng Liu et.al. 2505.16934 null Kimi
897 2025-05-22 Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning Bosung Kim et.al. 2505.16928 null Kimi
898 2025-05-22 Don’t “Overthink” Passage Reranking: Is Reasoning Truly Necessary? Nour Jedidi et.al. 2505.16886 null Kimi
899 2025-05-22 CASTILLO: Characterizing Response Length Distributions of Large Language Models Daniel F. Perez-Ramirez et.al. 2505.16881 link Kimi
900 2025-05-22 LaViDa: A Large Diffusion Language Model for Multimodal Understanding Shufan Li et.al. 2505.16839 link Kimi
901 2025-05-22 R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search Yibo Wang et.al. 2505.16838 link Kimi
902 2025-05-22 Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning Fanrui Zhang et.al. 2505.16836 link Kimi
903 2025-05-22 SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis Shuang Sun et.al. 2505.16834 link Kimi
904 2025-05-22 From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization Haonian Ji et.al. 2505.16832 link Kimi
905 2025-05-22 Unlearning Isn’t Deletion: Investigating Reversibility of Machine Unlearning in LLMs Xiaoyu Xu et.al. 2505.16831 link Kimi
906 2025-05-22 KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning Wei Sun et.al. 2505.16826 link Kimi
907 2025-05-22 REPA Works Until It Doesn’t: Early-Stopped, Holistic Alignment Supercharges Diffusion Training Ziqiao Wang et.al. 2505.16792 link Kimi
908 2025-05-22 CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models Zhenzhen Ren et.al. 2505.16785 null Kimi
909 2025-05-22 Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning Xinghao Chen et.al. 2505.16782 link Kimi
910 2025-05-22 R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO Huanjin Yao et.al. 2505.16673 link Kimi
911 2025-05-22 Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding Feilong Tang et.al. 2505.16652 null Kimi
912 2025-05-22 Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains Wenhui Tan et.al. 2505.16552 null Kimi
913 2025-05-22 LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing Dario Di Palma et.al. 2505.16491 null Kimi
914 2025-05-22 WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning Zhepei Wei et.al. 2505.16421 link Kimi
915 2025-05-22 DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving Zhenjie Yang et.al. 2505.16278 null Kimi
916 2025-05-22 LIFEBench: Evaluating Length Instruction Following in Large Language Models Wei Zhang et.al. 2505.16234 link Kimi
917 2025-05-22 NQKV: A KV Cache Quantization Scheme Based on Normal Distribution Characteristics Zhihang Cai et.al. 2505.16210 null Kimi
918 2025-05-22 QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design Benjamin Schneider et.al. 2505.16175 link Kimi
919 2025-05-22 KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization Mingbo Song et.al. 2505.16162 null Kimi
920 2025-05-22 Training-Free Reasoning and Reflection in MLLMs Hongchen Wei et.al. 2505.16151 null Kimi
921 2025-05-22 Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation Zhenglin Hua et.al. 2505.16146 null Kimi
922 2025-05-22 Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning Gagan Bhatia et.al. 2505.16088 null Kimi
923 2025-05-22 Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development Ming Shen et.al. 2505.16086 null Kimi
924 2025-05-21 Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models Jingcong Liang et.al. 2505.16056 link Kimi
925 2025-05-21 Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Alex Su et.al. 2505.15966 null Kimi
926 2025-05-21 Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization Aliakbar Nafar et.al. 2505.15918 null Kimi
927 2025-05-21 dKV-Cache: The Cache for Diffusion Language Models Xinyin Ma et.al. 2505.15781 link Kimi
928 2025-05-21 Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Zhen Zhang et.al. 2505.15778 link Kimi
929 2025-05-21 Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention Huanxuan Liao et.al. 2505.15774 link Kimi
930 2025-05-21 ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy Gengyang Li et.al. 2505.15684 null Kimi
931 2025-05-21 A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability Zishuai Zhang et.al. 2505.15683 link Kimi
932 2025-05-21 Feature Extraction and Steering for Enhanced Chain-of-Thought Reasoning in Language Models Zihao Li et.al. 2505.15634 null Kimi
933 2025-05-21 Learn to Reason Efficiently with Adaptive Length-based Reward Shaping Wei Liu et.al. 2505.15612 link Kimi
934 2025-05-21 Multilingual Test-Time Scaling via Initial Thought Transfer Prasoon Bajpai et.al. 2505.15508 null Kimi
935 2025-05-21 Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought Ao Liu et.al. 2505.15431 null Kimi
936 2025-05-21 FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management Xiang Liu et.al. 2505.15347 null Kimi
937 2025-05-21 Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack Silvia Cappelletti et.al. 2505.15323 null Kimi
938 2025-05-21 Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization Joonho Yang et.al. 2505.15291 null Kimi
939 2025-05-21 LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval Zhenyu Ning et.al. 2505.15269 null Kimi
940 2025-05-21 Towards Explainable Temporal Reasoning in Large Language Models: A Structure-Aware Generative Framework Zihao Jiang et.al. 2505.15245 link Kimi
941 2025-05-21 Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning Jinghui Lu et.al. 2505.15154 null Kimi
942 2025-05-21 BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Yunlong Hou et.al. 2505.15141 null Kimi
943 2025-05-21 The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning Shivam Agarwal et.al. 2505.15134 link Kimi
944 2025-05-21 An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents Bowen Jin et.al. 2505.15117 link Kimi
945 2025-05-21 RoT: Enhancing Table Reasoning with Iterative Row-Wise Traversals Xuanliang Zhang et.al. 2505.15110 null Kimi
946 2025-05-21 Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs Hao Wang et.al. 2505.15075 link Kimi
947 2025-05-21 Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision Eric Hanchen Jiang et.al. 2505.14999 null Kimi
948 2025-05-20 STree: Speculative Tree Decoding for Hybrid State-Space Models Yangchao Wu et.al. 2505.14969 null Kimi
949 2025-05-20 Too Long, Didn’t Model: Decomposing LLM Long-Context Understanding With Novels Sil Hamilton et.al. 2505.14925 link Kimi
950 2025-05-20 Scaling Laws for State Dynamics in Large Language Models Jacob X Li et.al. 2505.14892 null Kimi
951 2025-05-20 Balanced and Elastic End-to-end Training of Dynamic LLMs Mohamed Wahib et.al. 2505.14864 null Kimi
952 2025-05-20 Text Generation Beyond Discrete Token Sampling Yufan Zhuang et.al. 2505.14827 null Kimi
953 2025-05-21 Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning Haolei Xu et.al. 2505.14684 null Kimi
954 2025-05-20 Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training Mengru Wang et.al. 2505.14681 null Kimi
955 2025-05-20 Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning Jiaer Xia et.al. 2505.14677 null Kimi
956 2025-05-20 SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment Wonje Jeung et.al. 2505.14667 null Kimi
957 2025-05-20 Beyond Words: Multimodal LLM Knows When to Speak Zikai Liao et.al. 2505.14654 null Kimi
958 2025-05-20 KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models Fnu Mohbat et.al. 2505.14629 link Kimi
959 2025-05-20 Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs Morgan Lindsay Heisler et.al. 2505.14620 null Kimi
960 2025-05-20 Can Pruning Improve Reasoning? Revisiting Long-CoT Compression with Capability in Mind for Better Reasoning Shangziqi Zhao et.al. 2505.14582 null Kimi
961 2025-05-20 Reasoning Models Better Express Their Confidence Dongkeun Yoon et.al. 2505.14489 link Kimi
962 2025-05-20 Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation Peter Baile Chen et.al. 2505.14398 null Kimi
963 2025-05-20 Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach Umberto Cappellazzo et.al. 2505.14336 null Kimi
964 2025-05-20 Speculative Decoding Reimagined for Multimodal Large Language Models Luxi Lin et.al. 2505.14260 link Kimi
965 2025-05-20 FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation Shaolin Zhu et.al. 2505.14256 null Kimi
966 2025-05-20 Tokenization Constraints in LLMs: A Study of Symbolic and Arithmetic Reasoning Limits Xiang Zhang et.al. 2505.14178 null Kimi
967 2025-05-20 RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning Qianyue Hao et.al. 2505.14140 null Kimi
968 2025-05-20 DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models Yakun Zhu et.al. 2505.14107 link Kimi
969 2025-05-20 Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models Wenhui Zhu et.al. 2505.13973 null Kimi
970 2025-05-20 FlashThink: An Early Exit Method For Efficient Reasoning Guochao Jiang et.al. 2505.13949 null Kimi
971 2025-05-20 EEG-to-Text Translation: A Model for Deciphering Human Brain Activity Saydul Akbar Murad et.al. 2505.13936 link Kimi
972 2025-05-20 Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning Jiwon Song et.al. 2505.13866 link Kimi
973 2025-05-20 EfficientLLM: Efficiency in Large Language Models Zhengqing Yuan et.al. 2505.13840 null Kimi
974 2025-05-20 Structured Agent Distillation for Large Language Model Jun Liu et.al. 2505.13820 null Kimi
975 2025-05-19 Ice Cream Doesn’t Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference Jin Du et.al. 2505.13770 null Kimi
976 2025-05-19 Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers Andrew Nam et.al. 2505.13737 null Kimi
977 2025-05-19 RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs Soumya Rani Samineni et.al. 2505.13697 null Kimi
978 2025-05-19 Optimizing Anytime Reasoning via Budget Relative Policy Optimization Penghui Qi et.al. 2505.13438 link Kimi
979 2025-05-19 CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process Jinhe Bi et.al. 2505.13408 null Kimi
980 2025-05-19 Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference Shuqing Luo et.al. 2505.13345 link Kimi
981 2025-05-19 Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space Hengli Li et.al. 2505.13308 link Kimi
982 2025-05-19 RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning Qiguang Chen et.al. 2505.13307 link Kimi
983 2025-05-19 Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability Jingyi Ren et.al. 2505.13258 link Kimi
984 2025-05-19 HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding Siran Liu et.al. 2505.13254 null Kimi
985 2025-05-19 Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification Jikai Wang et.al. 2505.13204 null Kimi
986 2025-05-19 Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities Lili Zhang et.al. 2505.13195 null Kimi
987 2025-05-19 ModernGBERT: German-only 1B Encoder Model Trained from Scratch Anton Ehrmanntraut et.al. 2505.13136 null Kimi
988 2025-05-19 Benchmarking and Confidence Evaluation of LALMs For Temporal Reasoning Debarpan Bhattacharya et.al. 2505.13115 link Kimi
989 2025-05-19 FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference Guangda Liu et.al. 2505.13109 null Kimi
990 2025-05-19 Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning Xiaoyu Yang et.al. 2505.13081 null Kimi
991 2025-05-19 MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO Yicheng Xiao et.al. 2505.13031 link Kimi
992 2025-05-19 Fractured Chain-of-Thought Reasoning Baohao Liao et.al. 2505.12992 null Kimi
993 2025-05-19 A3 : an Analytical Low-Rank Approximation Framework for Attention Jeffrey T. H. Wong et.al. 2505.12942 null Kimi
994 2025-05-19 Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs Zhihe Yang et.al. 2505.12929 link Kimi
995 2025-05-19 The Traitors: Deception and Trust in Multi-Agent Language Model Simulations Pedro M. P. Curvo et.al. 2505.12923 link Kimi
996 2025-05-19 LEXam: Benchmarking Legal Reasoning on 340 Law Exams Yu Fan et.al. 2505.12864 null Kimi
997 2025-05-19 Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs Zhuo Yang et.al. 2505.12833 null Kimi
998 2025-05-19 SynDec: A Synthesize-then-Decode Approach for Arbitrary Textual Style Transfer via Large Language Models Han Sun et.al. 2505.12821 null Kimi
999 2025-05-19 Accelerating Adaptive Retrieval Augmented Generation via Instruction-Driven Representation Reduction of Retrieval Overlaps Jie Ou et.al. 2505.12731 null Kimi
1000 2025-05-19 FLASH: Latent-Aware Semi-Autoregressive Speculative Decoding for Multimodal Tasks Zihua Wang et.al. 2505.12728 link Kimi
1001 2025-05-19 ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving Haoyuan Wu et.al. 2505.12717 null Kimi
1002 2025-05-19 Shadow-FT: Tuning Instruct via Base Taiqiang Wu et.al. 2505.12716 link Kimi
1003 2025-05-19 Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities Haoyu Zhao et.al. 2505.12680 link Kimi
1004 2025-05-19 HydraInfer: Hybrid Disaggregated Scheduling for Multimodal Large Language Model Serving Xianzhe Dong et.al. 2505.12658 null Kimi
1005 2025-05-19 Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents Yunseok Jang et.al. 2505.12632 null Kimi
1006 2025-05-19 Enhancing Latent Computation in Transformers with Latent Tokens Yuchang Sun et.al. 2505.12629 null Kimi
1007 2025-05-18 A Survey of Attacks on Large Language Models Wenrui Xu et.al. 2505.12567 null Kimi
1008 2025-05-15 3D-Fixup: Advancing Photo Editing with 3D Priors Yen-Chi Cheng et.al. 2505.10566 null Kimi
1009 2025-05-15 End-to-End Vision Tokenizer Tuning Wenxuan Wang et.al. 2505.10562 null Kimi
1010 2025-05-15 Neural Thermodynamic Laws for Large Language Model Training Ziming Liu et.al. 2505.10559 null Kimi
1011 2025-05-15 MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning Ke Wang et.al. 2505.10557 link Kimi
1012 2025-05-15 Beyond ‘Aha!’: Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Zhiyuan Hu et.al. 2505.10554 link Kimi
1013 2025-05-15 Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data Yiwen Liu et.al. 2505.10551 link Kimi
1014 2025-05-15 Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning Milan Ganai et.al. 2505.10547 null Kimi
1015 2025-05-15 Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models Annie Wong et.al. 2505.10543 link Kimi
1016 2025-05-15 Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis Pengfei Wang et.al. 2505.10541 link Kimi
1017 2025-05-15 Enhancing Multi-Image Question Answering via Submodular Subset Selection Aaryan Sharma et.al. 2505.10533 null Kimi
1018 2025-05-15 MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models Mugilan Ganesan et.al. 2505.10526 null Kimi
1019 2025-05-15 Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation Xinrui Wang et.al. 2505.10522 null Kimi
1020 2025-05-15 Multi-Token Prediction Needs Registers Anastasios Gerontopoulos et.al. 2505.10518 link Kimi
1021 2025-05-15 The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks Benedikt Ebing et.al. 2505.10507 null Kimi
1022 2025-05-15 RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs Vibha Belavadi et.al. 2505.10495 null Kimi
1023 2025-05-15 Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective Yutao Mou et.al. 2505.10494 link Kimi
1024 2025-05-15 CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning Shaohan Wang et.al. 2505.10493 null Kimi
1025 2025-05-15 UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation Yi Li et.al. 2505.10483 null Kimi
1026 2025-05-15 Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps Ningyuan Yang et.al. 2505.10482 null Kimi
1027 2025-05-15 Parallel Scaling Law for Language Models Mouxiang Chen et.al. 2505.10475 link Kimi
1028 2025-05-15 AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Ranjan Sapkota et.al. 2505.10468 null Kimi
1029 2025-05-15 Superposition Yields Robust Neural Scaling Yizhou liu et.al. 2505.10465 link Kimi
1030 2025-05-15 Vision language models have difficulty recognizing virtual objects Tyler Tran et.al. 2505.10453 null Kimi
1031 2025-05-15 Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models Zemin Huang et.al. 2505.10446 null Kimi
1032 2025-05-15 Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations? Pedro Orvalho et.al. 2505.10443 null Kimi
1033 2025-05-15 Hierarchical Document Refinement for Long-context Retrieval-augmented Generation Jiajie Jin et.al. 2505.10413 link Kimi
1034 2025-05-15 Are LLM-generated plain language summaries truly understandable? A large-scale crowdsourced evaluation Yue Guo et.al. 2505.10409 null Kimi
1035 2025-05-15 Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding Jianhao Huang et.al. 2505.10405 null Kimi
1036 2025-05-15 Rethinking Repetition Problems of LLMs in Code Generation Yihong Dong et.al. 2505.10402 link Kimi
1037 2025-05-15 Evaluating Model Explanations without Ground Truth Kaivalya Rawal et.al. 2505.10399 link Kimi
1038 2025-05-15 J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning Chenxi Whitehouse et.al. 2505.10320 null Kimi
1039 2025-05-15 StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation Daniel A. P. Oliveira et.al. 2505.10292 link Kimi
1040 2025-05-15 The Evolving Landscape of Generative Large Language Models and Traditional Natural Language Processing in Medicine Rui Yang et.al. 2505.10261 null Kimi
1041 2025-05-15 Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data Poli Apollinaire Nemkova et.al. 2505.10260 link Kimi
1042 2025-05-15 On the Interplay of Human-AI Alignment,Fairness, and Performance Trade-offs in Medical Imaging Haozhe Luo et.al. 2505.10231 link Kimi
1043 2025-05-15 ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention Jintian Shao et.al. 2505.10222 null Kimi
1044 2025-05-15 The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think Seongyun Lee et.al. 2505.10185 null Kimi
1045 2025-05-15 GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs Longchao Da et.al. 2505.10143 null Kimi
1046 2025-05-15 From Text to Network: Constructing a Knowledge Graph of Taiwan-Based China Studies Using Generative AI Hsuan-Lei Shao et.al. 2505.10093 null Kimi
1047 2025-05-15 CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability Han Peng et.al. 2505.10063 null Kimi
1048 2025-05-15 PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language Ijazul Haq et.al. 2505.10055 link Kimi
1049 2025-05-15 ServeGen: Workload Characterization and Generation of Large Language Model Serving in Production Yuxing Xiang et.al. 2505.09999 link Kimi
1050 2025-05-15 Analysing Safety Risks in LLMs Fine-Tuned with Pseudo-Malicious Cyber Security Data Adel ElZemity et.al. 2505.09974 null Kimi
1051 2025-05-15 Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents Mrinal Rawat et.al. 2505.09970 null Kimi
1052 2025-05-15 Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph Deeksha Prahlad et.al. 2505.09945 link Kimi
1053 2025-05-15 Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit Tasks Ziyuan Zhang et.al. 2505.09901 link Kimi
1054 2025-05-14 Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting Apollinaire Poli Nemkova et.al. 2505.09852 null Kimi
1055 2025-05-14 Contextual Phenotyping of Pediatric Sepsis Cohort Using Large Language Models Aditya Nagori et.al. 2505.09805 null Kimi
1056 2025-05-14 Trustless Autonomy: Understanding Motivations, Benefits and Governance Dilemma in Self-Sovereign Decentralized AI Agents Botao Amber Hu et.al. 2505.09757 null Kimi
1057 2025-05-14 System Prompt Optimization with Meta-Learning Yumin Choi et.al. 2505.09666 null Kimi
1058 2025-05-14 Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists? Anthony GX-Chen et.al. 2505.09614 null Kimi
1059 2025-05-14 Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors Nicolas Dupuis et.al. 2505.09610 null Kimi
1060 2025-05-14 WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models Abdullah Mushtaq et.al. 2505.09595 null Kimi
1061 2025-05-14 PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning Zongqian Li et.al. 2505.09519 link Kimi
1062 2025-05-14 CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios Raghav Garg et.al. 2505.09436 link Kimi
1063 2025-05-14 Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records Yili He et.al. 2505.09435 null Kimi
1064 2025-05-14 Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits Subrit Dikshit et.al. 2505.09407 null Kimi
1065 2025-05-14 The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners Vince Trencsenyi et.al. 2505.09396 null Kimi
1066 2025-05-14 Qwen3 Technical Report An Yang et.al. 2505.09388 link Kimi
1067 2025-05-14 Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Chenggang Zhao et.al. 2505.09343 null Kimi
1068 2025-05-14 Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs Jingcheng Niu et.al. 2505.09338 link Kimi
1069 2025-05-14 Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging Hongjin Qian et.al. 2505.09316 null Kimi
1070 2025-05-14 Reproducibility Study of “Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents” Pedro M. P. Curvo et.al. 2505.09289 link Kimi
1071 2025-05-14 Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt Bin-Bin Gao et.al. 2505.09264 link Kimi
1072 2025-05-14 ELIS: Efficient LLM Iterative Scheduling System with Response Length Predictor Seungbeom Choi et.al. 2505.09142 null Kimi
1073 2025-05-14 CEC-Zero: Chinese Error Correction Solution Based on LLM Sophie Zhang et.al. 2505.09082 null Kimi
1074 2025-05-14 A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias Brandon Smith et.al. 2505.09056 null Kimi
1075 2025-05-13 Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification Adarsh Kumar et.al. 2505.09031 null Kimi
1076 2025-05-13 Automated Meta Prompt Engineering for Alignment with the Theory of Mind Aaron Baughman et.al. 2505.09024 null Kimi
1077 2025-05-13 Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training Yangyi Chen et.al. 2505.08971 link Kimi
1078 2025-05-13 Toward Cost-Efficient Serving of Mixture-of-Experts with Asynchrony Shaoyu Wang et.al. 2505.08944 null Kimi
1079 2025-05-13 Performance Gains of LLMs With Humans in a World of LLMs Versus Humans Lucas McCullum et.al. 2505.08902 null Kimi
1080 2025-05-13 Generative AI for Autonomous Driving: Frontiers and Opportunities Yuping Wang et.al. 2505.08854 link Kimi
1081 2025-05-13 CodePDE: An Inference Framework for LLM-driven PDE Solver Generation Shanda Li et.al. 2505.08783 link Kimi
1082 2025-05-14 Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology Yatai Ji et.al. 2505.08765 null Kimi
1083 2025-05-13 DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models Xiaoyang Chen et.al. 2505.08744 link Kimi
1084 2025-05-13 Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies Xiaoliang Luo et.al. 2505.08739 link Kimi
1085 2025-05-13 NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context Ben Yao et.al. 2505.08734 null Kimi
1086 2025-05-13 PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts Yang Su et.al. 2505.08719 null Kimi
1087 2025-05-13 LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs K M Sajjadul Islam et.al. 2505.08704 null Kimi
1088 2025-05-13 TRAIL: Trace Reasoning and Agentic Issue Localization Darshan Deshpande et.al. 2505.08638 null Kimi
1089 2025-05-13 Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models Donghoon Kim et.al. 2505.08622 null Kimi
1090 2025-05-13 Automatic Task Detection and Heterogeneous LLM Speculative Decoding Danying Ge et.al. 2505.08600 null Kimi
1091 2025-05-13 Small but Significant: On the Promise of Small Language Models for Accessible AIED Yumou Wei et.al. 2505.08588 null Kimi
1092 2025-05-13 The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News Yuhan Liu et.al. 2505.08532 null Kimi
1093 2025-05-13 LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models Takumi Shibata et.al. 2505.08498 null Kimi
1094 2025-05-13 RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models Fujun Zhang et.al. 2505.08463 null Kimi
1095 2025-05-13 Accelerating Chain-of-Thought Reasoning: When Goal-Gradient Importance Meets Dynamic Skipping Ren Zhuang et.al. 2505.08392 null Kimi
1096 2025-05-13 Benchmarking AI scientists in omics data-driven biological research Erpai Luo et.al. 2505.08341 link Kimi
1097 2025-05-13 AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale Yunjie Ji et.al. 2505.08311 null Kimi
1098 2025-05-13 Evaluating the Effectiveness of Black-Box Prompt Optimization as the Scale of LLMs Continues to Grow Ziyu Zhou et.al. 2505.08303 null Kimi
1099 2025-05-13 Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration Rishabh Agrawal et.al. 2505.08261 null Kimi
1100 2025-05-13 Evaluating LLM Metrics Through Real-World Capabilities Justin K Miller et.al. 2505.08253 null Kimi
1101 2025-05-13 Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement Haoran Ye et.al. 2505.08245 link Kimi
1102 2025-05-13 A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs Artem Shelmanov et.al. 2505.08200 null Kimi
1103 2025-05-13 Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage Ruilin Liu et.al. 2505.08167 null Kimi
1104 2025-05-13 Decoding Neighborhood Environments with Large Language Models Andrew Cart et.al. 2505.08163 null Kimi
1105 2025-05-13 Lost in Transmission: When and Why LLMs Fail to Reason Globally Tobias Schnabel et.al. 2505.08140 null Kimi
1106 2025-05-13 ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval Mingxu Tao et.al. 2505.08130 null Kimi
1107 2025-05-12 Are LLMs complicated ethical dilemma analyzers? Jiashen et.al. 2505.08106 link Kimi
1108 2025-05-12 Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders Dong Shu et.al. 2505.08080 null Kimi
1109 2025-05-12 FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning Zhehao Zhang et.al. 2505.08054 null Kimi
1110 2025-05-12 Learning from Peers in Reasoning Models Tongxu Luo et.al. 2505.07787 null Kimi
1111 2025-05-12 S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models Muzhi Dai et.al. 2505.07686 null Kimi
1112 2025-05-12 SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models Hang Wu et.al. 2505.07680 null Kimi
1113 2025-05-13 OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit Arun S. Maiya et.al. 2505.07672 link Kimi
1114 2025-05-12 Benchmarking Retrieval-Augmented Generation for Chemistry Xianrui Zhong et.al. 2505.07671 null Kimi
1115 2025-05-12 Concept-Level Explainability for Auditing & Steering LLM Responses Kenza Amara et.al. 2505.07610 link Kimi
1116 2025-05-12 MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining Xiaomi LLM-Core Team et.al. 2505.07608 link Kimi
1117 2025-05-12 Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent Ziyang Huang et.al. 2505.07596 null Kimi
1118 2025-05-12 A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models Junjie Ye et.al. 2505.07591 link Kimi
1119 2025-05-12 ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution Xu Huang et.al. 2505.07512 null Kimi
1120 2025-05-12 A Survey on Collaborative Mechanisms Between Large and Small Language Models Yi Chen et.al. 2505.07460 null Kimi
1121 2025-05-12 How well do LLMs reason over tabular data, really? Cornelius Wolff et.al. 2505.07453 null Kimi
1122 2025-05-12 Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data David de-Fitero-Dominguez et.al. 2505.07372 null Kimi
1123 2025-05-12 QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines Ohjoon Kwon et.al. 2505.07345 null Kimi
1124 2025-05-12 Generative Pre-trained Autoregressive Diffusion Transformer Yuan Zhang et.al. 2505.07344 null Kimi
1125 2025-05-12 Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study Baixuan Xu et.al. 2505.07313 null Kimi
1126 2025-05-12 Semantic Retention and Extreme Compression in LLMs: Can We Have Both? Stanislas Laborde et.al. 2505.07289 null Kimi
1127 2025-05-12 UMoE: Unifying Attention and FFN with Shared Experts Yuanhang Yang et.al. 2505.07260 null Kimi
1128 2025-05-12 SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models Peichao Lai et.al. 2505.07247 link Kimi
1129 2025-05-12 Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity Guang Yan et.al. 2505.07239 null Kimi
1130 2025-05-12 DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation Jiashuo Sun et.al. 2505.07233 link Kimi
1131 2025-05-12 Measuring General Intelligence with Generated Games Vivek Verma et.al. 2505.07215 link Kimi
1132 2025-05-12 Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 2030 Mouxiao Bian et.al. 2505.07205 null Kimi
1133 2025-05-12 PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications Kuntai Du et.al. 2505.07203 null Kimi
1134 2025-05-12 One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models Haoran Gu et.al. 2505.07167 null Kimi
1135 2025-05-12 Pre-training vs. Fine-tuning: A Reproducibility Study on Dense Retrieval Knowledge Acquisition Zheng Yao et.al. 2505.07166 link Kimi
1136 2025-05-11 RefPentester: A Knowledge-Informed Self-Reflective Penetration Testing Framework Based on Large Language Models Hanzheng Dai et.al. 2505.07089 null Kimi
1137 2025-05-11 Architectural Precedents for General Agents using Large Language Models Robert E. Wray et.al. 2505.07087 null Kimi
1138 2025-05-11 DialogueReason: Rule-Based RL Sparks Dialogue Reasoning in LLMs Yubo Shu et.al. 2505.07049 null Kimi
1139 2025-05-11 LLM-Augmented Chemical Synthesis and Design Decision Programs Haorui Wang et.al. 2505.07027 null Kimi
1140 2025-05-08 DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion Qitao Zhao et.al. 2505.05473 null Kimi
1141 2025-05-08 Flow-GRPO: Training Flow Matching Models via Online RL Jie Liu et.al. 2505.05470 link Kimi
1142 2025-05-08 Generating Physically Stable and Buildable LEGO Designs from Text Ava Pun et.al. 2505.05469 link Kimi
1143 2025-05-08 StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant Haibo Wang et.al. 2505.05467 null Kimi
1144 2025-05-08 ComPO: Preference Alignment via Comparison Oracles Peter Chen et.al. 2505.05465 null Kimi
1145 2025-05-08 Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging Shiqi Chen et.al. 2505.05464 link Kimi
1146 2025-05-08 UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections Fatima Haouari et.al. 2505.05459 null Kimi
1147 2025-05-08 SITE: towards Spatial Intelligence Thorough Evaluation Wenqi Wang et.al. 2505.05456 null Kimi
1148 2025-05-08 Conversational Process Model Redesign Nataliia Klievtsova et.al. 2505.05453 null Kimi
1149 2025-05-08 Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding Han Xiao et.al. 2505.05446 link Kimi
1150 2025-05-08 clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations Chalamalasetti Kranti et.al. 2505.05445 null Kimi
1151 2025-05-08 EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation Biao Yi et.al. 2505.05440 null Kimi
1152 2025-05-08 Empowering Scientific Workflows with Federated Agents J. Gregory Pauloski et.al. 2505.05428 link Kimi
1153 2025-05-08 Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data Yudong Wang et.al. 2505.05427 null Kimi
1154 2025-05-08 TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering Ran Zhang et.al. 2505.05423 link Kimi
1155 2025-05-08 TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation Haokun Lin et.al. 2505.05422 link Kimi
1156 2025-05-08 Reasoning Models Don’t Always Say What They Think Yanda Chen et.al. 2505.05410 null Kimi
1157 2025-05-08 Crosslingual Reasoning through Test-Time Scaling Zheng-Xin Yong et.al. 2505.05408 link Kimi
1158 2025-05-08 Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans? Valeria Pastorino et.al. 2505.05406 null Kimi
1159 2025-05-08 CART-ELC: Oblique Decision Tree Induction via Exhaustive Search Andrew D. Laack et.al. 2505.05402 link Kimi
1160 2025-05-08 PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model Zhang Zhang et.al. 2505.05397 null Kimi
1161 2025-05-08 EDmamba: A Simple yet Effective Event Denoising Method with State Space Model Ciyu Ruan et.al. 2505.05391 null Kimi
1162 2025-05-08 Walrus: An Efficient Decentralized Storage Network George Danezis et.al. 2505.05370 null Kimi
1163 2025-05-08 High-fidelity Grain Growth Modeling: Leveraging Deep Learning for Fast Computations Pungponhavoan Tep et.al. 2505.05354 null Kimi
1164 2025-05-08 Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization Sooyoung Park et.al. 2505.05343 link Kimi
1165 2025-05-08 Progressive Inertial Poser: Progressive Real-Time Kinematic Chain Estimation for 3D Full-Body Pose from Three IMU Sensors Zunjie Zhu et.al. 2505.05336 null Kimi
1166 2025-05-08 ICon: In-Context Contribution for Automatic Data Selection Yixin Yang et.al. 2505.05327 null Kimi
1167 2025-05-08 Scalable Chain of Thoughts via Elastic Reasoning Yuhui Xu et.al. 2505.05315 link Kimi
1168 2025-05-08 T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction Kun Peng et.al. 2505.05271 null Kimi
1169 2025-05-08 Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks Yixin Cheng et.al. 2505.05190 link Kimi
1170 2025-05-08 Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models Wei Peng et.al. 2505.05189 link Kimi
1171 2025-05-08 MARK: Memory Augmented Refinement of Knowledge Anish Ganguli et.al. 2505.05177 null Kimi
1172 2025-05-08 X-Driver: Explainable Autonomous Driving with Vision-Language Models Wei Liu et.al. 2505.05098 null Kimi
1173 2025-05-08 Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes Zhuocheng Gong et.al. 2505.04993 null Kimi
1174 2025-05-08 Chain-of-Thought Tokens are Computer Program Variables Fangwei Zhu et.al. 2505.04955 link Kimi
1175 2025-05-08 Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Yunxin Li et.al. 2505.04921 link Kimi
1176 2025-05-08 An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education Ramteja Sajja et.al. 2505.04916 null Kimi
1177 2025-05-08 Enigme: Generative Text Puzzles for Evaluating Reasoning in Language Models John Hawkins et.al. 2505.04914 link Kimi
1178 2025-05-08 SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models Shun Taguchi et.al. 2505.04911 null Kimi
1179 2025-05-08 ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning Ziqing Qiao et.al. 2505.04881 null Kimi
1180 2025-05-08 GroverGPT-2: Simulating Grover’s Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization Min Chen et.al. 2505.04880 null Kimi
1181 2025-05-07 CRAFT: Cultural Russian-Oriented Dataset Adaptation for Focused Text-to-Image Generation Viacheslav Vasilev et.al. 2505.04851 null Kimi
1182 2025-05-07 Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards Manveer Singh Tamber et.al. 2505.04847 link Kimi
1183 2025-05-07 Large Language Models are Autonomous Cyber Defenders Sebastián R. Castro et.al. 2505.04843 link Kimi
1184 2025-05-07 ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling Xiao Wang et.al. 2505.04802 null Kimi
1185 2025-05-07 The Promise and Limits of LLMs in Constructing Proofs and Hints for Logic Problems in Intelligent Tutoring Systems Sutapa Dey Tithi et.al. 2505.04736 null Kimi
1186 2025-05-07 SOAEsV2-7B/72B: Full-Pipeline Optimization for State-Owned Enterprise LLMs via Continual Pre-Training, Domain-Progressive SFT and Distillation-Enhanced Speculative Decoding Jingyang Deng et.al. 2505.04723 null Kimi
1187 2025-05-07 EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning Zhenghao Xing et.al. 2505.04623 link Kimi
1188 2025-05-07 ZeroSearch: Incentivize the Search Capability of LLMs without Searching Hao Sun et.al. 2505.04588 link Kimi
1189 2025-05-07 Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review Josh McGiff et.al. 2505.04531 null Kimi
1190 2025-05-07 Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs Yehui Tang et.al. 2505.04519 null Kimi
1191 2025-05-07 CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation Jiahao Li et.al. 2505.04481 null Kimi
1192 2025-05-07 OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models Xiaoyu Xu et.al. 2505.04416 null Kimi
1193 2025-05-07 YABLoCo: Yet Another Benchmark for Long Context Code Generation Aidar Valeev et.al. 2505.04406 null Kimi
1194 2025-05-07 The Aloe Family Recipe for Open and Specialized Healthcare LLMs Dario Garcia-Gasulla et.al. 2505.04388 null Kimi
1195 2025-05-07 Benchmarking LLMs’ Swarm intelligence Kai Ruan et.al. 2505.04364 link Kimi
1196 2025-05-07 GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance Sofia Jamil et.al. 2505.04284 link Kimi
1197 2025-05-07 SToLa: Self-Adaptive Touch-Language Framework with Tactile Commonsense Reasoning in Open-Ended Scenarios Ning Cheng et.al. 2505.04201 null Kimi
1198 2025-05-07 VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction Tuning Trinh T. L. Vuong et.al. 2505.04192 link Kimi
1199 2025-05-07 S3D: Sketch-Driven 3D Model Generation Hail Song et.al. 2505.04185 link Kimi
1200 2025-05-07 Large Language Models are often politically extreme, usually ideologically inconsistent, and persuasive even in informational contexts Nouar Aldahoul et.al. 2505.04171 null Kimi
1201 2025-05-07 Unmasking the Canvas: A Dynamic Benchmark for Image Generation Jailbreaking and LLM Content Safety Variath Madhupal Gautham Nair et.al. 2505.04146 null Kimi
1202 2025-05-07 Enhancing Granular Sentiment Classification with Chain-of-Thought Prompting in Large Language Models Vihaan Miriyala et.al. 2505.04135 null Kimi
1203 2025-05-07 LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress? Teddy Foley et.al. 2505.04075 link Kimi
1204 2025-05-07 Advancing and Benchmarking Personalized Tool Invocation for LLMs Xu Huang et.al. 2505.04072 link Kimi
1205 2025-05-06 Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving Shan Yu et.al. 2505.04021 null Kimi
1206 2025-05-06 SLOT: Structuring the Output of Large Language Models Darren Yow-Bang Wang et.al. 2505.04016 null Kimi
1207 2025-05-06 Can Large Language Models Predict Parallel Code Performance? Gregory Bolet et.al. 2505.03988 null Kimi
1208 2025-05-06 X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains Qianchu Liu et.al. 2505.03981 null Kimi
1209 2025-05-06 The Power of Stories: Narrative Priming Shapes How LLM Agents Collaborate and Compete Gerrit Großmann et.al. 2505.03961 link Kimi
1210 2025-05-06 Frog Soup: Zero-Shot, In-Context, and Sample-Efficient Frogger Agents Xiang Li et.al. 2505.03947 link Kimi
1211 2025-05-06 MARCO: A Multi-Agent System for Optimizing HPC Code Generation Using Large Language Models Asif Rahman et.al. 2505.03906 null Kimi
1212 2025-05-06 Novel Extraction of Discriminative Fine-Grained Feature to Improve Retinal Vessel Segmentation Shuang Zeng et.al. 2505.03896 link Kimi
1213 2025-05-06 VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model Zuwei Long et.al. 2505.03739 link Kimi
1214 2025-05-06 WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch Zimu Lu et.al. 2505.03733 link Kimi
1215 2025-05-06 Distribution-Conditional Generation: From Class Distribution to Creative Generation Fu Feng et.al. 2505.03667 null Kimi
1216 2025-05-06 ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision Assistant Yifan Xiang et.al. 2505.03654 link Kimi
1217 2025-05-06 A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning Kolawole E. Ogunsina et.al. 2505.03553 null Kimi
1218 2025-05-06 Faster MoE LLM Inference for Extremely Large Models Haoqi Yang et.al. 2505.03531 null Kimi
1219 2025-05-06 Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models Bin Yu et.al. 2505.03469 link Kimi
1220 2025-05-06 The Steganographic Potentials of Language Models Artem Karpov et.al. 2505.03439 null Kimi
1221 2025-05-06 Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents Schaun Wheeler et.al. 2505.03434 null Kimi
1222 2025-05-06 MedArabiQ: Benchmarking Large Language Models on Arabic Medical Tasks Mouath Abu Daoud et.al. 2505.03427 link Kimi
1223 2025-05-06 Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation Mohammad Shoaib Ansari et.al. 2505.03406 null Kimi
1224 2025-05-06 Absolute Zero: Reinforced Self-play Reasoning with Zero Data Andrew Zhao et.al. 2505.03335 link Kimi
1225 2025-05-06 AI-Driven Scholarly Peer Review via Persistent Workflow Prompting, Meta-Prompting, and Meta-Reasoning Evgeny Markhasin et.al. 2505.03332 null Kimi
1226 2025-05-06 Recall with Reasoning: Chain-of-Thought Distillation for Mamba’s Long-Context Memory and Extrapolation Junyu Ma et.al. 2505.03320 null Kimi
1227 2025-05-06 SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation Zhaoxi Mu et.al. 2505.03273 null Kimi
1228 2025-05-06 RAVU: Retrieval Augmented Video Understanding with Compositional Reasoning over Graph Sameer Malik et.al. 2505.03173 null Kimi
1229 2025-05-06 Assessing and Enhancing the Robustness of LLM-based Multi-Agent Systems Through Chaos Engineering Joshua Owotogbe et.al. 2505.03096 null Kimi
1230 2025-05-05 Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text Jennifer Healey et.al. 2505.03053 null Kimi
1231 2025-05-05 A Typology of Synthetic Datasets for Dialogue Processing in Clinical Contexts Steven Bedrick et.al. 2505.03025 null Kimi
1232 2025-05-05 Memorization or Interpolation ? Detecting LLM Memorization through Input Perturbation Analysis Albérick Euraste Djiré et.al. 2505.03019 null Kimi
1233 2025-05-05 RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale Daniel Goldstein et.al. 2505.03005 link Kimi
1234 2025-05-05 Generating Narrated Lecture Videos from Slides with Synchronized Highlights Alexander Holmberg et.al. 2505.02966 null Kimi
1235 2025-05-05 When Your Own Output Becomes Your Training Data: Noise-to-Meaning Loops and a Formal RSI Trigger Rintaro Ando et.al. 2505.02888 link Kimi
1236 2025-05-05 AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation Qingqiu Li et.al. 2505.02830 null Kimi
1237 2025-05-05 AutoLibra: Agent Metric Induction from Open-Ended Feedback Hao Zhu et.al. 2505.02820 link Kimi
1238 2025-05-05 Knowing You Don’t Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing Diji Yang et.al. 2505.02811 link Kimi
1239 2025-05-05 HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models Zheng Lin et.al. 2505.02795 null Kimi
1240 2025-05-05 Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models Matthew Dahl et.al. 2505.02763 null Kimi
1241 2025-05-05 Using Knowledge Graphs to harvest datasets for efficient CLIP model training Simon Ging et.al. 2505.02746 link Kimi
1242 2025-05-05 FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Zhouliang Yu et.al. 2505.02735 link Kimi
1243 2025-05-05 Enhancing LLMs’ Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry Junu Kim et.al. 2505.02722 link Kimi
1244 2025-05-05 Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Yemin Shi et.al. 2505.02707 link Kimi
1245 2025-05-05 Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models Xiaobao Wu et.al. 2505.02686 link Kimi
1246 2025-05-05 A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law Qianjun Pan et.al. 2505.02665 null Kimi
1247 2025-05-05 Enhancing Chemical Reaction and Retrosynthesis Prediction with Large Language Model and Dual-task Learning Xuan Lin et.al. 2505.02639 null Kimi
1248 2025-05-05 LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis Qingkai Fang et.al. 2505.02625 link Kimi
1249 2025-05-05 EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning Lingxiao Kong et.al. 2505.02579 link Kimi
1250 2025-05-05 Bielik v3 Small: Technical Report Krzysztof Ociepa et.al. 2505.02550 null Kimi
1251 2025-05-05 Large Language Model Partitioning for Low-Latency Inference at the Edge Dimitrios Kafetzis et.al. 2505.02533 null Kimi
1252 2025-05-05 Beyond the model: Key differentiators in large language models and multi-agent services Muskaan Goyal et.al. 2505.02489 null Kimi
1253 2025-05-05 Incentivizing Inclusive Contributions in Model Sharing Markets Enpei Zhang et.al. 2505.02462 null Kimi
1254 2025-05-05 Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs Elisa Forcada Rodríguez et.al. 2505.02456 null Kimi
1255 2025-05-05 Bielik 11B v2 Technical Report Krzysztof Ociepa et.al. 2505.02410 null Kimi
1256 2025-05-05 Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL Jiarui Yao et.al. 2505.02391 link Kimi
1257 2025-05-05 RM-R1: Reward Modeling as Reasoning Xiusi Chen et.al. 2505.02387 link Kimi
1258 2025-05-05 JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings Tianyu Zong et.al. 2505.02366 link Kimi
1259 2025-05-05 Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques Sanjay Surendranath Girija et.al. 2505.02309 null Kimi
1260 2025-05-05 Generative Sign-description Prompts with Multi-positive Contrastive Learning for Sign Language Recognition Siyu Liang et.al. 2505.02304 null Kimi
1261 2025-05-04 Parameter-Efficient Transformer Embeddings Henry Ndubuaku et.al. 2505.02266 link Kimi
1262 2025-05-04 SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation Tanguy Herserant et.al. 2505.02235 null Kimi
1263 2025-05-04 Interpretable Emergent Language Using Inter-Agent Transformers Mannan Bhardwaj et.al. 2505.02215 link Kimi
1264 2025-05-04 Leveraging LLMs to Automate Energy-Aware Refactoring of Parallel Scientific Codes Matthew T. Dearing et.al. 2505.02184 null Kimi
1265 2025-05-04 Measuring Hong Kong Massive Multi-Task Language Understanding Chuxue Cao et.al. 2505.02177 null Kimi
1266 2025-05-04 A New HOPE: Domain-agnostic Automatic Evaluation of Text Chunking Henrik Brådland et.al. 2505.02171 null Kimi
1267 2025-05-04 Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents Minzheng Wang et.al. 2505.02156 link Kimi
1268 2025-05-01 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Dongzhi Jiang et.al. 2505.00703 link Kimi
1269 2025-05-01 RayZer: A Self-supervised Large View Synthesis Model Hanwen Jiang et.al. 2505.00702 null Kimi
1270 2025-05-01 Robotic Visual Instruction Yanbang Li et.al. 2505.00693 null Kimi
1271 2025-05-01 Towards Autonomous Micromobility through Scalable Urban Simulation Wayne Wu et.al. 2505.00690 null Kimi
1272 2025-05-01 GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution Aditya Arora et.al. 2505.00687 null Kimi
1273 2025-05-01 Visual Test-time Scaling for GUI Agent Grounding Tiange Luo et.al. 2505.00684 link Kimi
1274 2025-05-01 MINERVA: Evaluating Complex Video Reasoning Arsha Nagrani et.al. 2505.00681 link Kimi
1275 2025-05-01 Steering Large Language Models with Register Analysis for Arbitrary Style Transfer Xinchen Yang et.al. 2505.00679 null Kimi
1276 2025-05-01 Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions Yiming Du et.al. 2505.00675 link Kimi
1277 2025-05-01 DeepCritic: Deliberate Critique with Large Language Models Wenkai Yang et.al. 2505.00662 link Kimi
1278 2025-05-01 On the generalization of language models from in-context learning and finetuning: a controlled study Andrew K. Lampinen et.al. 2505.00661 null Kimi
1279 2025-05-01 Large Language Models Understanding: an Inherent Ambiguity Barrier Daniel N. Nissani et.al. 2505.00654 null Kimi
1280 2025-05-01 Open-Source LLM-Driven Federated Transformer for Predictive IoV Management Yazan Otoum et.al. 2505.00651 null Kimi
1281 2025-05-01 OmicsCL: Unsupervised Contrastive Learning for Cancer Subtype Discovery and Survival Stratification Atahan Karagoz et.al. 2505.00650 link Kimi
1282 2025-05-01 Investigating Task Arithmetic for Zero-Shot Information Retrieval Marco Braga et.al. 2505.00649 link Kimi
1283 2025-05-01 Deep Learning Assisted Outer Volume Removal for Highly-Accelerated Real-Time Dynamic MRI Merve Gülle et.al. 2505.00643 null Kimi
1284 2025-05-01 Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook Muyi Bao et.al. 2505.00630 link Kimi
1285 2025-05-01 The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them) Zihao Wang et.al. 2505.00626 null Kimi
1286 2025-05-01 FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation Chaitali Bhattacharyya et.al. 2505.00624 null Kimi
1287 2025-05-01 Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction Simon Giebenhain et.al. 2505.00615 null Kimi
1288 2025-05-01 Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation D. Sculley et.al. 2505.00612 null Kimi
1289 2025-05-01 Combining LLMs with Logic-Based Framework to Explain MCTS Ziyan An et.al. 2505.00610 null Kimi
1290 2025-05-01 Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4 Phanish Puranam et.al. 2505.00603 null Kimi
1291 2025-05-01 Fast and Low-Cost Genomic Foundation Models via Outlier Removal Haozheng Luo et.al. 2505.00598 link Kimi
1292 2025-05-01 A Finite-State Controller Based Offline Solver for Deterministic POMDPs Alex Schutz et.al. 2505.00596 link Kimi
1293 2025-05-01 Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading Shuo Tong et.al. 2505.00592 null Kimi
1294 2025-05-01 FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension Jushi Kai et.al. 2505.00570 null Kimi
1295 2025-05-01 Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models Makoto Sato et.al. 2505.00557 null Kimi
1296 2025-05-01 100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Chong Zhang et.al. 2505.00551 null Kimi
1297 2025-05-01 HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection Deanna Emery et.al. 2505.00506 null Kimi
1298 2025-05-01 UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces Alaa Saleh et.al. 2505.00472 null Kimi
1299 2025-05-01 Red Teaming Large Language Models for Healthcare Vahid Balazadeh et.al. 2505.00467 null Kimi
1300 2025-05-01 Data Therapist: Eliciting Domain Knowledge from Subject Matter Experts Using Large Language Models Sungbok Shin et.al. 2505.00455 null Kimi
1301 2025-05-01 KoACD: The First Korean Adolescent Dataset for Cognitive Distortion Analysis JunSeo Kim et.al. 2505.00367 null Kimi
1302 2025-05-01 Enhancing AI-Driven Education: Integrating Cognitive Frameworks, Linguistic Feedback Analysis, and Ethical Considerations for Improved Content Generation Antoun Yaacoub et.al. 2505.00339 null Kimi
1303 2025-05-01 Mixture of Sparse Attention: Content-Based Learnable Sparse Attention via Expert-Choice Routing Piotr Piękos et.al. 2505.00315 link Kimi
1304 2025-05-01 Fine-grained spatial-temporal perception for gas leak segmentation Xinlong Zhao et.al. 2505.00295 link Kimi
1305 2025-05-01 Empowering Agentic Video Analytics Systems with Video Language Models Yuxuan Yan et.al. 2505.00254 null Kimi
1306 2025-04-30 Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems Shaokun Zhang et.al. 2505.00212 link Kimi
1307 2025-04-30 Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models Minh-Hao Van et.al. 2505.00150 null Kimi
1308 2025-04-30 AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models Yinghui He et.al. 2505.00147 null Kimi
1309 2025-04-30 Between Underthinking and Overthinking: An Empirical Study of Reasoning Length and correctness in LLMs Jinyan Su et.al. 2505.00127 null Kimi
1310 2025-04-30 Fine-Tuning LLMs for Low-Resource Dialect Translation: The Case of Lebanese Silvana Yakhni et.al. 2505.00114 link Kimi
1311 2025-04-30 GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling Siqi Li et.al. 2505.00063 null Kimi
1312 2025-04-30 TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments Sichang Tu et.al. 2504.21851 null Kimi
1313 2025-04-30 Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization Anas Anwarul Haq Khan et.al. 2504.21831 null Kimi
1314 2025-04-30 DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition Z. Z. Ren et.al. 2504.21801 link Kimi
1315 2025-04-30 WebThinker: Empowering Large Reasoning Models with Deep Research Capability Xiaoxi Li et.al. 2504.21776 link Kimi
1316 2025-04-30 MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness Junsheng Huang et.al. 2504.21773 null Kimi
1317 2025-04-30 AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization Haotian Luo et.al. 2504.21659 link Kimi
1318 2025-04-30 Sadeed: Advancing Arabic Diacritization Through Small Language Model Zeina Aldallal et.al. 2504.21635 null Kimi
1319 2025-04-30 Meeseeks: An Iterative Benchmark Evaluating LLMs Multi-Turn Instruction-Following Ability Jiaming Wang et.al. 2504.21625 null Kimi
1320 2025-04-30 RDF-Based Structured Quality Assessment Representation of Multilingual LLM Evaluations Jonas Gwozdz et.al. 2504.21605 null Kimi
1321 2025-04-30 DNB-AI-Project at SemEval-2025 Task 5: An LLM-Ensemble Approach for Automated Subject Indexing Lisa Kluge et.al. 2504.21589 link Kimi
1322 2025-04-30 Precision Where It Matters: A Novel Spike Aware Mixed-Precision Quantization Strategy for LLaMA-based Language Models Lucas Maisonnave et.al. 2504.21553 null Kimi
1323 2025-04-30 RWKV-X: A Linear Complexity Hybrid Language Model Haowen Hou et.al. 2504.21463 link Kimi
1324 2025-04-30 SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding Chenkai Zhang et.al. 2504.21435 link Kimi
1325 2025-04-30 Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction Máté Gedeon et.al. 2504.21372 null Kimi
1326 2025-04-30 ShorterBetter: Guiding Reasoning Models to Find Optimal Inference Length for Efficient Reasoning Jingyang Yi et.al. 2504.21370 null Kimi
1327 2025-04-30 Revisiting Diffusion Autoencoder Training for Image Reconstruction Quality Pramook Khungurn et.al. 2504.21368 null Kimi
1328 2025-04-30 Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing Hong Zhang et.al. 2504.21356 link Kimi
1329 2025-04-30 Phi-4-reasoning Technical Report Marah Abdin et.al. 2504.21318 null Kimi
1330 2025-04-30 BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models Zhiting Fan et.al. 2504.21299 null Kimi
1331 2025-04-30 Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models Guanghao Zhou et.al. 2504.21277 null Kimi
1332 2025-04-30 Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA Xuanzhao Dong et.al. 2504.21252 link Kimi
1333 2025-04-30 Memorization and Knowledge Injection in Gated LLMs Xu Pan et.al. 2504.21239 null Kimi
1334 2025-04-30 Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Haoran Xu et.al. 2504.21233 null Kimi
1335 2025-04-29 CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks Rui Wang et.al. 2504.21228 null Kimi
1336 2025-04-29 Automatic Legal Writing Evaluation of LLMs Ramon Pires et.al. 2504.21202 link Kimi
1337 2025-04-29 Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare Lovedeep Gondara et.al. 2504.21191 null Kimi
1338 2025-04-29 OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification Shangyu Li et.al. 2504.20964 link Kimi
1339 2025-04-29 Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models Maryna Vyshnyvetska et.al. 2504.20951 null Kimi
1340 2025-04-29 Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models Tyler McDonald et.al. 2504.20946 null Kimi
1341 2025-04-29 ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification Ziqing Fan et.al. 2504.20930 link Kimi
1342 2025-04-29 DYNAMAX: Dynamic computing for Transformers and Mamba based architectures Miguel Nogales et.al. 2504.20922 null Kimi
1343 2025-04-29 Using LLMs in Generating Design Rationale for Software Architecture Decisions Xiyu Zhou et.al. 2504.20781 link Kimi
1344 2025-04-29 JTreeformer: Graph-Transformer via Latent-Diffusion Model for Molecular Generation Ji Shi et.al. 2504.20770 null Kimi
1345 2025-04-29 Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption Wenxiao Wang et.al. 2504.20769 null Kimi
1346 2025-04-29 Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Hasan Abed Al Kader Hammoud et.al. 2504.20708 null Kimi
1347 2025-04-29 Cooking Up Creativity: A Cognitively-Inspired Approach for Enhancing LLM Creativity through Structured Representations Moran Mizrahi et.al. 2504.20643 link Kimi
1348 2025-04-29 The Hidden Risks of LLM-Generated Web Application Code: A Security-Centric Evaluation of Code Generation Capabilities in Large Language Models Swaroop Dora et.al. 2504.20612 null Kimi
1349 2025-04-29 Reinforcement Learning for Reasoning in Large Language Models with One Training Example Yiping Wang et.al. 2504.20571 link Kimi
1350 2025-04-29 UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation Huimin Lu et.al. 2504.20500 link Kimi
1351 2025-04-29 Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression Yu Cui et.al. 2504.20493 null Kimi
1352 2025-04-29 A Summary on GUI Agents with Foundation Models Enhanced by Reinforcement Learning Jiahao Li et.al. 2504.20464 null Kimi
1353 2025-04-29 Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding Gabe Guo et.al. 2504.20456 link Kimi
1354 2025-04-29 GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection DiJia Su et.al. 2504.20437 null Kimi
1355 2025-04-29 FiLA-Video: Spatio-Temporal Compression for Fine-Grained Long Video Understanding Yanan Guo et.al. 2504.20384 null Kimi
1356 2025-04-29 Local Prompt Optimization Yash Jain et.al. 2504.20355 null Kimi
1357 2025-04-29 MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation Amaan Izhar et.al. 2504.20343 link Kimi
1358 2025-04-28 Enhancing Systematic Reviews with Large Language Models: Using GPT-4 and Kimi Dandan Chen Kaptur et.al. 2504.20276 null Kimi
1359 2025-04-28 Can Large Language Models Learn Formal Logic? A Data-Driven Training and Evaluation Framework Yuan Xia et.al. 2504.20213 null Kimi
1360 2025-04-28 Weaving Context Across Images: Improving Vision-Language Models through Focus-Centric Visual Chains Juntian Zhang et.al. 2504.20199 null Kimi
1361 2025-04-28 MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools Nishant Subramani et.al. 2504.20168 link Kimi
1362 2025-04-28 AutoJudge: Judge Decoding Without Manual Annotation Roman Garipov et.al. 2504.20039 null Kimi
1363 2025-04-28 Towards Automated Scoping of AI for Social Good Projects Jacob Emmerson et.al. 2504.20010 null Kimi
1364 2025-04-28 TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons Emre Can Acikgoz et.al. 2504.19982 null Kimi
1365 2025-04-28 Accelerating Mixture-of-Experts Training with Adaptive Expert Replication Athinagoras Skiadopoulos et.al. 2504.19925 null Kimi
1366 2025-04-28 Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI Hugo Georgenthum et.al. 2504.19918 null Kimi
1367 2025-04-28 Can AI Agents Design and Implement Drug Discovery Pipelines? Khachik Smbatyan et.al. 2504.19912 null Kimi
1368 2025-04-28 GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets Mingqian He et.al. 2504.19898 null Kimi
1369 2025-04-28 semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage Ke Hong et.al. 2504.19867 null Kimi
1370 2025-04-28 Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance Takuya Tamura et.al. 2504.19811 null Kimi
1371 2025-04-28 Moral Reasoning Across Languages: The Critical Role of Low-Resource Languages in LLMs Huichi Zhou et.al. 2504.19759 null Kimi
1372 2025-04-28 Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented Generation Carlo Merola et.al. 2504.19754 link Kimi
1373 2025-04-28 LLM-Assisted Automated Deductive Coding of Dialogue Data: Leveraging Dialogue-Specific Characteristics to Enhance Contextual Understanding Ying Na et.al. 2504.19734 null Kimi
1374 2025-04-28 Taming the Titans: A Survey of Efficient LLM Inference Serving Ranran Zhen et.al. 2504.19720 link Kimi
1375 2025-04-28 From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review Mohamed Amine Ferrag et.al. 2504.19678 null Kimi
1376 2025-04-28 Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMs Osma Suominen et.al. 2504.19675 link Kimi
1377 2025-04-28 VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning Run Luo et.al. 2504.19627 null Kimi
1378 2025-04-28 m-KAILIN: Knowledge-Driven Agentic Scientific Corpus Distillation Framework for Biomedical Large Language Models Training Meng Xiao et.al. 2504.19565 null Kimi
1379 2025-04-28 DEEMO: De-identity Multimodal Emotion Recognition and Reasoning Deng Li et.al. 2504.19549 null Kimi
1380 2025-04-28 Bullet: Boosting GPU Utilization for LLM Serving via Dynamic Spatial-Temporal Orchestration Zejia Lin et.al. 2504.19516 null Kimi
1381 2025-04-28 Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding Yan Wang et.al. 2504.19500 null Kimi
1382 2025-04-28 Improving Reasoning Performance in Large Language Models via Representation Engineering Bertram Højer et.al. 2504.19483 null Kimi
1383 2025-04-28 BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text Jiageng Wu et.al. 2504.19467 link Kimi
1384 2025-04-28 Towards Long Context Hallucination Detection Siyi Liu et.al. 2504.19457 null Kimi
1385 2025-04-28 Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Prateek Chhikara et.al. 2504.19413 null Kimi
1386 2025-04-28 ICL CIPHERS: Quantifying “Learning’’ in In-Context Learning via Substitution Ciphers Zhouxiang Fang et.al. 2504.19395 null Kimi
1387 2025-04-27 LLMs for Engineering: Teaching Models to Design High Powered Rockets Toby Simonds et.al. 2504.19394 null Kimi
1388 2025-04-27 Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing James O’ Neill et.al. 2504.19333 null Kimi
1389 2025-04-27 Platonic Grounding for Efficient Multimodal Language Models Moulik Choraria et.al. 2504.19327 null Kimi
1390 2025-04-27 BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese Peilin Zhou et.al. 2504.19314 link Kimi
1391 2025-04-27 AndroidGen: Building an Android Language Agent under Data Scarcity Hanyu Lai et.al. 2504.19298 link Kimi
1392 2025-04-24 Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Xu Ma et.al. 2504.17789 null Kimi
1393 2025-04-24 The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs Piotr Nawrot et.al. 2504.17768 null Kimi
1394 2025-04-24 Step1X-Edit: A Practical Framework for General Image Editing Shiyu Liu et.al. 2504.17761 link Kimi
1395 2025-04-24 Conversational Assistants to support Heart Failure Patients: comparing a Neurosymbolic Architecture with ChatGPT Anuja Tayal et.al. 2504.17753 null Kimi
1396 2025-04-24 CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos Shucheng Gong et.al. 2504.17728 link Kimi
1397 2025-04-24 Multilingual Performance Biases of Large Language Models in Education Vansh Gupta et.al. 2504.17720 null Kimi
1398 2025-04-24 Early Detection of Multidrug Resistance Using Multivariate Time Series Analysis and Interpretable Patient-Similarity Representations Óscar Escudero-Arnanz et.al. 2504.17717 null Kimi
1399 2025-04-24 Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields Zhuo He et.al. 2504.17712 null Kimi
1400 2025-04-24 Plasma State Monitoring and Disruption Characterization using Multimodal VAEs Yoeri Poels et.al. 2504.17710 null Kimi
1401 2025-04-24 Safety in Large Reasoning Models: A Survey Cheng Wang et.al. 2504.17704 null Kimi
1402 2025-04-24 Federated Learning: A Survey on Privacy-Preserving Collaborative Intelligence Edward Collins et.al. 2504.17703 null Kimi
1403 2025-04-24 Hierarchical and Multimodal Data for Daily Activity Understanding Ghazal Kaviani et.al. 2504.17696 link Kimi
1404 2025-04-24 BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring Asier Bikandi et.al. 2504.17693 null Kimi
1405 2025-04-24 Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks Haru-Tada Sato et.al. 2504.17685 null Kimi
1406 2025-04-24 INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models Jarne Thys et.al. 2504.17677 null Kimi
1407 2025-04-24 Energy Considerations of Large Language Model Inference and Efficiency Optimizations Jared Fernandez et.al. 2504.17674 null Kimi
1408 2025-04-24 Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation Ying Zhu et.al. 2504.17672 null Kimi
1409 2025-04-24 Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction Yuanchang Ye et.al. 2504.17671 null Kimi
1410 2025-04-24 DiMeR: Disentangled Mesh Reconstruction Model Lutao Jiang et.al. 2504.17670 link Kimi
1411 2025-04-24 Towards a HIPAA Compliant Agentic AI System in Healthcare Subash Neupane et.al. 2504.17669 null Kimi
1412 2025-04-24 Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics Zena Al-Khalili et.al. 2504.17665 null Kimi
1413 2025-04-24 Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction Farhad Pourkamali-Anaraki et.al. 2504.17655 null Kimi
1414 2025-04-24 DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training Xiaoyu Tian et.al. 2504.17565 null Kimi
1415 2025-04-24 HalluLens: LLM Hallucination Benchmark Yejin Bang et.al. 2504.17550 null Kimi
1416 2025-04-24 A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task Jiaqi Deng et.al. 2504.17547 null Kimi
1417 2025-04-24 Auditing the Ethical Logic of Generative AI Models W. Russell Neuman et.al. 2504.17544 null Kimi
1418 2025-04-24 Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation Xin Yi et.al. 2504.17480 null Kimi
1419 2025-04-24 FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding De-An Huang et.al. 2504.17447 link Kimi
1420 2025-04-24 Assessing the Capability of Large Language Models for Domain-Specific Ontology Generation Anna Sofia Lippolis et.al. 2504.17402 null Kimi
1421 2025-04-24 LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams Yongxuan Wu et.al. 2504.17366 link Kimi
1422 2025-04-24 TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation Ling You et.al. 2504.17365 null Kimi
1423 2025-04-24 FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation Yulia Otmakhova et.al. 2504.17311 null Kimi
1424 2025-04-24 JurisCTC: Enhancing Legal Judgment Prediction via Cross-Domain Transfer and Contrastive Learning Zhaolu Kang et.al. 2504.17264 null Kimi
1425 2025-04-24 MCAF: Efficient Agent-based Video Understanding Framework through Multimodal Coarse-to-Fine Attention Focusing Shiwen Cao et.al. 2504.17213 null Kimi
1426 2025-04-24 A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation Yangxinyu Xie et.al. 2504.17200 null Kimi
1427 2025-04-24 Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Minju Seo et.al. 2504.17192 link Kimi
1428 2025-04-23 MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation Chanhee Park et.al. 2504.17137 null Kimi
1429 2025-04-23 Steering the CensorShip: Uncovering Representation Vectors for LLM “Thought” Control Hannah Cyberey et.al. 2504.17130 link Kimi
1430 2025-04-23 The Rise of Small Language Models in Healthcare: A Comprehensive Survey Muskan Garg et.al. 2504.17119 null Kimi
1431 2025-04-23 Leveraging LLMs as Meta-Judges: A Multi-Agent Framework for Evaluating LLM Judgments Yuran Li et.al. 2504.17087 null Kimi
1432 2025-04-23 DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs Zhenhailong Wang et.al. 2504.17040 null Kimi
1433 2025-04-23 (Im)possibility of Automated Hallucination Detection in Large Language Models Amin Karbasi et.al. 2504.17004 null Kimi
1434 2025-04-23 Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text Shifali Agrahari et.al. 2504.16913 null Kimi
1435 2025-04-23 Do Large Language Models know who did what to whom? Joseph M. Denning et.al. 2504.16884 null Kimi
1436 2025-04-23 Monte Carlo Planning with Large Language Model for Text-Based Game Agents Zijing Shi et.al. 2504.16855 null Kimi
1437 2025-04-23 GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning Luu Quy Tung et.al. 2504.16832 null Kimi
1438 2025-04-23 Process Reward Models That Think Muhammad Khalifa et.al. 2504.16828 link Kimi
1439 2025-04-23 Random Long-Context Access for Mamba via Hardware-aligned Hierarchical Sparse Attention Xiang Hu et.al. 2504.16795 null Kimi
1440 2025-04-23 Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation Lakshita Agarwal et.al. 2504.16788 null Kimi
1441 2025-04-23 MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores Fengwei Zhou et.al. 2504.16786 null Kimi
1442 2025-04-23 How Effective are Generative Large Language Models in Performing Requirements Classification? Waad Alhoshan et.al. 2504.16768 null Kimi
1443 2025-04-23 Lightweight Latent Verifiers for Efficient Meta-Generation Strategies Bartosz Piotrowski et.al. 2504.16760 null Kimi
1444 2025-04-23 HEMA : A Hippocampus-Inspired Extended Memory Architecture for Long-Context AI Conversations Kwangseob Ahn et.al. 2504.16754 null Kimi
1445 2025-04-23 IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery Aniketh Garikaparthi et.al. 2504.16728 link Kimi
1446 2025-04-23 Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories Mareike Lisker et.al. 2504.16604 null Kimi
1447 2025-04-23 Comparing Large Language Models and Traditional Machine Translation Tools for Translating Medical Consultation Summaries: A Pilot Study Andy Li et.al. 2504.16601 null Kimi
1448 2025-04-23 PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression Lizhe Chen et.al. 2504.16574 null Kimi
1449 2025-04-23 Amplified Vulnerabilities: Structured Jailbreak Attacks on LLM-based Multi-Agent Debate Senmao Qi et.al. 2504.16489 null Kimi
1450 2025-04-23 Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark Hanlei Zhang et.al. 2504.16427 link Kimi
1451 2025-04-23 Evaluating Multi-Hop Reasoning in Large Language Models: A Chemistry-Centric Case Study Mohammad Khodadad et.al. 2504.16414 null Kimi
1452 2025-04-23 ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs Fahmida Liza Piya et.al. 2504.16394 link Kimi
1453 2025-04-23 SplitReason: Learning To Offload Reasoning Yash Akhauri et.al. 2504.16379 null Kimi
1454 2025-04-23 Text-to-TrajVis: Enabling Trajectory Data Visualizations from Natural Language Questions Tian Bai et.al. 2504.16358 null Kimi
1455 2025-04-23 DP2FL: Dual Prompt Personalized Federated Learning in Foundation Models Ying Chang et.al. 2504.16357 null Kimi
1456 2025-04-22 The Paradox of Poetic Intent in Back-Translation: Evaluating the Quality of Large Language Models in Chinese Translation Li Weigang et.al. 2504.16286 null Kimi
1457 2025-04-22 FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking Jabez Magomere et.al. 2504.16188 null Kimi
1458 2025-04-22 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention Yucheng Li et.al. 2504.16083 null Kimi
1459 2025-04-22 MR. Video: “MapReduce” is the Principle for Long Video Understanding Ziqi Pang et.al. 2504.16082 null Kimi
1460 2025-04-22 LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Thomas Schmied et.al. 2504.16078 null Kimi
1461 2025-04-22 LongMamba: Enhancing Mamba’s Long Context Capabilities via Training-Free Receptive Field Enlargement Zhifan Ye et.al. 2504.16053 link Kimi
1462 2025-04-22 Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3 Ahmed R. Sadik et.al. 2504.16027 null Kimi
1463 2025-04-23 CAPO: Cost-Aware Prompt Optimization Tom Zehle et.al. 2504.16005 link Kimi
1464 2025-04-22 FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity Fanny Jourdan et.al. 2504.15941 link Kimi
1465 2025-04-22 Impact of Noise on LLM-Models Performance in Abstraction and Reasoning Corpus (ARC) Tasks with Model Temperature Considerations Nikhil Khandalkar et.al. 2504.15903 null Kimi
1466 2025-04-22 SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning Cheng Wen et.al. 2504.15900 null Kimi
1467 2025-04-22 Dynamic Early Exit in Reasoning Models Chenxu Yang et.al. 2504.15895 link Kimi
1468 2025-04-22 What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns Michael A. Hedderich et.al. 2504.15815 link Kimi
1469 2025-04-22 A closer look at how large language models trust humans: patterns and biases Valeria Lerman et.al. 2504.15801 null Kimi
1470 2025-04-22 Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach Ruizhe Li et.al. 2504.15784 null Kimi
1471 2025-04-22 TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving Daocheng Fu et.al. 2504.15780 null Kimi
1472 2025-04-22 DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models Jie Zhu et.al. 2504.15716 link Kimi
1473 2025-04-22 Cost-Effective Text Clustering with Large Language Models Hongtao Wang et.al. 2504.15640 null Kimi
1474 2025-04-22 DR.FIX: Automatically Fixing Data Races at Industry Scale Farnaz Behrang et.al. 2504.15637 link Kimi
1475 2025-04-22 Exploiting Contextual Knowledge in LLMs through V-usable Information based Layer Enhancement Xiaowei Yuan et.al. 2504.15630 null Kimi
1476 2025-04-22 A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models Gengxian Cao et.al. 2504.15552 null Kimi
1477 2025-04-22 llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length Issa Sugiura et.al. 2504.15544 null Kimi
1478 2025-04-22 Compass-V2 Technical Report Sophia Maria et.al. 2504.15527 null Kimi
1479 2025-04-21 CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting Atin Pothiraj et.al. 2504.15485 null Kimi
1480 2025-04-21 Speculative Sampling via Exponential Races Szymon Kobus et.al. 2504.15475 null Kimi
1481 2025-04-21 Trillion 7B Technical Report Sungjun Han et.al. 2504.15431 null Kimi
1482 2025-04-21 LLM-Assisted Translation of Legacy FORTRAN Codes to C++: A Cross-Platform Study Nishath Rajiv Ranasinghe et.al. 2504.15424 null Kimi
1483 2025-04-21 IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs David Ma et.al. 2504.15415 link Kimi
1484 2025-04-21 Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection Myrthe Reuver et.al. 2504.15392 link Kimi
1485 2025-04-21 Towards Understanding Camera Motions in Any Video Zhiqiu Lin et.al. 2504.15376 null Kimi
1486 2025-04-21 KeDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments Junyoung Park et.al. 2504.15364 null Kimi
1487 2025-04-21 Exploring Compositional Generalization (in ReCOGS_pos) by Transformers using Restricted Access Sequence Processing (RASP) William Bruns et.al. 2504.15349 null Kimi
1488 2025-04-21 Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Chun-Hsiao Yeh et.al. 2504.15280 link Kimi
1489 2025-04-21 FlowReasoner: Reinforcing Query-Level Meta-Agents Hongcheng Gao et.al. 2504.15257 link Kimi
1490 2025-04-21 Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges Nandan Thakur et.al. 2504.15205 null Kimi
1491 2025-04-21 The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks Joan C. Timoneda et.al. 2504.15160 null Kimi
1492 2025-04-21 EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models Ziwen Xu et.al. 2504.15133 link Kimi
1493 2025-04-21 Kuwain 1.5B: An Arabic SLM via Language Injection Khalil Hennara et.al. 2504.15120 null Kimi
1494 2025-04-21 A triple-branch network for latent fingerprint enhancement guided by orientation fields and minutiae Yurun Wang et.al. 2504.15105 null Kimi
1495 2025-04-21 Rethinking the Potential of Multimodality in Collaborative Problem Solving Diagnosis with Large Language Models K. Wong et.al. 2504.15093 null Kimi
1496 2025-04-21 DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation Weijie He et.al. 2504.15032 null Kimi
1497 2025-04-21 Efficient Pretraining Length Scaling Bohong Wu et.al. 2504.14992 null Kimi
1498 2025-04-21 Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues Rui Ribeiro et.al. 2504.14963 null Kimi
1499 2025-04-21 MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core Dennis Liu et.al. 2504.14960 null Kimi
1500 2025-04-21 EducationQ: Evaluating LLMs’ Teaching Capabilities Through Multi-Agent Dialogue Framework Yao Shi et.al. 2504.14928 null Kimi
1501 2025-04-21 CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs Yingming Zheng et.al. 2504.14905 link Kimi
1502 2025-04-21 Latent Bayesian Optimization via Autoregressive Normalizing Flows Seunghun Lee et.al. 2504.14889 null Kimi
1503 2025-04-21 Natural Fingerprints of Large Language Models Teppei Suzuki et.al. 2504.14871 null Kimi
1504 2025-04-21 OTC: Optimal Tool Calls via Reinforcement Learning Hongru Wang et.al. 2504.14870 null Kimi
1505 2025-04-21 ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages Zhoujie Qian et.al. 2504.14825 null Kimi
1506 2025-04-21 On Self-improving Token Embeddings Mario M. Kubek et.al. 2504.14808 null Kimi
1507 2025-04-21 Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends Jiaxin GUO et.al. 2504.14804 null Kimi
1508 2025-04-21 gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling Tianyu Guo et.al. 2504.14775 link Kimi
1509 2025-04-21 PLANET: A Collection of Benchmarks for Evaluating LLMs’ Planning Capabilities Haoming Li et.al. 2504.14773 null Kimi
1510 2025-04-20 Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions Luyang Fang et.al. 2504.14772 null Kimi
1511 2025-04-20 SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs Minh V. T. Pham et.al. 2504.14757 null Kimi
1512 2025-04-20 PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines Reya Vir et.al. 2504.14738 null Kimi
1513 2025-04-20 AI with Emotions: Exploring Emotional Expressions in Large Language Models Shin-nosuke Ishikawa et.al. 2504.14706 null Kimi
1514 2025-04-20 Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark Enxin Song et.al. 2504.14693 link Kimi
1515 2025-04-20 FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models Mehrnoush Shamsfard et.al. 2504.14690 null Kimi
1516 2025-04-20 Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens Kaihang Pan et.al. 2504.14666 null Kimi
1517 2025-04-20 A Case Study Exploring the Current Landscape of Synthetic Medical Record Generation with Commercial LLMs Yihan Lin et.al. 2504.14657 null Kimi
1518 2025-04-17 PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding Jang Hyun Cho et.al. 2504.13180 link Kimi
1519 2025-04-17 Single-Shot Shape and Reflectance with Spatial Polarization Multiplexing Tomoki Ichikawa et.al. 2504.13177 null Kimi
1520 2025-04-17 It’s All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization Ali Behrouz et.al. 2504.13173 null Kimi
1521 2025-04-17 Sleep-time Compute: Beyond Inference Scaling at Test-time Kevin Lin et.al. 2504.13171 link Kimi
1522 2025-04-17 Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Tsung-Han Wu et.al. 2504.13169 link Kimi
1523 2025-04-17 CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Shizhe Diao et.al. 2504.13161 null Kimi
1524 2025-04-17 MIB: A Mechanistic Interpretability Benchmark Aaron Mueller et.al. 2504.13151 link Kimi
1525 2025-04-17 Readable Twins of Unreadable Models Krzysztof Pancerz et.al. 2504.13150 link Kimi
1526 2025-04-17 Antidistillation Sampling Yash Savani et.al. 2504.13146 null Kimi
1527 2025-04-17 Exploring Expert Failures Improves LLM Agent Tuning Li-Cheng Lan et.al. 2504.13145 null Kimi
1528 2025-04-17 $\texttt{Complex-Edit}$ : CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark Siwei Yang et.al. 2504.13143 null Kimi
1529 2025-04-17 Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo João Loula et.al. 2504.13139 null Kimi
1530 2025-04-17 Energy-Based Reward Models for Robust Language Model Alignment Anamika Lochab et.al. 2504.13134 link Kimi
1531 2025-04-17 Science-T2I: Addressing Scientific Illusions in Image Synthesis Jialuo Li et.al. 2504.13129 null Kimi
1532 2025-04-17 LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard Varun Rao et.al. 2504.13125 null Kimi
1533 2025-04-17 Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training Xinsong Zhang et.al. 2504.13123 null Kimi
1534 2025-04-17 VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models Haojian Huang et.al. 2504.13122 link Kimi
1535 2025-04-17 Probing and Inducing Combinational Creativity in Vision-Language Models Yongqian Peng et.al. 2504.13120 null Kimi
1536 2025-04-17 EventVAD: Training-Free Event-Aware Video Anomaly Detection Yihua Shao et.al. 2504.13092 null Kimi
1537 2025-04-17 Retrieval-Augmented Generation with Conflicting Evidence Han Wang et.al. 2504.13079 link Kimi
1538 2025-04-17 Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off Riza Velioglu et.al. 2504.13078 link Kimi
1539 2025-04-17 SkyReels-V2: Infinite-length Film Generative Model Guibin Chen et.al. 2504.13074 link Kimi
1540 2025-04-17 Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models Sudesh Ramesh Bhagat et.al. 2504.13068 null Kimi
1541 2025-04-17 RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins Yao Mu et.al. 2504.13059 null Kimi
1542 2025-04-17 Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation Yichao Feng et.al. 2504.13054 null Kimi
1543 2025-04-17 How Large Language Models Are Changing MOOC Essay Answers: A Comparison of Pre- and Post-LLM Responses Leo Leppänen et.al. 2504.13038 null Kimi
1544 2025-04-17 Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Assessment and Beyond Yundi Zhang et.al. 2504.13037 link Kimi
1545 2025-04-17 InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning Zheng Wang et.al. 2504.13032 null Kimi
1546 2025-04-17 ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images Sangwook Kim et.al. 2504.13023 null Kimi
1547 2025-04-17 Pose and Facial Expression Transfer by using StyleGAN Petr Jahoda et.al. 2504.13021 null Kimi
1548 2025-04-17 SHA256 at SemEval-2025 Task 4: Selective Amnesia – Constrained Unlearning for Large Language Models via Knowledge Isolation Saransh Agrawal et.al. 2504.12996 link Kimi
1549 2025-04-17 Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback Nearchos Potamitis et.al. 2504.12951 null Kimi
1550 2025-04-17 Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models Zhouhao Sun et.al. 2504.12898 null Kimi
1551 2025-04-17 EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting Guanrou Yang et.al. 2504.12867 null Kimi
1552 2025-04-17 Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks Amey Hengle et.al. 2504.12845 link Kimi
1553 2025-04-17 Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration Yicheng Pan et.al. 2504.12773 link Kimi
1554 2025-04-17 Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge Yongrui Chen et.al. 2504.12734 null Kimi
1555 2025-04-17 Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations Yiyou Sun et.al. 2504.12691 link Kimi
1556 2025-04-17 Data-efficient LLM Fine-tuning for Code Generation Weijie Lv et.al. 2504.12687 link Kimi
1557 2025-04-17 Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation Linda He et.al. 2504.12637 null Kimi
1558 2025-04-17 Identifying and Mitigating the Influence of the Prior Distribution in Large Language Models Liyi Zhang et.al. 2504.12585 link Kimi
1559 2025-04-17 MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation Haris Riaz et.al. 2504.12563 null Kimi
1560 2025-04-17 ZeroSumEval: Scaling LLM Evaluation with Inter-Model Competition Haidar Khan et.al. 2504.12562 link Kimi
1561 2025-04-17 Memorization: A Close Look at Books Iris Ma et.al. 2504.12549 null Kimi
1562 2025-04-16 MOM: Memory-Efficient Offloaded Mini-Sequence Inference for Long Context Language Models Junyang Zhang et.al. 2504.12526 null Kimi
1563 2025-04-16 Memorization vs. Reasoning: Updating LLMs with New Knowledge Aochong Oliver Li et.al. 2504.12523 null Kimi
1564 2025-04-16 Towards Conversational AI for Human-Machine Collaborative MLOps George Fatouros et.al. 2504.12477 null Kimi
1565 2025-04-16 Integrating Structural and Semantic Signals in Text-Attributed Graphs with BiGTex Azadeh Beiranvand et.al. 2504.12474 link Kimi
1566 2025-04-16 Dense Backpropagation Improves Training for Sparse Mixture-of-Experts Ashwinee Panda et.al. 2504.12463 link Kimi
1567 2025-04-16 Activated LoRA: Fine-tuned LLMs for Intrinsics Kristjan Greenewald et.al. 2504.12397 link Kimi
1568 2025-04-16 BitNet b1.58 2B4T Technical Report Shuming Ma et.al. 2504.12285 null Kimi
1569 2025-04-16 How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions Aditya Prakash et.al. 2504.12284 null Kimi
1570 2025-04-16 FLIP Reasoning Challenge Andreas Plesner et.al. 2504.12256 link Kimi
1571 2025-04-16 What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure Céline Budding et.al. 2504.12187 null Kimi
1572 2025-04-16 SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data Suyoung Bae et.al. 2504.12185 null Kimi
1573 2025-04-16 Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models - Laura Fieback et.al. 2504.12137 null Kimi
1574 2025-04-16 Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework Jack Preuveneers et.al. 2504.12090 null Kimi
1575 2025-04-16 Purposefully Induced Psychosis (PIP): Embracing Hallucination as Imagination in Large Language Models Kris Pilcher et.al. 2504.12012 null Kimi
1576 2025-04-16 Generative Recommendation with Continuous-Token Diffusion Haohao Qu et.al. 2504.12007 null Kimi
1577 2025-04-16 Language Models as Quasi-Crystalline Thought: Structure, Constraint, and Emergence in Generative Systems Jose Manuel Guevara-Vela et.al. 2504.11986 null Kimi
1578 2025-04-16 ADAT: Time-Series-Aware Adaptive Transformer Architecture for Sign Language Translation Nada Shahin et.al. 2504.11942 null Kimi
1579 2025-04-16 Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading Qianjin Yu et.al. 2504.11919 null Kimi
1580 2025-04-16 Evaluating the Goal-Directedness of Large Language Models Tom Everitt et.al. 2504.11844 link Kimi
1581 2025-04-16 FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations Yue Zhao et.al. 2504.11837 null Kimi
1582 2025-04-16 Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation Julia Kreutzer et.al. 2504.11829 null Kimi
1583 2025-04-16 Cost-Efficient LLM Serving in the Cloud: VM Selection with KV Cache Offloading Kihyun Kim et.al. 2504.11816 link Kimi
1584 2025-04-16 Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification Yue Li et.al. 2504.11793 null Kimi
1585 2025-04-16 Enhancing Web Agents with Explicit Rollback Mechanisms Zhisong Zhang et.al. 2504.11788 null Kimi
1586 2025-04-16 Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs Hyungwoo Lee et.al. 2504.11765 null Kimi
1587 2025-04-16 Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures Prabhu Vellaisamy et.al. 2504.11750 null Kimi
1588 2025-04-16 Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics Yiran He et.al. 2504.11686 null Kimi
1589 2025-04-16 Steering Prosocial AI Agents: Computational Basis of LLM’s Decision Making in Social Simulation Ji Ma et.al. 2504.11671 null Kimi
1590 2025-04-15 GraphicBench: A Planning Benchmark for Graphic Design with Language Agents Dayeon Ki et.al. 2504.11571 null Kimi
1591 2025-04-15 ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Jiazhan Feng et.al. 2504.11536 link Kimi
1592 2025-04-15 HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation Haokun Liu et.al. 2504.11524 null Kimi
1593 2025-04-15 TextArena Leon Guertler et.al. 2504.11442 link Kimi
1594 2025-04-15 A Dual-Space Framework for General Knowledge Distillation of Large Language Models Xue Zhang et.al. 2504.11426 null Kimi
1595 2025-04-15 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce Wei Xiong et.al. 2504.11343 link Kimi
1596 2025-04-15 Transformer-Based Model for Cold Start Mitigation in FaaS Architecture Alexandre Savi Fayam Mbala Mouen et.al. 2504.11338 null Kimi
1597 2025-04-15 Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints Ruicheng Ao et.al. 2504.11320 link Kimi
1598 2025-04-15 Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs Chang Yang et.al. 2504.11239 link Kimi
1599 2025-04-15 Video Summarization with Large Language Models Min Jung Lee et.al. 2504.11199 null Kimi
1600 2025-04-15 Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items Minjie Zou et.al. 2504.11186 null Kimi
1601 2025-04-15 DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis Efthymios Georgiou et.al. 2504.11082 null Kimi
1602 2025-04-15 Dynamic Compressing Prompts for Efficient Inference of Large Language Models Jinwu Hu et.al. 2504.11004 null Kimi
1603 2025-04-15 Efficient Reasoning Models: A Survey Sicheng Feng et.al. 2504.10903 link Kimi
1604 2025-04-15 ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search Yize Zhang et.al. 2504.10893 null Kimi
1605 2025-04-15 Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content Yilang Peng et.al. 2504.10878 null Kimi
1606 2025-04-15 Moving Beyond Next-Token Prediction: Transformers are Context-Sensitive Language Generators Phill Kyu Rhee et.al. 2504.10845 null Kimi
1607 2025-04-15 LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation Hengyu Shi et.al. 2504.10829 null Kimi
1608 2025-04-15 CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives Ayoung Lee et.al. 2504.10823 null Kimi
1609 2025-04-14 How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients Ming Li et.al. 2504.10766 link Kimi
1610 2025-04-14 ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models Amirhosein Chahe et.al. 2504.10757 link Kimi
1611 2025-04-14 CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD Map Updates Ankit Kumar Shaw et.al. 2504.10738 null Kimi
1612 2025-04-14 HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving Avinash Kumar et.al. 2504.10724 null Kimi
1613 2025-04-14 Weight-of-Thought Reasoning: Exploring Neural Network Weights for Enhanced LLM Reasoning Saif Punjwani et.al. 2504.10646 link Kimi
1614 2025-04-14 Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models Thilo Hagendorff et.al. 2504.10615 null Kimi
1615 2025-04-15 GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents Xiaobo Xia et.al. 2504.10458 null Kimi
1616 2025-04-14 RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users Suyu Ye et.al. 2504.10445 link Kimi
1617 2025-04-14 Multimodal Long Video Modeling Based on Temporal Dynamic Context Haoran Hao et.al. 2504.10443 link Kimi
1618 2025-04-14 LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models Minqian Liu et.al. 2504.10430 null Kimi
1619 2025-04-14 LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models Parshin Shojaee et.al. 2504.10415 link Kimi
1620 2025-04-14 Performance of Large Language Models in Supporting Medical Diagnosis and Treatment Diogo Sousa et.al. 2504.10405 null Kimi
1621 2025-04-14 Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families Shahriar Noroozizadeh et.al. 2504.10340 null Kimi
1622 2025-04-14 Heimdall: test-time scaling on the generative verification Wenlei Shi et.al. 2504.10337 null Kimi
1623 2025-04-14 AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference Yangshen Deng et.al. 2504.10326 null Kimi
1624 2025-04-14 Deep Reasoning Translation via Reinforcement Learning Jiaan Wang et.al. 2504.10187 link Kimi
1625 2025-04-14 HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection Mohamed A. Abdallah et.al. 2504.10168 null Kimi
1626 2025-04-14 Breaking the Data Barrier – Building GUI Agents Through Task Generalization Junlei Zhang et.al. 2504.10127 link Kimi
1627 2025-04-14 CameraBench: Benchmarking Visual Reasoning in MLLMs via Photography I-Sheng Fang et.al. 2504.10090 null Kimi
1628 2025-04-14 RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability Yichi Zhang et.al. 2504.10081 null Kimi
1629 2025-04-14 Mavors: Multi-granularity Video Representation for Multimodal Large Language Model Yang Shi et.al. 2504.10068 null Kimi
1630 2025-04-14 Hallucination Detection in LLMs via Topological Divergence on Attention Graphs Alexandra Bazarova et.al. 2504.10063 null Kimi
1631 2025-04-14 DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify Zhengxuan Zhang et.al. 2504.10036 null Kimi
1632 2025-04-14 The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination Hao Yin et.al. 2504.10020 null Kimi
1633 2025-04-14 Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models? Yanbo Wang et.al. 2504.10000 null Kimi
1634 2025-04-14 KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference Yuxuan Tian et.al. 2504.09936 null Kimi
1635 2025-04-14 FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding Zheng Liu et.al. 2504.09925 link Kimi
1636 2025-04-14 Reasoning Models Can Be Effective Without Thinking Wenjie Ma et.al. 2504.09858 null Kimi
1637 2025-04-14 A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science Jie Feng et.al. 2504.09848 null Kimi
1638 2025-04-14 OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training Juntao Zhao et.al. 2504.09844 null Kimi
1639 2025-04-14 Training Small Reasoning LLMs with Cognitive Preference Alignment Wenrui Cai et.al. 2504.09802 null Kimi
1640 2025-04-14 VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents Ryota Tanaka et.al. 2504.09795 null Kimi
1641 2025-04-14 Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning Jingtian Wu et.al. 2504.09781 null Kimi
1642 2025-04-14 Understanding and Optimizing Multi-Stage AI Inference Pipelines Abhimanyu Rajeshkumar Bambhaniya et.al. 2504.09775 null Kimi
1643 2025-04-14 Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning Can Jin et.al. 2504.09772 link Kimi
1644 2025-04-13 Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability Haotian Wang et.al. 2504.09639 link Kimi
1645 2025-04-13 Metropolis-Hastings Captioning Game: Knowledge Fusion of Vision Language Models via Decentralized Bayesian Inference Yuta Matsui et.al. 2504.09620 null Kimi
1646 2025-04-10 Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments Lorenz Linhardt et.al. 2504.07965 null Kimi
1647 2025-04-10 PixelFlow: Pixel-Space Generative Models with Flow Shoufa Chen et.al. 2504.07963 link Kimi
1648 2025-04-10 GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation Lang Lin et.al. 2504.07962 null Kimi
1649 2025-04-10 Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction Zeren Jiang et.al. 2504.07961 link Kimi
1650 2025-04-10 CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy Dongyoung Kim et.al. 2504.07959 null Kimi
1651 2025-04-10 MM-IFEngine: Towards Multimodal Instruction Following Shengyuan Ding et.al. 2504.07957 link Kimi
1652 2025-04-10 VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning Yukun Qi et.al. 2504.07956 null Kimi
1653 2025-04-10 Perception-R1: Pioneering Perception Policy with Reinforcement Learning En Yu et.al. 2504.07954 link Kimi
1654 2025-04-10 Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models Mustafa Shukor et.al. 2504.07951 null Kimi
1655 2025-04-10 InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians Kefan Chen et.al. 2504.07949 null Kimi
1656 2025-04-10 GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces Hao Yu et.al. 2504.07945 null Kimi
1657 2025-04-10 HoloPart: Generative 3D Part Amodal Segmentation Yunhan Yang et.al. 2504.07943 null Kimi
1658 2025-04-10 SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement Xiyao Wang et.al. 2504.07934 link Kimi
1659 2025-04-10 The Urban Impact of AI: Modeling Feedback Loops in Next-Venue Recommendation Giovanni Mauro et.al. 2504.07911 link Kimi
1660 2025-04-10 The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound Blake VanBerlo et.al. 2504.07904 null Kimi
1661 2025-04-10 Redefining Machine Translation on Social Network Services with Large Language Models Hongcheng Guo et.al. 2504.07901 link Kimi
1662 2025-04-10 How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective Qi Liu et.al. 2504.07898 link Kimi
1663 2025-04-10 Fast Adaptation with Behavioral Foundation Models Harshit Sikchi et.al. 2504.07896 null Kimi
1664 2025-04-10 SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning Rui Pan et.al. 2504.07891 link Kimi
1665 2025-04-10 Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge Riccardo Cantini et.al. 2504.07887 link Kimi
1666 2025-04-10 Token Level Routing Inference System for Edge Devices Jianshu She et.al. 2504.07878 null Kimi
1667 2025-04-10 Dual Engines of Thoughts: A Depth-Breadth Integration Framework for Open-Ended Analysis Fei-Hsuan Yu et.al. 2504.07872 null Kimi
1668 2025-04-10 SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos Joshua Li et.al. 2504.07867 null Kimi
1669 2025-04-10 Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Yichun Yin et.al. 2504.07866 null Kimi
1670 2025-04-10 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization Mengyang Li et.al. 2504.07856 null Kimi
1671 2025-04-10 The KL3M Data Project: Copyright-Clean Training Resources for Large Language Models Michael J Bommarito II et.al. 2504.07854 link Kimi
1672 2025-04-10 V2V3D: View-to-View Denoised 3D Reconstruction for Light-Field Microscopy Jiayin Zhao et.al. 2504.07853 null Kimi
1673 2025-04-10 Anytime Single-Step MAPF Planning with Anytime PIBT Nayesha Gandotra et.al. 2504.07841 null Kimi
1674 2025-04-10 Understanding Learner-LLM Chatbot Interactions and the Impact of Prompting Guidelines Cansu Koyuturk et.al. 2504.07840 null Kimi
1675 2025-04-10 Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems Simon Lermen et.al. 2504.07831 null Kimi
1676 2025-04-10 MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations Genglin Liu et.al. 2504.07830 link Kimi
1677 2025-04-10 Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models Hongcheng Guo et.al. 2504.07807 link Kimi
1678 2025-04-10 On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data Alfredo Garrachón Ruiz et.al. 2504.07646 null Kimi
1679 2025-04-10 ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models Joel Barmettler et.al. 2504.07624 null Kimi
1680 2025-04-10 VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Haozhan Shen et.al. 2504.07615 link Kimi
1681 2025-04-10 Boosting Universal LLM Reward Design through the Heuristic Reward Observation Space Evolution Zen Kit Heng et.al. 2504.07596 null Kimi
1682 2025-04-10 AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation Tuhin Chakrabarty et.al. 2504.07532 link Kimi
1683 2025-04-10 Supervised Optimism Correction: Be Confident When LLMs Are Sure Junjie Zhang et.al. 2504.07527 null Kimi
1684 2025-04-10 VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding Henghao Zhao et.al. 2504.07519 null Kimi
1685 2025-04-10 GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable Jianqiao Wangni et.al. 2504.07513 null Kimi
1686 2025-04-10 Kimi-VL Technical Report Kimi Team et.al. 2504.07491 link Kimi
1687 2025-04-10 Beyond LLMs: A Linguistic Approach to Causal Graph Generation from Narrative Texts Zehan Li et.al. 2504.07459 null Kimi
1688 2025-04-10 From Token to Line: Enhancing Code Generation with a Long-Term Perspective Tingwei Lu et.al. 2504.07433 null Kimi
1689 2025-04-10 TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models Sher Badshah et.al. 2504.07385 null Kimi
1690 2025-04-10 Enhancing Time Series Forecasting via Multi-Level Text Alignment with LLMs Taibiao Zhao et.al. 2504.07360 link Kimi
1691 2025-04-10 Revisiting Prompt Optimization with Large Reasoning Models-A Case Study on Event Extraction Saurabh Srivastava et.al. 2504.07357 null Kimi
1692 2025-04-09 Modeling Response Consistency in Multi-Agent LLM Systems: A Comparative Analysis of Shared and Separate Context Approaches Tooraj Helmi et.al. 2504.07303 null Kimi
1693 2025-04-09 SemEval-2025 Task 5: LLMs4Subjects – LLM-based Automated Subject Tagging for a National Technical Library’s Open-Access Catalog Jennifer D’Souza et.al. 2504.07199 link Kimi
1694 2025-04-09 HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation Mingxuan Li et.al. 2504.07174 link Kimi
1695 2025-04-09 Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning Nikhil Shivakumar Nayak et.al. 2504.07097 link Kimi
1696 2025-04-09 OmniCaptioner: One Captioner to Rule Them All Yiting Lu et.al. 2504.07089 link Kimi
1697 2025-04-09 KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs Elan Markowitz et.al. 2504.07087 null Kimi
1698 2025-04-09 DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning Atharva Pandey et.al. 2504.07080 null Kimi
1699 2025-04-09 SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills Boyuan Zheng et.al. 2504.07079 null Kimi
1700 2025-04-09 HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification Bibek Paudel et.al. 2504.07069 null Kimi
1701 2025-04-09 Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration Kostas Hatalis et.al. 2504.06943 null Kimi
1702 2025-04-09 Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Image Recognition Sergio Romero-Tapiador et.al. 2504.06925 null Kimi
1703 2025-04-09 Integrating Cognitive Processing Signals into Language Models: A Review of Advances, Applications and Future Directions Angela Lopez-Cardona et.al. 2504.06843 null Kimi
1704 2025-04-09 LVC: A Lightweight Compression Framework for Enhancing VLMs in Long Video Understanding Ziyi Wang et.al. 2504.06835 null Kimi
1705 2025-04-09 Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations Zican Dong et.al. 2504.06792 null Kimi
1706 2025-04-09 Zero-Shot Image-Based Large Language Model Approach to Road Pavement Monitoring Shuoshuo Xu et.al. 2504.06785 null Kimi
1707 2025-04-09 FamilyTool: A Multi-hop Personalized Tool Use Benchmark Yuxin Wang et.al. 2504.06766 link Kimi
1708 2025-04-09 EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture Wenfeng Feng et.al. 2504.06738 null Kimi
1709 2025-04-09 A Neuro-inspired Interpretation of Unlearning in Large Language Models through Sample-level Unlearning Difficulty Xiaohua Feng et.al. 2504.06658 null Kimi
1710 2025-04-09 Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program Minghe Gao et.al. 2504.06606 link Kimi
1711 2025-04-09 Automated Business Process Analysis: An LLM-Based Approach to Value Assessment William De Michele et.al. 2504.06600 link Kimi
1712 2025-04-09 Right Prediction, Wrong Reasoning: Uncovering LLM Misalignment in RA Disease Diagnosis Umakanta Maharana et.al. 2504.06581 link Kimi
1713 2025-04-09 NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables Lanrui Wang et.al. 2504.06560 null Kimi
1714 2025-04-09 Lugha-Llama: Adapting Large Language Models for African Languages Happy Buzaaba et.al. 2504.06536 null Kimi
1715 2025-04-08 Don’t Let It Hallucinate: Premise Verification via Retrieval-Augmented Logical Reasoning Yuehan Qin et.al. 2504.06438 null Kimi
1716 2025-04-08 S’MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning Hanqing Zeng et.al. 2504.06426 null Kimi
1717 2025-04-08 Understanding Machine Unlearning Through the Lens of Mode Connectivity Jiali Cheng et.al. 2504.06407 null Kimi
1718 2025-04-08 GOLLuM: Gaussian Process Optimized LLMs – Reframing LLM Finetuning through Bayesian Optimization Bojana Ranković et.al. 2504.06265 link Kimi
1719 2025-04-09 Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Gleb Rodionov et.al. 2504.06261 link Kimi
1720 2025-04-08 FEABench: Evaluating Language Models on Multiphysics Reasoning Ability Nayantara Mudur et.al. 2504.06260 link Kimi
1721 2025-04-08 Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation Biao Zhang et.al. 2504.06225 null Kimi
1722 2025-04-08 From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models Chejian Xu et.al. 2504.06214 null Kimi
1723 2025-04-08 TxGemma: Efficient and Agentic LLMs for Therapeutics Eric Wang et.al. 2504.06196 null Kimi
1724 2025-04-08 Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups Rijul Magu et.al. 2504.06160 null Kimi
1725 2025-04-08 QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform Movina Moses et.al. 2504.06136 null Kimi
1726 2025-04-08 Multi-Sense Embeddings for Language Models and Knowledge Distillation Qitong Wang et.al. 2504.06036 null Kimi
1727 2025-04-08 NativQA Framework: Enabling LLMs with Native, Local, and Everyday Knowledge Firoj Alam et.al. 2504.05995 null Kimi
1728 2025-04-08 PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario Sriram Mandalika et.al. 2504.05908 null Kimi
1729 2025-04-08 HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference Shuzhang Zhong et.al. 2504.05897 link Kimi
1730 2025-04-08 Agent Guide: A Simple Agent Behavioral Watermarking Framework Kaibo Huang et.al. 2504.05871 null Kimi
1731 2025-04-08 Are Generative AI Agents Effective Personalized Financial Advisors? Takehiro Takayanagi et.al. 2504.05862 link Kimi
1732 2025-04-08 How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM Jirong Zha et.al. 2504.05786 null Kimi
1733 2025-04-08 DDT: Decoupled Diffusion Transformer Shuai Wang et.al. 2504.05741 null Kimi
1734 2025-04-08 Rank-Then-Score: Enhancing Large Language Models for Automated Essay Scoring Yida Cai et.al. 2504.05736 null Kimi
1735 2025-04-08 STRIVE: A Think & Improve Approach with Iterative Refinement for Enhancing Question Quality Estimation Aniket Deroy et.al. 2504.05693 null Kimi
1736 2025-04-08 Towards Smarter Hiring: Are Zero-Shot and Few-Shot Pre-trained LLMs Ready for HR Spoken Interview Transcript Analysis? Subhankar Maity et.al. 2504.05683 null Kimi
1737 2025-04-08 Sugar-Coated Poison: Benign Generation Unlocks LLM Jailbreaking Yu-Hang Wu et.al. 2504.05652 link Kimi
1738 2025-04-08 TAGC: Optimizing Gradient Communication in Distributed Transformer Training Igor Polyakov et.al. 2504.05638 link Kimi
1739 2025-04-08 FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction Qian-Wen Zhang et.al. 2504.05607 link Kimi
1740 2025-04-08 ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs Gejian Zhao et.al. 2504.05605 null Kimi
1741 2025-04-08 Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Yi Peng et.al. 2504.05599 null Kimi
1742 2025-04-08 DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding Hossein Entezari Zarch et.al. 2504.05598 null Kimi
1743 2025-04-08 Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions Oded Ovadia et.al. 2504.05571 null Kimi
1744 2025-04-07 Bridging Industrial Expertise and XR with LLM-Powered Conversational Agents Despina Tomkou et.al. 2504.05527 null Kimi
1745 2025-04-07 Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling Benjamin Lipkin et.al. 2504.05410 null Kimi
1746 2025-04-07 LiveVQA: Live Visual Knowledge Seeking Mingyang Fu et.al. 2504.05288 null Kimi
1747 2025-04-07 Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models Adrián Bazaga et.al. 2504.05258 null Kimi
1748 2025-04-07 Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling Hengran Zhang et.al. 2504.05216 null Kimi
1749 2025-04-07 Post-Training Language Models for Continual Relation Extraction Sefika Efeoglu et.al. 2504.05214 null Kimi
1750 2025-04-07 Concise Reasoning via Reinforcement Learning Mehdi Fatemi et.al. 2504.05185 link Kimi
1751 2025-04-07 VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks YuYue et.al. 2504.05118 null Kimi
1752 2025-04-07 AI for Climate Finance: Agentic Retrieval and Multi-Step Reasoning for Early Warning System Investments Saeid Ario Vaghefi et.al. 2504.05104 null Kimi
1753 2025-04-07 The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning Tianshi Zheng et.al. 2504.05081 null Kimi
1754 2025-04-07 Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models Jiawei Lian et.al. 2504.05050 null Kimi
1755 2025-04-07 Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning Sugyeong Eo et.al. 2504.05047 null Kimi
1756 2025-04-07 Following the Whispers of Values: Unraveling Neural Mechanisms Behind Value-Oriented Behaviors in LLMs Ling Hu et.al. 2504.04994 null Kimi
1757 2025-04-07 Towards Visual Text Grounding of Multimodal Large Language Model Ming Li et.al. 2504.04974 null Kimi
1758 2025-04-07 M-Prometheus: A Suite of Open Multilingual LLM Judges José Pombal et.al. 2504.04953 link Kimi
1759 2025-04-07 A Llama walks into the ‘Bar’: Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam Rean Fernandes et.al. 2504.04945 null Kimi
1760 2025-04-07 Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration Ran Xu et.al. 2504.04915 link Kimi
1761 2025-04-07 Leveraging Large Language Models for Cost-Effective, Multilingual Depression Detection and Severity Assessment Longdi Xian et.al. 2504.04891 null Kimi
1762 2025-04-07 Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos Zhi Zuo et.al. 2504.04837 null Kimi
1763 2025-04-07 Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models Ruikang Liu et.al. 2504.04823 link Kimi
1764 2025-04-07 Can LLMs Interpret and Leverage Structured Linguistic Representations? A Case Study with AMRs Ankush Raut et.al. 2504.04745 null Kimi
1765 2025-04-07 TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context Shubham Kumar Nigam et.al. 2504.04737 null Kimi
1766 2025-04-07 Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use Anna Goldie et.al. 2504.04736 null Kimi
1767 2025-04-07 Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models Yubo Li et.al. 2504.04717 link Kimi
1768 2025-04-07 Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts Yifei Yu et.al. 2504.04713 null Kimi
1769 2025-04-07 LagKV: Lag-Relative Information of the KV Cache Tells Which Tokens Are Important Manlai Liang et.al. 2504.04704 link Kimi
1770 2025-04-07 R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation Martin Weyssow et.al. 2504.04699 link Kimi
1771 2025-04-07 LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts Yimu Wang et.al. 2504.04653 null Kimi
1772 2025-04-06 Splits! A Flexible Dataset for Evaluating a Model’s Demographic Social Inference Eylon Caplan et.al. 2504.04640 link Kimi
1773 2025-04-06 SECQUE: A Benchmark for Evaluating Real-World Financial Analysis Capabilities Noga Ben Yoash et.al. 2504.04596 null Kimi
1774 2025-04-06 The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models? Weichen Zhang et.al. 2504.04540 null Kimi
1775 2025-04-06 An Empirical Comparison of Text Summarization: A Multi-Dimensional Evaluation of Large Language Models Anantharaman Janakiraman et.al. 2504.04534 null Kimi
1776 2025-04-03 Concept Lancet: Image Editing with Compositional Representation Transplant Jinqi Luo et.al. 2504.02828 null Kimi
1777 2025-04-03 On Vanishing Variance in Transformer Length Generalization Ruining Li et.al. 2504.02827 null Kimi
1778 2025-04-03 Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing Xiangyu Zhao et.al. 2504.02826 link Kimi
1779 2025-04-03 Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models Mateusz Pach et.al. 2504.02821 link Kimi
1780 2025-04-03 GMR-Conv: An Efficient Rotation and Reflection Equivariant Convolution Kernel Using Gaussian Mixture Rings Yuexi Du et.al. 2504.02819 link Kimi
1781 2025-04-03 Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization Kangle Deng et.al. 2504.02817 null Kimi
1782 2025-04-03 Generative Evaluation of Complex Reasoning in Large Language Models Haowei Lin et.al. 2504.02810 link Kimi
1783 2025-04-03 MegaMath: Pushing the Limits of Open Math Corpora Fan Zhou et.al. 2504.02807 link Kimi
1784 2025-04-03 A Survey of Large Language Models in Mental Health Disorder Detection on Social Media Zhuohan Ge et.al. 2504.02800 null Kimi
1785 2025-04-03 Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence Anita Rau et.al. 2504.02799 null Kimi
1786 2025-04-03 Spline-based Transformers Prashanth Chandran et.al. 2504.02797 null Kimi
1787 2025-04-03 A Framework for Situating Innovations, Opportunities, and Challenges in Advancing Vertical Systems with Large AI Models Gaurav Verma et.al. 2504.02793 null Kimi
1788 2025-04-03 Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets Chuning Zhu et.al. 2504.02792 null Kimi
1789 2025-04-03 A Framework for Robust Cognitive Evaluation of LLMs Karin de Langis et.al. 2504.02789 null Kimi
1790 2025-04-03 GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation Zhiyuan Yan et.al. 2504.02782 link Kimi
1791 2025-04-03 From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks Joshua Holstein et.al. 2504.02780 null Kimi
1792 2025-04-03 Multi-Head Adaptive Graph Convolution Network for Sparse Point Cloud-Based Human Activity Recognition Vincent Gbouna Zakka et.al. 2504.02778 link Kimi
1793 2025-04-03 MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs Jaap Jumelet et.al. 2504.02768 link Kimi
1794 2025-04-03 How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices? Andres Algaba et.al. 2504.02767 link Kimi
1795 2025-04-03 Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model Shengjun Zhang et.al. 2504.02764 null Kimi
1796 2025-04-03 CanonNet: Canonical Ordering and Curvature Learning for Point Cloud Analysis Benjy Friedmann et.al. 2504.02763 null Kimi
1797 2025-04-03 RBR4DNN: Requirements-based Testing of Neural Networks Nusrat Jahan Mozumder et.al. 2504.02737 link Kimi
1798 2025-04-03 Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study Aryan Agrawal et.al. 2504.02733 link Kimi
1799 2025-04-03 Why do LLMs attend to the first token? Federico Barbero et.al. 2504.02732 null Kimi
1800 2025-04-03 HQViT: Hybrid Quantum Vision Transformer for Image Classification Hui Zhang et.al. 2504.02730 null Kimi
1801 2025-04-03 ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization Kehua Feng et.al. 2504.02725 null Kimi
1802 2025-04-03 Autonomous Human-Robot Interaction via Operator Imitation Sammy Christen et.al. 2504.02724 null Kimi
1803 2025-04-03 The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context Nikhil Verma et.al. 2504.02708 null Kimi
1804 2025-04-03 Responsible Development of Offensive AI Ryan Marinelli et.al. 2504.02701 link Kimi
1805 2025-04-03 Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation Xingguang Zhang et.al. 2504.02697 link Kimi
1806 2025-04-03 Affordable AI Assistants with Knowledge Graph of Thoughts Maciej Besta et.al. 2504.02670 null Kimi
1807 2025-04-03 Inference-Time Scaling for Generalist Reward Modeling Zijun Liu et.al. 2504.02495 null Kimi
1808 2025-04-03 Cognitive Memory in Large Language Models Lianlei Shan et.al. 2504.02441 null Kimi
1809 2025-04-03 Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation Chuanqi Cheng et.al. 2504.02438 link Kimi
1810 2025-04-03 AnesBench: Multi-Dimensional Evaluation of LLM Reasoning in Anesthesiology Xiang Feng et.al. 2504.02404 link Kimi
1811 2025-04-03 CoTAL: Human-in-the-Loop Prompt Engineering, Chain-of-Thought Reasoning, and Active Learning for Generalizable Formative Assessment Scoring Clayton Cohn et.al. 2504.02323 null Kimi
1812 2025-04-03 MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism Ruidong Zhu et.al. 2504.02263 null Kimi
1813 2025-04-03 LLMs as Deceptive Agents: How Role-Based Prompting Induces Semantic Ambiguity in Puzzle Tasks Seunghyun Yoo et.al. 2504.02254 null Kimi
1814 2025-04-03 FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant Attention Huangliang Dai et.al. 2504.02211 null Kimi
1815 2025-04-03 More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment Yifan Wang et.al. 2504.02193 null Kimi
1816 2025-04-02 A Survey of Scaling in Large Language Model Reasoning Zihan Chen et.al. 2504.02181 null Kimi
1817 2025-04-02 OmniCellTOSG: The First Cell Text-Omic Signaling Graphs Dataset for Joint LLM and GNN Modeling Heming Zhang et.al. 2504.02148 link Kimi
1818 2025-04-02 On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software Ali Nouri et.al. 2504.02141 null Kimi
1819 2025-04-02 Achieving Unanimous Consensus in Decision Making Using Multi-Agents Apurba Pokharel et.al. 2504.02128 null Kimi
1820 2025-04-02 Exploring LLM Reasoning Through Controlled Prompt Variations Giannis Chatziveroglou et.al. 2504.02111 link Kimi
1821 2025-04-02 The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data Massimiliano Luca et.al. 2504.01951 null Kimi
1822 2025-04-02 OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Wasi Uddin Ahmad et.al. 2504.01943 null Kimi
1823 2025-04-02 Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length? Celine Lee et.al. 2504.01935 link Kimi
1824 2025-04-02 A thorough benchmark of automatic text classification: From traditional approaches to large language models Washington Cunha et.al. 2504.01930 link Kimi
1825 2025-04-03 Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation Baban Gain et.al. 2504.01919 null Kimi
1826 2025-04-02 FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs Mothilal Asokan et.al. 2504.01916 null Kimi
1827 2025-04-02 Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning Yinggan Xu et.al. 2504.01911 null Kimi
1828 2025-04-02 STAR-1: Safer Alignment of Reasoning LLMs with 1K Data Zijun Wang et.al. 2504.01903 null Kimi
1829 2025-04-02 TransientTables: Evaluating LLMs’ Reasoning on Temporally Evolving Semi-structured Tables Abhilash Shankarampeta et.al. 2504.01879 null Kimi
1830 2025-04-02 Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models Zhiwei Yu et.al. 2504.01857 null Kimi
1831 2025-04-02 InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation Bowen Cao et.al. 2504.01707 null Kimi
1832 2025-04-02 ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs Yi-Long Lu et.al. 2504.01698 link Kimi
1833 2025-04-02 Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish Cedric Lothritz et.al. 2504.01667 null Kimi
1834 2025-04-02 Enabling Systematic Generalization in Abstract Spatial Reasoning through Meta-Learning for Compositionality Philipp Mondorf et.al. 2504.01445 link Kimi
1835 2025-04-02 FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations Athena Wen et.al. 2504.01420 link Kimi
1836 2025-04-02 An Illusion of Progress? Assessing the Current State of Web Agents Tianci Xue et.al. 2504.01382 link Kimi
1837 2025-04-02 Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design Mohan Zhang et.al. 2504.01337 null Kimi
1838 2025-04-02 Slow-Fast Architecture for Video Multi-Modal Large Language Models Min Shi et.al. 2504.01328 link Kimi
1839 2025-04-02 On Data Synthesis and Post-training for Visual Abstract Reasoning Ke Zhu et.al. 2504.01324 null Kimi
1840 2025-04-02 Adaptive Rectification Sampling for Test-Time Compute Scaling Zhendong Tan et.al. 2504.01317 link Kimi
1841 2025-04-02 ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning Bairu Hou et.al. 2504.01296 link Kimi
1842 2025-04-02 Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding Sakhinana Sagar Srinivas et.al. 2504.01281 null Kimi
1843 2025-04-01 Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models Rafael Giebisch et.al. 2504.01248 null Kimi
1844 2025-04-01 Detecting PTSD in Clinical Interviews: A Comparative Analysis of NLP Methods and Large Language Models Feng Chen et.al. 2504.01216 null Kimi
1845 2025-04-01 $μ$ KE: Matryoshka Unstructured Knowledge Editing of Large Language Models Zian Su et.al. 2504.01196 null Kimi
1846 2025-04-01 When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning Nishad Singhi et.al. 2504.01005 null Kimi
1847 2025-04-01 Token embeddings violate the manifold hypothesis Michael Robinson et.al. 2504.01002 null Kimi
1848 2025-04-01 MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Siyuan Li et.al. 2504.00999 link Kimi
1849 2025-04-01 MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs Juncheng Wu et.al. 2504.00993 link Kimi
1850 2025-04-01 SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching Yuxuan Zhu et.al. 2504.00970 null Kimi
1851 2025-04-01 Multi-Token Attention Olga Golovneva et.al. 2504.00927 null Kimi
1852 2025-04-01 Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Saaket Agashe et.al. 2504.00906 link Kimi
1853 2025-03-31 Easi3R: Estimating Disentangled Motion from DUSt3R Without Training Xingyu Chen et.al. 2503.24391 link Kimi
1854 2025-03-31 RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy Zhonghan Zhao et.al. 2503.24388 null Kimi
1855 2025-03-31 Consistent Subject Generation via Contrastive Instantiated Concepts Lee Hsin-Ying et.al. 2503.24387 null Kimi
1856 2025-03-31 Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Shengqiong Wu et.al. 2503.24379 null Kimi
1857 2025-03-31 ACPBench Hard: Unrestrained Reasoning about Action, Change, and Planning Harsha Kokel et.al. 2503.24378 null Kimi
1858 2025-03-31 Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models Rui Wang et.al. 2503.24377 link Kimi
1859 2025-03-31 Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Yi Chen et.al. 2503.24376 link Kimi
1860 2025-03-31 ERUPT: Efficient Rendering with Unposed Patch Transformer Maxim V. Shugaev et.al. 2503.24374 null Kimi
1861 2025-03-31 Effectively Controlling Reasoning Models through Thinking Intervention Tong Wu et.al. 2503.24370 null Kimi
1862 2025-03-31 Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation Xiaoran Zhang et.al. 2503.24368 null Kimi
1863 2025-03-31 Query and Conquer: Execution-Guided SQL Generation Łukasz Borchmann et.al. 2503.24364 null Kimi
1864 2025-03-31 SQuat: Subspace-orthogonal KV Cache Quantization Hao Wang et.al. 2503.24358 null Kimi
1865 2025-03-31 ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion Rana Muhammad Shahroz Khan et.al. 2503.24354 null Kimi
1866 2025-03-31 Can Test-Time Scaling Improve World Foundation Model? Wenyan Cong et.al. 2503.24320 link Kimi
1867 2025-03-31 BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models Alok Abhishek et.al. 2503.24310 null Kimi
1868 2025-03-31 A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG Arshia Kermani et.al. 2503.24307 null Kimi
1869 2025-03-31 Order Matters: On Parameter-Efficient Image-to-Video Probing for Recognizing Nearly Symmetric Actions Thinesh Thiyakesan Ponbagavathi et.al. 2503.24298 null Kimi
1870 2025-03-31 Is analogy enough to draw novel adjective-noun inferences? Hayley Ross et.al. 2503.24293 link Kimi
1871 2025-03-31 Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Jingcheng Hu et.al. 2503.24290 null Kimi
1872 2025-03-31 Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning Jiacheng Lin et.al. 2503.24289 link Kimi
1873 2025-03-31 Style Quantization for Data-Efficient GAN Training Jian Wang et.al. 2503.24282 null Kimi
1874 2025-03-31 Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality Sewoong Lee et.al. 2503.24277 link Kimi
1875 2025-03-31 FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics Yixuan Li et.al. 2503.24267 null Kimi
1876 2025-03-31 Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation Dun Yuan et.al. 2503.24245 null Kimi
1877 2025-03-31 Spatio-temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications in Ridesharing Run Yang et.al. 2503.24237 null Kimi
1878 2025-03-31 What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models Qiyuan Zhang et.al. 2503.24235 link Kimi
1879 2025-03-31 PAARS: Persona Aligned Agentic Retail Shoppers Saab Mansour et.al. 2503.24228 null Kimi
1880 2025-03-31 MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing Karim Radouane et.al. 2503.24219 link Kimi
1881 2025-03-31 All You Need is Sally-Anne: ToM in AI Strongly Supported After Surpassing Tests for 3-Year-Olds Nitay Alon et.al. 2503.24215 null Kimi
1882 2025-03-31 Synthetic News Generation for Fake News Classification Abdul Sittar et.al. 2503.24206 null Kimi
1883 2025-03-31 TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers’ Guidance Jingxian Xu et.al. 2503.24198 null Kimi
1884 2025-03-31 Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms Shuoming Zhang et.al. 2503.24191 null Kimi
1885 2025-03-31 Grounding Agent Reasoning in Image Schemas: A Neurosymbolic Approach to Embodied Cognition François Olivier et.al. 2503.24110 null Kimi
1886 2025-03-31 Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data Fatemeh Mohammadi et.al. 2503.24062 null Kimi
1887 2025-03-31 AirCache: Activating Inter-modal Relevancy KV Cache Compression for Efficient Large Vision-Language Model Inference Kai Huang et.al. 2503.23956 null Kimi
1888 2025-03-31 Model Hemorrhage and the Robustness Limits of Large Language Models Ziyang Ma et.al. 2503.23924 null Kimi
1889 2025-03-31 OrchMLLM: Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training Yijie Zheng et.al. 2503.23830 null Kimi
1890 2025-03-31 Expanding RL with Verifiable Rewards Across Diverse Domains Yi Su et.al. 2503.23829 null Kimi
1891 2025-03-31 Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute Yingwei Ma et.al. 2503.23803 link Kimi
1892 2025-03-31 Adaptive Layer-skipping in Pre-trained LLMs Xuan Luo et.al. 2503.23798 null Kimi
1893 2025-03-31 WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization Ine Gevers et.al. 2503.23779 null Kimi
1894 2025-03-31 Short-video Propagation Influence Rating: A New Real-world Dataset and A New Large Graph Model Dizhan Xue et.al. 2503.23746 link Kimi
1895 2025-03-31 LANID: LLM-assisted New Intent Discovery Lu Fan et.al. 2503.23740 link Kimi
1896 2025-03-31 AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization Yiyang Du et.al. 2503.23733 link Kimi
1897 2025-03-30 Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models Haochen Liu et.al. 2503.23523 link Kimi
1898 2025-03-30 If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs Siqi Fan et.al. 2503.23514 null Kimi
1899 2025-03-30 RARE: Retrieval-Augmented Reasoning Modeling Zhengren Wang et.al. 2503.23513 link Kimi
1900 2025-03-30 Benchmarking Systematic Relational Reasoning with Large Language and Reasoning Models Irtaza Khalid et.al. 2503.23487 null Kimi
1901 2025-03-30 Order Independence With Finetuning Katrina Brown et.al. 2503.23483 null Kimi
1902 2025-03-27 Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model Abdelrahman Shaker et.al. 2503.21782 link Kimi
1903 2025-03-27 X $^{2}$ -Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction Weihao Yu et.al. 2503.21779 null Kimi
1904 2025-03-27 Video-R1: Reinforcing Video Reasoning in MLLMs Kaituo Feng et.al. 2503.21776 link Kimi
1905 2025-03-27 StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion Ziyu Guo et.al. 2503.21775 null Kimi
1906 2025-03-27 MemInsight: Autonomous Memory Augmentation for LLM Agents Rana Salama et.al. 2503.21760 null Kimi
1907 2025-03-27 Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck Adrian Bulat et.al. 2503.21757 null Kimi
1908 2025-03-27 LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis Shitian Zhao et.al. 2503.21749 null Kimi
1909 2025-03-27 GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics Arsham Gholamzadeh Khoee et.al. 2503.21735 null Kimi
1910 2025-03-27 Effective Skill Unlearning through Intervention and Abstention Yongce Li et.al. 2503.21730 link Kimi
1911 2025-03-27 ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation Zhicheng Lee et.al. 2503.21729 link Kimi
1912 2025-03-27 OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation Mallika Garg et.al. 2503.21723 null Kimi
1913 2025-03-27 Collab: Controlled Decoding using Mixture of Agents for LLM Alignment Souradip Chakraborty et.al. 2503.21720 null Kimi
1914 2025-03-27 Outlier dimensions favor frequent tokens in language model Iuri Macocco et.al. 2503.21718 null Kimi
1915 2025-03-27 CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers? Jiefu Ou et.al. 2503.21717 link Kimi
1916 2025-03-27 Elementwise Layer Normalization Felix Stollenwerk et.al. 2503.21708 link Kimi
1917 2025-03-27 MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX Liuyue Xie et.al. 2503.21699 null Kimi
1918 2025-03-27 Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks Wenqi Zhang et.al. 2503.21696 link Kimi
1919 2025-03-27 AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation Jiahe Qian et.al. 2503.21695 null Kimi
1920 2025-03-27 Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Zhiyuan Ma et.al. 2503.21694 link Kimi
1921 2025-03-27 LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning Hui Wang et.al. 2503.21683 null Kimi
1922 2025-03-27 JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community Yunze Xiao et.al. 2503.21679 null Kimi
1923 2025-03-27 How do language models learn facts? Dynamics, curricula and hallucinations Nicolas Zucchet et.al. 2503.21676 null Kimi
1924 2025-03-27 COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing Rajvee Sheth et.al. 2503.21670 null Kimi
1925 2025-03-27 Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI Danaja Rutar et.al. 2503.21668 null Kimi
1926 2025-03-27 UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Zhengxi Lu et.al. 2503.21620 link Kimi
1927 2025-03-27 A Measure Based Generalizable Approach to Understandability Vikas Kushwaha et.al. 2503.21615 null Kimi
1928 2025-03-27 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond Xiaoye Qu et.al. 2503.21614 link Kimi
1929 2025-03-27 Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach Javier Coronado-Blázquez et.al. 2503.21613 null Kimi
1930 2025-03-27 GenEdit: Compounding Operators and Continuous Improvement to Tackle Text-to-SQL in the Enterprise Karime Maamari et.al. 2503.21602 null Kimi
1931 2025-03-27 Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing Johan Wahréus et.al. 2503.21598 null Kimi
1932 2025-03-27 debug-gym: A Text-Based Environment for Interactive Debugging Xingdi Yuan et.al. 2503.21557 null Kimi
1933 2025-03-27 SWI: Speaking with Intent in Large Language Models Yuwei Yin et.al. 2503.21544 link Kimi
1934 2025-03-27 Keyword-Oriented Multimodal Modeling for Euphemism Identification Yuxue Hu et.al. 2503.21504 link Kimi
1935 2025-03-27 Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection Ryan Marinelli et.al. 2503.21464 link Kimi
1936 2025-03-27 An evaluation of LLMs and Google Translate for translation of selected Indian languages via sentiment and semantic analyses Rohitash Chandra et.al. 2503.21393 null Kimi
1937 2025-03-27 Controlling Large Language Model with Latent Actions Chengxing Jia et.al. 2503.21383 link Kimi
1938 2025-03-27 Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models Haoxiang Sun et.al. 2503.21380 link Kimi
1939 2025-03-27 ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback Taewon Yun et.al. 2503.21332 null Kimi
1940 2025-03-27 InternVL-X: Advancing and Accelerating InternVL Series with Efficient Visual Token Compression Dongchen Lu et.al. 2503.21307 link Kimi
1941 2025-03-27 ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition Yujie Liu et.al. 2503.21248 null Kimi
1942 2025-03-27 Bias-Aware Agent: Enhancing Fairness in AI-Driven Knowledge Retrieval Karanbir Singh et.al. 2503.21237 link Kimi
1943 2025-03-27 LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models Hengyuan Zhao et.al. 2503.21227 null Kimi
1944 2025-03-27 ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging Haoming Xu et.al. 2503.21088 link Kimi
1945 2025-03-27 EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues Yuhan Liu et.al. 2503.21080 null Kimi
1946 2025-03-27 Rerouting Connection: Hybrid Computer Vision Analysis Reveals Visual Similarity Between Indus and Tibetan-Yi Corridor Writing Systems Ooha Lakkadi Reddy et.al. 2503.21074 link Kimi
1947 2025-03-26 Can Large Language Models Predict Associations Among Human Attitudes? Ana Ma et.al. 2503.21011 null Kimi
1948 2025-03-26 VinaBench: Benchmark for Faithful and Consistent Visual Narratives Silin Gao et.al. 2503.20871 null Kimi
1949 2025-03-26 Understanding R1-Zero-Like Training: A Critical Perspective Zichen Liu et.al. 2503.20783 link Kimi
1950 2025-03-27 Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning Huajie Tan et.al. 2503.20752 null Kimi
1951 2025-03-26 Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework Soham Sane et.al. 2503.20750 null Kimi
1952 2025-03-27 Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs Yuxuan Lu et.al. 2503.20749 null Kimi
1953 2025-03-26 Vision as LoRA Han Wang et.al. 2503.20680 link Kimi
1954 2025-03-26 TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews Huimin Xu et.al. 2503.20666 null Kimi
1955 2025-03-26 Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions Alessandro Maisto et.al. 2503.20623 null Kimi
1956 2025-03-26 Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation Yunkai Liang et.al. 2503.20552 link Kimi
1957 2025-03-26 Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence Yijiong Yu et.al. 2503.20533 link Kimi
1958 2025-03-26 StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs Zhicheng Guo et.al. 2503.20527 link Kimi
1959 2025-03-26 From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment Yucheng Suo et.al. 2503.20472 null Kimi
1960 2025-03-26 MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation Rongyu Zhang et.al. 2503.20384 null Kimi
1961 2025-03-26 VideoGEM: Training-free Action Grounding in Videos Felix Vogel et.al. 2503.20348 null Kimi
1962 2025-03-26 Iterative Prompting with Persuasion Skills in Jailbreaking Large Language Models Shih-Wen Ke et.al. 2503.20320 null Kimi
1963 2025-03-26 QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions Siyin Wang et.al. 2503.20290 null Kimi
1964 2025-03-26 sudo rm -rf agentic_security Sejin Lee et.al. 2503.20279 link Kimi
1965 2025-03-26 ViLBench: A Suite for Vision-Language Process Reward Modeling Haoqin Tu et.al. 2503.20271 null Kimi
1966 2025-03-26 Qwen2.5-Omni Technical Report Jin Xu et.al. 2503.20215 null Kimi
1967 2025-03-26 SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain Nan Gao et.al. 2503.20202 null Kimi
1968 2025-03-26 Open Deep Search: Democratizing Search with Open-source Reasoning Agents Salaheddin Alzubi et.al. 2503.20201 link Kimi
1969 2025-03-25 Can Multi-modal (reasoning) LLMs work as deepfake detectors? Simiao Ren et.al. 2503.20084 null Kimi
1970 2025-03-25 Cross-Tokenizer Distillation via Approximate Likelihood Matching Benjamin Minixhofer et.al. 2503.20083 link Kimi
1971 2025-03-25 OmniNova:A General Multimodal Agent Framework Pengfei Du et.al. 2503.20028 null Kimi
1972 2025-03-25 ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback Bohan Zhai et.al. 2503.19988 link Kimi
1973 2025-03-25 LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation Han Chen et.al. 2503.19950 link Kimi
1974 2025-03-25 CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning Hao Yu et.al. 2503.19900 link Kimi
1975 2025-03-25 Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Xiaoyu Tian et.al. 2503.19855 null Kimi
1976 2025-03-25 FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs Carlos Plou et.al. 2503.19850 null Kimi
1977 2025-03-25 A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950 Zhao Fang et.al. 2503.19844 null Kimi
1978 2025-03-25 PAVE: Patching and Adapting Video Large Language Models Zhuoming Liu et.al. 2503.19794 link Kimi
1979 2025-03-25 Gemma 3 Technical Report Gemma Team et.al. 2503.19786 null Kimi
1980 2025-03-25 AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Itay Nakash et.al. 2503.19693 link Kimi
1981 2025-03-25 1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training Han Zhao et.al. 2503.19633 null Kimi
1982 2025-03-25 Innate Reasoning is Not Enough: In-Context Learning Enhances Reasoning Large Language Models with Less Overthinking Yuyao Ge et.al. 2503.19602 null Kimi
1983 2025-03-25 Scaling Laws of Synthetic Data for Language Models Zeyu Qin et.al. 2503.19551 null Kimi
1984 2025-03-25 FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models Dahyun Jung et.al. 2503.19540 link Kimi
1985 2025-03-25 ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning Mingyang Chen et.al. 2503.19470 null Kimi
1986 2025-03-25 DeCAP: Context-Adaptive Prompt Generation for Debiasing Zero-shot Question Answering in Large Language Models Suyoung Bae et.al. 2503.19426 null Kimi
1987 2025-03-25 Process or Result? Manipulated Ending Tokens Can Mislead Reasoning LLMs to Ignore the Correct Reasoning Steps Yu Cui et.al. 2503.19326 null Kimi
1988 2025-03-25 Long-Context Autoregressive Video Modeling with Next-Frame Prediction Yuchao Gu et.al. 2503.19325 link Kimi
1989 2025-03-25 Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications Ben Rahman et.al. 2503.19276 null Kimi
1990 2025-03-25 MARS: Memory-Enhanced Agents with Reflective Self-improvement Xuechen Liang et.al. 2503.19271 null Kimi
1991 2025-03-25 Linguistic Blind Spots of Large Language Models Jiali Cheng et.al. 2503.19260 null Kimi
1992 2025-03-25 SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings Farhana Keya et.al. 2503.19257 null Kimi
1993 2025-03-24 A Survey of Large Language Model Agents for Question Answering Murong Yue et.al. 2503.19213 null Kimi
1994 2025-03-24 Overtrained Language Models Are Harder to Fine-Tune Jacob Mitchell Springer et.al. 2503.19206 null Kimi
1995 2025-03-24 Language Model Uncertainty Quantification with Attention Chain Yinghao Li et.al. 2503.19168 link Kimi
1996 2025-03-24 LLM-Based Insight Extraction for Contact Center Analytics and Cost-Efficient Deployment Varsha Embar et.al. 2503.19090 null Kimi
1997 2025-03-24 Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization Zhanda Zhu et.al. 2503.19050 link Kimi
1998 2025-03-24 LookAhead Tuning: Safer Language Models via Partial Answer Previews Kangwei Liu et.al. 2503.19041 link Kimi
1999 2025-03-24 Exploring Training and Inference Scaling Laws in Generative Retrieval Hongru Cai et.al. 2503.18941 link Kimi
2000 2025-03-24 xKV: Cross-Layer SVD for KV-Cache Compression Chi-Chih Chang et.al. 2503.18893 link Kimi
2001 2025-03-24 SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild Weihao Zeng et.al. 2503.18892 null Kimi
2002 2025-03-24 AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration Zhexuan Wang et.al. 2503.18891 link Kimi
2003 2025-03-24 I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Andrey Galichin et.al. 2503.18878 link Kimi
2004 2025-03-24 EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments Sara Fish et.al. 2503.18825 null Kimi
2005 2025-03-24 REALM: A Dataset of Real-World LLM Use Cases Jingwen Cheng et.al. 2503.18792 null Kimi
2006 2025-03-24 BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache Dayou Du et.al. 2503.18773 link Kimi
2007 2025-03-24 AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning Alan Dao et.al. 2503.18769 null Kimi
2008 2025-03-24 Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models Yazhou Zhang et.al. 2503.18681 null Kimi
2009 2025-03-24 Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures Abdoul Majid O. Thiombiano et.al. 2503.18565 null Kimi
2010 2025-03-24 Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models Nariman Naderi et.al. 2503.18562 null Kimi
2011 2025-03-24 Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models Bin Li et.al. 2503.18556 null Kimi
2012 2025-03-24 SciClaims: An End-to-End Generative System for Biomedical Claim Analysis Raúl Ortega et.al. 2503.18526 null Kimi
2013 2025-03-24 Verbal Process Supervision Elicits Better Coding Agents Hao-Yuan Chen et.al. 2503.18494 null Kimi
2014 2025-03-24 Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding Xiangrui Liu et.al. 2503.18478 null Kimi
2015 2025-03-24 A Simple yet Effective Layout Token in Large Language Models for Document Understanding Zhaoqing Zhu et.al. 2503.18434 null Kimi
2016 2025-03-24 Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning Junsong Li et.al. 2503.18432 null Kimi
2017 2025-03-24 Breaking the Encoder Barrier for Seamless Video-Language Understanding Handong Li et.al. 2503.18422 null Kimi
2018 2025-03-24 J&H: Evaluating the Robustness of Large Language Models Under Knowledge-Injection Attacks in Legal Domain Yiran Hu et.al. 2503.18360 link Kimi
2019 2025-03-24 Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions Dong Jing et.al. 2503.18320 null Kimi
2020 2025-03-24 Jenga: Effective Memory Management for Serving LLM with Heterogeneity Chen Zhang et.al. 2503.18292 null Kimi
2021 2025-03-24 Sun-Shine: A Large Language Model for Tibetan Culture Cheng Huang et.al. 2503.18288 link Kimi
2022 2025-03-24 TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model Cheng Yang et.al. 2503.18278 null Kimi
2023 2025-03-24 Bridging Emotions and Architecture: Sentiment Analysis in Modern Distributed Systems Mahak Shah et.al. 2503.18260 null Kimi
2024 2025-03-23 ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices Aneesh Vathul et.al. 2503.18242 null Kimi
2025 2025-03-23 Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering Zixin Chen et.al. 2503.18172 null Kimi
2026 2025-03-23 LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space Zhangyu Wang et.al. 2503.18142 null Kimi
2027 2025-03-23 AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs Diwei Wang et.al. 2503.18141 null Kimi
2028 2025-03-23 GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks Varvara Krechetova et.al. 2503.18129 link Kimi
2029 2025-03-20 Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation Yuqing Wang et.al. 2503.16430 null Kimi
2030 2025-03-20 XAttention: Block Sparse Attention with Antidiagonal Scoring Ruyi Xu et.al. 2503.16428 link Kimi
2031 2025-03-20 DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding Keyan Chen et.al. 2503.16426 link Kimi
2032 2025-03-20 Tokenize Image as a Set Zigang Geng et.al. 2503.16425 link Kimi
2033 2025-03-20 1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering Yuheng Yuan et.al. 2503.16422 null Kimi
2034 2025-03-20 Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Yang Sui et.al. 2503.16419 link Kimi
2035 2025-03-20 Survey on Evaluation of LLM-based Agents Asaf Yehudai et.al. 2503.16416 null Kimi
2036 2025-03-20 M3: 3D-Spatial MultiModal Memory Xueyan Zou et.al. 2503.16413 link Kimi
2037 2025-03-20 RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints Yiran Qin et.al. 2503.16408 null Kimi
2038 2025-03-20 The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination Yifan Sun et.al. 2503.16402 link Kimi
2039 2025-03-20 SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation Chun-Han Yao et.al. 2503.16396 null Kimi
2040 2025-03-20 Do Visual Imaginations Improve Vision-and-Language Navigation Agents? Akhil Perincherry et.al. 2503.16394 null Kimi
2041 2025-03-20 Attentional Triple-Encoder Network in Spatiospectral Domains for Medical Image Segmentation Kristin Qi et.al. 2503.16389 null Kimi
2042 2025-03-20 Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation Yijia Luo et.al. 2503.16385 link Kimi
2043 2025-03-20 LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images Leyang Wang et.al. 2503.16376 null Kimi
2044 2025-03-20 NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes Han-Hung Lee et.al. 2503.16375 link Kimi
2045 2025-03-20 JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Muyao Li et.al. 2503.16365 null Kimi
2046 2025-03-20 Neural Networks: According to the Principles of Grassmann Algebra Z. Zarezadeh et.al. 2503.16364 null Kimi
2047 2025-03-20 CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners Yunzhi Yao et.al. 2503.16356 link Kimi
2048 2025-03-20 Enhancing Software Quality Assurance with an Adaptive Differential Evolution based Quantum Variational Autoencoder-Transformer Model Seshu Babu Barma et.al. 2503.16335 null Kimi
2049 2025-03-20 LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates Ying Shen et.al. 2503.16334 null Kimi
2050 2025-03-20 OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence Long Yuan et.al. 2503.16326 null Kimi
2051 2025-03-20 Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 Peiran Gu et.al. 2503.16304 null Kimi
2052 2025-03-20 Unleashing Vecset Diffusion Model for Fast Shape Generation Zeqiang Lai et.al. 2503.16302 link Kimi
2053 2025-03-20 PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification Sharon Peled et.al. 2503.16284 link Kimi
2054 2025-03-20 Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data Zijian Li et.al. 2503.16260 null Kimi
2055 2025-03-20 Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Keda Tao et.al. 2503.16257 null Kimi
2056 2025-03-20 M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation Markus Karmann et.al. 2503.16254 null Kimi
2057 2025-03-20 Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning Zhaowei Liu et.al. 2503.16252 link Kimi
2058 2025-03-20 AI Agents in Cryptoland: Practical Attacks and No Silver Bullet Atharv Singh Patlan et.al. 2503.16248 null Kimi
2059 2025-03-20 Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t Quy-Anh Dang et.al. 2503.16219 link Kimi
2060 2025-03-20 Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation Andrea Maracani et.al. 2503.16184 null Kimi
2061 2025-03-20 SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs Shibo Jie et.al. 2503.16163 null Kimi
2062 2025-03-20 Tuning LLMs by RAG Principles: Towards LLM-native Memory Jiale Wei et.al. 2503.16071 link Kimi
2063 2025-03-20 PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval Qiang Zou et.al. 2503.16064 link Kimi
2064 2025-03-20 Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts Yike Yuan et.al. 2503.16057 null Kimi
2065 2025-03-20 Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond Yaoyao Yu et.al. 2503.16040 null Kimi
2066 2025-03-20 Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models Zhihang Liu et.al. 2503.16036 link Kimi
2067 2025-03-20 The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement Ruihan Yang et.al. 2503.16024 null Kimi
2068 2025-03-20 Autonomous AI imitators increase diversity in homogeneous information ecosystems Emil Bakkensen Johansen et.al. 2503.16021 null Kimi
2069 2025-03-20 GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions Xiaomeng Chu et.al. 2503.16013 null Kimi
2070 2025-03-20 Adaptive Group Policy Optimization: Towards Stable Training and Token-Efficient Reasoning Chen Li et.al. 2503.15952 null Kimi
2071 2025-03-20 Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment Gaole Dai et.al. 2503.15937 null Kimi
2072 2025-03-20 SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models Fahao Chen et.al. 2503.15921 null Kimi
2073 2025-03-20 DeepPsy-Agent: A Stage-Aware and Deep-Thinking Emotional Support Agent System Kai Chen et.al. 2503.15876 null Kimi
2074 2025-03-20 MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations Kyungho Bae et.al. 2503.15871 null Kimi
2075 2025-03-20 Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey Xiaoou Liu et.al. 2503.15850 null Kimi
2076 2025-03-20 Entropy-based Exploration Conduction for Multi-step Reasoning Jinghan Zhang et.al. 2503.15848 null Kimi
2077 2025-03-20 Grammar and Gameplay-aligned RL for Game Description Generation with LLMs Tsunehiko Tanaka et.al. 2503.15783 null Kimi
2078 2025-03-19 UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction Shravan Nayak et.al. 2503.15661 null Kimi
2079 2025-03-19 LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning Federico Cocchi et.al. 2503.15621 link Kimi
2080 2025-03-19 Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification ZhengLin Lai et.al. 2503.15469 link Kimi
2081 2025-03-19 SemEval-2025 Task 1: AdMIRe – Advancing Multimodal Idiomaticity Representation Thomas Pickard et.al. 2503.15358 null Kimi
2082 2025-03-19 MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration David Wan et.al. 2503.15272 null Kimi
2083 2025-03-19 Do Chains-of-Thoughts of Large Language Models Suffer from Hallucinations, Cognitive Biases, or Phobias in Bayesian Reasoning? Roberto Araya et.al. 2503.15268 null Kimi
2084 2025-03-19 Efficient allocation of image recognition and LLM tasks on multi-GPU system Marcin Lawenda et.al. 2503.15252 null Kimi
2085 2025-03-19 Automated Non-Functional Requirements Generation in Software Engineering with Large Language Models: A Comparative Study Jomar Thomas Almonte et.al. 2503.15248 null Kimi
2086 2025-03-19 BigO(Bench) – Can LLMs Generate Code with Controlled Time and Space Complexity? Pierre Chambon et.al. 2503.15242 link Kimi
2087 2025-03-19 Exploring Large Language Models for Word Games:Who is the Spy? Chentian Wei et.al. 2503.15235 link Kimi
2088 2025-03-19 CoE: Chain-of-Explanation via Automatic Visual Concept Circuit Description and Polysemanticity Quantification Wenlong Yu et.al. 2503.15234 link Kimi
2089 2025-03-19 A Review on Large Language Models for Visual Analytics Navya Sonal Agarwal et.al. 2503.15176 null Kimi
2090 2025-03-19 Machine Unlearning in Hyperbolic vs. Euclidean Multimodal Contrastive Learning: Adapting Alignment Calibration to MERU Àlex Pujol Vidal et.al. 2503.15166 null Kimi
2091 2025-03-19 VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making Mohamed Salim Aissi et.al. 2503.15108 null Kimi
2092 2025-03-19 Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings Zonghao Ying et.al. 2503.15092 link Kimi
2093 2025-03-19 Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices Ziyao Wang et.al. 2503.14932 null Kimi
2094 2025-03-19 MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models Jiazheng Li et.al. 2503.14917 null Kimi
2095 2025-03-19 Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations Shuo Li et.al. 2503.14895 null Kimi
2096 2025-03-19 MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer Honglin Lin et.al. 2503.14891 link Kimi
2097 2025-03-19 Communication-Efficient Distributed On-Device LLM Inference Over Wireless Networks Kai Zhang et.al. 2503.14882 null Kimi
2098 2025-03-19 Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers Bo Chen et.al. 2503.14881 null Kimi
2099 2025-03-19 LogLLaMA: Transformer-based log anomaly detection with LLaMA Zhuoyi Yang et.al. 2503.14849 null Kimi
2100 2025-03-18 RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving Wenqi Jiang et.al. 2503.14649 null Kimi
2101 2025-03-18 Dynamic Accumulated Attention Map for Interpreting Evolution of Decision-Making in Vision Transformer Yi Liao et.al. 2503.14640 link Kimi
2102 2025-03-18 Assessing Large Language Models for Automated Feedback Generation in Learning Programming Problem Solving Priscylla Silva et.al. 2503.14630 link Kimi
2103 2025-03-18 Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives Sara Sarto et.al. 2503.14604 link Kimi
2104 2025-03-19 State Space Model Meets Transformer: A New Paradigm for 3D Object Detection Chuxin Wang et.al. 2503.14493 null Kimi
2105 2025-03-18 DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers Minglei Shi et.al. 2503.14487 null Kimi
2106 2025-03-18 Gricean Norms as a Basis for Effective Collaboration Fardin Saad et.al. 2503.14484 link Kimi
2107 2025-03-18 LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers Nikhil Abhyankar et.al. 2503.14434 link Kimi
2108 2025-03-18 PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play Wei Fang et.al. 2503.14432 null Kimi
2109 2025-03-18 VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation Shoubin Yu et.al. 2503.14350 null Kimi
2110 2025-03-18 DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies Wei Song et.al. 2503.14324 link Kimi
2111 2025-03-18 DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal Vaibhav Aggarwal et.al. 2503.14269 link Kimi
2112 2025-03-18 Speculative Decoding for Verilog: Speed and Quality, All in One Changran Xu et.al. 2503.14153 null Kimi
2113 2025-03-18 Inference-Time Intervention in Large Language Models for Reliable Requirement Verification Paul Darm et.al. 2503.14130 null Kimi
2114 2025-03-18 Growing a Twig to Accelerate Large Vision-Language Models Zhenwei Shao et.al. 2503.14075 null Kimi
2115 2025-03-18 Fast Autoregressive Video Generation with Diagonal Decoding Yang Ye et.al. 2503.14070 null Kimi
2116 2025-03-18 Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks Mykyta Syromiatnikov et.al. 2503.13988 link Kimi
2117 2025-03-18 Improving LLM Video Understanding with 16 Frames Per Second Yixuan Li et.al. 2503.13956 null Kimi
2118 2025-03-18 ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models Alexey Karev et.al. 2503.13923 null Kimi
2119 2025-03-18 Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models Mingming Peng et.al. 2503.13813 null Kimi
2120 2025-03-18 LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation Yang Zhou et.al. 2503.13794 null Kimi
2121 2025-03-17 Mitigating KV Cache Competition to Enhance User Experience in LLM Inference Haiying Shen et.al. 2503.13773 null Kimi
2122 2025-03-17 Do Large Language Models Understand Performance Optimization? Bowen Cui et.al. 2503.13772 null Kimi
2123 2025-03-17 MetaScale: Test-Time Scaling with Evolving Meta-Thoughts Qin Liu et.al. 2503.13447 null Kimi
2124 2025-03-17 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Ye Liu et.al. 2503.13444 link Kimi
2125 2025-03-17 xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference Maximilian Beck et.al. 2503.13427 link Kimi
2126 2025-03-17 MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research James Burgess et.al. 2503.13399 link Kimi
2127 2025-03-17 Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning Mengyao Lyu et.al. 2503.13383 null Kimi
2128 2025-03-17 TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM Ye Wang et.al. 2503.13377 link Kimi
2129 2025-03-17 Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning Hai-Long Sun et.al. 2503.13360 null Kimi
2130 2025-03-17 Computation Mechanism Behind LLM Position Generalization Chi Han et.al. 2503.13305 null Kimi
2131 2025-03-17 A Survey on Transformer Context Extension: Approaches and Evaluation Yijun Liu et.al. 2503.13299 null Kimi
2132 2025-03-17 $φ$ -Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation Fangzhi Xu et.al. 2503.13288 link Kimi
2133 2025-03-17 Knowledge-Aware Iterative Retrieval for Multi-Agent Systems Seyoung Song et.al. 2503.13275 null Kimi
2134 2025-03-17 Can Language Models Follow Multiple Turns of Entangled Instructions? Chi Han et.al. 2503.13222 link Kimi
2135 2025-03-17 Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach Sinan Fan et.al. 2503.13208 null Kimi
2136 2025-03-17 MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways Zhen Chen et.al. 2503.13205 null Kimi
2137 2025-03-17 Are LLMs (Really) Ideological? An IRT-based Analysis and Alignment Tool for Perceived Socio-Economic Bias in LLMs Jasmin Wachter et.al. 2503.13149 null Kimi
2138 2025-03-17 Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding Weiyu Guo et.al. 2503.13139 null Kimi
2139 2025-03-17 Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference Hao Yin et.al. 2503.13108 link Kimi
2140 2025-03-17 ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models Hao Yin et.al. 2503.13107 link Kimi
2141 2025-03-17 A Multi-Stage Framework with Taxonomy-Guided Reasoning for Occupation Classification Using Large Language Models Palakorn Achananuparp et.al. 2503.12989 null Kimi
2142 2025-03-17 ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM Wenqiang Wang et.al. 2503.12988 null Kimi
2143 2025-03-17 R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization Jingyi Zhang et.al. 2503.12937 link Kimi
2144 2025-03-17 HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models Xinyan Jiang et.al. 2503.12908 link Kimi
2145 2025-03-17 VITED: Video Temporal Evidence Distillation Yujie Lu et.al. 2503.12855 null Kimi
2146 2025-03-17 ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing Aditi Tiwari et.al. 2503.12852 null Kimi
2147 2025-03-17 Grounded Chain-of-Thought for Multimodal Large Language Models Qiong Wu et.al. 2503.12799 link Kimi
2148 2025-03-17 DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding Xinyu Ma et.al. 2503.12797 link Kimi
2149 2025-03-17 Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering Kenneth J. K. Ong et.al. 2503.12722 null Kimi
2150 2025-03-17 Can Reasoning Models Reason about Hardware? An Agentic HLS Perspective Luca Collini et.al. 2503.12721 null Kimi
2151 2025-03-16 Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility Jacob Chmura et.al. 2503.12667 null Kimi
2152 2025-03-16 VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures Yoo Yeon Sung et.al. 2503.12651 null Kimi
2153 2025-03-16 MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network Vrushank Ahire et.al. 2503.12623 link Kimi
2154 2025-03-16 MoECollab: Democratizing LLM Development Through Collaborative Mixture of Experts Harshit et.al. 2503.12592 null Kimi
2155 2025-03-16 AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding Xiao Wang et.al. 2503.12559 link Kimi
2156 2025-03-14 TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing Stefan Lionar et.al. 2503.11629 link Kimi
2157 2025-03-14 ASMA-Tune: Unlocking LLMs’ Assembly Code Comprehension via Structural-Semantic Instruction Tuning Xinyi Wang et.al. 2503.11617 link Kimi
2158 2025-03-14 Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space Zhiliang Chen et.al. 2503.11586 link Kimi
2159 2025-03-14 Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers Weiming Ren et.al. 2503.11579 null Kimi
2160 2025-03-14 Implicit Bias-Like Patterns in Reasoning Models Messi H. J. Lee et.al. 2503.11572 null Kimi
2161 2025-03-14 Similarity-Aware Token Pruning: Your VLM but Faster Ahmadreza Jeddi et.al. 2503.11549 link Kimi
2162 2025-03-14 HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models Ziqin Zhou et.al. 2503.11513 null Kimi
2163 2025-03-14 V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning Zixu Cheng et.al. 2503.11495 null Kimi
2164 2025-03-14 Integrating LLMs in Gamified Systems Carlos J. Costa et.al. 2503.11458 null Kimi
2165 2025-03-14 Cerebrum (AIOS SDK): A Platform for Agent Development, Deployment, Distribution, and Discovery Balaji Rama et.al. 2503.11444 link Kimi
2166 2025-03-14 Text Compression for Efficient Language Generation David Gu et.al. 2503.11426 null Kimi
2167 2025-03-14 Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages Jiyeong Kim et.al. 2503.11384 null Kimi
2168 2025-03-14 Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches Panggih Kusuma Ningrum et.al. 2503.11376 null Kimi
2169 2025-03-14 AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation Fengyu Li et.al. 2503.11346 link Kimi
2170 2025-03-14 Rule-Guided Feedback: Enhancing Reasoning by Enforcing Rule Adherence in Large Language Models Aissatou Diallo et.al. 2503.11336 null Kimi
2171 2025-03-14 Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking Ziyi Wang et.al. 2503.11324 null Kimi
2172 2025-03-14 MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens Jeong Hun Yeo et.al. 2503.11315 link Kimi
2173 2025-03-14 Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering Xinyu Tang et.al. 2503.11314 link Kimi
2174 2025-03-14 BriLLM: Brain-inspired Large Language Model Hai Zhao et.al. 2503.11299 null Kimi
2175 2025-03-14 Line of Duty: Evaluating LLM Self-Knowledge via Consistency in Feasibility Boundaries Sahil Kale et.al. 2503.11256 link Kimi
2176 2025-03-14 Reasoning-Grounded Natural Language Explanations for Language Models Vojtech Cahlik et.al. 2503.11248 link Kimi
2177 2025-03-14 Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty? Giacomo Camposampiero et.al. 2503.11207 link Kimi
2178 2025-03-14 LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs Leqi Shen et.al. 2503.11205 null Kimi
2179 2025-03-14 Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering Gang Li et.al. 2503.11197 link Kimi
2180 2025-03-14 FastVID: Dynamic Density Pruning for Fast Video Large Language Models Leqi Shen et.al. 2503.11187 link Kimi
2181 2025-03-14 Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity Chi Xu et.al. 2503.11164 null Kimi
2182 2025-03-14 Don’t Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models Shaotian Yan et.al. 2503.11154 null Kimi
2183 2025-03-14 MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling Rachel S. Y. Teo et.al. 2503.11144 link Kimi
2184 2025-03-14 X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression Guihong Li et.al. 2503.11132 null Kimi
2185 2025-03-14 Direction-Aware Diagonal Autoregressive Image Generation Yijia Xu et.al. 2503.11129 null Kimi
2186 2025-03-13 GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Rongyao Fang et.al. 2503.10639 link Kimi
2187 2025-03-13 Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers? Subhajit Maity et.al. 2503.10632 null Kimi
2188 2025-03-13 SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems Ziyu Guo et.al. 2503.10627 null Kimi
2189 2025-03-13 Transformers without Normalization Jiachen Zhu et.al. 2503.10622 null Kimi
2190 2025-03-13 Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search Andy Zhou et.al. 2503.10619 null Kimi
2191 2025-03-13 Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models Andy Zhou et.al. 2503.10617 null Kimi
2192 2025-03-13 TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention Jinhao Duan et.al. 2503.10602 link Kimi
2193 2025-03-13 Long Context Tuning for Video Generation Yuwei Guo et.al. 2503.10589 null Kimi
2194 2025-03-13 Autoregressive Image Generation with Randomized Parallel Decoding Haopeng Li et.al. 2503.10568 link Kimi
2195 2025-03-13 AudioX: Diffusion Transformer for Anything-to-Audio Generation Zeyue Tian et.al. 2503.10522 null Kimi
2196 2025-03-13 TokenCarve: Information-Preserving Visual Token Compression in Multimodal Large Language Models Xudong Tan et.al. 2503.10501 link Kimi
2197 2025-03-13 MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation Weihao Xuan et.al. 2503.10497 null Kimi
2198 2025-03-13 Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents Hanxu Hu et.al. 2503.10494 link Kimi
2199 2025-03-13 LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions Gaurav Kumar Gupta et.al. 2503.10486 null Kimi
2200 2025-03-13 DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation Wenhao Hu et.al. 2503.10452 null Kimi
2201 2025-03-13 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Wanhua Li et.al. 2503.10437 link Kimi
2202 2025-03-13 BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models Can Zheng et.al. 2503.10432 null Kimi
2203 2025-03-13 Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning Jonathan Shaki et.al. 2503.10408 null Kimi
2204 2025-03-13 SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading Qiaoling Chen et.al. 2503.10377 null Kimi
2205 2025-03-13 G-Boost: Boosting Private SLMs with General LLMs Yijiang Fan et.al. 2503.10367 null Kimi
2206 2025-03-13 KV-Distill: Nearly Lossless Learnable Context Compression for LLMs Vivek Chari et.al. 2503.10337 null Kimi
2207 2025-03-13 Collaborative Speculative Inference for Efficient LLM Inference Serving Luyao Gao et.al. 2503.10325 null Kimi
2208 2025-03-13 VisualPRM: An Effective Process Reward Model for Multimodal Reasoning Weiyun Wang et.al. 2503.10291 null Kimi
2209 2025-03-13 Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout Shilong Wang et.al. 2503.10217 null Kimi
2210 2025-03-13 LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents Boyu Chen et.al. 2503.10200 null Kimi
2211 2025-03-13 Robustness Tokens: Towards Adversarial Robustness of Transformers Brian Pulfer et.al. 2503.10191 link Kimi
2212 2025-03-13 Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding Shunqi Mao et.al. 2503.10183 null Kimi
2213 2025-03-13 “Well, Keep Thinking”: Enhancing LLM Reasoning with Adaptive Injection Decoding Hyunbin Jin et.al. 2503.10167 null Kimi
2214 2025-03-13 ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning Pengfei Luo et.al. 2503.10166 link Kimi
2215 2025-03-13 Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding Jinze Li et.al. 2503.10135 null Kimi
2216 2025-03-11 QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension Yongdong Luo et.al. 2503.08689 link Kimi
2217 2025-03-11 CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving Changxing Liu et.al. 2503.08683 link Kimi
2218 2025-03-11 Chain-of-Thought Reasoning In The Wild Is Not Always Faithful Iván Arcuschin et.al. 2503.08679 link Kimi
2219 2025-03-11 REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder Yitian Zhang et.al. 2503.08665 null Kimi
2220 2025-03-11 MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention Yuhan Wang et.al. 2503.08664 link Kimi
2221 2025-03-11 Exploring the Word Sense Disambiguation Capabilities of Large Language Models Pierpaolo Basile et.al. 2503.08662 null Kimi
2222 2025-03-11 Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention Emily Xiao et.al. 2503.08640 link Kimi
2223 2025-03-11 HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder Yingqi Tang et.al. 2503.08612 link Kimi
2224 2025-03-11 Vision Transformer for Intracranial Hemorrhage Classification in CT Scans Using an Entropy-Aware Fuzzy Integral Strategy for Adaptive Scan-Level Decision Fusion Mehdi Hosseini Chagahi et.al. 2503.08609 null Kimi
2225 2025-03-11 Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling Subin Kim et.al. 2503.08605 null Kimi
2226 2025-03-11 RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding Xichen Tan et.al. 2503.08576 null Kimi
2227 2025-03-11 DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process Minjun Zhu et.al. 2503.08569 null Kimi
2228 2025-03-11 MoE-Loco: Mixture of Experts for Multitask Locomotion Runhan Huang et.al. 2503.08564 null Kimi
2229 2025-03-11 Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs Wanyong Feng et.al. 2503.08551 null Kimi
2230 2025-03-11 Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation Xian Gao et.al. 2503.08549 null Kimi
2231 2025-03-11 DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering Sher Badshah et.al. 2503.08542 null Kimi
2232 2025-03-11 Mellow: a small audio language model for reasoning Soham Deshmukh et.al. 2503.08540 link Kimi
2233 2025-03-11 Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation Andres M Bran et.al. 2503.08537 link Kimi
2234 2025-03-11 ChromaFormer: A Scalable and Accurate Transformer Architecture for Land Cover Classification Mingshi Li et.al. 2503.08534 null Kimi
2235 2025-03-11 Visual Attention Graph Kai-Fu Yang et.al. 2503.08531 null Kimi
2236 2025-03-11 Position-Aware Depth Decay Decoding ( $D^3$ ): Boosting Large Language Model Inference Efficiency Siqi Fan et.al. 2503.08524 null Kimi
2237 2025-03-11 Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models Han Cao et.al. 2503.08495 null Kimi
2238 2025-03-11 Accelerating MoE Model Inference with Expert Sharding Oana Balmau et.al. 2503.08467 null Kimi
2239 2025-03-11 FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework Jianian Zhu et.al. 2503.08461 null Kimi
2240 2025-03-11 Controlling Latent Diffusion Using Latent CLIP Jason Becker et.al. 2503.08455 link Kimi
2241 2025-03-11 TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference Systems Feiyang Wu et.al. 2503.08415 link Kimi
2242 2025-03-11 Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information Elizaveta Kuznetsova et.al. 2503.08404 null Kimi
2243 2025-03-11 Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens Qingsong Xie et.al. 2503.08377 null Kimi
2244 2025-03-11 Robust Latent Matters: Boosting Image Generation with Sampling Error Kai Qiu et.al. 2503.08354 link Kimi
2245 2025-03-11 Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs Chongjun Tu et.al. 2503.08342 null Kimi
2246 2025-03-10 Securing External Deeper-than-black-box GPAI Evaluations Alejandro Tlaie et.al. 2503.07496 null Kimi
2247 2025-03-10 V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation Guiwei Zhang et.al. 2503.07493 link Kimi
2248 2025-03-10 Destination Calculus: A Linear λ-Calculus for Purely Functional Memory Writes Thomas Bagrel et.al. 2503.07489 link Kimi
2249 2025-03-10 LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? Bangyan Li et.al. 2503.07487 null Kimi
2250 2025-03-10 Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction Zongzheng Zhang et.al. 2503.07485 link Kimi
2251 2025-03-10 VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models Jiacheng Ruan et.al. 2503.07478 link Kimi
2252 2025-03-10 Petri Net Modeling of Root Hair Response to Phosphate Starvation in Arabidopsis Thaliana Amber H. B. Fijn et.al. 2503.07477 null Kimi
2253 2025-03-10 MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Xiangru Tang et.al. 2503.07459 link Kimi
2254 2025-03-10 Open-Set Gait Recognition from Sparse mmWave Radar Point Clouds Riccardo Mazzieri et.al. 2503.07435 link Kimi
2255 2025-03-10 DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks Feiran You et.al. 2503.07433 link Kimi
2256 2025-03-10 CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving Ziliang Xiong et.al. 2503.07425 null Kimi
2257 2025-03-10 Inorganic Catalyst Efficiency Prediction Based on EAPCR Model: A Deep Learning Solution for Multi-Source Heterogeneous Data Zhangdi Liu et.al. 2503.07424 null Kimi
2258 2025-03-10 AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion Mingzhen Sun et.al. 2503.07418 null Kimi
2259 2025-03-07 Task-oriented Uncertainty Collaborative Learning for Label-Efficient Brain Tumor Segmentation Zhenxuan Zhang et.al. 2503.05682 link Kimi
2260 2025-03-07 The latent variable proximal point algorithm for variational problems with inequality constraints Jørgen S. Dokken et.al. 2503.05672 link Kimi
2261 2025-03-07 Kinodynamic Model Predictive Control for Energy Efficient Locomotion of Legged Robots with Parallel Elasticity Yulun Zhuang et.al. 2503.05666 null Kimi
2262 2025-03-07 A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval Yu Zhang et.al. 2503.05659 link Kimi
2263 2025-03-07 Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning Justin Chih-Yao Chen et.al. 2503.05641 null Kimi
2264 2025-03-07 Exploring FMCW Radars and Feature Maps for Activity Recognition: A Benchmark Study Ali Samimi Fard et.al. 2503.05629 null Kimi
2265 2025-03-07 FMT:A Multimodal Pneumonia Detection Model Based on Stacking MOE Framework Jingyu Xu et.al. 2503.05626 null Kimi
2266 2025-03-07 A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models Dong Shu et.al. 2503.05613 null Kimi
2267 2025-03-07 D2GV: Deformable 2D Gaussian Splatting for Video Representation in 400FPS Mufan Liu et.al. 2503.05600 link Kimi
2268 2025-03-07 R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Huatong Song et.al. 2503.05592 null Kimi
2269 2025-03-06 L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling Zhuo Chen et.al. 2503.04725 link Kimi
2270 2025-03-07 Shifting Long-Context LLMs Research from Input to Output Yuhao Wu et.al. 2503.04723 null Kimi
2271 2025-03-06 Enough Coin Flips Can Make LLMs Act Bayesian Ritwik Gupta et.al. 2503.04722 null Kimi
2272 2025-03-06 L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning Pranjal Aggarwal et.al. 2503.04697 null Kimi
2273 2025-03-06 UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets Wenyu Wang et.al. 2503.04693 null Kimi
2274 2025-03-06 The Influence of Prior Discourse on Conversational Agent-Driven Decision-Making Stephen Pilli et.al. 2503.04692 null Kimi
2275 2025-03-06 Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases Pengcheng Qiu et.al. 2503.04691 null Kimi
2276 2025-03-07 DIMSUM: Discourse in Mathematical Reasoning as a Supervision Module Krish Sharma et.al. 2503.04685 null Kimi
2277 2025-03-06 Matrix Factorization for Inferring Associations and Missing Links Ryan Barron et.al. 2503.04680 null Kimi
2278 2025-03-06 LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue Sangyeop Kim et.al. 2503.04675 null Kimi
2279 2025-03-05 PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning Ryozo Masukawa et.al. 2503.03747 null Kimi
2280 2025-03-05 Process-based Self-Rewarding Language Models Shimao Zhang et.al. 2503.03746 link Kimi
2281 2025-03-05 Rethinking Deep Clustering Paradigms: Self-Supervision Is All You Need Amal Shaheena et.al. 2503.03733 null Kimi
2282 2025-03-05 Towards Understanding Distilled Reasoning Models: A Representational Approach David D. Baek et.al. 2503.03730 null Kimi
2283 2025-03-05 When Radiation Meets Linux: Analyzing Soft Errors in Linux on COTS SoCs under Proton Irradiation Saad Memon et.al. 2503.03722 null Kimi
2284 2025-03-05 Improving LLM Safety Alignment with Dual-Objective Optimization Xuandong Zhao et.al. 2503.03710 link Kimi
2285 2025-03-05 Rethinking Video Tokenization: A Conditioned Diffusion-based Approach Nianzu Yang et.al. 2503.03708 link Kimi
2286 2025-03-05 A Practical Memory Injection Attack against LLM Agents Shen Dong et.al. 2503.03704 null Kimi
2287 2025-03-05 ILLC: Iterative Layer-by-Layer Compression for Enhancing Structural Faithfulness in SpArX Ungsik Kim et.al. 2503.03693 null Kimi
2288 2025-03-05 DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance Zhao Yang et.al. 2503.03689 link Kimi
2289 2025-03-04 Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation Han Xue et.al. 2503.02881 link Kimi
2290 2025-03-04 Language Models can Self-Improve at State-Value Estimation for Better Search Ethan Mendes et.al. 2503.02878 link Kimi
2291 2025-03-04 Weak-to-Strong Generalization Even in Random Feature Networks, Provably Marko Medvedev et.al. 2503.02877 null Kimi
2292 2025-03-04 SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models Dmitry Nechaev et.al. 2503.02876 link Kimi
2293 2025-03-04 The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models Ke Ji et.al. 2503.02875 null Kimi
2294 2025-03-04 Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework Ziang Zhou et.al. 2503.02863 null Kimi
2295 2025-03-04 PileUp Mitigation at the HL-LHC Using Attention for Event-Wide Context Luke Vaughan et.al. 2503.02860 null Kimi
2296 2025-03-04 Unsupervised Attributed Dynamic Network Embedding with Stability Guarantees Emma Ceccherini et.al. 2503.02859 null Kimi
2297 2025-03-04 Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs’ Decoding Layers Zicong He et.al. 2503.02851 link Kimi
2298 2025-03-04 Multimodal Deep Learning for Subtype Classification in Breast Cancer Using Histopathological Images and Gene Expression Data Amin Honarmandi Shandiz et.al. 2503.02849 link Kimi
2299 2025-02-28 LLM Post-Training: A Deep Dive into Reasoning Large Language Models Komal Kumar et.al. 2502.21321 link Kimi
2300 2025-02-28 Doping dependence of 2-spinon excitations in the doped 1D cuprate Ba $2$CuO${3+δ}$ Jiarui Li et.al. 2502.21316 null Kimi
2301 2025-02-28 Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos Zhiyu Tan et.al. 2502.21314 null Kimi
2302 2025-02-28 FANformer: Improving Large Language Models Through Effective Periodicity Modeling Yihong Dong et.al. 2502.21309 link Kimi
2303 2025-02-28 Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With Faithfulness Based on Causal Theory of Mind Dingyi Zhang et.al. 2502.21297 null Kimi
2304 2025-02-28 Bilevel Optimized Implicit Neural Representation for Scan-Specific Accelerated MRI Reconstruction Hongze Yu et.al. 2502.21292 null Kimi
2305 2025-02-28 Contextualizing biological perturbation experiments through language Menghua Wu et.al. 2502.21290 link Kimi
2306 2025-02-28 Boosting Prediction with Data Missing Not at Random Yuan Bian et.al. 2502.21276 null Kimi
2307 2025-02-28 Adaptive Keyframe Sampling for Long Video Understanding Xi Tang et.al. 2502.21271 null Kimi
2308 2025-02-28 Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks Andrea Montanari et.al. 2502.21269 null Kimi
2309 2025-02-27 R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts Zhongyang Li et.al. 2502.20395 link Kimi
2310 2025-02-27 LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding Ang Cao et.al. 2502.20389 null Kimi
2311 2025-02-27 InsTaG: Learning Personalized 3D Talking Head from Few-Second Video Jiahe Li et.al. 2502.20387 link Kimi
2312 2025-02-27 ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting Dexter Ong et.al. 2502.20386 null Kimi
2313 2025-02-27 rSPDE: tools for statistical modeling using fractional SPDEs David Bolin et.al. 2502.20385 null Kimi
2314 2025-02-27 PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation Albert Gong et.al. 2502.20377 link Kimi
2315 2025-02-27 Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization Ryan C. Barron et.al. 2502.20364 link Kimi
2316 2025-02-27 Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs Kuan Lok Zhou et.al. 2502.20356 null Kimi
2317 2025-02-27 Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners Daniele Paliotta et.al. 2502.20339 null Kimi
2318 2025-02-27 KeBaB: $k$ -mer based breaking for finding super-maximal exact matches Nathaniel K. Brown et.al. 2502.20338 null Kimi
2319 2025-02-26 Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models Lucy Xiaoyang Shi et.al. 2502.19417 null Kimi
2320 2025-02-26 Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation Shiven Sinha et.al. 2502.19414 link Kimi
2321 2025-02-26 The Mighty ToRR: A Benchmark for Table Reasoning and Robustness Shir Ashury-Tahan et.al. 2502.19412 link Kimi
2322 2025-02-26 Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs Dayu Yang et.al. 2502.19411 link Kimi
2323 2025-02-26 ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models Danae Sánchez Villegas et.al. 2502.19409 null Kimi
2324 2025-02-26 Learning Code-Edit Embedding to Model Student Debugging Behavior Hasnain Heickal et.al. 2502.19407 null Kimi
2325 2025-02-26 Single-shot and two-shot decoding with generalized bicycle codes Hsiang-Ku Lin et.al. 2502.19406 null Kimi
2326 2025-02-26 General Reasoning Requires Learning to Reason from the Get-go Seungwook Han et.al. 2502.19402 null Kimi
2327 2025-02-26 TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Max Ku et.al. 2502.19400 null Kimi
2328 2025-02-26 The End of Easy Phenomenology for CMB Experiments: A Case Study in the Dark Sector Cynthia Trendafilova et.al. 2502.19383 null Kimi
2329 2025-02-25 K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs Ziheng Ouyang et.al. 2502.18461 null Kimi
2330 2025-02-25 DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers Xueguang Ma et.al. 2502.18460 link Kimi
2331 2025-02-25 GHOST 2.0: generative high-fidelity one shot transfer of heads Alexander Groshev et.al. 2502.18417 null Kimi
2332 2025-02-25 Comparative Analysis of MDL-VAE vs. Standard VAE on 202 Years of Gynecological Data Paula Santos et.al. 2502.18412 null Kimi
2333 2025-02-25 The FFT Strikes Back: An Efficient Alternative to Self-Attention Jacob Fein-Ashley et.al. 2502.18394 link Kimi
2334 2025-02-25 ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Yifan Pu et.al. 2502.18364 null Kimi
2335 2025-02-25 Graph Inference with Effective Resistance Queries Huck Bennett et.al. 2502.18350 null Kimi
2336 2025-02-25 Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology Romy Beauté et.al. 2502.18318 null Kimi
2337 2025-02-25 RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction Jianhao Yan et.al. 2502.18308 null Kimi
2338 2025-02-25 DeepCircuitX: A Comprehensive Repository-Level Dataset for RTL Code Understanding, Generation, and PPA Analysis Zeju Li et.al. 2502.18297 null Kimi
2339 2025-02-24 S4S: Solving for a Diffusion Model Solver Eric Frankel et.al. 2502.17423 null Kimi
2340 2025-02-24 MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs Jiarui Zhang et.al. 2502.17422 link Kimi
2341 2025-02-24 LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification Penghui Yang et.al. 2502.17421 link Kimi
2342 2025-02-24 Reasoning with Latent Thoughts: On the Power of Looped Transformers Nikunj Saunshi et.al. 2502.17416 null Kimi
2343 2025-02-24 X-Dancer: Expressive Music to Human Dance Video Generation Zeyuan Chen et.al. 2502.17414 null Kimi
2344 2025-02-24 Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning Guijin Son et.al. 2502.17407 link Kimi
2345 2025-02-24 Advances in multiparameter quantum sensing and metrology Luca Pezzè et.al. 2502.17396 null Kimi
2346 2025-02-24 The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE Andrei Chernov et.al. 2502.17391 null Kimi
2347 2025-02-24 A Concise Lyapunov Analysis of Nesterov’s Accelerated Gradient Method Jun Liu et.al. 2502.17373 null Kimi
2348 2025-02-24 KV-Edit: Training-Free Image Editing for Precise Background Preservation Tianrui Zhu et.al. 2502.17363 link Kimi
2349 2025-02-21 Sparks of cognitive flexibility: self-guided context inference for flexible stimulus-response mapping by attentional routing Rowan Sommers et.al. 2502.15634 null Kimi
2350 2025-02-21 LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models Hugo Pitorro et.al. 2502.15612 null Kimi
2351 2025-02-21 Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning Wenhao Zhu et.al. 2502.15592 link Kimi
2352 2025-02-21 LightThinker: Thinking Step-by-Step Compression Jintian Zhang et.al. 2502.15589 null Kimi
2353 2025-02-21 Adaptive Expansion for Hypergraph Learning Tianyi Ma et.al. 2502.15564 null Kimi
2354 2025-02-21 Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach Sai Krishna Reddy Mareddy et.al. 2502.15545 null Kimi
2355 2025-02-21 Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors Milad Sefidgaran et.al. 2502.15540 link Kimi
2356 2025-02-21 Towards Swift Serverless LLM Cold Starts with ParaServe Chiheng Lou et.al. 2502.15524 null Kimi
2357 2025-02-21 Solving Inverse Problems with Deep Linear Neural Networks: Global Convergence Guarantees for Gradient Descent with Weight Decay Hannah Laus et.al. 2502.15522 null Kimi
2358 2025-02-21 Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection Yue Sun et.al. 2502.15516 null Kimi
2359 2025-02-20 LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention Shang Yang et.al. 2502.14866 link Kimi
2360 2025-02-20 CLIPPER: Compression enables long-context synthetic data generation Chau Minh Pham et.al. 2502.14854 link Kimi
2361 2025-02-20 Revealing and Mitigating Over-Attention in Knowledge Editing Pinzheng Wang et.al. 2502.14838 link Kimi
2362 2025-02-20 Towards Economical Inference: Enabling DeepSeek’s Multi-Head Latent Attention in Any Transformer-based LLMs Tao Ji et.al. 2502.14837 link Kimi
2363 2025-02-20 Improving the Diffusability of Autoencoders Ivan Skorokhodov et.al. 2502.14831 null Kimi
2364 2025-02-20 Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps Martin Tutek et.al. 2502.14829 link Kimi
2365 2025-02-20 Turning on the Light: Polymorphism-Induced Photoluminescence in Cysteine Crystals Debarshi Banerjee et.al. 2502.14826 null Kimi
2366 2025-02-20 Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models Vlad Sobal et.al. 2502.14819 null Kimi
2367 2025-02-20 RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation Henrique Piñeiro Monteagudo et.al. 2502.14792 null Kimi
2368 2025-02-20 Ray-Tracing for Conditionally Activated Neural Networks Claudio Gallicchio et.al. 2502.14788 null Kimi
2369 2025-02-20 LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning Yansheng Mao et.al. 2502.14644 null Kimi
2370 2025-02-20 PEARL: Towards Permutation-Resilient LLMs Liang Chen et.al. 2502.14628 link Kimi
2371 2025-02-20 PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models Yu Meng et.al. 2502.14504 null Kimi
2372 2025-02-20 Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression Haoyu Wang et.al. 2502.14477 null Kimi
2373 2025-02-20 Early-Exit and Instant Confidence Translation Quality Estimation Vilém Zouhar et.al. 2502.14429 link Kimi
2374 2025-02-19 MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads Weihao Liu et.al. 2502.13963 link Kimi
2375 2025-02-19 A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models Hao Huang et.al. 2502.13942 null Kimi
2376 2025-02-19 Qwen2.5-VL Technical Report Shuai Bai et.al. 2502.13923 null Kimi
2377 2025-02-19 LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization Guanzheng Chen et.al. 2502.13922 link Kimi
2378 2025-02-19 A measurement-based approach to analyze the power consumption of the softwarized 5G core Arturo Bellin et.al. 2502.13879 null Kimi
2379 2025-02-19 SPEX: Scaling Feature Interaction Explanations for LLMs Justin Singh Kang et.al. 2502.13870 link Kimi
2380 2025-02-19 Enhancing LLM-Based Recommendations Through Personalized Reasoning Jiahao Liu et.al. 2502.13845 link Kimi
2381 2025-02-19 SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning Renxi Wang et.al. 2502.13753 link Kimi
2382 2025-02-19 MoM: Linear Sequence Modeling with Mixture-of-Memories Jusen Du et.al. 2502.13685 link Kimi
2383 2025-02-19 PeerQA: A Scientific Question Answering Dataset from Peer Reviews Tim Baumgärtner et.al. 2502.13668 link Kimi
2384 2025-02-18 Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning Jingyang Lin et.al. 2502.13127 null Kimi
2385 2025-02-18 Eager Updates For Overlapped Communication and Computation in DiLoCo Satyen Kale et.al. 2502.12996 null Kimi
2386 2025-02-18 Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing Xiaoju Ye et.al. 2502.12962 null Kimi
2387 2025-02-18 Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models Gyeongman Kim et.al. 2502.12947 null Kimi
2388 2025-02-18 S $^2$ R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning Ruotian Ma et.al. 2502.12853 link Kimi
2389 2025-02-18 A $^2$ ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization Junhui He et.al. 2502.12665 null Kimi
2390 2025-02-18 MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation Sihyun Yu et.al. 2502.12632 null Kimi
2391 2025-02-18 Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions Leonardo Ranaldi et.al. 2502.12616 null Kimi
2392 2025-02-18 LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data Cehao Yang et.al. 2502.12583 link Kimi
2393 2025-02-18 HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading Cheng Luo et.al. 2502.12574 link Kimi
2394 2025-02-17 Small Models Struggle to Learn from Strong Reasoners Yuetai Li et.al. 2502.12143 null Kimi
2395 2025-02-17 SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs Yige Xu et.al. 2502.12134 link Kimi
2396 2025-02-17 APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs Yuxiang Huang et.al. 2502.12085 link Kimi
2397 2025-02-17 AdaSplash: Adaptive Sparse Flash Attention Nuno Gonçalves et.al. 2502.12082 link Kimi
2398 2025-02-17 TokenSkip: Controllable Chain-of-Thought Compression in LLMs Heming Xia et.al. 2502.12067 link Kimi
2399 2025-02-17 SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities Fengqing Jiang et.al. 2502.12025 null Kimi
2400 2025-02-17 Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving Xin Xu et.al. 2502.12022 null Kimi
2401 2025-02-17 Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL Hanbing Liu et.al. 2502.11656 link Kimi
2402 2025-02-17 SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking Zijian Wu et.al. 2502.11534 null Kimi
2403 2025-02-17 AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification Xiaoyu Tan et.al. 2502.11520 null Kimi
2404 2025-02-14 Are Large Language Models the future crowd workers of Linguistics? Iris Ferrazzo et.al. 2502.10266 null Kimi
2405 2025-02-14 LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing Kuan Li et.al. 2502.09977 null Kimi
2406 2025-02-14 MIR-Bench: Benchmarking LLM’s Long-Context Intelligence via Many-Shot In-Context Inductive Reasoning Kai Yan et.al. 2502.09933 null Kimi
2407 2025-02-14 INF^2: High-Throughput Generative Inference of Large Language Models using Near-Storage Processing Hongsun Jang et.al. 2502.09921 null Kimi
2408 2025-02-13 ATM-Net: Adaptive Termination and Multi-Precision Neural Networks for Energy-Harvested Edge Intelligence Neeraj Solanki et.al. 2502.09822 null Kimi
2409 2025-02-13 NestQuant: Nested Lattice Quantization for Matrix Products and LLMs Semyon Savkin et.al. 2502.09720 null Kimi
2410 2025-02-13 MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Dongzhi Jiang et.al. 2502.09621 null Kimi
2411 2025-02-13 CoT-Valve: Length-Compressible Chain-of-Thought Tuning Xinyin Ma et.al. 2502.09601 link Kimi
2412 2025-02-13 Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs Siyan Zhao et.al. 2502.09597 link Kimi
2413 2025-02-13 SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Daniel Fleischer et.al. 2502.09390 link Kimi
2414 2025-02-13 Generalizability through Explainability: Countering Overfitting with Counterfactual Examples Flavio Giorgi et.al. 2502.09193 null Kimi
2415 2025-02-13 Bridging the Gap Between LLMs and Human Intentions: Progresses and Challenges in Instruction Understanding, Intention Reasoning, and Reliable Generation Zongyu Chang et.al. 2502.09101 null Kimi
2416 2025-02-13 Unleashing the Power of Large Language Model for Denoising Recommendation Shuyao Wang et.al. 2502.09058 null Kimi
2417 2025-02-13 Diversity Enhances an LLM’s Performance in RAG and Long-context Task Zhchao Wang et.al. 2502.09017 null Kimi
2418 2025-02-13 RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models Quan Wei et.al. 2502.09003 null Kimi
2419 2025-02-13 Task Generalization With AutoRegressive Compositional Structure: Can Learning From $\d$ Tasks Generalize to $\d^{T}$ Tasks? Amirhesam Abedsoltan et.al. 2502.08991 null Kimi
2420 2025-02-12 Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning Qifan Yu et.al. 2502.08482 null Kimi
2421 2025-02-12 The MoE-Empowered Edge LLMs Deployment: Architecture, Challenges, and Opportunities Ning Li et.al. 2502.08381 null Kimi
2422 2025-02-12 Inference-time sparse attention with asymmetric indexing Pierre-Emmanuel Mazaré et.al. 2502.08246 null Kimi
2423 2025-02-12 Learning Human Skill Generators at Key-Step Levels Yilu Wu et.al. 2502.08234 null Kimi
2424 2025-02-12 Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Lingfei Qian et.al. 2502.08127 link Kimi
2425 2025-02-12 GCoT: Chain-of-Thought Prompt Learning for Graphs Xingtong Yu et.al. 2502.08092 null Kimi
2426 2025-02-12 Mixture of Decoupled Message Passing Experts with Entropy Constraint for General Node Classification Xuanze Chen et.al. 2502.08083 null Kimi
2427 2025-02-11 Training Sparse Mixture Of Experts Text Embedding Models Zach Nussbaum et.al. 2502.07972 link Kimi
2428 2025-02-11 HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment Youhe Jiang et.al. 2502.07903 null Kimi
2429 2025-02-11 TransMLA: Multi-head Latent Attention Is All You Need Fanxu Meng et.al. 2502.07864 link Kimi
2430 2025-02-11 Magic 1-For-1: Generating One Minute Video Clips within One Minute Hongwei Yi et.al. 2502.07701 link Kimi
2431 2025-02-11 LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid Weigao Sun et.al. 2502.07563 link Kimi
2432 2025-02-11 Early Stopping Against Label Noise Without Validation Data Suqin Yuan et.al. 2502.07551 link Kimi
2433 2025-02-11 Instance-dependent Early Stopping Suqin Yuan et.al. 2502.07547 link Kimi
2434 2025-02-11 Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More Xialie Zhuang et.al. 2502.07490 link Kimi
2435 2025-02-11 LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! Dacheng Li et.al. 2502.07374 link Kimi
2436 2025-02-11 LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation Zican Dong et.al. 2502.07365 null Kimi
2437 2025-02-11 BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Xu Huang et.al. 2502.07346 link Kimi
2438 2025-02-11 CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Junlong Li et.al. 2502.07316 link Kimi
2439 2025-02-11 OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms Lumen AI et.al. 2502.07312 link Kimi
2440 2025-02-10 On the Emergence of Thinking in LLMs I: Searching for the Right Intuition Guanghao Ye et.al. 2502.06773 link Kimi
2441 2025-02-10 ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates Ling Yang et.al. 2502.06772 link Kimi
2442 2025-02-10 Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs Ryan Synk et.al. 2502.06766 link Kimi
2443 2025-02-10 History-Guided Video Diffusion Kiwhan Song et.al. 2502.06764 null Kimi
2444 2025-02-10 Rationalization Models for Text-to-SQL Gaetano Rossiello et.al. 2502.06759 null Kimi
2445 2025-02-10 MoETuner: Optimized Mixture of Expert Serving with Balanced Expert Placement and Token Routing Seokjin Go et.al. 2502.06643 null Kimi
2446 2025-02-10 Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text Approaches Adithya Pratapa et.al. 2502.06617 link Kimi
2447 2025-02-10 Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation Chengwen Qi et.al. 2502.06563 link Kimi
2448 2025-02-10 CoS: Chain-of-Shot Prompting for Long Video Understanding Jian Hu et.al. 2502.06428 null Kimi
2449 2025-02-10 Expect the Unexpected: FailSafe Long Context QA for Finance Kiran Kamble et.al. 2502.06329 null Kimi
2450 2025-02-07 Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray Yunhang Shen et.al. 2502.05177 link Kimi
2451 2025-02-07 VideoRoPE: What Makes for Good Video Rotary Position Embedding? Xilin Wei et.al. 2502.05173 link Kimi
2452 2025-02-07 Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient Jan Ludziejewski et.al. 2502.05172 null Kimi
2453 2025-02-07 NoLiMa: Long-Context Evaluation Beyond Literal Matching Ali Modarressi et.al. 2502.05167 link Kimi
2454 2025-02-07 Data-Parallel Neural Network Training via Nonlinearly Preconditioned Trust-Region Method Samuel A. Cruz Alegría et.al. 2502.05133 null Kimi
2455 2025-02-07 Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures Tushar Pandey et.al. 2502.05078 link Kimi
2456 2025-02-07 S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency Yuting Zeng et.al. 2502.04790 null Kimi
2457 2025-02-07 Early Stopping for Regression Trees Ratmir Miftachov et.al. 2502.04709 null Kimi
2458 2025-02-07 ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning Yuwei Yin et.al. 2502.04689 link Kimi
2459 2025-02-07 Unveiling the Mechanisms of Explicit CoT Training: How Chain-of-Thought Enhances Reasoning Generalization Xinhao Yao et.al. 2502.04667 link Kimi
2460 2025-02-06 Exploring operation parallelism vs. ion movement in ion-trapped QCCD architectures Anabel Ovide et.al. 2502.04181 null Kimi
2461 2025-02-06 HD-EPIC: A Highly-Detailed Egocentric Video Dataset Toby Perrett et.al. 2502.04144 null Kimi
2462 2025-02-06 AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference Qingyue Yang et.al. 2502.04077 link Kimi
2463 2025-02-06 RWKV-UI: UI Understanding with Enhanced Perception and Reasoning Jiaxi Yang et.al. 2502.03971 null Kimi
2464 2025-02-06 InfinitePOD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers Chenchen Shou et.al. 2502.03885 null Kimi
2465 2025-02-06 Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning Peizhuang Cong et.al. 2502.03884 null Kimi
2466 2025-02-06 Identify Critical KV Cache in LLM Inference from an Output Perturbation Perspective Yuan Feng et.al. 2502.03805 link Kimi
2467 2025-02-05 (GG) MoE vs. MLP on Tabular Data Andrei Chernov et.al. 2502.03608 null Kimi
2468 2025-02-05 HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference Zeyu Zhang et.al. 2502.03589 null Kimi
2469 2025-02-05 Demystifying Long Chain-of-Thought Reasoning in LLMs Edward Yeo et.al. 2502.03373 link Kimi
2470 2025-02-05 ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model Qiguang Chen et.al. 2502.03325 null Kimi
2471 2025-02-05 Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning DiJia Su et.al. 2502.03275 null Kimi
2472 2025-02-05 MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding Pengyi Li et.al. 2502.03183 null Kimi
2473 2025-02-05 Structured Token Retention and Computational Memory Paths in Large Language Models Jonathan Delena et.al. 2502.03102 null Kimi
2474 2025-02-05 IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates Aissatou Diallo et.al. 2502.03080 null Kimi
2475 2025-02-05 Scaling Laws for Upcycling Mixture-of-Experts Language Models Seng Pei Liew et.al. 2502.03009 null Kimi
2476 2025-02-05 LLM-KT: Aligning Large Language Models with Knowledge Tracing using a Plug-and-Play Instruction Ziwei Wang et.al. 2502.02945 null Kimi
2477 2025-02-05 Early Stopping in Contextual Bandits and Inferences Zihan Cui et.al. 2502.02793 null Kimi
2478 2025-02-04 Twilight: Adaptive Attention Sparsity with Hierarchical Top- $p$ Pruning Chaofan Lin et.al. 2502.02770 null Kimi
2479 2025-02-04 Hecate: Unlocking Efficient Sparse Model Training via Fully Sharded Sparse Data Parallelism Yuhao Qing et.al. 2502.02581 null Kimi
2480 2025-02-04 Brief analysis of DeepSeek R1 and it’s implications for Generative AI Sarah Mercer et.al. 2502.02523 null Kimi
2481 2025-02-04 EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization Yize Wu et.al. 2502.02493 null Kimi
2482 2025-02-04 Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers Alireza Amiri et.al. 2502.02393 null Kimi
2483 2025-02-04 STAIR: Improving Safety Alignment with Introspective Reasoning Yichi Zhang et.al. 2502.02384 link Kimi
2484 2025-02-04 Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs Sagnik Mukherjee et.al. 2502.02362 null Kimi
2485 2025-02-04 VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation Siyu Xu et.al. 2502.02175 null Kimi
2486 2025-02-04 M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference Nikhil Bhendawade et.al. 2502.02040 null Kimi
2487 2025-02-04 Wavelet-based Positional Representation for Long Context Yui Oka et.al. 2502.02004 null Kimi
2488 2025-02-04 MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving Shiju Zhao et.al. 2502.01960 null Kimi
2489 2025-01-31 Scalable-Softmax Is Superior for Attention Ken M. Nakanishi et.al. 2501.19399 null Kimi
2490 2025-01-31 Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models Alina Shutova et.al. 2501.19392 link Kimi
2491 2025-01-31 Efficient Reasoning with Hidden Thinking Xuan Shen et.al. 2501.19201 link Kimi
2492 2025-01-31 Rethinking Early Stopping: Refine, Then Calibrate Eugène Berta et.al. 2501.19195 link Kimi
2493 2025-01-31 A theoretical framework for overfitting in energy-based modeling Giovanni Catania et.al. 2501.19158 null Kimi
2494 2025-01-31 $\infty$ -Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation Saul Santos et.al. 2501.19098 link Kimi
2495 2025-01-30 Rope to Nope and Back Again: A New Hybrid Attention Strategy Bowen Yang et.al. 2501.18795 null Kimi
2496 2025-01-30 Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning Maya Kruse et.al. 2501.18724 null Kimi
2497 2025-01-30 Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models Yi Ding et.al. 2501.18533 null Kimi
2498 2025-01-30 State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence Thea Aviss et.al. 2501.18356 null Kimi
2499 2025-01-30 Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge Swarnadeep Saha et.al. 2501.18099 null Kimi
2500 2025-01-29 Physics-Grounded Differentiable Simulation for Soft Growing Robots Lucas Chen et.al. 2501.17963 link Kimi
2501 2025-01-29 Free Agent in Agent-Based Mixture-of-Experts Generative AI Framework Jung-Hua Liu et.al. 2501.17903 null Kimi
2502 2025-01-29 Formally Verified Binary-level Pointer Analysis Freek Verbeek et.al. 2501.17766 null Kimi
2503 2025-01-29 CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs Amey Hengle et.al. 2501.17581 null Kimi
2504 2025-01-29 Heuristic-Informed Mixture of Experts for Link Prediction in Multilayer Networks Lucio La Cava et.al. 2501.17557 null Kimi
2505 2025-01-29 DINT Transformer Yueyang Cang et.al. 2501.17486 null Kimi
2506 2025-01-28 TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network Yumingzhi Pan et.al. 2501.16784 null Kimi
2507 2025-01-28 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow Yueen Ma et.al. 2501.16698 null Kimi
2508 2025-01-28 MCTS-SQL: An Effective Framework for Text-to-SQL with Monte Carlo Tree Search Shuozhi Yuan et.al. 2501.16607 null Kimi
2509 2025-01-27 Searching for GEMS: Discovery and Characterization of Two Brown Dwarfs Around M Dwarfs Alexander Larsen et.al. 2501.16554 null Kimi
2510 2025-01-27 MoEVD: Enhancing Vulnerability Detection by Mixture-of-Experts (MoE) Xu Yang et.al. 2501.16454 null Kimi
2511 2025-01-27 The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model Kaito Takanami et.al. 2501.16226 link Kimi
2512 2025-01-27 Provence: efficient and robust context pruning for retrieval-augmented generation Nadezhda Chirkova et.al. 2501.16214 null Kimi
2513 2025-01-27 Options-Aware Dense Retrieval for Multiple-Choice query Answering Manish Singh et.al. 2501.16111 null Kimi
2514 2025-01-27 Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference Yinghan Li et.al. 2501.16103 null Kimi
2515 2025-01-27 Understanding Long Videos via LLM-Powered Entity Relation Graphs Meng Chu et.al. 2501.15953 null Kimi
2516 2025-01-27 Memorization and Regularization in Generative Diffusion Models Ricardo Baptista et.al. 2501.15785 link Kimi
2517 2025-01-27 Renewable Energy Prediction: A Comparative Study of Deep Learning Models for Complex Dataset Analysis Haibo Wang et.al. 2501.15731 null Kimi
2518 2025-01-26 A Benchmarking Platform for DDR4 Memory Performance in Data-Center-Class FPGAs Andrea Galimberti et.al. 2501.15582 null Kimi
2519 2025-01-26 Qwen2.5-1M Technical Report An Yang et.al. 2501.15383 null Kimi
2520 2025-01-25 ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning Shangqian Gao et.al. 2501.15316 null Kimi
2521 2025-01-24 Mean-field limit from general mixtures of experts to quantum neural networks Anderson Melchor Hernandez et.al. 2501.14660 null Kimi
2522 2025-01-24 Experimentally Evaluating the Resource Efficiency of Big Data Autoscaling Jonathan Will et.al. 2501.14456 link Kimi
2523 2025-01-24 Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains Xu Chu et.al. 2501.14431 null Kimi
2524 2025-01-24 GraphBC: Improving LLMs for Better Graph Data Processing Xu Chu et.al. 2501.14427 null Kimi
2525 2025-01-24 Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation Shengzhe Zhang et.al. 2501.14269 link Kimi
2526 2025-01-24 Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading Minrui Xu et.al. 2501.14205 null Kimi
2527 2025-01-23 Can We Generate Images with CoT? Let’s Verify and Reinforce Image Generation Step by Step Ziyu Guo et.al. 2501.13926 link Kimi
2528 2025-01-23 The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities Chan-Jan Hsu et.al. 2501.13921 link Kimi
2529 2025-01-23 PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection Peiyuan Zhang et.al. 2501.13898 link Kimi
2530 2025-01-23 Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Zhenghao Lin et.al. 2501.13629 null Kimi
2531 2025-01-23 Coarse-to-Fine Process Reward Modeling for Enhanced Mathematical Reasoning Yulan Hu et.al. 2501.13622 null Kimi
2532 2025-01-23 Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge Haomiao Xiong et.al. 2501.13468 link Kimi
2533 2025-01-23 Contrast: A Hybrid Architecture of Transformers and State Space Models for Low-Level Vision Aman Urumbekov et.al. 2501.13353 null Kimi
2534 2025-01-23 Qrazor: Reliable and effortless 4-bit llm quantization by significant data razoring Dongyoung Lee et.al. 2501.13331 null Kimi
2535 2025-01-22 Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment Melissa Kazemi Rad et.al. 2501.13080 null Kimi
2536 2025-01-22 Autonomy-of-Experts Models Ang Lv et.al. 2501.13074 null Kimi
2537 2025-01-22 Ehrenfeucht-Haussler Rank and Chain of Thought Pablo Barceló et.al. 2501.12997 null Kimi
2538 2025-01-22 LLM4WM: Adapting LLM for Wireless Multi-Tasking Xuanyu Liu et.al. 2501.12983 null Kimi
2539 2025-01-22 Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference Weizhi Fei et.al. 2501.12959 null Kimi
2540 2025-01-22 Late Breaking Result: FPGA-Based Emulation and Fault Injection for CNN Inference Accelerators Filip Masar et.al. 2501.12818 link Kimi
2541 2025-01-22 NExtLong: Toward Effective Long-Context Training without Long Documents Chaochen Gao et.al. 2501.12766 link Kimi
2542 2025-01-22 BLR-MoE: Boosted Language-Routing Mixture of Experts for Domain-Robust Multilingual E2E ASR Guodong Ma et.al. 2501.12602 null Kimi
2543 2025-01-22 Kimi k1.5: Scaling Reinforcement Learning with LLMs Kimi Team et.al. 2501.12599 null Kimi
2544 2025-01-21 Slot-BERT: Self-supervised Object Discovery in Surgical Video Guiqiu Liao et.al. 2501.12477 null Kimi
2545 2025-01-21 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null Kimi
2546 2025-01-21 Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL Yeounoh Chung et.al. 2501.12372 link Kimi
2547 2025-01-21 Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models Samira Abnar et.al. 2501.12370 null Kimi
2548 2025-01-21 CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning Yuanheng Fang et.al. 2501.12226 null Kimi
2549 2025-01-21 Muon-specific two-Higgs-doublet model for $(g-2)_μ$ anomaly, $W$ -boson mass-shift, and Zee model I. A. Yafi et.al. 2501.12181 null Kimi
2550 2025-01-21 Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Zihan Qiu et.al. 2501.11873 null Kimi
2551 2025-01-20 Characterization of GPU TEE Overheads in Distributed Data Parallel ML Training Jonghytun Lee et.al. 2501.11771 null Kimi
2552 2025-01-20 Early Stopping Bayesian Optimization for Controller Tuning David Stenger et.al. 2501.11532 link Kimi
2553 2025-01-20 CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation Zheng Chong et.al. 2501.11325 link Kimi
2554 2025-01-20 RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems? Haotian Xu et.al. 2501.11284 null Kimi
2555 2025-01-17 AraXL: A Physically Scalable, Ultra-Wide RISC-V Vector Processor Design for Fast and Efficient Computation on Long Vectors Navaneeth Kunhi Purayil et.al. 2501.10301 null Kimi
2556 2025-01-17 ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario Lucen Zhong et.al. 2501.10132 link Kimi
2557 2025-01-17 Multi-Dimensional Vector ISA Extension for Mobile In-Cache Computing Alireza Khadem et.al. 2501.09902 link Kimi
2558 2025-01-16 Coded Deep Learning: Framework and Algorithm En-hui Yang et.al. 2501.09849 null Kimi
2559 2025-01-15 LeMo: Enabling LEss Token Involvement for MOre Context Fine-tuning Tuowei Wang et.al. 2501.09767 null Kimi
2560 2025-01-16 AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Junjie He et.al. 2501.09503 link Kimi
2561 2025-01-16 PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks Huiyou Zhan et.al. 2501.09367 null Kimi
2562 2025-01-15 Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation Jiaxin Guo et.al. 2501.08523 null Kimi
2563 2025-01-14 Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models Yifu Qiu et.al. 2501.08248 null Kimi
2564 2025-01-14 PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving Ahmet Caner Yüzügüler et.al. 2501.08192 null Kimi
2565 2025-01-13 A Survey of Early Exit Deep Neural Networks in NLP Divya Jyoti Bajpai et.al. 2501.07670 null Kimi
2566 2025-01-14 Monotone Curve Estimation via Convex Duality Tongseok Lim et.al. 2501.06975 null Kimi
2567 2025-01-12 MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference Wenxuan Zeng et.al. 2501.06807 null Kimi
2568 2025-01-12 Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management Liu Qianli et.al. 2501.06709 null Kimi
2569 2025-01-11 SafeSplit: A Novel Defense Against Client-Side Backdoor Attacks in Split Learning Phillip Rieger et.al. 2501.06650 null Kimi
2570 2025-01-11 Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks Amr Almorsi et.al. 2501.06625 null Kimi
2571 2025-01-11 Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping Muru Zhang et.al. 2501.06589 link Kimi
2572 2025-01-11 Tensor Product Attention Is All You Need Yifan Zhang et.al. 2501.06425 link Kimi
2573 2025-01-10 Scale-up Unlearnable Examples Learning with High-Performance Computing Yanfan Zhu et.al. 2501.06080 link Kimi
2574 2025-01-09 Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters Ziyue Luo et.al. 2501.05563 null Kimi
2575 2025-01-09 LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation Xi Ye et.al. 2501.05414 null Kimi
2576 2025-01-09 Euclid: Detecting Solar System objects in Euclid images and classifying them using Kohonen self-organising maps A. A. Nucita et.al. 2501.05023 null Kimi
2577 2025-01-09 SyNPar: Synthetic Null Data Parallelism for High-Power False Discovery Rate Control in High-Dimensional Variable Selection Changhu Wang et.al. 2501.05012 null Kimi
2578 2025-01-09 TreeKV: Smooth Key-Value Cache Compression with Tree Structures Ziwei He et.al. 2501.04987 null Kimi
2579 2025-01-08 Collaborative Inference Acceleration with Non-Penetrative Tensor Partitioning Zhibang Liu et.al. 2501.04489 null Kimi
2580 2025-01-06 The Power of Negative Zero: Datatype Customization for Quantized Large Language Models Yuzong Chen et.al. 2501.04052 link Kimi
2581 2025-01-07 CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering Jialiang Chen et.al. 2501.03447 null Kimi
2582 2025-01-05 PTEENet: Post-Trained Early-Exit Neural Networks Augmentation for Inference Cost Optimization Assaf Lahiany et.al. 2501.02508 null Kimi
2583 2025-01-07 ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Chaojie Mao et.al. 2501.02487 null Kimi
2584 2025-01-04 AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference Zhuomin He et.al. 2501.02336 link Kimi
2585 2025-01-04 The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit Huixue Zhou et.al. 2501.02173 null Kimi
2586 2025-01-03 Efficient LLM Inference with Activation Checkpointing and Hybrid Caching Sanghyeon Lee et.al. 2501.01792 null Kimi
2587 2025-01-03 Data Parallel Visualization and Rendering on the RAMSES Supercomputer with ANARI Stefan Zellmann et.al. 2501.01628 null Kimi
2588 2025-01-02 TreeLUT: An Efficient Alternative to Deep Neural Networks for Inference Acceleration Using Gradient Boosted Decision Trees Alireza Khataei et.al. 2501.01511 link Kimi
2589 2025-01-02 FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving Zihao Ye et.al. 2501.01005 link Kimi
2590 2025-01-01 Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding Jiajun Zhu et.al. 2501.00712 link Kimi
2591 2025-01-01 Adjoint sharding for very long context training of state space models Xingzi Xu et.al. 2501.00692 null Kimi
2592 2024-12-31 Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing Peihao Wang et.al. 2501.00658 link Kimi
2593 2024-12-31 A Study on Context Length and Efficient Transformers for Biomedical Image Analysis Sarah M. Hooper et.al. 2501.00619 null Kimi
2594 2024-12-31 VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling Xinhao Li et.al. 2501.00574 link Kimi
2595 2024-12-30 CaseSumm: A Large-Scale Dataset for Long-Context Summarization from U.S. Supreme Court Opinions Mourad Heddaya et.al. 2501.00097 null Kimi
2596 2024-12-30 Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism Tim Tsz-Kit Lau et.al. 2412.21124 null Kimi
2597 2024-12-30 Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA Qingyun Jin et.al. 2412.20677 null Kimi
2598 2024-12-29 ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding Xiao Wang et.al. 2412.20504 link Kimi
2599 2024-12-29 TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication Zongwu Wang et.al. 2412.20501 link Kimi
2600 2024-12-29 NeutronTP: Load-Balanced Distributed Full-Graph GNN Training with Tensor Parallelism Xin Ai et.al. 2412.20379 null Kimi
2601 2024-12-28 LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System Hyucksung Kwon et.al. 2412.20166 null Kimi
2602 2024-12-28 ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming Jiedong Zhuang et.al. 2412.20105 null Kimi
2603 2024-12-27 Goal-oriented Communications based on Recursive Early Exit Neural Networks Jary Pomponi et.al. 2412.19587 null Kimi
2604 2024-12-27 StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture Miaomiao Dai et.al. 2412.19535 null Kimi
2605 2025-01-02 A Survey on Large Language Model Acceleration based on KV Cache Management Haoyang Li et.al. 2412.19442 link Kimi
2606 2024-12-26 Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones Mehrnaz Mofakhami et.al. 2412.19325 null Kimi
2607 2024-12-26 Multi-matrix Factorization Attention Jingcheng Hu et.al. 2412.19255 null Kimi
2608 2024-12-26 Repository Structure-Aware Training Makes SLMs Better Issue Resolver Zexiong Ma et.al. 2412.19031 null Kimi
2609 2024-12-25 Long-Range Tasks Using Short-Context LLMs: Incremental Reasoning With Structured Memories Dulhan Jayalath et.al. 2412.18914 null Kimi
2610 2024-12-25 Bootstrap Your Own Context Length Liang Wang et.al. 2412.18860 null Kimi
2611 2024-12-25 DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search Lei Yang et.al. 2412.18811 link Kimi
2612 2024-12-24 Efficient Long Context Language Model Retrieval with Compression Minju Seo et.al. 2412.18232 null Kimi
2613 2024-12-24 Adapter Merging with Centroid Prototype Mapping for Scalable Class-Incremental Learning Takuma Fukuda et.al. 2412.18219 link Kimi
2614 2024-12-24 KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management Rongxin Cheng et.al. 2412.18169 null Kimi
2615 2024-12-24 Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering Francois Chaubard et.al. 2412.18052 link Kimi
2616 2024-12-23 Theoretical Constraints on the Expressive Power of $\mathsf{RoPE}$ -based Tensor Attention Transformers Xiaoyu Li et.al. 2412.18040 null Kimi
2617 2024-12-23 Deliberation in Latent Space via Differentiable Cache Augmentation Luyang Liu et.al. 2412.17747 null Kimi
2618 2024-12-24 YuLan-Mini: An Open Data-efficient Language Model Yiwen Hu et.al. 2412.17743 link Kimi
2619 2024-12-23 Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework Aswini Kumar Patra et.al. 2412.17587 null Kimi
2620 2024-12-23 Optimal Convergence Rates for Neural Operators Mike Nguyen et.al. 2412.17518 null Kimi
2621 2024-12-23 A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression Chenlong Deng et.al. 2412.17483 null Kimi
2622 2024-12-23 MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models Beibei Yu et.al. 2412.17339 null Kimi
2623 2024-12-22 Revisiting In-Context Learning with Long Context Language Models Jinheon Baek et.al. 2412.16926 null Kimi
2624 2024-12-20 A survey on FPGA-based accelerator for ML models Feng Yan et.al. 2412.15666 null Kimi
2625 2024-12-20 Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks Brian J Chan et.al. 2412.15605 link Kimi
2626 2024-12-19 Systematic Evaluation of Long-Context LLMs on Financial Concepts Lavanya Gupta et.al. 2412.15386 null Kimi
2627 2024-12-19 LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Yushi Bai et.al. 2412.15204 link Kimi
2628 2024-12-19 Minimizing speculation overhead in a parallel recognizer for regular texts Angelo Borsotti et.al. 2412.14975 null Kimi
2629 2024-12-19 DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs Xiabin Zhou et.al. 2412.14838 null Kimi
2630 2024-12-19 Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models Wenhan Liu et.al. 2412.14574 link Kimi
2631 2024-12-19 HashAttention: Semantic Sparsity for Faster Inference Aditya Desai et.al. 2412.14468 null Kimi
2632 2024-12-18 Scaling Deep Learning Training with MPMD Pipeline Parallelism Anxhelo Xhebraj et.al. 2412.14374 null Kimi
2633 2024-12-18 ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals Utkarsh Saxena et.al. 2412.14363 link Kimi
2634 2024-12-18 State Space Models are Strong Text Rerankers Zhichao Xu et.al. 2412.14354 null Kimi
2635 2024-12-19 Online MDP with Transition Prototypes: A Robust Adaptive Approach Shuo Sun et.al. 2412.14075 null Kimi
2636 2024-12-19 Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Benjamin Warner et.al. 2412.13663 link Kimi
2637 2024-12-18 SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation Jialong Wu et.al. 2412.13649 link Kimi
2638 2024-12-18 LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning Yansheng Mao et.al. 2412.13626 null Kimi
2639 2024-12-18 Attention-aware convolutional neural networks for identification of magnetic islands in the tearing mode on EAST tokamak Feifei Long et.al. 2412.13498 null Kimi
2640 2024-12-18 Deploying Foundation Model Powered Agent Services: A Survey Wenchao Xu et.al. 2412.13437 null Kimi
2641 2024-12-17 COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism Jianing He et.al. 2412.13236 link Kimi
2642 2024-12-17 GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models Mukai Li et.al. 2412.12735 link Kimi
2643 2024-12-17 More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression Jiebin Zhang et.al. 2412.12706 null Kimi
2644 2024-12-17 LLMs are Also Effective Embedding Models: An In-depth Overview Chongyang Tao et.al. 2412.12591 null Kimi
2645 2024-12-17 PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization Yun Luo et.al. 2412.12588 link Kimi
2646 2024-12-17 ITP: Instance-Aware Test Pruning for Out-of-Distribution Detection Haonan Xu et.al. 2412.12566 link Kimi
2647 2024-12-17 A System for Microserving of LLMs Hongyi Jin et.al. 2412.12488 null Kimi
2648 2024-12-17 Boosting Long-Context Information Seeking via Query-Guided Activation Refilling Hongjin Qian et.al. 2412.12486 link Kimi
2649 2024-12-17 Core Context Aware Attention for Long Context Language Modeling Yaofo Chen et.al. 2412.12465 null Kimi
2650 2024-12-17 SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Guoxuan Chen et.al. 2412.12094 link Kimi
2651 2024-12-16 SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval Yueqian Lin et.al. 2412.12009 link Kimi
2652 2024-12-16 EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents Mengna Zhu et.al. 2412.11814 null Kimi
2653 2024-12-16 CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation Hongxuan Zhang et.al. 2412.11741 null Kimi
2654 2024-12-16 Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning Xingchi Chen et.al. 2412.11685 null Kimi
2655 2024-12-16 On the SDP Relaxation of Direct Torque Finite Control Set Model Predictive Control Luca M. Hartmann et.al. 2412.11666 null Kimi
2656 2024-12-16 FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation Dannong Wang et.al. 2412.11378 link Kimi
2657 2024-12-15 Timing of Seven Isolated Pulsars in the Globular Cluster Terzan 1 Justine Singleton et.al. 2412.11271 null Kimi
2658 2024-12-15 Wasserstein Bounds for generative diffusion models with Gaussian tail targets Xixian Wang et.al. 2412.11251 null Kimi
2659 2024-12-15 ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction Yi Feng et.al. 2412.11210 link Kimi
2660 2024-12-13 SCBench: A KV Cache-Centric Analysis of Long-Context Methods Yucheng Li et.al. 2412.10319 null Kimi
2661 2024-12-13 Lost in the Middle, and In-Between: Enhancing Language Models’ Ability to Reason Over Long Contexts in Multi-Hop QA George Arthur Baker et.al. 2412.10079 link Kimi
2662 2024-12-13 Benchmarking Table Comprehension In The Wild Yikang Pan et.al. 2412.09884 null Kimi
2663 2024-12-13 V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding Junqi Ge et.al. 2412.09616 link Kimi
2664 2024-12-12 InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions Pan Zhang et.al. 2412.09596 link Kimi
2665 2024-12-12 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Tiehan Fan et.al. 2412.09283 null Kimi
2666 2024-12-12 ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty Meizhi Zhong et.al. 2412.09036 null Kimi
2667 2024-12-12 RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios Ruiwen Zhou et.al. 2412.08972 link Kimi
2668 2024-12-12 Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries Junhyuck Kim et.al. 2412.08890 link Kimi
2669 2024-12-11 TURBOATTENTION: Efficient Attention Approximation For High Throughputs LLMs Hao Kang et.al. 2412.08585 null Kimi
2670 2024-12-11 EMS: Adaptive Evict-then-Merge Strategy for Head-wise KV Cache Compression Based on Global-Local Importance Yingxin Li et.al. 2412.08521 null Kimi
2671 2024-12-10 From Slow Bidirectional to Fast Causal Video Generators Tianwei Yin et.al. 2412.07772 null Kimi
2672 2024-12-10 ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Jinyi Hu et.al. 2412.07720 link Kimi
2673 2024-12-09 FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization Boyang Zhang et.al. 2412.06865 null Kimi
2674 2024-12-09 Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models Wei Suo et.al. 2412.06458 null Kimi
2675 2024-12-08 BiDM: Pushing the Limit of Quantization for Diffusion Models Xingyu Zheng et.al. 2412.05926 link Kimi
2676 2024-12-08 XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference Weizhuo Li et.al. 2412.05896 null Kimi
2677 2024-12-07 Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression Michael R. Metel et.al. 2412.05693 null Kimi
2678 2024-12-11 Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference Qingyuan Li et.al. 2412.04964 null Kimi
2679 2024-12-06 GUIDE: A Global Unified Inference Engine for Deploying Large Language Models in Heterogeneous Environments Yanyu Chen et.al. 2412.04788 null Kimi
2680 2024-12-05 Cross-Self KV Cache Pruning for Efficient Vision-Language Inference Xiaohuan Pei et.al. 2412.04652 link Kimi
2681 2024-12-05 votess: A multi-target, GPU-capable, parallel Voronoi tessellator C. Byrohl et.al. 2412.04514 link Kimi
2682 2024-12-05 p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay Jun Zhang et.al. 2412.04449 link Kimi
2683 2024-12-07 PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation Ao Wang et.al. 2412.03409 link Kimi
2684 2024-12-04 ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression Guangda Liu et.al. 2412.03213 link Kimi
2685 2024-12-04 Unifying KV Cache Compression for Large Language Models with LeanKV Yanqi Zhang et.al. 2412.03131 null Kimi
2686 2024-12-04 Lightweight Multiplane Images Network for Real-Time Stereoscopic Conversion from Planar Video Shanding Diao et.al. 2412.03102 null Kimi
2687 2024-12-03 Resource-Adaptive Successive Doubling for Hyperparameter Optimization with Large Datasets on High-Performance Computing Systems Marcel Aach et.al. 2412.02729 link Kimi
2688 2024-12-03 Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity Da Ma et.al. 2412.02252 null Kimi
2689 2024-12-02 RandAR: Decoder-only Autoregressive Visual Generation in Random Orders Ziqi Pang et.al. 2412.01827 null Kimi
2690 2024-12-05 Yi-Lightning Technical Report 01. AI et.al. 2412.01253 null Kimi
2691 2024-12-02 INTELLECT-1 Technical Report Sami Jaghouar et.al. 2412.01152 link Kimi
2692 2024-12-03 Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification Wenxuan Huang et.al. 2412.00876 link Kimi
2693 2024-12-01 MERLIN: Multi-stagE query performance prediction for dynamic paRallel oLap pIpeliNe Kaixin Zhang et.al. 2412.00749 null Kimi
2694 2024-11-29 DeMo: Decoupled Momentum Optimization Bowen Peng et.al. 2411.19870 link Kimi
2695 2024-11-27 FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving Ao Shen et.al. 2411.18424 null Kimi
2696 2024-11-28 MiniKV: Pushing the Limits of LLM Inference via 2-Bit Layer-Discriminative KV Cache Akshat Sharma et.al. 2411.18077 null Kimi
2697 2024-11-27 Addressing Architectural Obstacles for Overlay with Stream Network Abstraction Chengyue Wang et.al. 2411.17966 null Kimi
2698 2024-11-26 Attamba: Attending To Multi-Token States Yash Akhauri et.al. 2411.17685 link Kimi
2699 2024-11-26 Toward High-Performance LLM Serving: A Simulation-Based Approach for Identifying Optimal Parallelism Yi-Chien Lin et.al. 2411.17651 link Kimi
2700 2024-11-26 Efficient LLM Inference with I/O-Aware Partial KV Cache Recomputation Chaoyi Jiang et.al. 2411.17089 link Kimi
2701 2024-11-25 Lion Cub: Minimizing Communication Overhead in Distributed Lion Satoki Ishikawa et.al. 2411.16462 null Kimi
2702 2024-11-24 Hiding Communication Cost in Distributed LLM Training via Micro-batch Co-execution Haiquan Wang et.al. 2411.15871 null Kimi
2703 2024-11-27 A Method for Building Large Language Models with Predefined KV Cache Capacity Zhonghua Yi et.al. 2411.15785 null Kimi
2704 2024-11-22 DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models Keda Tao et.al. 2411.15024 link Kimi
2705 2024-11-21 Functional Array Programming in an Extended Pi-Calculus Hans Hüttel et.al. 2411.14579 null Kimi
2706 2024-11-22 Quantization without Tears Minghao Fu et.al. 2411.13918 link Kimi
2707 2024-11-19 Faster Multi-GPU Training with PPLL: A Pipeline Parallelism Framework Leveraging Local Learning Xiuyuan Guo et.al. 2411.12780 null Kimi
2708 2024-11-18 Parsing Millions of DNS Records per Second Jeroen Koekkoek et.al. 2411.12035 link Kimi
2709 2024-11-17 SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Jintao Zhang et.al. 2411.10958 link Kimi
2710 2024-11-16 Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model Ting Liu et.al. 2411.10803 link Kimi
2711 2024-11-15 SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers Joseph Liu et.al. 2411.10510 link Kimi
2712 2024-11-14 Squeezed Attention: Accelerating Long Context Length LLM Inference Coleman Hooper et.al. 2411.09688 link Kimi
2713 2024-11-15 Communication Compression for Tensor Parallel LLM Inference Jan Hansen-Palmus et.al. 2411.09510 null Kimi
2714 2024-11-12 Towards Low-bit Communication for Tensor Parallel LLM Inference Harry Dong et.al. 2411.07942 null Kimi
2715 2024-11-11 Anchor Attention, Small Cache: Code Generation with Large Language Models Xiangyu Zhang et.al. 2411.06680 link Kimi
2716 2024-11-10 Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator Kazuki Fujii et.al. 2411.06465 null Kimi
2717 2024-11-08 Balancing Pipeline Parallelism with Vocabulary Parallelism Man Tsung Yeung et.al. 2411.05288 link Kimi
2718 2024-11-07 BitNet a4.8: 4-bit Activations for 1-bit LLMs Hongyu Wang et.al. 2411.04965 null Kimi
2719 2024-11-06 Stepping Forward on the Last Mile Chen Feng et.al. 2411.04036 null Kimi
2720 2024-11-05 TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection Wei Wu et.al. 2411.02886 null Kimi
2721 2024-11-05 DroidSpeak: Enhancing Cross-LLM Communication Yuhan Liu et.al. 2411.02820 null Kimi
2722 2024-11-04 “Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization Eldar Kurtic et.al. 2411.02355 null Kimi
2723 2024-11-04 Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism Fan Wu et.al. 2411.02086 null Kimi
2724 2024-11-04 xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism Jiarui Fang et.al. 2411.01738 link Kimi
2725 2024-11-02 NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference Xuanlin Jiang et.al. 2411.01142 null Kimi
2726 2024-11-01 MoNTA: Accelerating Mixture-of-Experts Training with Network-Traffc-Aware Parallel Optimization Jingming Guo et.al. 2411.00662 link Kimi
2727 2024-11-01 Constrained Diffusion Implicit Models Vivek Jayaram et.al. 2411.00359 null Kimi
2728 2024-11-05 SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile Ruisi Zhang et.al. 2411.00284 null Kimi
2729 2024-10-31 Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2 Weijie Ke et.al. 2410.23776 null Kimi
2730 2024-10-31 ALISE: Accelerating Large Language Model Serving with Speculative Scheduling Youpeng Zhao et.al. 2410.23537 null Kimi
2731 2024-10-29 VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration Dezhan Tu et.al. 2410.23317 null Kimi
2732 2024-10-30 BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference Junqi Zhao et.al. 2410.23079 link Kimi
2733 2024-10-29 The Impact of Inference Acceleration Strategies on Bias of LLMs Elisabeth Kirsten et.al. 2410.22118 link Kimi
2734 2024-10-29 How Does Critical Batch Size Scale in Pre-training? Hanlin Zhang et.al. 2410.21676 link Kimi
2735 2024-10-28 ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference Hanshi Sun et.al. 2410.21465 link Kimi
2736 2024-10-28 Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments Yuzhe Yang et.al. 2410.21340 null Kimi
2737 2024-10-28 Beyond Autoregression: Fast LLMs via Self-Distillation Through Time Justin Deschenaux et.al. 2410.21035 link Kimi
2738 2024-10-26 DQRM: Deep Quantized Recommendation Models Yang Zhou et.al. 2410.20046 link Kimi
2739 2024-10-25 RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction Tanqiu Jiang et.al. 2410.19937 null Kimi
2740 2024-10-25 BitPipe: Bidirectional Interleaved Pipeline Parallelism for Accelerating Large Models Training Houming Wu et.al. 2410.19367 link Kimi
2741 2024-10-28 Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning Yu Fu et.al. 2410.19258 link Kimi
2742 2024-10-24 KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing Yifei Yang et.al. 2410.18517 link Kimi
2743 2024-10-24 The Nature of Mathematical Modeling and Probabilistic Optimization Engineering in Generative AI Fulu Li et.al. 2410.18441 null Kimi
2744 2024-10-25 Fast Inference for Augmented Large Language Models Rana Shahout et.al. 2410.18248 null Kimi
2745 2024-10-23 Value Residual Learning For Alleviating Attention Concentration In Transformers Zhanchao Zhou et.al. 2410.17897 link Kimi
2746 2024-10-23 Markov Chain of Thought for Efficient Mathematical Reasoning Wen Yang et.al. 2410.17635 null Kimi
2747 2024-10-22 PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction Long Xing et.al. 2410.17247 link Kimi
2748 2024-10-21 MagicPIG: LSH Sampling for Efficient LLM Generation Zhuoming Chen et.al. 2410.16179 link Kimi
2749 2024-10-21 Residual vector quantization for KV cache compression in large language model Ankur Kumar et.al. 2410.15704 link Kimi
2750 2024-10-20 SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training Jinda Jia et.al. 2410.15526 link Kimi
2751 2024-10-20 EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models Junhao Hu et.al. 2410.15332 null Kimi
2752 2024-10-20 Lossless KV Cache Compression to 2% Zhen Yang et.al. 2410.15252 null Kimi
2753 2024-10-19 Pipeline Gradient-based Model Training on Analog In-memory Accelerators Zhaoxian Wu et.al. 2410.15155 link Kimi
2754 2024-10-18 A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference You Wu et.al. 2410.14442 link Kimi
2755 2024-10-23 TiMePReSt: Time and Memory Efficient Pipeline Parallel DNN Training with Removed Staleness Ankita Dutta et.al. 2410.14312 null Kimi
2756 2024-10-17 SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction Xuan Zhang et.al. 2410.13846 link Kimi
2757 2024-10-17 AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations Qian Tao et.al. 2410.13212 null Kimi
2758 2024-10-19 In-context KV-Cache Eviction for LLMs via Attention-Gate Zihao Zeng et.al. 2410.12876 null Kimi
2759 2024-10-16 FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction Akriti Jain et.al. 2410.12513 null Kimi
2760 2024-10-16 COMET: Towards Partical W4A4KV4 LLMs Serving Lian Liu et.al. 2410.12168 null Kimi
2761 2024-10-15 From promise to practice: realizing high-performance decentralized training Zesen Wang et.al. 2410.11998 null Kimi
2762 2024-10-15 QSpec: Speculative Decoding with Complementary Quantization Schemes Juntao Zhao et.al. 2410.11305 null Kimi
2763 2024-10-14 DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads Guangxuan Xiao et.al. 2410.10819 link Kimi
2764 2024-10-14 When Attention Sink Emerges in Language Models: An Empirical View Xiangming Gu et.al. 2410.10781 link Kimi
2765 2024-10-14 Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling Wenze Liu et.al. 2410.10511 link Kimi
2766 2024-10-15 EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations Zhangchi Feng et.al. 2410.10315 link Kimi
2767 2024-10-11 ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression Yefei He et.al. 2410.08584 null Kimi
2768 2024-10-10 KV Prediction for Improved Time to First Token Maxwell Horton et.al. 2410.08391 link Kimi
2769 2024-10-10 TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text Songshuo Lu et.al. 2410.07590 link Kimi
2770 2024-10-09 SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration Heming Xia et.al. 2410.06916 link Kimi
2771 2024-10-07 PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs Mengzhao Chen et.al. 2410.05265 link Kimi
2772 2024-10-07 Presto! Distilling Steps and Layers for Accelerating Music Generation Zachary Novack et.al. 2410.05167 null Kimi
2773 2024-10-07 TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention Lijie Yang et.al. 2410.05076 link Kimi
2774 2024-10-07 Fast State Restoration in LLM Serving with HCache Shiwei Gao et.al. 2410.05004 null Kimi
2775 2024-10-06 Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective Jinhao Li et.al. 2410.04466 link Kimi
2776 2024-10-04 SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation Aurick Qiao et.al. 2410.03960 null Kimi
2777 2024-10-04 LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy Rongzhi Zhang et.al. 2410.03111 null Kimi
2778 2024-10-04 UNComp: Uncertainty-Aware Long-Context Compressor for Efficient Large Language Model Inference Jing Xiong et.al. 2410.03090 null Kimi
2779 2024-10-09 LEGO: QEC Decoding System Architecture for Dynamic Circuits Yue Wu et.al. 2410.03073 null Kimi
2780 2024-10-04 Compute Or Load KV Cache? Why Not Both? Shuowei Jin et.al. 2410.03065 null Kimi
2781 2024-10-03 EinDecomp: Decomposition of Declaratively-Specified Machine Learning and Numerical Computations for Parallel Execution Daniel Bourgeois et.al. 2410.02682 null Kimi
2782 2024-10-03 SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration Jintao Zhang et.al. 2410.02367 link Kimi
2783 2024-10-02 Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads Yuxiang Huang et.al. 2410.01805 link Kimi
2784 2024-10-02 InfiniPot: Infinite Context Processing on Memory-Constrained LLMs Minsoo Kim et.al. 2410.01518 null Kimi
2785 2024-10-02 A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts Suyu Ge et.al. 2410.01485 null Kimi
2786 2024-10-01 Developing a BLAS library for the AMD AI Engine Tristan Laan et.al. 2410.00825 null Kimi
2787 2024-10-01 TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices Zonghang Li et.al. 2410.00531 link Kimi
2788 2024-10-01 LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management Yi Xiong et.al. 2410.00428 null Kimi
2789 2024-09-30 KV-Compress: Paged KV-Cache Compression with Variable Compression Rates per Attention Head Isaac Rehg et.al. 2410.00161 link Kimi
2790 2024-09-30 The Early Bird Catches the Leak: Unveiling Timing Side Channels in LLM Serving Systems Linke Song et.al. 2409.20002 null Kimi
2791 2024-09-27 Toward Greener Matrix Operations by Lossless Compressed Formats Francesco Tosoni et.al. 2409.18620 link Kimi
2792 2024-09-26 Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores Shaobo Ma et.al. 2409.17870 null Kimi
2793 2024-09-25 Search for Efficient Large Language Models Xuan Shen et.al. 2409.17372 link Kimi
2794 2024-09-25 Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations Amey Agrawal et.al. 2409.17264 null Kimi
2795 2024-09-25 AlignedKV: Reducing Memory Access of KV-Cache with Precision-Aligned Quantization Yifan Tan et.al. 2409.16546 link Kimi
2796 2024-09-25 A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence Xin Yuan et.al. 2409.16537 null Kimi
2797 2024-09-23 CSPS: A Communication-Efficient Sequence-Parallelism based Serving System for Transformer based Models with Long Prompts Zeyu Zhang et.al. 2409.15104 null Kimi
2798 2024-09-23 Inference-Friendly Models With MixAttention Shashank Rajput et.al. 2409.15012 null Kimi
2799 2024-09-23 Mutation-Based Deep Learning Framework Testing Method in JavaScript Environment Yinglong Zou et.al. 2409.14968 null Kimi
2800 2024-09-16 Do Large Language Models Need a Content Delivery Network? Yihua Cheng et.al. 2409.13761 link Kimi
2801 2024-09-20 Time Distributed Deep Learning models for Purely Exogenous Forecasting. Application to Water Table Depth Prediction using Weather Image Time Series Matteo Salis et.al. 2409.13284 null Kimi
2802 2024-09-23 CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs Junlin Lv et.al. 2409.12490 link Kimi
2803 2024-09-04 ISO: Overlap of Computation and Communication within Seqenence For LLM Inference Bin Xiao et.al. 2409.11155 null Kimi
2804 2024-09-17 KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models Bo Lv et.al. 2409.11057 null Kimi
2805 2024-09-21 CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios Luning Wang et.al. 2409.10593 link Kimi
2806 2024-09-14 A Dynamic Weighting Strategy to Mitigate Worker Node Failure in Distributed Deep Learning Yuesheng Xu et.al. 2409.09242 null Kimi
2807 2024-09-11 Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU Zhenyu Ning et.al. 2409.09086 null Kimi
2808 2024-09-13 SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity Qitian Wu et.al. 2409.09007 link Kimi
2809 2024-09-11 Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering Weixi Weng et.al. 2409.07331 null Kimi
2810 2024-09-11 FreeRide: Harvesting Bubbles in Pipeline Parallelism Jiashu Zhang et.al. 2409.06941 null Kimi
2811 2024-09-09 DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects Xu Zhang et.al. 2409.05404 null Kimi
2812 2024-09-08 InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference Xiurui Pan et.al. 2409.04992 null Kimi
2813 2024-09-04 Accelerating Large Language Model Training with Hybrid GPU-based Compression Lang Xu et.al. 2409.02423 null Kimi
2814 2024-09-03 Contemporary Model Compression on Large Language Models Inference Dong Liu et.al. 2409.01990 link Kimi
2815 2024-09-03 On-chain Validation of Tracking Data Messages (TDM) Using Distributed Deep Learning on a Proof of Stake (PoS) Blockchain Yasir Latif et.al. 2409.01614 null Kimi
2816 2024-09-02 LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUs Mo Sun et.al. 2409.00918 null Kimi
2817 2024-08-26 Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition Axel Klawonn et.al. 2408.14442 null Kimi
2818 2024-08-23 Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI Mikhail Khalilov et.al. 2408.13356 null Kimi
2819 2024-08-22 LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation Shihao Chen et.al. 2408.12354 null Kimi
2820 2024-08-23 MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding Jian Chen et.al. 2408.11049 link Kimi
2821 2024-08-20 Security Assessment of Hierarchical Federated Deep Learning D Alqattan et.al. 2408.10752 link Kimi
2822 2024-08-20 Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning Bei Ouyang et.al. 2408.10746 null Kimi
2823 2024-08-21 LongVILA: Scaling Long-Context Visual Language Models for Long Videos Fuzhao Xue et.al. 2408.10188 link Kimi
2824 2024-08-17 RepControlNet: ControlNet Reparameterization Zhaoli Deng et.al. 2408.09240 null Kimi
2825 2024-08-17 Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version) Mingkuan Xu et.al. 2408.09055 null Kimi
2826 2024-08-23 ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models Chao Zeng et.al. 2408.08554 link Kimi
2827 2024-08-16 Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models Jerry Huang et.al. 2408.08470 null Kimi
2828 2024-08-15 Asteroid: Resource-Efficient Hybrid Pipeline Parallelism for Collaborative DNN Training on Heterogeneous Edge Devices Shengyuan Ye et.al. 2408.08015 null Kimi
2829 2024-08-17 Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference Rohan Baskar Prabhakar et.al. 2408.07802 null Kimi
2830 2024-08-18 Post-Training Sparse Attention with Double Sparsity Shuo Yang et.al. 2408.07092 link Kimi
2831 2024-08-12 LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration Zhiwen Mo et.al. 2408.06003 null Kimi
2832 2024-08-10 Eigen Attention: Attention in Low-Rank Space for KV Cache Compression Utkarsh Saxena et.al. 2408.05646 link Kimi
2833 2024-08-05 SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving Andreas Kosmas Kakolyris et.al. 2408.05235 null Kimi
2834 2024-08-08 Partial Experts Checkpoint: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training Weilin Cai et.al. 2408.04307 null Kimi
2835 2024-08-07 Zero-Delay QKV Compression for Mitigating KV Cache and Network Bottlenecks in LLM Inference Zeyu Zhang et.al. 2408.04107 null Kimi
2836 2024-08-08 NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time Yilong Chen et.al. 2408.03675 link Kimi
2837 2024-08-04 Cross-layer Attention Sharing for Large Language Models Yongyu Mu et.al. 2408.01890 null Kimi
2838 2024-08-01 Intermittent Semi-working Mask: A New Masking Paradigm for LLMs Mingcong Lu et.al. 2408.00539 null Kimi
2839 2024-08-13 Finch: Prompt-guided Key-Value Cache Compression Giulio Corallo et.al. 2408.00167 null Kimi
2840 2024-07-31 EdgeLLM: A Highly Efficient CPU-FPGA Heterogeneous Edge Accelerator for Large Language Models Mingqiang Huang et.al. 2407.21325 null Kimi
2841 2024-07-30 Palu: Compressing KV-Cache with Low-Rank Projection Chi-Chih Chang et.al. 2407.21118 link Kimi
2842 2024-07-30 ThinK: Thinner Key Cache by Query-Driven Pruning Yuhui Xu et.al. 2407.21018 null Kimi
2843 2024-07-31 A2SF: Accumulative Attention Scoring with Forgetting Factor for Token Pruning in Transformer Decoder Hyun-rae Jo et.al. 2407.20485 null Kimi
2844 2024-07-25 An Efficient Inference Framework for Early-exit Large Language Models Ruijie Miao et.al. 2407.20272 null Kimi
2845 2024-07-29 When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention Lianghong Guo et.al. 2407.20042 link Kimi
2846 2024-07-29 Inference acceleration for large language models using “stairs” assisted greedy generation Domas Grigaliūnas et.al. 2407.19947 null Kimi
2847 2024-07-29 Rina: Enhancing Ring-AllReduce with In-network Aggregation in Distributed Model Training Zixuan Chen et.al. 2407.19721 null Kimi
2848 2024-07-25 Efficient Inference of Vision Instruction-Following Models with Elastic Cache Zuyan Liu et.al. 2407.18121 link Kimi
2849 2024-07-28 Keep the Cost Down: A Review on Methods to Optimize LLM’ s KV-Cache Consumption Luohe Shi et.al. 2407.18003 null Kimi
2850 2024-07-25 Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads Xihui Lin et.al. 2407.17678 null Kimi
2851 2024-07-23 A deeper look at depth pruning of LLMs Shoaib Ahmed Siddiqui et.al. 2407.16286 link Kimi
2852 2024-07-22 RazorAttention: Efficient KV Cache Compression Through Retrieval Heads Hanlin Tang et.al. 2407.15891 null Kimi
2853 2024-07-22 AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description Junyu Xie et.al. 2407.15850 link Kimi
2854 2024-07-22 LLMmap: Fingerprinting For Large Language Models Dario Pasquini et.al. 2407.15847 link Kimi
2855 2024-07-22 CarFormer: Self-Driving with Learned Object-Centric Representations Shadi Hamdan et.al. 2407.15843 null Kimi
2856 2024-07-22 SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models Mingze Xu et.al. 2407.15841 link Kimi
2857 2024-07-22 MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity Yangzhou Liu et.al. 2407.15838 link Kimi
2858 2024-07-22 dMel: Speech Tokenization made Simple He Bai et.al. 2407.15835 link Kimi
2859 2024-07-22 Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight Ziyuan Huang et.al. 2407.15819 null Kimi
2860 2024-07-23 A simple and fast C++ thread pool implementation capable of running task graphs Dmytro Puyda et.al. 2407.15805 link Kimi
2861 2024-07-22 Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation Guanyu Hu et.al. 2407.15798 null Kimi
2862 2024-07-22 Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach Rian Dolphin et.al. 2407.15788 null Kimi
2863 2024-07-22 Parallel Split Learning with Global Sampling Mohammad Kohankhaki et.al. 2407.15738 link Kimi
2864 2024-07-22 vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving Jiale Xu et.al. 2407.15309 link Kimi
2865 2024-07-19 Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference Joyjit Kundu et.al. 2407.14645 null Kimi
2866 2024-07-19 Internal Consistency and Self-Feedback in Large Language Models: A Survey Xun Liang et.al. 2407.14507 link Kimi
2867 2024-07-19 On Pre-training of Multimodal Language Models Customized for Chart Understanding Wan-Cyuan Fan et.al. 2407.14506 null Kimi
2868 2024-07-19 PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding Chenshu Hou et.al. 2407.14491 null Kimi
2869 2024-07-19 Evaluating the Reliability of Self-Explanations in Large Language Models Korbinian Randl et.al. 2407.14487 link Kimi
2870 2024-07-19 Contrastive Learning with Counterfactual Explanations for Radiology Report Generation Mingjie Li et.al. 2407.14474 null Kimi
2871 2024-07-19 Check-Eval: A Checklist-based Approach for Evaluating Text Quality Jayr Pereira et.al. 2407.14467 null Kimi
2872 2024-07-19 AttentNet: Fully Convolutional 3D Attention for Lung Nodule Detection Majedaldein Almahasneh et.al. 2407.14464 null Kimi
2873 2024-07-19 PolyFormer: Scalable Node-wise Filters via Polynomial Graph Transformer Jiahong Ma et.al. 2407.14459 link Kimi
2874 2024-07-19 Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier Zachary Wojtowicz et.al. 2407.14452 null Kimi
2875 2024-07-19 From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards Nicole Sultanum et.al. 2407.14451 null Kimi
2876 2024-07-19 LoAS: Fully Temporal-Parallel Datatflow for Dual-Sparse Spiking Neural Networks Ruokai Yin et.al. 2407.14073 link Kimi
2877 2024-07-19 LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Qichen Fu et.al. 2407.14057 null Kimi
2878 2024-07-18 SegPoint: Segment Any Point Cloud via Large Language Model Shuting He et.al. 2407.13761 null Kimi
2879 2024-07-18 Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models Zhuo Chen et.al. 2407.13757 null Kimi
2880 2024-07-18 CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications Mirza Masfiqur Rahman et.al. 2407.13742 null Kimi
2881 2024-07-18 Baba Is AI: Break the Rules to Beat the Benchmark Nathan Cloos et.al. 2407.13729 null Kimi
2882 2024-07-18 Compressing Structured Tensor Algebra Mahdi Ghorbani et.al. 2407.13726 null Kimi
2883 2024-07-18 CoDefeater: Using LLMs To Find Defeaters in Assurance Cases Usman Gohar et.al. 2407.13717 link Kimi
2884 2024-07-18 Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning Ans Munir et.al. 2407.13715 link Kimi
2885 2024-07-18 Understanding Reference Policies in Direct Preference Optimization Yixin Liu et.al. 2407.13709 link Kimi
2886 2024-07-18 ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection Janek Herrlein et.al. 2407.13702 link Kimi
2887 2024-07-18 Cross-Task Attack: A Self-Supervision Generative Framework Based on Attention Shift Qingyuan Zeng et.al. 2407.13700 null Kimi
2888 2024-07-17 Analysis of Crab X-ray Polarization using Deeper IXPE Observations Josephine Wong et.al. 2407.12779 null Kimi
2889 2024-07-17 The BRST quantisation of chiral BMS-like field theories José Figueroa-O’Farrill et.al. 2407.12778 null Kimi
2890 2024-07-17 Jigsaw Game: Federated Clustering Jinxuan Xu et.al. 2407.12764 null Kimi
2891 2024-07-17 LookupViT: Compressing visual information to a limited number of tokens Rajat Koner et.al. 2407.12753 null Kimi
2892 2024-07-17 CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference Mohammad Erfan Sadeghi et.al. 2407.12736 null Kimi
2893 2024-07-17 EchoSight: Advancing Visual-Language Models with Wiki Knowledge Yibin Yan et.al. 2407.12735 null Kimi
2894 2024-07-17 FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible Pruning in Uncertain Scenarios Zekai Chen et.al. 2407.12729 null Kimi
2895 2024-07-17 Exploring the interplay of individual traits and interaction dynamics in preschool social networks Gülşah Akçakır et.al. 2407.12728 null Kimi
2896 2024-07-17 NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model Zhongqun Zhang et.al. 2407.12727 null Kimi
2897 2024-07-17 Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models? Ben Yao et.al. 2407.12725 null Kimi
2898 2024-07-16 GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression Daniel Goldstein et.al. 2407.12077 link Kimi
2899 2024-07-16 Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale Aymen Alsaadi et.al. 2407.11967 null Kimi
2900 2024-07-16 UrbanWorld: An Urban World Model for 3D City Generation Yu Shang et.al. 2407.11965 link Kimi
2901 2024-07-16 NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? Mo Li et.al. 2407.11963 link Kimi
2902 2024-07-17 Hierarchical Separable Video Transformer for Snapshot Compressive Imaging Ping Wang et.al. 2407.11946 link Kimi
2903 2024-07-16 Min-max theory and existence of H-spheres with arbitrary codimensions Rui Gao et.al. 2407.11945 null Kimi
2904 2024-07-16 Beyond Spatial Explanations: Explainable Face Recognition in the Frequency Domain Marco Huber et.al. 2407.11941 null Kimi
2905 2024-07-16 Generalized Difference-in-Differences Yiqing Xu et.al. 2407.11937 null Kimi
2906 2024-07-16 Learning Multi-view Anomaly Detection Haoyang He et.al. 2407.11935 null Kimi
2907 2024-07-16 Code Documentation and Analysis to Secure Software Development Paul Attie et.al. 2407.11934 null Kimi
2908 2024-07-16 What’s Wrong? Refining Meeting Summaries with LLM Feedback Frederic Kirstein et.al. 2407.11919 null Kimi
2909 2024-07-16 PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation Branden Butler et.al. 2407.11798 null Kimi
2910 2024-07-21 Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference Yuan Feng et.al. 2407.11550 link Kimi
2911 2024-07-15 VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation Bocheng Zou et.al. 2407.10972 link Kimi
2912 2024-07-15 Q-Sparse: All Large Language Models can be Fully Sparsely-Activated Hongyu Wang et.al. 2407.10969 null Kimi
2913 2024-07-15 Induction of non-Fermi liquids by critical cavity photons at the onset of superradiance Ipsita Mandal et.al. 2407.10963 null Kimi
2914 2024-07-15 Fast Matrix Multiplications for Lookup Table-Quantized LLMs Han Guo et.al. 2407.10960 link Kimi
2915 2024-07-15 InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models Nirat Saini et.al. 2407.10958 null Kimi
2916 2024-07-15 MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models Chengguang Gan et.al. 2407.10953 null Kimi
2917 2024-07-15 The infamous 95 GeV $\rm b \bar b$ excess at LEP: Two b or not two b? Patrick Janot et.al. 2407.10948 null Kimi
2918 2024-07-15 Can Textual Semantics Mitigate Sounding Object Segmentation Preference? Yaoting Wang et.al. 2407.10947 link Kimi
2919 2024-07-15 GRUtopia: Dream General Robots in a City at Scale Hanqing Wang et.al. 2407.10943 link Kimi
2920 2024-07-15 IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation Yuanhao Zhai et.al. 2407.10937 link Kimi
2921 2024-07-12 FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 Georgios Makridis et.al. 2407.09467 null Kimi
2922 2024-07-12 Human-like Episodic Memory for Infinite Context LLMs Zafeirios Fountas et.al. 2407.09450 link Kimi
2923 2024-07-12 ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts Amelia F. Hardy et.al. 2407.09447 link Kimi
2924 2024-07-12 MUSCLE: A Model Update Strategy for Compatible LLM Evolution Jessica Echterhoff et.al. 2407.09435 null Kimi
2925 2024-07-12 Open (Clinical) LLMs are Sensitive to Instruction Phrasings Alberto Mario Ceballos Arroyo et.al. 2407.09429 link Kimi
2926 2024-07-12 TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models Hang Zou et.al. 2407.09424 null Kimi
2927 2024-07-12 Mitigating Entity-Level Hallucination in Large Language Models Weihang Su et.al. 2407.09417 link Kimi
2928 2024-07-12 SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers Shraman Pramanick et.al. 2407.09413 link Kimi
2929 2024-07-12 Thunderbolt: Causal Concurrent Consensus and Execution Junchao Chen et.al. 2407.09409 null Kimi
2930 2024-07-12 PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents Saber Zerhoudi et.al. 2407.09394 link Kimi
2931 2024-07-11 MAVIS: Mathematical Visual Instruction Tuning Renrui Zhang et.al. 2407.08739 link Kimi
2932 2024-07-11 Real-Time Anomaly Detection and Reactive Planning with Large Language Models Rohan Sinha et.al. 2407.08735 null Kimi
2933 2024-07-11 Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist Zihao Zhou et.al. 2407.08733 null Kimi
2934 2024-07-11 Planar decomposition of the HOMFLY polynomial for bipartite knots and links A. Anokhina et.al. 2407.08724 null Kimi
2935 2024-07-11 A Taxonomy for Data Contamination in Large Language Models Medha Palavalli et.al. 2407.08716 null Kimi
2936 2024-07-11 GTA: A Benchmark for General Tool Agents Jize Wang et.al. 2407.08713 link Kimi
2937 2024-07-11 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing et.al. 2407.08701 null Kimi
2938 2024-07-11 Flex-TPU: A Flexible TPU with Runtime Reconfigurable Dataflow Architecture Mohammed Elbtity et.al. 2407.08700 null Kimi
2939 2024-07-11 Mitigating Catastrophic Forgetting in Language Transfer via Model Merging Anton Alexandrov et.al. 2407.08699 null Kimi
2940 2024-07-11 Patterns of link reciprocity in directed, signed networks Anna Gallo et.al. 2407.08697 null Kimi
2941 2024-07-10 Training on the Test Task Confounds Evaluation and Emergence Ricardo Dominguez-Olmedo et.al. 2407.07890 link Kimi
2942 2024-07-10 Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization Junkang Wu et.al. 2407.07880 link Kimi
2943 2024-07-10 Bound States in Continuum via Singular Transfer Matrices Ovidiu-Zeno Lipan et.al. 2407.07879 null Kimi
2944 2024-07-10 FACTS About Building Retrieval Augmented Generation-based Chatbots Rama Akkiraju et.al. 2407.07858 null Kimi
2945 2024-07-10 OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training Sami Jaghouar et.al. 2407.07852 link Kimi
2946 2024-07-10 Harnessing Integrated CPU-GPU System Memory for HPC: a first look into Grace Hopper Gabin Schieffer et.al. 2407.07850 null Kimi
2947 2024-07-10 Natural Language Mechanisms via Self-Resolution with Foundation Models Nicolas Della Penna et.al. 2407.07845 null Kimi
2948 2024-07-10 Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification Mei Qiu et.al. 2407.07842 null Kimi
2949 2024-07-10 Transformer Alignment in Large Language Models Murdock Aubry et.al. 2407.07810 null Kimi
2950 2024-07-10 Attribute or Abstain: Large Language Models as Long Document Assistants Jan Buchmann et.al. 2407.07799 link Kimi
2951 2024-07-09 AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning Jiaxi Cui et.al. 2407.07094 link Kimi
2952 2024-07-09 FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation Liqun Ma et.al. 2407.07093 link Kimi
2953 2024-07-09 Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic Ruochen Jin et.al. 2407.07089 link Kimi
2954 2024-07-09 Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models Logan Cross et.al. 2407.07086 link Kimi
2955 2024-07-09 Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities Shaltiel Shmidman et.al. 2407.07080 null Kimi
2956 2024-07-09 ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction Shaozhe Hao et.al. 2407.07077 link Kimi
2957 2024-07-09 Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps Yung-Sung Chuang et.al. 2407.07071 link Kimi
2958 2024-07-09 Prompting Techniques for Secure Code Generation: A Systematic Investigation Catherine Tony et.al. 2407.07064 null Kimi
2959 2024-07-09 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Weize Chen et.al. 2407.07061 link Kimi
2960 2024-07-09 CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement Wang Wei et.al. 2407.07056 null Kimi
2961 2024-07-08 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Orr Zohar et.al. 2407.06189 link Kimi
2962 2024-07-08 CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation Xinying Guo et.al. 2407.06188 null Kimi
2963 2024-07-08 Left-Linear Rewriting in Adhesive Categories Paolo Baldan et.al. 2407.06181 null Kimi
2964 2024-07-08 The Tug-of-War Between Deepfake Generation and Detection Hannah Lee et.al. 2407.06174 null Kimi
2965 2024-07-08 On Speeding Up Language Model Evaluation Jin Peng Zhou et.al. 2407.06172 null Kimi
2966 2024-07-08 Inevitable Endgame of Comet Tsuchinshan-ATLAS (C/2023 A3) Zdenek Sekanina et.al. 2407.06166 null Kimi
2967 2024-07-08 What’s Wrong with Your Code Generated by Large Language Models? An Extensive Study Shihan Dou et.al. 2407.06153 null Kimi
2968 2024-07-08 WIBACong: An Argument-centric Framework for Understanding US Congressional Hearings Arman Irani et.al. 2407.06149 null Kimi
2969 2024-07-08 Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks Lukas Netz et.al. 2407.06146 null Kimi
2970 2024-07-08 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Ethan Chern et.al. 2407.06135 link Kimi
2971 2024-07-05 LaRa: Efficient Large-Baseline Radiance Fields Anpei Chen et.al. 2407.04699 null Kimi
2972 2024-07-05 Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs Rudolf Laine et.al. 2407.04694 link Kimi
2973 2024-07-05 ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu et.al. 2407.04693 link Kimi
2974 2024-07-05 Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge Yuanze Lin et.al. 2407.04681 null Kimi
2975 2024-07-05 Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition Ye Bai et.al. 2407.04675 null Kimi
2976 2024-07-05 Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement Yongji Wu et.al. 2407.04656 null Kimi
2977 2024-07-05 Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework Reza Averly et.al. 2407.04629 null Kimi
2978 2024-07-05 On scalable oversight with weak LLMs judging strong LLMs Zachary Kenton et.al. 2407.04622 null Kimi
2979 2024-07-08 OneRestore: A Universal Restoration Framework for Composite Degradation Yu Guo et.al. 2407.04621 link Kimi
2980 2024-07-05 Learning to (Learn at Test Time): RNNs with Expressive Hidden States Yu Sun et.al. 2407.04620 link Kimi
2981 2024-07-03 Universal Length Generalization with Turing Programs Kaiying Hou et.al. 2407.03310 null Kimi
2982 2024-07-03 Eyes on the Game: Deciphering Implicit Human Signals to Infer Human Proficiency, Trust, and Intent Nikhil Hulle et.al. 2407.03298 null Kimi
2983 2024-07-03 Large Language Models for JSON Schema Discovery Michael J. Mior et.al. 2407.03286 null Kimi
2984 2024-07-03 LLM Internal States Reveal Hallucination Risk Faced With a Query Ziwei Ji et.al. 2407.03282 link Kimi
2985 2024-07-03 Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks Mintae Kim et.al. 2407.03280 null Kimi
2986 2024-07-03 Nesterov’s Accelerated Jacobi-Type Methods for Large-scale Symmetric Positive Semidefinite Linear Systems Ling Liang et.al. 2407.03272 null Kimi
2987 2024-07-03 STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data Kheir Eddine Daouadi et.al. 2407.03253 null Kimi
2988 2024-07-03 ACTRESS: Active Retraining for Semi-supervised Visual Grounding Weitai Kang et.al. 2407.03251 null Kimi
2989 2024-07-04 When big data actually are low-rank, or entrywise approximation of certain function-generated matrices Stanislav Budzinskiy et.al. 2407.03250 link Kimi
2990 2024-07-03 Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning Jiaqi Wang et.al. 2407.03247 link Kimi
2991 2024-07-02 MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang et.al. 2407.02490 link Kimi
2992 2024-07-02 Neurocache: Efficient Vector Retrieval for Long-range Language Modeling Ali Safaya et.al. 2407.02486 link Kimi
2993 2024-07-02 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Yue Yu et.al. 2407.02485 null Kimi
2994 2024-07-02 Characterizing the Interpretability of Attention Maps in Digital Pathology Tomé Albuquerque et.al. 2407.02484 null Kimi
2995 2024-07-02 MMedAgent: Learning to Use Medical Tools with Multi-modal Agent Binxu Li et.al. 2407.02483 link Kimi
2996 2024-07-02 Understanding Alignment in Multimodal LLMs: A Comprehensive Study Elmira Amirloo et.al. 2407.02477 null Kimi
2997 2024-07-02 Open Scene Graphs for Open World Object-Goal Navigation Joel Loo et.al. 2407.02473 null Kimi
2998 2024-07-02 Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I Harrie Oosterhuis et.al. 2407.02464 null Kimi
2999 2024-07-02 Decentralized Intelligence Network (DIN) Abraham Nash et.al. 2407.02461 null Kimi
3000 2024-07-02 Revisión de Métodos de Planificación de Camino de Cobertura para Entornos Agrícolas Ismael Ait et.al. 2407.02449 null Kimi

Early Stopping

ID Publish Date Title Authors PDF Code Kimi
1 2024-12-12 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Tiehan Fan et.al. 2412.09283 null Kimi
2 2024-12-11 GradStop: Exploring Training Dynamics in Unsupervised Outlier Detection through Gradient Cohesion Yuang Zhang et.al. 2412.08501 link Kimi
3 2024-12-11 Collaborative Inference for Large Models with Task Offloading and Early Exiting Zuan Xie et.al. 2412.08284 null Kimi
4 2024-12-11 Diff-GO $^\text{n}$ : Enhancing Diffusion Models for Goal-Oriented Communications Suchinthaka Wanninayaka et.al. 2412.06980 null Kimi
5 2024-12-06 Sparse autoencoders reveal selective remapping of visual concepts during adaptation Hyesu Lim et.al. 2412.05276 link Kimi
6 2024-12-06 BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits Wazib Ansar et.al. 2412.05225 null Kimi
7 2024-12-05 A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs Wangbo Zhao et.al. 2412.03324 link Kimi
8 2024-12-03 Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control Sebastian Hirt et.al. 2412.02423 null Kimi
9 2024-12-02 Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization Weiqiao Shan et.al. 2412.01455 null Kimi
10 2024-12-02 EdgeOAR: Real-time Online Action Recognition On Edge Devices Wei Luo et.al. 2412.01267 null Kimi
11 2024-12-02 Reliable and scalable variable importance estimation via warm-start and early stopping Zexuan Sun et.al. 2412.01120 link Kimi
12 2024-11-28 Deep Neural Network-Based Prediction of B-Cell Epitopes for SARS-CoV and SARS-CoV-2: Enhancing Vaccine Design through Machine Learning Xinyu Shi et.al. 2412.00109 null Kimi
13 2024-11-26 Selfish Evolution: Making Discoveries in Extreme Label Noise with the Help of Overfitting Dynamics Nima Sedaghat et.al. 2412.00077 null Kimi
14 2024-11-28 DIESEL – Dynamic Inference-Guidance via Evasion of Semantic Embeddings in LLMs Ben Ganon et.al. 2411.19038 null Kimi
15 2024-11-27 One-Step Early Stopping Strategy using Neural Tangent Kernel Theory and Rademacher Complexity Daniel Martin Xavier et.al. 2411.18806 null Kimi
16 2024-11-27 HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression Lei Liu et.al. 2411.18473 null Kimi
17 2024-11-26 Instance-Aware Graph Prompt Learning Jiazheng Li et.al. 2411.17676 null Kimi
18 2024-11-22 Instance-Aware Generalized Referring Expression Segmentation E-Ro Nguyen et.al. 2411.15087 null Kimi
19 2024-11-19 Deep Learning-Driven Heat Map Analysis for Evaluating thickness of Wounded Skin Layers Devakumar GR et.al. 2411.12678 null Kimi
20 2024-11-15 Exploiting Negative Curvature in Conjunction with Adaptive Sampling: Theoretical Results and a Practical Algorithm Albert S. Berahas et.al. 2411.10378 null Kimi
21 2024-11-13 Voxeland: Probabilistic Instance-Aware Semantic Mapping with Evidence-based Uncertainty Quantification Jose-Luis Matez-Bandera et.al. 2411.08727 link Kimi
22 2024-11-11 The Unreasonable Effectiveness of Monte Carlo Simulations in A/B Testing Márton Trencséni et.al. 2411.06701 link Kimi
23 2024-11-07 Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale Flavio Di Palo et.al. 2411.05045 null Kimi
24 2024-11-07 LoFi: Scalable Local Image Reconstruction with Implicit Neural Representation AmirEhsan Khorashadizadeh et.al. 2411.04995 link Kimi
25 2024-11-05 SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents Dawei Li et.al. 2411.03284 link Kimi
26 2024-11-06 Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression: A Distribution-Free Analysis Yingzhen Yang et.al. 2411.02904 null Kimi
27 2024-11-05 Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery Bowei Du et.al. 2411.02861 null Kimi
28 2024-11-05 CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration Hongpeng Jin et.al. 2411.02829 null Kimi
29 2024-11-06 Energy-Aware Dynamic Neural Inference Marcello Bullo et.al. 2411.02471 null Kimi
30 2024-11-04 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution Yang Yue et.al. 2411.02359 link Kimi
31 2024-11-02 Bi-Level Graph Structure Learning for Next POI Recommendation Liang Wang et.al. 2411.01169 null Kimi
32 2024-10-30 Accelerated AI Inference via Dynamic Execution Methods Haim Barad et.al. 2411.00853 null Kimi
33 2024-11-01 Preventing Model Collapse in Deep Canonical Correlation Analysis by Noise Regularization Junlin He et.al. 2411.00383 null Kimi
34 2024-10-29 Power side-channel leakage localization through adversarial training of deep neural networks Jimmy Gammell et.al. 2410.22425 link Kimi
35 2024-10-27 Branch-and-bound algorithm for efficient reliability analysis of general coherent systems Ji-Eun Byun et.al. 2410.22363 null Kimi
36 2024-10-28 Agreement Tasks in Fault-Prone Synchronous Networks of Arbitrary Structure Pierre Fraigniaud et.al. 2410.21538 null Kimi
37 2024-10-28 Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA Sangmin Bae et.al. 2410.20672 null Kimi
38 2024-10-27 Sequential Large Language Model-Based Hyper-Parameter Optimization Kanan Mahammadli et.al. 2410.20302 link Kimi
39 2024-10-26 Looking Beyond The Top-1: Transformers Determine Top Tokens In Order Daria Lioubashevski et.al. 2410.20210 link Kimi
40 2024-10-26 Dynamic layer selection in decoder-only transformers Theodore Glavas et.al. 2410.20022 link Kimi
41 2024-10-25 COMSPLIT: A Communication-Aware Split Learning Design for Heterogeneous IoT Platforms Vukan Ninkovic et.al. 2410.19375 null Kimi
42 2024-10-30 Dynamic Vocabulary Pruning in Early-Exit LLMs Jort Vincenti et.al. 2410.18952 link Kimi
43 2024-10-24 AdaEDL: Early Draft Stopping for Speculative Decoding of Large Language Models via an Entropy-based Lower Bound on Token Acceptance Probability Sudhanshu Agrawal et.al. 2410.18351 null Kimi
44 2024-10-23 Inferring stability properties of chaotic systems on autoencoders’ latent spaces Elise Özalp et.al. 2410.18003 link Kimi
45 2024-10-23 Diffusion Priors for Variational Likelihood Estimation and Image Denoising Jun Cheng et.al. 2410.17521 link Kimi
46 2024-10-21 Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification Nishant S. Gaikwad et.al. 2410.15681 null Kimi
47 2024-10-24 BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping Taolin Zhang et.al. 2410.15430 link Kimi
48 2024-10-16 FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction Akriti Jain et.al. 2410.12513 null Kimi
49 2024-10-15 Juggernaut: Efficient Crypto-Agnostic Byzantine Agreement Daniel Collins et.al. 2410.12121 null Kimi
50 2024-10-14 Focused ReAct: Improving ReAct through Reiterate and Early Stop Shuoqiu Li et.al. 2410.10779 null Kimi
51 2024-10-14 big.LITTLE Vision Transformer for Efficient Visual Recognition He Guo et.al. 2410.10267 null Kimi
52 2024-10-12 DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach Daniel Gallo Fernández et.al. 2410.09633 link Kimi
53 2024-10-11 Scaling Gaussian Processes for Learning Curve Prediction via Latent Kronecker Structure Jihao Andreas Lin et.al. 2410.09239 null Kimi
54 2024-10-08 Benchmarking of a new data splitting method on volcanic eruption data Simona Reale et.al. 2410.06306 null Kimi
55 2024-10-08 MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More Wei Huang et.al. 2410.06270 link Kimi
56 2024-10-08 Mini-Batch Kernel $k$ -means Ben Jourdan et.al. 2410.05902 null Kimi
57 2024-10-06 Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach Divya Jyoti Bajpai et.al. 2410.05338 null Kimi
58 2024-10-07 L-C4: Language-Based Video Colorization for Creative and Consistent Color Zheng Chang et.al. 2410.04972 null Kimi
59 2024-10-06 CAPEEN: Image Captioning with Early Exits and Knowledge Distillation Divya Jyoti Bajpai et.al. 2410.04433 link Kimi
60 2024-10-06 DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs Divya Jyoti Bajpai et.al. 2410.04424 link Kimi
61 2024-10-03 Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis Zikun Zhang et.al. 2410.02321 null Kimi
62 2024-10-03 Global dynamical structures from infinitesimal data Benjamin McInroe et.al. 2410.02111 null Kimi
63 2024-10-02 CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL Mohammadreza Pourreza et.al. 2410.01943 null Kimi
64 2024-10-02 Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension Zaiquan Yang et.al. 2410.01544 null Kimi
65 2024-10-01 Timber! Poisoning Decision Trees Stefano Calzavara et.al. 2410.00862 null Kimi
66 2024-09-30 Inference of water waves surface elevation from horizontal velocity components using physics informed neural networks (PINN) Omar Sallam et.al. 2409.19851 null Kimi
67 2024-09-27 Improving Visual Object Tracking through Visual Prompting Shih-Fang Chen et.al. 2409.18901 link Kimi
68 2024-09-24 Reinforcement Leaning for Infinite-Dimensional Systems Wei Zhang et.al. 2409.15737 null Kimi
69 2024-10-03 Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer Prediction Amrit Diggavi Seshadri et.al. 2409.14091 null Kimi
70 2024-09-21 Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer Zheng Liu et.al. 2409.13999 null Kimi
71 2024-09-18 Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments Gang Chen et.al. 2409.11975 link Kimi
72 2024-09-17 UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning Kathakoli Sengupta et.al. 2409.11403 null Kimi
73 2024-09-16 Improving Multi-candidate Speculative Decoding Xiaofan Lu et.al. 2409.10644 link Kimi
74 2024-09-14 Group Sequential Testing of a Treatment Effect Using a Surrogate Marker Layla Parast et.al. 2409.09440 link Kimi
75 2024-09-13 Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection Dixi Yao et.al. 2409.08858 null Kimi
76 2024-09-11 AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge Han Wang et.al. 2409.07394 link Kimi
77 2024-09-11 From optimal score matching to optimal sampling Zehao Dou et.al. 2409.07032 null Kimi
78 2024-09-10 Noisy Early Stopping for Noisy Labels William Toner et.al. 2409.06830 null Kimi
79 2024-09-10 Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds Mu Cai et.al. 2409.06827 link Kimi
80 2024-08-26 Optimizing STAR Aligner for High Throughput Computing in the Cloud Piotr Kica et.al. 2409.05886 null Kimi
81 2024-09-09 Early-exit Convolutional Neural Networks Edanur Demir et.al. 2409.05336 link Kimi
82 2024-09-08 Attention-Based Efficient Breath Sound Removal in Studio Audio Recordings Nidula Elgiriyewithana et.al. 2409.04949 null Kimi
83 2024-09-16 RTop-K: Ultra-Fast Row-Wise Top-K Algorithm and GPU Implementation for Neural Networks Xi Xie et.al. 2409.00822 null Kimi
84 2024-08-30 Dynamic Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling Guangya Wan et.al. 2408.17017 null Kimi
85 2024-08-24 Inferring the shape of a solid inside a draining tank from its liquid level dynamics Gbenga Fabusola et.al. 2408.14503 null Kimi
86 2024-08-26 Re-Mix: Optimizing Data Mixtures for Large Scale Imitation Learning Joey Hejna et.al. 2408.14037 link Kimi
87 2024-08-24 Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning Xinglin Wang et.al. 2408.13457 null Kimi
88 2024-08-24 Face Clustering via Early Stopping and Edge Recall Junjie Liu et.al. 2408.13431 link Kimi
89 2024-08-21 Critique-out-Loud Reward Models Zachary Ankner et.al. 2408.11791 link Kimi
90 2024-08-21 EEG-Defender: Defending against Jailbreak through Early Exit Generation of Large Language Models Chongwen Zhao et.al. 2408.11308 null Kimi
91 2024-08-20 Inferring Underwater Topography with FINN Coşku Can Horuz et.al. 2408.10649 null Kimi
92 2024-08-15 An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation Jun Wang et.al. 2408.08047 null Kimi
93 2024-08-14 Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks Liting Jiang et.al. 2408.07613 null Kimi
94 2024-08-12 HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors Hyungtae Lim et.al. 2408.06328 null Kimi
95 2024-08-12 Transfer learning of state-based potential games for process optimization in decentralized manufacturing systems Steve Yuwono et.al. 2408.05992 null Kimi
96 2024-08-12 A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models Taehong Moon et.al. 2408.05927 link Kimi
97 2024-08-08 Early-Exit meets Model-Distributed Inference at Edge Networks Marco Colocrese et.al. 2408.05247 null Kimi
98 2024-08-09 PriPHiT: Privacy-Preserving Hierarchical Training of Deep Neural Networks Yamin Sepehri et.al. 2408.05092 null Kimi
99 2024-08-09 Early Exit Strategies for Approximate k-NN Search in Dense Retrieval Francesco Busolin et.al. 2408.04981 null Kimi
100 2024-08-07 Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling Zilyu Ye et.al. 2408.03695 link Kimi
101 2024-08-03 Advancing Green AI: Efficient and Accurate Lightweight CNNs for Rice Leaf Disease Identification Khairun Saddami et.al. 2408.01752 null Kimi
102 2024-08-01 Early Stopping Based on Repeated Significance Eric Bax et.al. 2408.00908 null Kimi
103 2024-07-31 Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation Wenyuan Chen et.al. 2408.00112 null Kimi
104 2024-07-30 Accelerating Large Language Model Inference with Self-Supervised Early Exits Florian Valade et.al. 2407.21082 null Kimi
105 2024-07-25 An Efficient Inference Framework for Early-exit Large Language Models Ruijie Miao et.al. 2407.20272 null Kimi
106 2024-07-26 Topology Optimization of Random Memristors for Input-Aware Dynamic SNN Bo Wang et.al. 2407.18625 link Kimi
107 2024-07-25 Superior Scoring Rules for Probabilistic Evaluation of Single-Label Multi-Class Classification Tasks Rouhollah Ahmadian et.al. 2407.17697 null Kimi
108 2024-07-23 Can Large Language Models Automatically Jailbreak GPT-4V? Yuanwei Wu et.al. 2407.16686 null Kimi
109 2024-07-22 WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding Quan Kong et.al. 2407.15350 null Kimi
110 2024-07-19 Joint or Disjoint: Mixing Training Regimes for Early-Exit Models Bartłomiej Krzepkowski et.al. 2407.14320 link Kimi
111 2024-07-19 BERTer: The Efficient One Pradyumna Saligram et.al. 2407.14039 null Kimi
112 2024-07-18 On the consistency of rotation curves and spatially integrated HI flux profiles Tariq Yasin et.al. 2407.13754 null Kimi
113 2024-07-19 Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View Jianan Fan et.al. 2407.12870 link Kimi
114 2024-07-17 Hallucination Index: An Image Quality Metric for Generative Reconstruction Models Matthew Tivnan et.al. 2407.12780 null Kimi
115 2024-07-16 Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning Yanting Miao et.al. 2407.12164 link Kimi
116 2024-07-16 Enhancing Split Computing and Early Exit Applications through Predefined Sparsity Luigi Capogrosso et.al. 2407.11763 link Kimi
117 2024-07-16 Preconditioned Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression Yingzhen Yang et.al. 2407.11353 null Kimi
118 2024-07-10 Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical Adarsh Prasad Behera et.al. 2407.11061 null Kimi
119 2024-07-15 Multilingual Contrastive Decoding via Language-Agnostic Layers Skipping Wenhao Zhu et.al. 2407.10795 link Kimi
120 2024-07-13 Towards understanding epoch-wise double descent in two-layer linear neural networks Amanda Olmin et.al. 2407.09845 null Kimi
121 2024-07-11 Sensor-Aware Classifiers for Energy-Efficient Time Series Applications on IoT Devices Dina Hussein et.al. 2407.08715 null Kimi
122 2024-07-07 Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking You Wu et.al. 2407.05383 null Kimi
123 2024-07-04 Unsupervised speech enhancement with spectral kurtosis and double deep priors Hien Ohnaka et.al. 2407.03887 null Kimi
124 2024-07-02 Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation Efstathia Soufleri et.al. 2407.02713 link Kimi
125 2024-07-02 Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model Cong Cao et.al. 2407.01960 null Kimi
126 2024-07-01 Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach Stef Baas et.al. 2407.01055 null Kimi
127 2024-07-01 SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection Dingkang Liang et.al. 2407.01016 null Kimi
128 2024-06-27 Adaptive Stochastic Weight Averaging Caglar Demir et.al. 2406.19092 link Kimi
129 2024-06-26 An Order Theory Framework of Recurrence Equations for Static Cost Analysis $-$ Dynamic Inference of Non-Linear Inequality Invariants Louis Rustenholz et.al. 2406.18260 null Kimi
130 2024-06-24 SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Neng Wang et.al. 2406.16279 link Kimi
131 2024-06-21 Micro-power spoken keyword spotting on Xylo Audio 2 Hannah Bos et.al. 2406.15112 null Kimi
132 2024-06-21 Early stopping for conjugate gradients in statistical inverse problems Laura Hucker et.al. 2406.15001 null Kimi
133 2024-06-21 Cost-Effective RF Fingerprinting Based on Hybrid CVNN-RF Classifier with Automated Multi-Dimensional Early-Exit Strategy Jiayan Gan et.al. 2406.14869 null Kimi
134 2024-06-20 On Layer-wise Representation Similarity: Application for Multi-Exit Models with a Single Classifier Jiachen Jiang et.al. 2406.14479 null Kimi