CV Arxiv Daily

Contributors Forks Stargazers Issues

Updated on 2025.05.24

Usage instructions: here

Other links:

LLM

ID Publish Date Title Authors PDF Code Kimi
1 2025-05-22 GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning Chengqi Duan et.al. 2505.17022 null Kimi
2 2025-05-22 CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms Shilin Yan et.al. 2505.17020 null Kimi
3 2025-05-22 Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO Chengzhuo Tong et.al. 2505.17017 null Kimi
4 2025-05-22 Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models Runsen Xu et.al. 2505.17015 null Kimi
5 2025-05-22 SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding Haoning Wu et.al. 2505.17012 link Kimi
6 2025-05-22 R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning Huatong Song et.al. 2505.17005 null Kimi
7 2025-05-22 Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? Jin Jiang et.al. 2505.16998 null Kimi
8 2025-05-22 X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs Rui Ye et.al. 2505.16997 null Kimi
9 2025-05-22 $\text{R}^2\text{ec}$ : Towards Large Recommender Models with Reasoning Runyang You et.al. 2505.16994 null Kimi
10 2025-05-22 Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding Runpeng Yu et.al. 2505.16990 null Kimi
11 2025-05-22 T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning Amartya Chakraborty et.al. 2505.16986 null Kimi
12 2025-05-22 Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine Adib Bazgir et.al. 2505.16982 null Kimi
13 2025-05-22 Bottlenecked Transformers: Periodic KV Cache Abstraction for Generalised Reasoning Adnan Oomerjee et.al. 2505.16950 null Kimi
14 2025-05-22 MixAT: Combining Continuous and Discrete Adversarial Training for LLMs Csaba Dékány et.al. 2505.16947 null Kimi
15 2025-05-22 AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios Yunjia Qi et.al. 2505.16944 null Kimi
16 2025-05-22 NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification NovelSeek Team et.al. 2505.16938 null Kimi
17 2025-05-22 In-Context Watermarks for Large Language Models Yepeng Liu et.al. 2505.16934 null Kimi
18 2025-05-22 Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning Bosung Kim et.al. 2505.16928 null Kimi
19 2025-05-22 Don’t “Overthink” Passage Reranking: Is Reasoning Truly Necessary? Nour Jedidi et.al. 2505.16886 null Kimi
20 2025-05-22 CASTILLO: Characterizing Response Length Distributions of Large Language Models Daniel F. Perez-Ramirez et.al. 2505.16881 null Kimi
21 2025-05-22 LaViDa: A Large Diffusion Language Model for Multimodal Understanding Shufan Li et.al. 2505.16839 null Kimi
22 2025-05-22 R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search Yibo Wang et.al. 2505.16838 null Kimi
23 2025-05-22 Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning Fanrui Zhang et.al. 2505.16836 null Kimi
24 2025-05-22 SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis Shuang Sun et.al. 2505.16834 null Kimi
25 2025-05-22 From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization Haonian Ji et.al. 2505.16832 null Kimi
26 2025-05-22 Unlearning Isn’t Deletion: Investigating Reversibility of Machine Unlearning in LLMs Xiaoyu Xu et.al. 2505.16831 null Kimi
27 2025-05-22 KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning Wei Sun et.al. 2505.16826 null Kimi
28 2025-05-22 REPA Works Until It Doesn’t: Early-Stopped, Holistic Alignment Supercharges Diffusion Training Ziqiao Wang et.al. 2505.16792 null Kimi
29 2025-05-22 CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models Zhenzhen Ren et.al. 2505.16785 null Kimi
30 2025-05-22 Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning Xinghao Chen et.al. 2505.16782 null Kimi
31 2025-05-22 R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO Huanjin Yao et.al. 2505.16673 null Kimi
32 2025-05-22 Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding Feilong Tang et.al. 2505.16652 null Kimi
33 2025-05-22 Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains Wenhui Tan et.al. 2505.16552 null Kimi
34 2025-05-22 LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing Dario Di Palma et.al. 2505.16491 null Kimi
35 2025-05-22 WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning Zhepei Wei et.al. 2505.16421 null Kimi
36 2025-05-22 DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving Zhenjie Yang et.al. 2505.16278 null Kimi
37 2025-05-22 LIFEBench: Evaluating Length Instruction Following in Large Language Models Wei Zhang et.al. 2505.16234 null Kimi
38 2025-05-22 NQKV: A KV Cache Quantization Scheme Based on Normal Distribution Characteristics Zhihang Cai et.al. 2505.16210 null Kimi
39 2025-05-22 QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design Benjamin Schneider et.al. 2505.16175 null Kimi
40 2025-05-22 KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization Mingbo Song et.al. 2505.16162 null Kimi
41 2025-05-22 Training-Free Reasoning and Reflection in MLLMs Hongchen Wei et.al. 2505.16151 null Kimi
42 2025-05-22 Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation Zhenglin Hua et.al. 2505.16146 null Kimi
43 2025-05-22 Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning Gagan Bhatia et.al. 2505.16088 null Kimi
44 2025-05-22 Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development Ming Shen et.al. 2505.16086 null Kimi
45 2025-05-21 Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models Jingcong Liang et.al. 2505.16056 null Kimi
46 2025-05-21 Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Alex Su et.al. 2505.15966 null Kimi
47 2025-05-21 Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization Aliakbar Nafar et.al. 2505.15918 null Kimi
48 2025-05-21 dKV-Cache: The Cache for Diffusion Language Models Xinyin Ma et.al. 2505.15781 link Kimi
49 2025-05-21 Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Zhen Zhang et.al. 2505.15778 link Kimi
50 2025-05-21 Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention Huanxuan Liao et.al. 2505.15774 null Kimi
51 2025-05-21 ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy Gengyang Li et.al. 2505.15684 null Kimi
52 2025-05-21 A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability Zishuai Zhang et.al. 2505.15683 link Kimi
53 2025-05-21 Feature Extraction and Steering for Enhanced Chain-of-Thought Reasoning in Language Models Zihao Li et.al. 2505.15634 null Kimi
54 2025-05-21 Learn to Reason Efficiently with Adaptive Length-based Reward Shaping Wei Liu et.al. 2505.15612 link Kimi
55 2025-05-21 Multilingual Test-Time Scaling via Initial Thought Transfer Prasoon Bajpai et.al. 2505.15508 null Kimi
56 2025-05-21 Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought Ao Liu et.al. 2505.15431 null Kimi
57 2025-05-21 FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management Xiang Liu et.al. 2505.15347 null Kimi
58 2025-05-21 Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack Silvia Cappelletti et.al. 2505.15323 null Kimi
59 2025-05-21 Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization Joonho Yang et.al. 2505.15291 null Kimi
60 2025-05-21 LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval Zhenyu Ning et.al. 2505.15269 null Kimi
61 2025-05-21 Towards Explainable Temporal Reasoning in Large Language Models: A Structure-Aware Generative Framework Zihao Jiang et.al. 2505.15245 link Kimi
62 2025-05-21 Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning Jinghui Lu et.al. 2505.15154 null Kimi
63 2025-05-21 BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Yunlong Hou et.al. 2505.15141 null Kimi
64 2025-05-21 The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning Shivam Agarwal et.al. 2505.15134 null Kimi
65 2025-05-21 An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents Bowen Jin et.al. 2505.15117 link Kimi
66 2025-05-21 RoT: Enhancing Table Reasoning with Iterative Row-Wise Traversals Xuanliang Zhang et.al. 2505.15110 null Kimi
67 2025-05-21 Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs Hao Wang et.al. 2505.15075 link Kimi
68 2025-05-21 Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision Eric Hanchen Jiang et.al. 2505.14999 null Kimi
69 2025-05-20 STree: Speculative Tree Decoding for Hybrid State-Space Models Yangchao Wu et.al. 2505.14969 null Kimi
70 2025-05-20 Too Long, Didn’t Model: Decomposing LLM Long-Context Understanding With Novels Sil Hamilton et.al. 2505.14925 null Kimi
71 2025-05-20 Scaling Laws for State Dynamics in Large Language Models Jacob X Li et.al. 2505.14892 null Kimi
72 2025-05-20 Balanced and Elastic End-to-end Training of Dynamic LLMs Mohamed Wahib et.al. 2505.14864 null Kimi
73 2025-05-20 Text Generation Beyond Discrete Token Sampling Yufan Zhuang et.al. 2505.14827 null Kimi
74 2025-05-21 Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning Haolei Xu et.al. 2505.14684 null Kimi
75 2025-05-20 Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training Mengru Wang et.al. 2505.14681 null Kimi
76 2025-05-20 Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning Jiaer Xia et.al. 2505.14677 null Kimi
77 2025-05-20 SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment Wonje Jeung et.al. 2505.14667 null Kimi
78 2025-05-20 Beyond Words: Multimodal LLM Knows When to Speak Zikai Liao et.al. 2505.14654 null Kimi
79 2025-05-20 KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models Fnu Mohbat et.al. 2505.14629 link Kimi
80 2025-05-20 Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs Morgan Lindsay Heisler et.al. 2505.14620 null Kimi
81 2025-05-20 Can Pruning Improve Reasoning? Revisiting Long-CoT Compression with Capability in Mind for Better Reasoning Shangziqi Zhao et.al. 2505.14582 null Kimi
82 2025-05-20 Reasoning Models Better Express Their Confidence Dongkeun Yoon et.al. 2505.14489 link Kimi
83 2025-05-20 Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation Peter Baile Chen et.al. 2505.14398 null Kimi
84 2025-05-20 Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach Umberto Cappellazzo et.al. 2505.14336 null Kimi
85 2025-05-20 Speculative Decoding Reimagined for Multimodal Large Language Models Luxi Lin et.al. 2505.14260 null Kimi
86 2025-05-20 FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation Shaolin Zhu et.al. 2505.14256 null Kimi
87 2025-05-20 Tokenization Constraints in LLMs: A Study of Symbolic and Arithmetic Reasoning Limits Xiang Zhang et.al. 2505.14178 null Kimi
88 2025-05-20 RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning Qianyue Hao et.al. 2505.14140 null Kimi
89 2025-05-20 DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models Yakun Zhu et.al. 2505.14107 link Kimi
90 2025-05-20 Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models Wenhui Zhu et.al. 2505.13973 null Kimi
91 2025-05-20 FlashThink: An Early Exit Method For Efficient Reasoning Guochao Jiang et.al. 2505.13949 null Kimi
92 2025-05-20 EEG-to-Text Translation: A Model for Deciphering Human Brain Activity Saydul Akbar Murad et.al. 2505.13936 link Kimi
93 2025-05-20 Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning Jiwon Song et.al. 2505.13866 null Kimi
94 2025-05-20 EfficientLLM: Efficiency in Large Language Models Zhengqing Yuan et.al. 2505.13840 null Kimi
95 2025-05-20 Structured Agent Distillation for Large Language Model Jun Liu et.al. 2505.13820 null Kimi
96 2025-05-19 Ice Cream Doesn’t Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference Jin Du et.al. 2505.13770 null Kimi
97 2025-05-19 Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers Andrew Nam et.al. 2505.13737 null Kimi
98 2025-05-19 RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs Soumya Rani Samineni et.al. 2505.13697 null Kimi
99 2025-05-19 Optimizing Anytime Reasoning via Budget Relative Policy Optimization Penghui Qi et.al. 2505.13438 link Kimi
100 2025-05-19 CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process Jinhe Bi et.al. 2505.13408 null Kimi
101 2025-05-19 Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference Shuqing Luo et.al. 2505.13345 link Kimi
102 2025-05-19 Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space Hengli Li et.al. 2505.13308 null Kimi
103 2025-05-19 RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning Qiguang Chen et.al. 2505.13307 link Kimi
104 2025-05-19 Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability Jingyi Ren et.al. 2505.13258 null Kimi
105 2025-05-19 HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding Siran Liu et.al. 2505.13254 null Kimi
106 2025-05-19 Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification Jikai Wang et.al. 2505.13204 null Kimi
107 2025-05-19 Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities Lili Zhang et.al. 2505.13195 null Kimi
108 2025-05-19 ModernGBERT: German-only 1B Encoder Model Trained from Scratch Anton Ehrmanntraut et.al. 2505.13136 null Kimi
109 2025-05-19 Benchmarking and Confidence Evaluation of LALMs For Temporal Reasoning Debarpan Bhattacharya et.al. 2505.13115 null Kimi
110 2025-05-19 FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference Guangda Liu et.al. 2505.13109 null Kimi
111 2025-05-19 Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning Xiaoyu Yang et.al. 2505.13081 null Kimi
112 2025-05-19 MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO Yicheng Xiao et.al. 2505.13031 link Kimi
113 2025-05-19 Fractured Chain-of-Thought Reasoning Baohao Liao et.al. 2505.12992 null Kimi
114 2025-05-19 A3 : an Analytical Low-Rank Approximation Framework for Attention Jeffrey T. H. Wong et.al. 2505.12942 null Kimi
115 2025-05-19 Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs Zhihe Yang et.al. 2505.12929 link Kimi
116 2025-05-19 The Traitors: Deception and Trust in Multi-Agent Language Model Simulations Pedro M. P. Curvo et.al. 2505.12923 null Kimi
117 2025-05-19 LEXam: Benchmarking Legal Reasoning on 340 Law Exams Yu Fan et.al. 2505.12864 null Kimi
118 2025-05-19 Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs Zhuo Yang et.al. 2505.12833 null Kimi
119 2025-05-19 SynDec: A Synthesize-then-Decode Approach for Arbitrary Textual Style Transfer via Large Language Models Han Sun et.al. 2505.12821 null Kimi
120 2025-05-19 Accelerating Adaptive Retrieval Augmented Generation via Instruction-Driven Representation Reduction of Retrieval Overlaps Jie Ou et.al. 2505.12731 null Kimi
121 2025-05-19 FLASH: Latent-Aware Semi-Autoregressive Speculative Decoding for Multimodal Tasks Zihua Wang et.al. 2505.12728 null Kimi
122 2025-05-19 ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving Haoyuan Wu et.al. 2505.12717 null Kimi
123 2025-05-19 Shadow-FT: Tuning Instruct via Base Taiqiang Wu et.al. 2505.12716 link Kimi
124 2025-05-19 Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities Haoyu Zhao et.al. 2505.12680 link Kimi
125 2025-05-19 HydraInfer: Hybrid Disaggregated Scheduling for Multimodal Large Language Model Serving Xianzhe Dong et.al. 2505.12658 null Kimi
126 2025-05-19 Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents Yunseok Jang et.al. 2505.12632 null Kimi
127 2025-05-19 Enhancing Latent Computation in Transformers with Latent Tokens Yuchang Sun et.al. 2505.12629 null Kimi
128 2025-05-18 A Survey of Attacks on Large Language Models Wenrui Xu et.al. 2505.12567 null Kimi
129 2025-05-15 3D-Fixup: Advancing Photo Editing with 3D Priors Yen-Chi Cheng et.al. 2505.10566 null Kimi
130 2025-05-15 End-to-End Vision Tokenizer Tuning Wenxuan Wang et.al. 2505.10562 null Kimi
131 2025-05-15 Neural Thermodynamic Laws for Large Language Model Training Ziming Liu et.al. 2505.10559 null Kimi
132 2025-05-15 MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning Ke Wang et.al. 2505.10557 link Kimi
133 2025-05-15 Beyond ‘Aha!’: Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Zhiyuan Hu et.al. 2505.10554 link Kimi
134 2025-05-15 Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data Yiwen Liu et.al. 2505.10551 link Kimi
135 2025-05-15 Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning Milan Ganai et.al. 2505.10547 null Kimi
136 2025-05-15 Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models Annie Wong et.al. 2505.10543 link Kimi
137 2025-05-15 Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis Pengfei Wang et.al. 2505.10541 link Kimi
138 2025-05-15 Enhancing Multi-Image Question Answering via Submodular Subset Selection Aaryan Sharma et.al. 2505.10533 null Kimi
139 2025-05-15 MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models Mugilan Ganesan et.al. 2505.10526 null Kimi
140 2025-05-15 Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation Xinrui Wang et.al. 2505.10522 null Kimi
141 2025-05-15 Multi-Token Prediction Needs Registers Anastasios Gerontopoulos et.al. 2505.10518 link Kimi
142 2025-05-15 The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks Benedikt Ebing et.al. 2505.10507 null Kimi
143 2025-05-15 RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs Vibha Belavadi et.al. 2505.10495 null Kimi
144 2025-05-15 Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective Yutao Mou et.al. 2505.10494 link Kimi
145 2025-05-15 CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning Shaohan Wang et.al. 2505.10493 null Kimi
146 2025-05-15 UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation Yi Li et.al. 2505.10483 null Kimi
147 2025-05-15 Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps Ningyuan Yang et.al. 2505.10482 null Kimi
148 2025-05-15 Parallel Scaling Law for Language Models Mouxiang Chen et.al. 2505.10475 link Kimi
149 2025-05-15 AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Ranjan Sapkota et.al. 2505.10468 null Kimi
150 2025-05-15 Superposition Yields Robust Neural Scaling Yizhou liu et.al. 2505.10465 link Kimi
151 2025-05-15 Vision language models have difficulty recognizing virtual objects Tyler Tran et.al. 2505.10453 null Kimi
152 2025-05-15 Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models Zemin Huang et.al. 2505.10446 null Kimi
153 2025-05-15 Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations? Pedro Orvalho et.al. 2505.10443 null Kimi
154 2025-05-15 Hierarchical Document Refinement for Long-context Retrieval-augmented Generation Jiajie Jin et.al. 2505.10413 link Kimi
155 2025-05-15 Are LLM-generated plain language summaries truly understandable? A large-scale crowdsourced evaluation Yue Guo et.al. 2505.10409 null Kimi
156 2025-05-15 Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding Jianhao Huang et.al. 2505.10405 null Kimi
157 2025-05-15 Rethinking Repetition Problems of LLMs in Code Generation Yihong Dong et.al. 2505.10402 link Kimi
158 2025-05-15 Evaluating Model Explanations without Ground Truth Kaivalya Rawal et.al. 2505.10399 link Kimi
159 2025-05-15 J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning Chenxi Whitehouse et.al. 2505.10320 null Kimi
160 2025-05-15 StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation Daniel A. P. Oliveira et.al. 2505.10292 link Kimi
161 2025-05-15 The Evolving Landscape of Generative Large Language Models and Traditional Natural Language Processing in Medicine Rui Yang et.al. 2505.10261 null Kimi
162 2025-05-15 Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data Poli Apollinaire Nemkova et.al. 2505.10260 link Kimi
163 2025-05-15 On the Interplay of Human-AI Alignment,Fairness, and Performance Trade-offs in Medical Imaging Haozhe Luo et.al. 2505.10231 link Kimi
164 2025-05-15 ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention Jintian Shao et.al. 2505.10222 null Kimi
165 2025-05-15 The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think Seongyun Lee et.al. 2505.10185 null Kimi
166 2025-05-15 GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs Longchao Da et.al. 2505.10143 null Kimi
167 2025-05-15 From Text to Network: Constructing a Knowledge Graph of Taiwan-Based China Studies Using Generative AI Hsuan-Lei Shao et.al. 2505.10093 null Kimi
168 2025-05-15 CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability Han Peng et.al. 2505.10063 null Kimi
169 2025-05-15 PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language Ijazul Haq et.al. 2505.10055 null Kimi
170 2025-05-15 ServeGen: Workload Characterization and Generation of Large Language Model Serving in Production Yuxing Xiang et.al. 2505.09999 null Kimi
171 2025-05-15 Analysing Safety Risks in LLMs Fine-Tuned with Pseudo-Malicious Cyber Security Data Adel ElZemity et.al. 2505.09974 null Kimi
172 2025-05-15 Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents Mrinal Rawat et.al. 2505.09970 null Kimi
173 2025-05-15 Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph Deeksha Prahlad et.al. 2505.09945 link Kimi
174 2025-05-15 Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit Tasks Ziyuan Zhang et.al. 2505.09901 link Kimi
175 2025-05-14 Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting Apollinaire Poli Nemkova et.al. 2505.09852 null Kimi
176 2025-05-14 Contextual Phenotyping of Pediatric Sepsis Cohort Using Large Language Models Aditya Nagori et.al. 2505.09805 null Kimi
177 2025-05-14 Trustless Autonomy: Understanding Motivations, Benefits and Governance Dilemma in Self-Sovereign Decentralized AI Agents Botao Amber Hu et.al. 2505.09757 null Kimi
178 2025-05-14 System Prompt Optimization with Meta-Learning Yumin Choi et.al. 2505.09666 null Kimi
179 2025-05-14 Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists? Anthony GX-Chen et.al. 2505.09614 null Kimi
180 2025-05-14 Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors Nicolas Dupuis et.al. 2505.09610 null Kimi
181 2025-05-14 WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models Abdullah Mushtaq et.al. 2505.09595 null Kimi
182 2025-05-14 PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning Zongqian Li et.al. 2505.09519 link Kimi
183 2025-05-14 CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios Raghav Garg et.al. 2505.09436 link Kimi
184 2025-05-14 Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records Yili He et.al. 2505.09435 null Kimi
185 2025-05-14 Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits Subrit Dikshit et.al. 2505.09407 null Kimi
186 2025-05-14 The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners Vince Trencsenyi et.al. 2505.09396 null Kimi
187 2025-05-14 Qwen3 Technical Report An Yang et.al. 2505.09388 link Kimi
188 2025-05-14 Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Chenggang Zhao et.al. 2505.09343 null Kimi
189 2025-05-14 Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs Jingcheng Niu et.al. 2505.09338 null Kimi
190 2025-05-14 Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging Hongjin Qian et.al. 2505.09316 null Kimi
191 2025-05-14 Reproducibility Study of “Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents” Pedro M. P. Curvo et.al. 2505.09289 link Kimi
192 2025-05-14 Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt Bin-Bin Gao et.al. 2505.09264 link Kimi
193 2025-05-14 ELIS: Efficient LLM Iterative Scheduling System with Response Length Predictor Seungbeom Choi et.al. 2505.09142 null Kimi
194 2025-05-14 CEC-Zero: Chinese Error Correction Solution Based on LLM Sophie Zhang et.al. 2505.09082 null Kimi
195 2025-05-14 A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias Brandon Smith et.al. 2505.09056 null Kimi
196 2025-05-13 Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification Adarsh Kumar et.al. 2505.09031 null Kimi
197 2025-05-13 Automated Meta Prompt Engineering for Alignment with the Theory of Mind Aaron Baughman et.al. 2505.09024 null Kimi
198 2025-05-13 Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training Yangyi Chen et.al. 2505.08971 link Kimi
199 2025-05-13 Toward Cost-Efficient Serving of Mixture-of-Experts with Asynchrony Shaoyu Wang et.al. 2505.08944 null Kimi
200 2025-05-13 Performance Gains of LLMs With Humans in a World of LLMs Versus Humans Lucas McCullum et.al. 2505.08902 null Kimi
201 2025-05-13 Generative AI for Autonomous Driving: Frontiers and Opportunities Yuping Wang et.al. 2505.08854 link Kimi
202 2025-05-13 CodePDE: An Inference Framework for LLM-driven PDE Solver Generation Shanda Li et.al. 2505.08783 link Kimi
203 2025-05-14 Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology Yatai Ji et.al. 2505.08765 null Kimi
204 2025-05-13 DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models Xiaoyang Chen et.al. 2505.08744 link Kimi
205 2025-05-13 Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies Xiaoliang Luo et.al. 2505.08739 link Kimi
206 2025-05-13 NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context Ben Yao et.al. 2505.08734 null Kimi
207 2025-05-13 PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts Yang Su et.al. 2505.08719 null Kimi
208 2025-05-13 LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs K M Sajjadul Islam et.al. 2505.08704 null Kimi
209 2025-05-13 TRAIL: Trace Reasoning and Agentic Issue Localization Darshan Deshpande et.al. 2505.08638 null Kimi
210 2025-05-13 Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models Donghoon Kim et.al. 2505.08622 null Kimi
211 2025-05-13 Automatic Task Detection and Heterogeneous LLM Speculative Decoding Danying Ge et.al. 2505.08600 null Kimi
212 2025-05-13 Small but Significant: On the Promise of Small Language Models for Accessible AIED Yumou Wei et.al. 2505.08588 null Kimi
213 2025-05-13 The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News Yuhan Liu et.al. 2505.08532 null Kimi
214 2025-05-13 LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models Takumi Shibata et.al. 2505.08498 null Kimi
215 2025-05-13 RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models Fujun Zhang et.al. 2505.08463 null Kimi
216 2025-05-13 Accelerating Chain-of-Thought Reasoning: When Goal-Gradient Importance Meets Dynamic Skipping Ren Zhuang et.al. 2505.08392 null Kimi
217 2025-05-13 Benchmarking AI scientists in omics data-driven biological research Erpai Luo et.al. 2505.08341 link Kimi
218 2025-05-13 AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale Yunjie Ji et.al. 2505.08311 null Kimi
219 2025-05-13 Evaluating the Effectiveness of Black-Box Prompt Optimization as the Scale of LLMs Continues to Grow Ziyu Zhou et.al. 2505.08303 null Kimi
220 2025-05-13 Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration Rishabh Agrawal et.al. 2505.08261 null Kimi
221 2025-05-13 Evaluating LLM Metrics Through Real-World Capabilities Justin K Miller et.al. 2505.08253 null Kimi
222 2025-05-13 Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement Haoran Ye et.al. 2505.08245 link Kimi
223 2025-05-13 A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs Artem Shelmanov et.al. 2505.08200 null Kimi
224 2025-05-13 Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage Ruilin Liu et.al. 2505.08167 null Kimi
225 2025-05-13 Decoding Neighborhood Environments with Large Language Models Andrew Cart et.al. 2505.08163 null Kimi
226 2025-05-13 Lost in Transmission: When and Why LLMs Fail to Reason Globally Tobias Schnabel et.al. 2505.08140 null Kimi
227 2025-05-13 ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval Mingxu Tao et.al. 2505.08130 null Kimi
228 2025-05-12 Are LLMs complicated ethical dilemma analyzers? Jiashen et.al. 2505.08106 link Kimi
229 2025-05-12 Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders Dong Shu et.al. 2505.08080 null Kimi
230 2025-05-12 FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning Zhehao Zhang et.al. 2505.08054 null Kimi
231 2025-05-12 Learning from Peers in Reasoning Models Tongxu Luo et.al. 2505.07787 null Kimi
232 2025-05-12 S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models Muzhi Dai et.al. 2505.07686 null Kimi
233 2025-05-12 SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models Hang Wu et.al. 2505.07680 null Kimi
234 2025-05-13 OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit Arun S. Maiya et.al. 2505.07672 link Kimi
235 2025-05-12 Benchmarking Retrieval-Augmented Generation for Chemistry Xianrui Zhong et.al. 2505.07671 null Kimi
236 2025-05-12 Concept-Level Explainability for Auditing & Steering LLM Responses Kenza Amara et.al. 2505.07610 link Kimi
237 2025-05-12 MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining Xiaomi LLM-Core Team et.al. 2505.07608 link Kimi
238 2025-05-12 Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent Ziyang Huang et.al. 2505.07596 null Kimi
239 2025-05-12 A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models Junjie Ye et.al. 2505.07591 link Kimi
240 2025-05-12 ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution Xu Huang et.al. 2505.07512 null Kimi
241 2025-05-12 A Survey on Collaborative Mechanisms Between Large and Small Language Models Yi Chen et.al. 2505.07460 null Kimi
242 2025-05-12 How well do LLMs reason over tabular data, really? Cornelius Wolff et.al. 2505.07453 null Kimi
243 2025-05-12 Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data David de-Fitero-Dominguez et.al. 2505.07372 null Kimi
244 2025-05-12 QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines Ohjoon Kwon et.al. 2505.07345 null Kimi
245 2025-05-12 Generative Pre-trained Autoregressive Diffusion Transformer Yuan Zhang et.al. 2505.07344 null Kimi
246 2025-05-12 Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study Baixuan Xu et.al. 2505.07313 null Kimi
247 2025-05-12 Semantic Retention and Extreme Compression in LLMs: Can We Have Both? Stanislas Laborde et.al. 2505.07289 null Kimi
248 2025-05-12 UMoE: Unifying Attention and FFN with Shared Experts Yuanhang Yang et.al. 2505.07260 null Kimi
249 2025-05-12 SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models Peichao Lai et.al. 2505.07247 link Kimi
250 2025-05-12 Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity Guang Yan et.al. 2505.07239 null Kimi
251 2025-05-12 DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation Jiashuo Sun et.al. 2505.07233 link Kimi
252 2025-05-12 Measuring General Intelligence with Generated Games Vivek Verma et.al. 2505.07215 link Kimi
253 2025-05-12 Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 2030 Mouxiao Bian et.al. 2505.07205 null Kimi
254 2025-05-12 PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications Kuntai Du et.al. 2505.07203 null Kimi
255 2025-05-12 One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models Haoran Gu et.al. 2505.07167 null Kimi
256 2025-05-12 Pre-training vs. Fine-tuning: A Reproducibility Study on Dense Retrieval Knowledge Acquisition Zheng Yao et.al. 2505.07166 link Kimi
257 2025-05-11 RefPentester: A Knowledge-Informed Self-Reflective Penetration Testing Framework Based on Large Language Models Hanzheng Dai et.al. 2505.07089 null Kimi
258 2025-05-11 Architectural Precedents for General Agents using Large Language Models Robert E. Wray et.al. 2505.07087 null Kimi
259 2025-05-11 DialogueReason: Rule-Based RL Sparks Dialogue Reasoning in LLMs Yubo Shu et.al. 2505.07049 null Kimi
260 2025-05-11 LLM-Augmented Chemical Synthesis and Design Decision Programs Haorui Wang et.al. 2505.07027 null Kimi
261 2025-05-08 DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion Qitao Zhao et.al. 2505.05473 null Kimi
262 2025-05-08 Flow-GRPO: Training Flow Matching Models via Online RL Jie Liu et.al. 2505.05470 link Kimi
263 2025-05-08 Generating Physically Stable and Buildable LEGO Designs from Text Ava Pun et.al. 2505.05469 link Kimi
264 2025-05-08 StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant Haibo Wang et.al. 2505.05467 null Kimi
265 2025-05-08 ComPO: Preference Alignment via Comparison Oracles Peter Chen et.al. 2505.05465 null Kimi
266 2025-05-08 Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging Shiqi Chen et.al. 2505.05464 link Kimi
267 2025-05-08 UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections Fatima Haouari et.al. 2505.05459 null Kimi
268 2025-05-08 SITE: towards Spatial Intelligence Thorough Evaluation Wenqi Wang et.al. 2505.05456 null Kimi
269 2025-05-08 Conversational Process Model Redesign Nataliia Klievtsova et.al. 2505.05453 null Kimi
270 2025-05-08 Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding Han Xiao et.al. 2505.05446 link Kimi
271 2025-05-08 clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations Chalamalasetti Kranti et.al. 2505.05445 null Kimi
272 2025-05-08 EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation Biao Yi et.al. 2505.05440 null Kimi
273 2025-05-08 Empowering Scientific Workflows with Federated Agents J. Gregory Pauloski et.al. 2505.05428 link Kimi
274 2025-05-08 Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data Yudong Wang et.al. 2505.05427 null Kimi
275 2025-05-08 TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering Ran Zhang et.al. 2505.05423 link Kimi
276 2025-05-08 TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation Haokun Lin et.al. 2505.05422 link Kimi
277 2025-05-08 Reasoning Models Don’t Always Say What They Think Yanda Chen et.al. 2505.05410 null Kimi
278 2025-05-08 Crosslingual Reasoning through Test-Time Scaling Zheng-Xin Yong et.al. 2505.05408 link Kimi
279 2025-05-08 Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans? Valeria Pastorino et.al. 2505.05406 null Kimi
280 2025-05-08 CART-ELC: Oblique Decision Tree Induction via Exhaustive Search Andrew D. Laack et.al. 2505.05402 link Kimi
281 2025-05-08 PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model Zhang Zhang et.al. 2505.05397 null Kimi
282 2025-05-08 EDmamba: A Simple yet Effective Event Denoising Method with State Space Model Ciyu Ruan et.al. 2505.05391 null Kimi
283 2025-05-08 Walrus: An Efficient Decentralized Storage Network George Danezis et.al. 2505.05370 null Kimi
284 2025-05-08 High-fidelity Grain Growth Modeling: Leveraging Deep Learning for Fast Computations Pungponhavoan Tep et.al. 2505.05354 null Kimi
285 2025-05-08 Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization Sooyoung Park et.al. 2505.05343 link Kimi
286 2025-05-08 Progressive Inertial Poser: Progressive Real-Time Kinematic Chain Estimation for 3D Full-Body Pose from Three IMU Sensors Zunjie Zhu et.al. 2505.05336 null Kimi
287 2025-05-08 ICon: In-Context Contribution for Automatic Data Selection Yixin Yang et.al. 2505.05327 null Kimi
288 2025-05-08 Scalable Chain of Thoughts via Elastic Reasoning Yuhui Xu et.al. 2505.05315 null Kimi
289 2025-05-08 T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction Kun Peng et.al. 2505.05271 null Kimi
290 2025-05-08 Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks Yixin Cheng et.al. 2505.05190 link Kimi
291 2025-05-08 Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models Wei Peng et.al. 2505.05189 null Kimi
292 2025-05-08 MARK: Memory Augmented Refinement of Knowledge Anish Ganguli et.al. 2505.05177 null Kimi
293 2025-05-08 X-Driver: Explainable Autonomous Driving with Vision-Language Models Wei Liu et.al. 2505.05098 null Kimi
294 2025-05-08 Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes Zhuocheng Gong et.al. 2505.04993 null Kimi
295 2025-05-08 Chain-of-Thought Tokens are Computer Program Variables Fangwei Zhu et.al. 2505.04955 link Kimi
296 2025-05-08 Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Yunxin Li et.al. 2505.04921 link Kimi
297 2025-05-08 An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education Ramteja Sajja et.al. 2505.04916 null Kimi
298 2025-05-08 Enigme: Generative Text Puzzles for Evaluating Reasoning in Language Models John Hawkins et.al. 2505.04914 link Kimi
299 2025-05-08 SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models Shun Taguchi et.al. 2505.04911 null Kimi
300 2025-05-08 ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning Ziqing Qiao et.al. 2505.04881 null Kimi
301 2025-05-08 GroverGPT-2: Simulating Grover’s Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization Min Chen et.al. 2505.04880 null Kimi
302 2025-05-07 CRAFT: Cultural Russian-Oriented Dataset Adaptation for Focused Text-to-Image Generation Viacheslav Vasilev et.al. 2505.04851 null Kimi
303 2025-05-07 Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards Manveer Singh Tamber et.al. 2505.04847 null Kimi
304 2025-05-07 Large Language Models are Autonomous Cyber Defenders Sebastián R. Castro et.al. 2505.04843 link Kimi
305 2025-05-07 ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling Xiao Wang et.al. 2505.04802 null Kimi
306 2025-05-07 The Promise and Limits of LLMs in Constructing Proofs and Hints for Logic Problems in Intelligent Tutoring Systems Sutapa Dey Tithi et.al. 2505.04736 null Kimi
307 2025-05-07 SOAEsV2-7B/72B: Full-Pipeline Optimization for State-Owned Enterprise LLMs via Continual Pre-Training, Domain-Progressive SFT and Distillation-Enhanced Speculative Decoding Jingyang Deng et.al. 2505.04723 null Kimi
308 2025-05-07 EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning Zhenghao Xing et.al. 2505.04623 link Kimi
309 2025-05-07 ZeroSearch: Incentivize the Search Capability of LLMs without Searching Hao Sun et.al. 2505.04588 null Kimi
310 2025-05-07 Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review Josh McGiff et.al. 2505.04531 null Kimi
311 2025-05-07 Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs Yehui Tang et.al. 2505.04519 null Kimi
312 2025-05-07 CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation Jiahao Li et.al. 2505.04481 null Kimi
313 2025-05-07 OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models Xiaoyu Xu et.al. 2505.04416 null Kimi
314 2025-05-07 YABLoCo: Yet Another Benchmark for Long Context Code Generation Aidar Valeev et.al. 2505.04406 null Kimi
315 2025-05-07 The Aloe Family Recipe for Open and Specialized Healthcare LLMs Dario Garcia-Gasulla et.al. 2505.04388 null Kimi
316 2025-05-07 Benchmarking LLMs’ Swarm intelligence Kai Ruan et.al. 2505.04364 link Kimi
317 2025-05-07 GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance Sofia Jamil et.al. 2505.04284 link Kimi
318 2025-05-07 SToLa: Self-Adaptive Touch-Language Framework with Tactile Commonsense Reasoning in Open-Ended Scenarios Ning Cheng et.al. 2505.04201 null Kimi
319 2025-05-07 VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction Tuning Trinh T. L. Vuong et.al. 2505.04192 link Kimi
320 2025-05-07 S3D: Sketch-Driven 3D Model Generation Hail Song et.al. 2505.04185 link Kimi
321 2025-05-07 Large Language Models are often politically extreme, usually ideologically inconsistent, and persuasive even in informational contexts Nouar Aldahoul et.al. 2505.04171 null Kimi
322 2025-05-07 Unmasking the Canvas: A Dynamic Benchmark for Image Generation Jailbreaking and LLM Content Safety Variath Madhupal Gautham Nair et.al. 2505.04146 null Kimi
323 2025-05-07 Enhancing Granular Sentiment Classification with Chain-of-Thought Prompting in Large Language Models Vihaan Miriyala et.al. 2505.04135 null Kimi
324 2025-05-07 LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress? Teddy Foley et.al. 2505.04075 link Kimi
325 2025-05-07 Advancing and Benchmarking Personalized Tool Invocation for LLMs Xu Huang et.al. 2505.04072 link Kimi
326 2025-05-06 Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving Shan Yu et.al. 2505.04021 null Kimi
327 2025-05-06 SLOT: Structuring the Output of Large Language Models Darren Yow-Bang Wang et.al. 2505.04016 null Kimi
328 2025-05-06 Can Large Language Models Predict Parallel Code Performance? Gregory Bolet et.al. 2505.03988 null Kimi
329 2025-05-06 X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains Qianchu Liu et.al. 2505.03981 null Kimi
330 2025-05-06 The Power of Stories: Narrative Priming Shapes How LLM Agents Collaborate and Compete Gerrit Großmann et.al. 2505.03961 link Kimi
331 2025-05-06 Frog Soup: Zero-Shot, In-Context, and Sample-Efficient Frogger Agents Xiang Li et.al. 2505.03947 link Kimi
332 2025-05-06 MARCO: A Multi-Agent System for Optimizing HPC Code Generation Using Large Language Models Asif Rahman et.al. 2505.03906 null Kimi
333 2025-05-06 Novel Extraction of Discriminative Fine-Grained Feature to Improve Retinal Vessel Segmentation Shuang Zeng et.al. 2505.03896 link Kimi
334 2025-05-06 VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model Zuwei Long et.al. 2505.03739 link Kimi
335 2025-05-06 WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch Zimu Lu et.al. 2505.03733 link Kimi
336 2025-05-06 Distribution-Conditional Generation: From Class Distribution to Creative Generation Fu Feng et.al. 2505.03667 null Kimi
337 2025-05-06 ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision Assistant Yifan Xiang et.al. 2505.03654 null Kimi
338 2025-05-06 A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning Kolawole E. Ogunsina et.al. 2505.03553 null Kimi
339 2025-05-06 Faster MoE LLM Inference for Extremely Large Models Haoqi Yang et.al. 2505.03531 null Kimi
340 2025-05-06 Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models Bin Yu et.al. 2505.03469 link Kimi
341 2025-05-06 The Steganographic Potentials of Language Models Artem Karpov et.al. 2505.03439 null Kimi
342 2025-05-06 Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents Schaun Wheeler et.al. 2505.03434 null Kimi
343 2025-05-06 MedArabiQ: Benchmarking Large Language Models on Arabic Medical Tasks Mouath Abu Daoud et.al. 2505.03427 link Kimi
344 2025-05-06 Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation Mohammad Shoaib Ansari et.al. 2505.03406 null Kimi
345 2025-05-06 Absolute Zero: Reinforced Self-play Reasoning with Zero Data Andrew Zhao et.al. 2505.03335 link Kimi
346 2025-05-06 AI-Driven Scholarly Peer Review via Persistent Workflow Prompting, Meta-Prompting, and Meta-Reasoning Evgeny Markhasin et.al. 2505.03332 null Kimi
347 2025-05-06 Recall with Reasoning: Chain-of-Thought Distillation for Mamba’s Long-Context Memory and Extrapolation Junyu Ma et.al. 2505.03320 null Kimi
348 2025-05-06 SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation Zhaoxi Mu et.al. 2505.03273 null Kimi
349 2025-05-06 RAVU: Retrieval Augmented Video Understanding with Compositional Reasoning over Graph Sameer Malik et.al. 2505.03173 null Kimi
350 2025-05-06 Assessing and Enhancing the Robustness of LLM-based Multi-Agent Systems Through Chaos Engineering Joshua Owotogbe et.al. 2505.03096 null Kimi
351 2025-05-05 Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text Jennifer Healey et.al. 2505.03053 null Kimi
352 2025-05-05 A Typology of Synthetic Datasets for Dialogue Processing in Clinical Contexts Steven Bedrick et.al. 2505.03025 null Kimi
353 2025-05-05 Memorization or Interpolation ? Detecting LLM Memorization through Input Perturbation Analysis Albérick Euraste Djiré et.al. 2505.03019 null Kimi
354 2025-05-05 RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale Daniel Goldstein et.al. 2505.03005 link Kimi
355 2025-05-05 Generating Narrated Lecture Videos from Slides with Synchronized Highlights Alexander Holmberg et.al. 2505.02966 null Kimi
356 2025-05-05 When Your Own Output Becomes Your Training Data: Noise-to-Meaning Loops and a Formal RSI Trigger Rintaro Ando et.al. 2505.02888 link Kimi
357 2025-05-05 AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation Qingqiu Li et.al. 2505.02830 null Kimi
358 2025-05-05 AutoLibra: Agent Metric Induction from Open-Ended Feedback Hao Zhu et.al. 2505.02820 link Kimi
359 2025-05-05 Knowing You Don’t Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing Diji Yang et.al. 2505.02811 link Kimi
360 2025-05-05 HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models Zheng Lin et.al. 2505.02795 null Kimi
361 2025-05-05 Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models Matthew Dahl et.al. 2505.02763 null Kimi
362 2025-05-05 Using Knowledge Graphs to harvest datasets for efficient CLIP model training Simon Ging et.al. 2505.02746 link Kimi
363 2025-05-05 FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Zhouliang Yu et.al. 2505.02735 link Kimi
364 2025-05-05 Enhancing LLMs’ Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry Junu Kim et.al. 2505.02722 link Kimi
365 2025-05-05 Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Yemin Shi et.al. 2505.02707 link Kimi
366 2025-05-05 Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models Xiaobao Wu et.al. 2505.02686 link Kimi
367 2025-05-05 A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law Qianjun Pan et.al. 2505.02665 null Kimi
368 2025-05-05 Enhancing Chemical Reaction and Retrosynthesis Prediction with Large Language Model and Dual-task Learning Xuan Lin et.al. 2505.02639 null Kimi
369 2025-05-05 LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis Qingkai Fang et.al. 2505.02625 link Kimi
370 2025-05-05 EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning Lingxiao Kong et.al. 2505.02579 link Kimi
371 2025-05-05 Bielik v3 Small: Technical Report Krzysztof Ociepa et.al. 2505.02550 null Kimi
372 2025-05-05 Large Language Model Partitioning for Low-Latency Inference at the Edge Dimitrios Kafetzis et.al. 2505.02533 null Kimi
373 2025-05-05 Beyond the model: Key differentiators in large language models and multi-agent services Muskaan Goyal et.al. 2505.02489 null Kimi
374 2025-05-05 Incentivizing Inclusive Contributions in Model Sharing Markets Enpei Zhang et.al. 2505.02462 null Kimi
375 2025-05-05 Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs Elisa Forcada Rodríguez et.al. 2505.02456 null Kimi
376 2025-05-05 Bielik 11B v2 Technical Report Krzysztof Ociepa et.al. 2505.02410 null Kimi
377 2025-05-05 Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL Jiarui Yao et.al. 2505.02391 link Kimi
378 2025-05-05 RM-R1: Reward Modeling as Reasoning Xiusi Chen et.al. 2505.02387 link Kimi
379 2025-05-05 JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings Tianyu Zong et.al. 2505.02366 link Kimi
380 2025-05-05 Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques Sanjay Surendranath Girija et.al. 2505.02309 null Kimi
381 2025-05-05 Generative Sign-description Prompts with Multi-positive Contrastive Learning for Sign Language Recognition Siyu Liang et.al. 2505.02304 null Kimi
382 2025-05-04 Parameter-Efficient Transformer Embeddings Henry Ndubuaku et.al. 2505.02266 link Kimi
383 2025-05-04 SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation Tanguy Herserant et.al. 2505.02235 null Kimi
384 2025-05-04 Interpretable Emergent Language Using Inter-Agent Transformers Mannan Bhardwaj et.al. 2505.02215 link Kimi
385 2025-05-04 Leveraging LLMs to Automate Energy-Aware Refactoring of Parallel Scientific Codes Matthew T. Dearing et.al. 2505.02184 null Kimi
386 2025-05-04 Measuring Hong Kong Massive Multi-Task Language Understanding Chuxue Cao et.al. 2505.02177 null Kimi
387 2025-05-04 A New HOPE: Domain-agnostic Automatic Evaluation of Text Chunking Henrik Brådland et.al. 2505.02171 null Kimi
388 2025-05-04 Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents Minzheng Wang et.al. 2505.02156 link Kimi
389 2025-05-01 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Dongzhi Jiang et.al. 2505.00703 link Kimi
390 2025-05-01 RayZer: A Self-supervised Large View Synthesis Model Hanwen Jiang et.al. 2505.00702 null Kimi
391 2025-05-01 Robotic Visual Instruction Yanbang Li et.al. 2505.00693 null Kimi
392 2025-05-01 Towards Autonomous Micromobility through Scalable Urban Simulation Wayne Wu et.al. 2505.00690 null Kimi
393 2025-05-01 GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution Aditya Arora et.al. 2505.00687 null Kimi
394 2025-05-01 Visual Test-time Scaling for GUI Agent Grounding Tiange Luo et.al. 2505.00684 link Kimi
395 2025-05-01 MINERVA: Evaluating Complex Video Reasoning Arsha Nagrani et.al. 2505.00681 link Kimi
396 2025-05-01 Steering Large Language Models with Register Analysis for Arbitrary Style Transfer Xinchen Yang et.al. 2505.00679 null Kimi
397 2025-05-01 Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions Yiming Du et.al. 2505.00675 link Kimi
398 2025-05-01 DeepCritic: Deliberate Critique with Large Language Models Wenkai Yang et.al. 2505.00662 link Kimi
399 2025-05-01 On the generalization of language models from in-context learning and finetuning: a controlled study Andrew K. Lampinen et.al. 2505.00661 null Kimi
400 2025-05-01 Large Language Models Understanding: an Inherent Ambiguity Barrier Daniel N. Nissani et.al. 2505.00654 null Kimi
401 2025-05-01 Open-Source LLM-Driven Federated Transformer for Predictive IoV Management Yazan Otoum et.al. 2505.00651 null Kimi
402 2025-05-01 OmicsCL: Unsupervised Contrastive Learning for Cancer Subtype Discovery and Survival Stratification Atahan Karagoz et.al. 2505.00650 link Kimi
403 2025-05-01 Investigating Task Arithmetic for Zero-Shot Information Retrieval Marco Braga et.al. 2505.00649 link Kimi
404 2025-05-01 Deep Learning Assisted Outer Volume Removal for Highly-Accelerated Real-Time Dynamic MRI Merve Gülle et.al. 2505.00643 null Kimi
405 2025-05-01 Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook Muyi Bao et.al. 2505.00630 link Kimi
406 2025-05-01 The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them) Zihao Wang et.al. 2505.00626 null Kimi
407 2025-05-01 FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation Chaitali Bhattacharyya et.al. 2505.00624 null Kimi
408 2025-05-01 Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction Simon Giebenhain et.al. 2505.00615 null Kimi
409 2025-05-01 Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation D. Sculley et.al. 2505.00612 null Kimi
410 2025-05-01 Combining LLMs with Logic-Based Framework to Explain MCTS Ziyan An et.al. 2505.00610 null Kimi
411 2025-05-01 Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4 Phanish Puranam et.al. 2505.00603 null Kimi
412 2025-05-01 Fast and Low-Cost Genomic Foundation Models via Outlier Removal Haozheng Luo et.al. 2505.00598 link Kimi
413 2025-05-01 A Finite-State Controller Based Offline Solver for Deterministic POMDPs Alex Schutz et.al. 2505.00596 link Kimi
414 2025-05-01 Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading Shuo Tong et.al. 2505.00592 null Kimi
415 2025-05-01 FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension Jushi Kai et.al. 2505.00570 null Kimi
416 2025-05-01 Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models Makoto Sato et.al. 2505.00557 null Kimi
417 2025-05-01 100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Chong Zhang et.al. 2505.00551 null Kimi
418 2025-05-01 HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection Deanna Emery et.al. 2505.00506 null Kimi
419 2025-05-01 UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces Alaa Saleh et.al. 2505.00472 null Kimi
420 2025-05-01 Red Teaming Large Language Models for Healthcare Vahid Balazadeh et.al. 2505.00467 null Kimi
421 2025-05-01 Data Therapist: Eliciting Domain Knowledge from Subject Matter Experts Using Large Language Models Sungbok Shin et.al. 2505.00455 null Kimi
422 2025-05-01 KoACD: The First Korean Adolescent Dataset for Cognitive Distortion Analysis JunSeo Kim et.al. 2505.00367 null Kimi
423 2025-05-01 Enhancing AI-Driven Education: Integrating Cognitive Frameworks, Linguistic Feedback Analysis, and Ethical Considerations for Improved Content Generation Antoun Yaacoub et.al. 2505.00339 null Kimi
424 2025-05-01 Mixture of Sparse Attention: Content-Based Learnable Sparse Attention via Expert-Choice Routing Piotr Piękos et.al. 2505.00315 link Kimi
425 2025-05-01 Fine-grained spatial-temporal perception for gas leak segmentation Xinlong Zhao et.al. 2505.00295 null Kimi
426 2025-05-01 Empowering Agentic Video Analytics Systems with Video Language Models Yuxuan Yan et.al. 2505.00254 null Kimi
427 2025-04-30 Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems Shaokun Zhang et.al. 2505.00212 link Kimi
428 2025-04-30 Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models Minh-Hao Van et.al. 2505.00150 null Kimi
429 2025-04-30 AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models Yinghui He et.al. 2505.00147 null Kimi
430 2025-04-30 Between Underthinking and Overthinking: An Empirical Study of Reasoning Length and correctness in LLMs Jinyan Su et.al. 2505.00127 null Kimi
431 2025-04-30 Fine-Tuning LLMs for Low-Resource Dialect Translation: The Case of Lebanese Silvana Yakhni et.al. 2505.00114 link Kimi
432 2025-04-30 GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling Siqi Li et.al. 2505.00063 null Kimi
433 2025-04-30 TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments Sichang Tu et.al. 2504.21851 null Kimi
434 2025-04-30 Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization Anas Anwarul Haq Khan et.al. 2504.21831 null Kimi
435 2025-04-30 DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition Z. Z. Ren et.al. 2504.21801 null Kimi
436 2025-04-30 WebThinker: Empowering Large Reasoning Models with Deep Research Capability Xiaoxi Li et.al. 2504.21776 link Kimi
437 2025-04-30 MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness Junsheng Huang et.al. 2504.21773 null Kimi
438 2025-04-30 AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization Haotian Luo et.al. 2504.21659 link Kimi
439 2025-04-30 Sadeed: Advancing Arabic Diacritization Through Small Language Model Zeina Aldallal et.al. 2504.21635 null Kimi
440 2025-04-30 Meeseeks: An Iterative Benchmark Evaluating LLMs Multi-Turn Instruction-Following Ability Jiaming Wang et.al. 2504.21625 null Kimi
441 2025-04-30 RDF-Based Structured Quality Assessment Representation of Multilingual LLM Evaluations Jonas Gwozdz et.al. 2504.21605 null Kimi
442 2025-04-30 DNB-AI-Project at SemEval-2025 Task 5: An LLM-Ensemble Approach for Automated Subject Indexing Lisa Kluge et.al. 2504.21589 link Kimi
443 2025-04-30 Precision Where It Matters: A Novel Spike Aware Mixed-Precision Quantization Strategy for LLaMA-based Language Models Lucas Maisonnave et.al. 2504.21553 null Kimi
444 2025-04-30 RWKV-X: A Linear Complexity Hybrid Language Model Haowen Hou et.al. 2504.21463 link Kimi
445 2025-04-30 SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding Chenkai Zhang et.al. 2504.21435 link Kimi
446 2025-04-30 Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction Máté Gedeon et.al. 2504.21372 null Kimi
447 2025-04-30 ShorterBetter: Guiding Reasoning Models to Find Optimal Inference Length for Efficient Reasoning Jingyang Yi et.al. 2504.21370 null Kimi
448 2025-04-30 Revisiting Diffusion Autoencoder Training for Image Reconstruction Quality Pramook Khungurn et.al. 2504.21368 null Kimi
449 2025-04-30 Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing Hong Zhang et.al. 2504.21356 link Kimi
450 2025-04-30 Phi-4-reasoning Technical Report Marah Abdin et.al. 2504.21318 null Kimi
451 2025-04-30 BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models Zhiting Fan et.al. 2504.21299 null Kimi
452 2025-04-30 Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models Guanghao Zhou et.al. 2504.21277 null Kimi
453 2025-04-30 Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA Xuanzhao Dong et.al. 2504.21252 link Kimi
454 2025-04-30 Memorization and Knowledge Injection in Gated LLMs Xu Pan et.al. 2504.21239 null Kimi
455 2025-04-30 Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Haoran Xu et.al. 2504.21233 null Kimi
456 2025-04-29 CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks Rui Wang et.al. 2504.21228 null Kimi
457 2025-04-29 Automatic Legal Writing Evaluation of LLMs Ramon Pires et.al. 2504.21202 link Kimi
458 2025-04-29 Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare Lovedeep Gondara et.al. 2504.21191 null Kimi
459 2025-04-29 OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification Shangyu Li et.al. 2504.20964 link Kimi
460 2025-04-29 Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models Maryna Vyshnyvetska et.al. 2504.20951 null Kimi
461 2025-04-29 Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models Tyler McDonald et.al. 2504.20946 null Kimi
462 2025-04-29 ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification Ziqing Fan et.al. 2504.20930 link Kimi
463 2025-04-29 DYNAMAX: Dynamic computing for Transformers and Mamba based architectures Miguel Nogales et.al. 2504.20922 null Kimi
464 2025-04-29 Using LLMs in Generating Design Rationale for Software Architecture Decisions Xiyu Zhou et.al. 2504.20781 null Kimi
465 2025-04-29 JTreeformer: Graph-Transformer via Latent-Diffusion Model for Molecular Generation Ji Shi et.al. 2504.20770 null Kimi
466 2025-04-29 Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption Wenxiao Wang et.al. 2504.20769 null Kimi
467 2025-04-29 Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Hasan Abed Al Kader Hammoud et.al. 2504.20708 null Kimi
468 2025-04-29 Cooking Up Creativity: A Cognitively-Inspired Approach for Enhancing LLM Creativity through Structured Representations Moran Mizrahi et.al. 2504.20643 link Kimi
469 2025-04-29 The Hidden Risks of LLM-Generated Web Application Code: A Security-Centric Evaluation of Code Generation Capabilities in Large Language Models Swaroop Dora et.al. 2504.20612 null Kimi
470 2025-04-29 Reinforcement Learning for Reasoning in Large Language Models with One Training Example Yiping Wang et.al. 2504.20571 link Kimi
471 2025-04-29 UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation Huimin Lu et.al. 2504.20500 link Kimi
472 2025-04-29 Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression Yu Cui et.al. 2504.20493 null Kimi
473 2025-04-29 A Summary on GUI Agents with Foundation Models Enhanced by Reinforcement Learning Jiahao Li et.al. 2504.20464 null Kimi
474 2025-04-29 Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding Gabe Guo et.al. 2504.20456 link Kimi
475 2025-04-29 GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection DiJia Su et.al. 2504.20437 null Kimi
476 2025-04-29 FiLA-Video: Spatio-Temporal Compression for Fine-Grained Long Video Understanding Yanan Guo et.al. 2504.20384 null Kimi
477 2025-04-29 Local Prompt Optimization Yash Jain et.al. 2504.20355 null Kimi
478 2025-04-29 MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation Amaan Izhar et.al. 2504.20343 link Kimi
479 2025-04-28 Enhancing Systematic Reviews with Large Language Models: Using GPT-4 and Kimi Dandan Chen Kaptur et.al. 2504.20276 null Kimi
480 2025-04-28 Can Large Language Models Learn Formal Logic? A Data-Driven Training and Evaluation Framework Yuan Xia et.al. 2504.20213 null Kimi
481 2025-04-28 Weaving Context Across Images: Improving Vision-Language Models through Focus-Centric Visual Chains Juntian Zhang et.al. 2504.20199 null Kimi
482 2025-04-28 MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools Nishant Subramani et.al. 2504.20168 null Kimi
483 2025-04-28 AutoJudge: Judge Decoding Without Manual Annotation Roman Garipov et.al. 2504.20039 null Kimi
484 2025-04-28 Towards Automated Scoping of AI for Social Good Projects Jacob Emmerson et.al. 2504.20010 null Kimi
485 2025-04-28 TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons Emre Can Acikgoz et.al. 2504.19982 null Kimi
486 2025-04-28 Accelerating Mixture-of-Experts Training with Adaptive Expert Replication Athinagoras Skiadopoulos et.al. 2504.19925 null Kimi
487 2025-04-28 Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI Hugo Georgenthum et.al. 2504.19918 null Kimi
488 2025-04-28 Can AI Agents Design and Implement Drug Discovery Pipelines? Khachik Smbatyan et.al. 2504.19912 null Kimi
489 2025-04-28 GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets Mingqian He et.al. 2504.19898 null Kimi
490 2025-04-28 semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage Ke Hong et.al. 2504.19867 null Kimi
491 2025-04-28 Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance Takuya Tamura et.al. 2504.19811 null Kimi
492 2025-04-28 Moral Reasoning Across Languages: The Critical Role of Low-Resource Languages in LLMs Huichi Zhou et.al. 2504.19759 null Kimi
493 2025-04-28 Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented Generation Carlo Merola et.al. 2504.19754 link Kimi
494 2025-04-28 LLM-Assisted Automated Deductive Coding of Dialogue Data: Leveraging Dialogue-Specific Characteristics to Enhance Contextual Understanding Ying Na et.al. 2504.19734 null Kimi
495 2025-04-28 Taming the Titans: A Survey of Efficient LLM Inference Serving Ranran Zhen et.al. 2504.19720 link Kimi
496 2025-04-28 From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review Mohamed Amine Ferrag et.al. 2504.19678 null Kimi
497 2025-04-28 Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMs Osma Suominen et.al. 2504.19675 link Kimi
498 2025-04-28 VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning Run Luo et.al. 2504.19627 null Kimi
499 2025-04-28 m-KAILIN: Knowledge-Driven Agentic Scientific Corpus Distillation Framework for Biomedical Large Language Models Training Meng Xiao et.al. 2504.19565 null Kimi
500 2025-04-28 DEEMO: De-identity Multimodal Emotion Recognition and Reasoning Deng Li et.al. 2504.19549 null Kimi
501 2025-04-28 Bullet: Boosting GPU Utilization for LLM Serving via Dynamic Spatial-Temporal Orchestration Zejia Lin et.al. 2504.19516 null Kimi
502 2025-04-28 Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding Yan Wang et.al. 2504.19500 null Kimi
503 2025-04-28 Improving Reasoning Performance in Large Language Models via Representation Engineering Bertram Højer et.al. 2504.19483 null Kimi
504 2025-04-28 BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text Jiageng Wu et.al. 2504.19467 link Kimi
505 2025-04-28 Towards Long Context Hallucination Detection Siyi Liu et.al. 2504.19457 null Kimi
506 2025-04-28 Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Prateek Chhikara et.al. 2504.19413 null Kimi
507 2025-04-28 ICL CIPHERS: Quantifying “Learning’’ in In-Context Learning via Substitution Ciphers Zhouxiang Fang et.al. 2504.19395 null Kimi
508 2025-04-27 LLMs for Engineering: Teaching Models to Design High Powered Rockets Toby Simonds et.al. 2504.19394 null Kimi
509 2025-04-27 Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing James O’ Neill et.al. 2504.19333 null Kimi
510 2025-04-27 Platonic Grounding for Efficient Multimodal Language Models Moulik Choraria et.al. 2504.19327 null Kimi
511 2025-04-27 BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese Peilin Zhou et.al. 2504.19314 link Kimi
512 2025-04-27 AndroidGen: Building an Android Language Agent under Data Scarcity Hanyu Lai et.al. 2504.19298 null Kimi
513 2025-04-24 Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Xu Ma et.al. 2504.17789 null Kimi
514 2025-04-24 The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs Piotr Nawrot et.al. 2504.17768 null Kimi
515 2025-04-24 Step1X-Edit: A Practical Framework for General Image Editing Shiyu Liu et.al. 2504.17761 link Kimi
516 2025-04-24 Conversational Assistants to support Heart Failure Patients: comparing a Neurosymbolic Architecture with ChatGPT Anuja Tayal et.al. 2504.17753 null Kimi
517 2025-04-24 CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos Shucheng Gong et.al. 2504.17728 link Kimi
518 2025-04-24 Multilingual Performance Biases of Large Language Models in Education Vansh Gupta et.al. 2504.17720 null Kimi
519 2025-04-24 Early Detection of Multidrug Resistance Using Multivariate Time Series Analysis and Interpretable Patient-Similarity Representations Óscar Escudero-Arnanz et.al. 2504.17717 null Kimi
520 2025-04-24 Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields Zhuo He et.al. 2504.17712 null Kimi
521 2025-04-24 Plasma State Monitoring and Disruption Characterization using Multimodal VAEs Yoeri Poels et.al. 2504.17710 null Kimi
522 2025-04-24 Safety in Large Reasoning Models: A Survey Cheng Wang et.al. 2504.17704 null Kimi
523 2025-04-24 Federated Learning: A Survey on Privacy-Preserving Collaborative Intelligence Edward Collins et.al. 2504.17703 null Kimi
524 2025-04-24 Hierarchical and Multimodal Data for Daily Activity Understanding Ghazal Kaviani et.al. 2504.17696 link Kimi
525 2025-04-24 BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring Asier Bikandi et.al. 2504.17693 null Kimi
526 2025-04-24 Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks Haru-Tada Sato et.al. 2504.17685 null Kimi
527 2025-04-24 INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models Jarne Thys et.al. 2504.17677 null Kimi
528 2025-04-24 Energy Considerations of Large Language Model Inference and Efficiency Optimizations Jared Fernandez et.al. 2504.17674 null Kimi
529 2025-04-24 Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation Ying Zhu et.al. 2504.17672 null Kimi
530 2025-04-24 Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction Yuanchang Ye et.al. 2504.17671 null Kimi
531 2025-04-24 DiMeR: Disentangled Mesh Reconstruction Model Lutao Jiang et.al. 2504.17670 null Kimi
532 2025-04-24 Towards a HIPAA Compliant Agentic AI System in Healthcare Subash Neupane et.al. 2504.17669 null Kimi
533 2025-04-24 Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics Zena Al-Khalili et.al. 2504.17665 null Kimi
534 2025-04-24 Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction Farhad Pourkamali-Anaraki et.al. 2504.17655 null Kimi
535 2025-04-24 DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training Xiaoyu Tian et.al. 2504.17565 null Kimi
536 2025-04-24 HalluLens: LLM Hallucination Benchmark Yejin Bang et.al. 2504.17550 null Kimi
537 2025-04-24 A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task Jiaqi Deng et.al. 2504.17547 null Kimi
538 2025-04-24 Auditing the Ethical Logic of Generative AI Models W. Russell Neuman et.al. 2504.17544 null Kimi
539 2025-04-24 Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation Xin Yi et.al. 2504.17480 null Kimi
540 2025-04-24 FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding De-An Huang et.al. 2504.17447 link Kimi
541 2025-04-24 Assessing the Capability of Large Language Models for Domain-Specific Ontology Generation Anna Sofia Lippolis et.al. 2504.17402 null Kimi
542 2025-04-24 LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams Yongxuan Wu et.al. 2504.17366 link Kimi
543 2025-04-24 TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation Ling You et.al. 2504.17365 null Kimi
544 2025-04-24 FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation Yulia Otmakhova et.al. 2504.17311 null Kimi
545 2025-04-24 JurisCTC: Enhancing Legal Judgment Prediction via Cross-Domain Transfer and Contrastive Learning Zhaolu Kang et.al. 2504.17264 null Kimi
546 2025-04-24 MCAF: Efficient Agent-based Video Understanding Framework through Multimodal Coarse-to-Fine Attention Focusing Shiwen Cao et.al. 2504.17213 null Kimi
547 2025-04-24 A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation Yangxinyu Xie et.al. 2504.17200 null Kimi
548 2025-04-24 Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Minju Seo et.al. 2504.17192 link Kimi
549 2025-04-23 MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation Chanhee Park et.al. 2504.17137 null Kimi
550 2025-04-23 Steering the CensorShip: Uncovering Representation Vectors for LLM “Thought” Control Hannah Cyberey et.al. 2504.17130 link Kimi
551 2025-04-23 The Rise of Small Language Models in Healthcare: A Comprehensive Survey Muskan Garg et.al. 2504.17119 null Kimi
552 2025-04-23 Leveraging LLMs as Meta-Judges: A Multi-Agent Framework for Evaluating LLM Judgments Yuran Li et.al. 2504.17087 null Kimi
553 2025-04-23 DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs Zhenhailong Wang et.al. 2504.17040 null Kimi
554 2025-04-23 (Im)possibility of Automated Hallucination Detection in Large Language Models Amin Karbasi et.al. 2504.17004 null Kimi
555 2025-04-23 Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text Shifali Agrahari et.al. 2504.16913 null Kimi
556 2025-04-23 Do Large Language Models know who did what to whom? Joseph M. Denning et.al. 2504.16884 null Kimi
557 2025-04-23 Monte Carlo Planning with Large Language Model for Text-Based Game Agents Zijing Shi et.al. 2504.16855 null Kimi
558 2025-04-23 GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning Luu Quy Tung et.al. 2504.16832 null Kimi
559 2025-04-23 Process Reward Models That Think Muhammad Khalifa et.al. 2504.16828 link Kimi
560 2025-04-23 Random Long-Context Access for Mamba via Hardware-aligned Hierarchical Sparse Attention Xiang Hu et.al. 2504.16795 null Kimi
561 2025-04-23 Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation Lakshita Agarwal et.al. 2504.16788 null Kimi
562 2025-04-23 MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores Fengwei Zhou et.al. 2504.16786 null Kimi
563 2025-04-23 How Effective are Generative Large Language Models in Performing Requirements Classification? Waad Alhoshan et.al. 2504.16768 null Kimi
564 2025-04-23 Lightweight Latent Verifiers for Efficient Meta-Generation Strategies Bartosz Piotrowski et.al. 2504.16760 null Kimi
565 2025-04-23 HEMA : A Hippocampus-Inspired Extended Memory Architecture for Long-Context AI Conversations Kwangseob Ahn et.al. 2504.16754 null Kimi
566 2025-04-23 IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery Aniketh Garikaparthi et.al. 2504.16728 null Kimi
567 2025-04-23 Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories Mareike Lisker et.al. 2504.16604 null Kimi
568 2025-04-23 Comparing Large Language Models and Traditional Machine Translation Tools for Translating Medical Consultation Summaries: A Pilot Study Andy Li et.al. 2504.16601 null Kimi
569 2025-04-23 PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression Lizhe Chen et.al. 2504.16574 null Kimi
570 2025-04-23 Amplified Vulnerabilities: Structured Jailbreak Attacks on LLM-based Multi-Agent Debate Senmao Qi et.al. 2504.16489 null Kimi
571 2025-04-23 Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark Hanlei Zhang et.al. 2504.16427 link Kimi
572 2025-04-23 Evaluating Multi-Hop Reasoning in Large Language Models: A Chemistry-Centric Case Study Mohammad Khodadad et.al. 2504.16414 null Kimi
573 2025-04-23 ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs Fahmida Liza Piya et.al. 2504.16394 link Kimi
574 2025-04-23 SplitReason: Learning To Offload Reasoning Yash Akhauri et.al. 2504.16379 null Kimi
575 2025-04-23 Text-to-TrajVis: Enabling Trajectory Data Visualizations from Natural Language Questions Tian Bai et.al. 2504.16358 null Kimi
576 2025-04-23 DP2FL: Dual Prompt Personalized Federated Learning in Foundation Models Ying Chang et.al. 2504.16357 null Kimi
577 2025-04-22 The Paradox of Poetic Intent in Back-Translation: Evaluating the Quality of Large Language Models in Chinese Translation Li Weigang et.al. 2504.16286 null Kimi
578 2025-04-22 FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking Jabez Magomere et.al. 2504.16188 null Kimi
579 2025-04-22 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention Yucheng Li et.al. 2504.16083 null Kimi
580 2025-04-22 MR. Video: “MapReduce” is the Principle for Long Video Understanding Ziqi Pang et.al. 2504.16082 null Kimi
581 2025-04-22 LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Thomas Schmied et.al. 2504.16078 null Kimi
582 2025-04-22 LongMamba: Enhancing Mamba’s Long Context Capabilities via Training-Free Receptive Field Enlargement Zhifan Ye et.al. 2504.16053 link Kimi
583 2025-04-22 Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3 Ahmed R. Sadik et.al. 2504.16027 null Kimi
584 2025-04-23 CAPO: Cost-Aware Prompt Optimization Tom Zehle et.al. 2504.16005 link Kimi
585 2025-04-22 FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity Fanny Jourdan et.al. 2504.15941 link Kimi
586 2025-04-22 Impact of Noise on LLM-Models Performance in Abstraction and Reasoning Corpus (ARC) Tasks with Model Temperature Considerations Nikhil Khandalkar et.al. 2504.15903 null Kimi
587 2025-04-22 SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning Cheng Wen et.al. 2504.15900 null Kimi
588 2025-04-22 Dynamic Early Exit in Reasoning Models Chenxu Yang et.al. 2504.15895 null Kimi
589 2025-04-22 What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns Michael A. Hedderich et.al. 2504.15815 null Kimi
590 2025-04-22 A closer look at how large language models trust humans: patterns and biases Valeria Lerman et.al. 2504.15801 null Kimi
591 2025-04-22 Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach Ruizhe Li et.al. 2504.15784 null Kimi
592 2025-04-22 TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving Daocheng Fu et.al. 2504.15780 null Kimi
593 2025-04-22 DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models Jie Zhu et.al. 2504.15716 null Kimi
594 2025-04-22 Cost-Effective Text Clustering with Large Language Models Hongtao Wang et.al. 2504.15640 null Kimi
595 2025-04-22 DR.FIX: Automatically Fixing Data Races at Industry Scale Farnaz Behrang et.al. 2504.15637 null Kimi
596 2025-04-22 Exploiting Contextual Knowledge in LLMs through V-usable Information based Layer Enhancement Xiaowei Yuan et.al. 2504.15630 null Kimi
597 2025-04-22 A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models Gengxian Cao et.al. 2504.15552 null Kimi
598 2025-04-22 llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length Issa Sugiura et.al. 2504.15544 null Kimi
599 2025-04-22 Compass-V2 Technical Report Sophia Maria et.al. 2504.15527 null Kimi
600 2025-04-21 CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting Atin Pothiraj et.al. 2504.15485 null Kimi
601 2025-04-21 Speculative Sampling via Exponential Races Szymon Kobus et.al. 2504.15475 null Kimi
602 2025-04-21 Trillion 7B Technical Report Sungjun Han et.al. 2504.15431 null Kimi
603 2025-04-21 LLM-Assisted Translation of Legacy FORTRAN Codes to C++: A Cross-Platform Study Nishath Rajiv Ranasinghe et.al. 2504.15424 null Kimi
604 2025-04-21 IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs David Ma et.al. 2504.15415 link Kimi
605 2025-04-21 Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection Myrthe Reuver et.al. 2504.15392 null Kimi
606 2025-04-21 Towards Understanding Camera Motions in Any Video Zhiqiu Lin et.al. 2504.15376 null Kimi
607 2025-04-21 KeDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments Junyoung Park et.al. 2504.15364 null Kimi
608 2025-04-21 Exploring Compositional Generalization (in ReCOGS_pos) by Transformers using Restricted Access Sequence Processing (RASP) William Bruns et.al. 2504.15349 null Kimi
609 2025-04-21 Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Chun-Hsiao Yeh et.al. 2504.15280 link Kimi
610 2025-04-21 FlowReasoner: Reinforcing Query-Level Meta-Agents Hongcheng Gao et.al. 2504.15257 link Kimi
611 2025-04-21 Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges Nandan Thakur et.al. 2504.15205 null Kimi
612 2025-04-21 The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks Joan C. Timoneda et.al. 2504.15160 null Kimi
613 2025-04-21 EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models Ziwen Xu et.al. 2504.15133 link Kimi
614 2025-04-21 Kuwain 1.5B: An Arabic SLM via Language Injection Khalil Hennara et.al. 2504.15120 null Kimi
615 2025-04-21 A triple-branch network for latent fingerprint enhancement guided by orientation fields and minutiae Yurun Wang et.al. 2504.15105 null Kimi
616 2025-04-21 Rethinking the Potential of Multimodality in Collaborative Problem Solving Diagnosis with Large Language Models K. Wong et.al. 2504.15093 null Kimi
617 2025-04-21 DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation Weijie He et.al. 2504.15032 null Kimi
618 2025-04-21 Efficient Pretraining Length Scaling Bohong Wu et.al. 2504.14992 null Kimi
619 2025-04-21 Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues Rui Ribeiro et.al. 2504.14963 null Kimi
620 2025-04-21 MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core Dennis Liu et.al. 2504.14960 null Kimi
621 2025-04-21 EducationQ: Evaluating LLMs’ Teaching Capabilities Through Multi-Agent Dialogue Framework Yao Shi et.al. 2504.14928 null Kimi
622 2025-04-21 CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs Yingming Zheng et.al. 2504.14905 link Kimi
623 2025-04-21 Latent Bayesian Optimization via Autoregressive Normalizing Flows Seunghun Lee et.al. 2504.14889 null Kimi
624 2025-04-21 Natural Fingerprints of Large Language Models Teppei Suzuki et.al. 2504.14871 null Kimi
625 2025-04-21 OTC: Optimal Tool Calls via Reinforcement Learning Hongru Wang et.al. 2504.14870 null Kimi
626 2025-04-21 ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages Zhoujie Qian et.al. 2504.14825 null Kimi
627 2025-04-21 On Self-improving Token Embeddings Mario M. Kubek et.al. 2504.14808 null Kimi
628 2025-04-21 Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends Jiaxin GUO et.al. 2504.14804 null Kimi
629 2025-04-21 gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling Tianyu Guo et.al. 2504.14775 link Kimi
630 2025-04-21 PLANET: A Collection of Benchmarks for Evaluating LLMs’ Planning Capabilities Haoming Li et.al. 2504.14773 null Kimi
631 2025-04-20 Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions Luyang Fang et.al. 2504.14772 null Kimi
632 2025-04-20 SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs Minh V. T. Pham et.al. 2504.14757 null Kimi
633 2025-04-20 PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines Reya Vir et.al. 2504.14738 null Kimi
634 2025-04-20 AI with Emotions: Exploring Emotional Expressions in Large Language Models Shin-nosuke Ishikawa et.al. 2504.14706 null Kimi
635 2025-04-20 Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark Enxin Song et.al. 2504.14693 link Kimi
636 2025-04-20 FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models Mehrnoush Shamsfard et.al. 2504.14690 null Kimi
637 2025-04-20 Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens Kaihang Pan et.al. 2504.14666 null Kimi
638 2025-04-20 A Case Study Exploring the Current Landscape of Synthetic Medical Record Generation with Commercial LLMs Yihan Lin et.al. 2504.14657 null Kimi
639 2025-04-17 PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding Jang Hyun Cho et.al. 2504.13180 link Kimi
640 2025-04-17 Single-Shot Shape and Reflectance with Spatial Polarization Multiplexing Tomoki Ichikawa et.al. 2504.13177 null Kimi
641 2025-04-17 It’s All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization Ali Behrouz et.al. 2504.13173 null Kimi
642 2025-04-17 Sleep-time Compute: Beyond Inference Scaling at Test-time Kevin Lin et.al. 2504.13171 link Kimi
643 2025-04-17 Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Tsung-Han Wu et.al. 2504.13169 link Kimi
644 2025-04-17 CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Shizhe Diao et.al. 2504.13161 null Kimi
645 2025-04-17 MIB: A Mechanistic Interpretability Benchmark Aaron Mueller et.al. 2504.13151 link Kimi
646 2025-04-17 Readable Twins of Unreadable Models Krzysztof Pancerz et.al. 2504.13150 link Kimi
647 2025-04-17 Antidistillation Sampling Yash Savani et.al. 2504.13146 null Kimi
648 2025-04-17 Exploring Expert Failures Improves LLM Agent Tuning Li-Cheng Lan et.al. 2504.13145 null Kimi
649 2025-04-17 $\texttt{Complex-Edit}$ : CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark Siwei Yang et.al. 2504.13143 null Kimi
650 2025-04-17 Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo João Loula et.al. 2504.13139 null Kimi
651 2025-04-17 Energy-Based Reward Models for Robust Language Model Alignment Anamika Lochab et.al. 2504.13134 link Kimi
652 2025-04-17 Science-T2I: Addressing Scientific Illusions in Image Synthesis Jialuo Li et.al. 2504.13129 null Kimi
653 2025-04-17 LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard Varun Rao et.al. 2504.13125 null Kimi
654 2025-04-17 Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training Xinsong Zhang et.al. 2504.13123 null Kimi
655 2025-04-17 VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models Haojian Huang et.al. 2504.13122 link Kimi
656 2025-04-17 Probing and Inducing Combinational Creativity in Vision-Language Models Yongqian Peng et.al. 2504.13120 null Kimi
657 2025-04-17 EventVAD: Training-Free Event-Aware Video Anomaly Detection Yihua Shao et.al. 2504.13092 null Kimi
658 2025-04-17 Retrieval-Augmented Generation with Conflicting Evidence Han Wang et.al. 2504.13079 link Kimi
659 2025-04-17 Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off Riza Velioglu et.al. 2504.13078 link Kimi
660 2025-04-17 SkyReels-V2: Infinite-length Film Generative Model Guibin Chen et.al. 2504.13074 link Kimi
661 2025-04-17 Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models Sudesh Ramesh Bhagat et.al. 2504.13068 null Kimi
662 2025-04-17 RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins Yao Mu et.al. 2504.13059 null Kimi
663 2025-04-17 Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation Yichao Feng et.al. 2504.13054 null Kimi
664 2025-04-17 How Large Language Models Are Changing MOOC Essay Answers: A Comparison of Pre- and Post-LLM Responses Leo Leppänen et.al. 2504.13038 null Kimi
665 2025-04-17 Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Assessment and Beyond Yundi Zhang et.al. 2504.13037 null Kimi
666 2025-04-17 InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning Zheng Wang et.al. 2504.13032 null Kimi
667 2025-04-17 ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images Sangwook Kim et.al. 2504.13023 null Kimi
668 2025-04-17 Pose and Facial Expression Transfer by using StyleGAN Petr Jahoda et.al. 2504.13021 null Kimi
669 2025-04-17 SHA256 at SemEval-2025 Task 4: Selective Amnesia – Constrained Unlearning for Large Language Models via Knowledge Isolation Saransh Agrawal et.al. 2504.12996 link Kimi
670 2025-04-17 Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback Nearchos Potamitis et.al. 2504.12951 null Kimi
671 2025-04-17 Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models Zhouhao Sun et.al. 2504.12898 null Kimi
672 2025-04-17 EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting Guanrou Yang et.al. 2504.12867 null Kimi
673 2025-04-17 Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks Amey Hengle et.al. 2504.12845 link Kimi
674 2025-04-17 Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration Yicheng Pan et.al. 2504.12773 link Kimi
675 2025-04-17 Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge Yongrui Chen et.al. 2504.12734 null Kimi
676 2025-04-17 Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations Yiyou Sun et.al. 2504.12691 link Kimi
677 2025-04-17 Data-efficient LLM Fine-tuning for Code Generation Weijie Lv et.al. 2504.12687 link Kimi
678 2025-04-17 Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation Linda He et.al. 2504.12637 null Kimi
679 2025-04-17 Identifying and Mitigating the Influence of the Prior Distribution in Large Language Models Liyi Zhang et.al. 2504.12585 link Kimi
680 2025-04-17 MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation Haris Riaz et.al. 2504.12563 null Kimi
681 2025-04-17 ZeroSumEval: Scaling LLM Evaluation with Inter-Model Competition Haidar Khan et.al. 2504.12562 link Kimi
682 2025-04-17 Memorization: A Close Look at Books Iris Ma et.al. 2504.12549 null Kimi
683 2025-04-16 MOM: Memory-Efficient Offloaded Mini-Sequence Inference for Long Context Language Models Junyang Zhang et.al. 2504.12526 null Kimi
684 2025-04-16 Memorization vs. Reasoning: Updating LLMs with New Knowledge Aochong Oliver Li et.al. 2504.12523 null Kimi
685 2025-04-16 Towards Conversational AI for Human-Machine Collaborative MLOps George Fatouros et.al. 2504.12477 null Kimi
686 2025-04-16 Integrating Structural and Semantic Signals in Text-Attributed Graphs with BiGTex Azadeh Beiranvand et.al. 2504.12474 link Kimi
687 2025-04-16 Dense Backpropagation Improves Training for Sparse Mixture-of-Experts Ashwinee Panda et.al. 2504.12463 link Kimi
688 2025-04-16 Activated LoRA: Fine-tuned LLMs for Intrinsics Kristjan Greenewald et.al. 2504.12397 null Kimi
689 2025-04-16 BitNet b1.58 2B4T Technical Report Shuming Ma et.al. 2504.12285 null Kimi
690 2025-04-16 How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions Aditya Prakash et.al. 2504.12284 null Kimi
691 2025-04-16 FLIP Reasoning Challenge Andreas Plesner et.al. 2504.12256 link Kimi
692 2025-04-16 What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure Céline Budding et.al. 2504.12187 null Kimi
693 2025-04-16 SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data Suyoung Bae et.al. 2504.12185 null Kimi
694 2025-04-16 Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models - Laura Fieback et.al. 2504.12137 null Kimi
695 2025-04-16 Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework Jack Preuveneers et.al. 2504.12090 null Kimi
696 2025-04-16 Purposefully Induced Psychosis (PIP): Embracing Hallucination as Imagination in Large Language Models Kris Pilcher et.al. 2504.12012 null Kimi
697 2025-04-16 Generative Recommendation with Continuous-Token Diffusion Haohao Qu et.al. 2504.12007 null Kimi
698 2025-04-16 Language Models as Quasi-Crystalline Thought: Structure, Constraint, and Emergence in Generative Systems Jose Manuel Guevara-Vela et.al. 2504.11986 null Kimi
699 2025-04-16 ADAT: Time-Series-Aware Adaptive Transformer Architecture for Sign Language Translation Nada Shahin et.al. 2504.11942 null Kimi
700 2025-04-16 Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading Qianjin Yu et.al. 2504.11919 null Kimi
701 2025-04-16 Evaluating the Goal-Directedness of Large Language Models Tom Everitt et.al. 2504.11844 link Kimi
702 2025-04-16 FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations Yue Zhao et.al. 2504.11837 null Kimi
703 2025-04-16 Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation Julia Kreutzer et.al. 2504.11829 null Kimi
704 2025-04-16 Cost-Efficient LLM Serving in the Cloud: VM Selection with KV Cache Offloading Kihyun Kim et.al. 2504.11816 link Kimi
705 2025-04-16 Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification Yue Li et.al. 2504.11793 null Kimi
706 2025-04-16 Enhancing Web Agents with Explicit Rollback Mechanisms Zhisong Zhang et.al. 2504.11788 null Kimi
707 2025-04-16 Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs Hyungwoo Lee et.al. 2504.11765 null Kimi
708 2025-04-16 Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures Prabhu Vellaisamy et.al. 2504.11750 null Kimi
709 2025-04-16 Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics Yiran He et.al. 2504.11686 null Kimi
710 2025-04-16 Steering Prosocial AI Agents: Computational Basis of LLM’s Decision Making in Social Simulation Ji Ma et.al. 2504.11671 null Kimi
711 2025-04-15 GraphicBench: A Planning Benchmark for Graphic Design with Language Agents Dayeon Ki et.al. 2504.11571 null Kimi
712 2025-04-15 ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Jiazhan Feng et.al. 2504.11536 link Kimi
713 2025-04-15 HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation Haokun Liu et.al. 2504.11524 null Kimi
714 2025-04-15 TextArena Leon Guertler et.al. 2504.11442 link Kimi
715 2025-04-15 A Dual-Space Framework for General Knowledge Distillation of Large Language Models Xue Zhang et.al. 2504.11426 null Kimi
716 2025-04-15 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce Wei Xiong et.al. 2504.11343 link Kimi
717 2025-04-15 Transformer-Based Model for Cold Start Mitigation in FaaS Architecture Alexandre Savi Fayam Mbala Mouen et.al. 2504.11338 null Kimi
718 2025-04-15 Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints Ruicheng Ao et.al. 2504.11320 link Kimi
719 2025-04-15 Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs Chang Yang et.al. 2504.11239 link Kimi
720 2025-04-15 Video Summarization with Large Language Models Min Jung Lee et.al. 2504.11199 null Kimi
721 2025-04-15 Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items Minjie Zou et.al. 2504.11186 null Kimi
722 2025-04-15 DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis Efthymios Georgiou et.al. 2504.11082 null Kimi
723 2025-04-15 Dynamic Compressing Prompts for Efficient Inference of Large Language Models Jinwu Hu et.al. 2504.11004 null Kimi
724 2025-04-15 Efficient Reasoning Models: A Survey Sicheng Feng et.al. 2504.10903 link Kimi
725 2025-04-15 ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search Yize Zhang et.al. 2504.10893 null Kimi
726 2025-04-15 Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content Yilang Peng et.al. 2504.10878 null Kimi
727 2025-04-15 Moving Beyond Next-Token Prediction: Transformers are Context-Sensitive Language Generators Phill Kyu Rhee et.al. 2504.10845 null Kimi
728 2025-04-15 LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation Hengyu Shi et.al. 2504.10829 null Kimi
729 2025-04-15 CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives Ayoung Lee et.al. 2504.10823 null Kimi
730 2025-04-14 How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients Ming Li et.al. 2504.10766 link Kimi
731 2025-04-14 ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models Amirhosein Chahe et.al. 2504.10757 link Kimi
732 2025-04-14 CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD Map Updates Ankit Kumar Shaw et.al. 2504.10738 null Kimi
733 2025-04-14 HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving Avinash Kumar et.al. 2504.10724 null Kimi
734 2025-04-14 Weight-of-Thought Reasoning: Exploring Neural Network Weights for Enhanced LLM Reasoning Saif Punjwani et.al. 2504.10646 link Kimi
735 2025-04-14 Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models Thilo Hagendorff et.al. 2504.10615 null Kimi
736 2025-04-15 GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents Xiaobo Xia et.al. 2504.10458 null Kimi
737 2025-04-14 RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users Suyu Ye et.al. 2504.10445 link Kimi
738 2025-04-14 Multimodal Long Video Modeling Based on Temporal Dynamic Context Haoran Hao et.al. 2504.10443 link Kimi
739 2025-04-14 LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models Minqian Liu et.al. 2504.10430 null Kimi
740 2025-04-14 LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models Parshin Shojaee et.al. 2504.10415 link Kimi
741 2025-04-14 Performance of Large Language Models in Supporting Medical Diagnosis and Treatment Diogo Sousa et.al. 2504.10405 null Kimi
742 2025-04-14 Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families Shahriar Noroozizadeh et.al. 2504.10340 null Kimi
743 2025-04-14 Heimdall: test-time scaling on the generative verification Wenlei Shi et.al. 2504.10337 null Kimi
744 2025-04-14 AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference Yangshen Deng et.al. 2504.10326 null Kimi
745 2025-04-14 Deep Reasoning Translation via Reinforcement Learning Jiaan Wang et.al. 2504.10187 link Kimi
746 2025-04-14 HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection Mohamed A. Abdallah et.al. 2504.10168 null Kimi
747 2025-04-14 Breaking the Data Barrier – Building GUI Agents Through Task Generalization Junlei Zhang et.al. 2504.10127 link Kimi
748 2025-04-14 CameraBench: Benchmarking Visual Reasoning in MLLMs via Photography I-Sheng Fang et.al. 2504.10090 null Kimi
749 2025-04-14 RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability Yichi Zhang et.al. 2504.10081 null Kimi
750 2025-04-14 Mavors: Multi-granularity Video Representation for Multimodal Large Language Model Yang Shi et.al. 2504.10068 null Kimi
751 2025-04-14 Hallucination Detection in LLMs via Topological Divergence on Attention Graphs Alexandra Bazarova et.al. 2504.10063 null Kimi
752 2025-04-14 DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify Zhengxuan Zhang et.al. 2504.10036 null Kimi
753 2025-04-14 The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination Hao Yin et.al. 2504.10020 null Kimi
754 2025-04-14 Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models? Yanbo Wang et.al. 2504.10000 null Kimi
755 2025-04-14 KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference Yuxuan Tian et.al. 2504.09936 null Kimi
756 2025-04-14 FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding Zheng Liu et.al. 2504.09925 link Kimi
757 2025-04-14 Reasoning Models Can Be Effective Without Thinking Wenjie Ma et.al. 2504.09858 null Kimi
758 2025-04-14 A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science Jie Feng et.al. 2504.09848 null Kimi
759 2025-04-14 OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training Juntao Zhao et.al. 2504.09844 null Kimi
760 2025-04-14 Training Small Reasoning LLMs with Cognitive Preference Alignment Wenrui Cai et.al. 2504.09802 null Kimi
761 2025-04-14 VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents Ryota Tanaka et.al. 2504.09795 null Kimi
762 2025-04-14 Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning Jingtian Wu et.al. 2504.09781 null Kimi
763 2025-04-14 Understanding and Optimizing Multi-Stage AI Inference Pipelines Abhimanyu Rajeshkumar Bambhaniya et.al. 2504.09775 null Kimi
764 2025-04-14 Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning Can Jin et.al. 2504.09772 link Kimi
765 2025-04-13 Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability Haotian Wang et.al. 2504.09639 null Kimi
766 2025-04-13 Metropolis-Hastings Captioning Game: Knowledge Fusion of Vision Language Models via Decentralized Bayesian Inference Yuta Matsui et.al. 2504.09620 null Kimi
767 2025-04-10 Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments Lorenz Linhardt et.al. 2504.07965 null Kimi
768 2025-04-10 PixelFlow: Pixel-Space Generative Models with Flow Shoufa Chen et.al. 2504.07963 link Kimi
769 2025-04-10 GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation Lang Lin et.al. 2504.07962 null Kimi
770 2025-04-10 Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction Zeren Jiang et.al. 2504.07961 link Kimi
771 2025-04-10 CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy Dongyoung Kim et.al. 2504.07959 null Kimi
772 2025-04-10 MM-IFEngine: Towards Multimodal Instruction Following Shengyuan Ding et.al. 2504.07957 link Kimi
773 2025-04-10 VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning Yukun Qi et.al. 2504.07956 null Kimi
774 2025-04-10 Perception-R1: Pioneering Perception Policy with Reinforcement Learning En Yu et.al. 2504.07954 link Kimi
775 2025-04-10 Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models Mustafa Shukor et.al. 2504.07951 null Kimi
776 2025-04-10 InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians Kefan Chen et.al. 2504.07949 null Kimi
777 2025-04-10 GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces Hao Yu et.al. 2504.07945 null Kimi
778 2025-04-10 HoloPart: Generative 3D Part Amodal Segmentation Yunhan Yang et.al. 2504.07943 null Kimi
779 2025-04-10 SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement Xiyao Wang et.al. 2504.07934 link Kimi
780 2025-04-10 The Urban Impact of AI: Modeling Feedback Loops in Next-Venue Recommendation Giovanni Mauro et.al. 2504.07911 link Kimi
781 2025-04-10 The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound Blake VanBerlo et.al. 2504.07904 null Kimi
782 2025-04-10 Redefining Machine Translation on Social Network Services with Large Language Models Hongcheng Guo et.al. 2504.07901 link Kimi
783 2025-04-10 How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective Qi Liu et.al. 2504.07898 link Kimi
784 2025-04-10 Fast Adaptation with Behavioral Foundation Models Harshit Sikchi et.al. 2504.07896 null Kimi
785 2025-04-10 SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning Rui Pan et.al. 2504.07891 link Kimi
786 2025-04-10 Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge Riccardo Cantini et.al. 2504.07887 link Kimi
787 2025-04-10 Token Level Routing Inference System for Edge Devices Jianshu She et.al. 2504.07878 null Kimi
788 2025-04-10 Dual Engines of Thoughts: A Depth-Breadth Integration Framework for Open-Ended Analysis Fei-Hsuan Yu et.al. 2504.07872 null Kimi
789 2025-04-10 SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos Joshua Li et.al. 2504.07867 null Kimi
790 2025-04-10 Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Yichun Yin et.al. 2504.07866 null Kimi
791 2025-04-10 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization Mengyang Li et.al. 2504.07856 null Kimi
792 2025-04-10 The KL3M Data Project: Copyright-Clean Training Resources for Large Language Models Michael J Bommarito II et.al. 2504.07854 link Kimi
793 2025-04-10 V2V3D: View-to-View Denoised 3D Reconstruction for Light-Field Microscopy Jiayin Zhao et.al. 2504.07853 null Kimi
794 2025-04-10 Anytime Single-Step MAPF Planning with Anytime PIBT Nayesha Gandotra et.al. 2504.07841 null Kimi
795 2025-04-10 Understanding Learner-LLM Chatbot Interactions and the Impact of Prompting Guidelines Cansu Koyuturk et.al. 2504.07840 null Kimi
796 2025-04-10 Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems Simon Lermen et.al. 2504.07831 null Kimi
797 2025-04-10 MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations Genglin Liu et.al. 2504.07830 link Kimi
798 2025-04-10 Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models Hongcheng Guo et.al. 2504.07807 link Kimi
799 2025-04-10 On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data Alfredo Garrachón Ruiz et.al. 2504.07646 null Kimi
800 2025-04-10 ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models Joel Barmettler et.al. 2504.07624 null Kimi
801 2025-04-10 VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Haozhan Shen et.al. 2504.07615 link Kimi
802 2025-04-10 Boosting Universal LLM Reward Design through the Heuristic Reward Observation Space Evolution Zen Kit Heng et.al. 2504.07596 null Kimi
803 2025-04-10 AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation Tuhin Chakrabarty et.al. 2504.07532 link Kimi
804 2025-04-10 Supervised Optimism Correction: Be Confident When LLMs Are Sure Junjie Zhang et.al. 2504.07527 null Kimi
805 2025-04-10 VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding Henghao Zhao et.al. 2504.07519 null Kimi
806 2025-04-10 GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable Jianqiao Wangni et.al. 2504.07513 null Kimi
807 2025-04-10 Kimi-VL Technical Report Kimi Team et.al. 2504.07491 link Kimi
808 2025-04-10 Beyond LLMs: A Linguistic Approach to Causal Graph Generation from Narrative Texts Zehan Li et.al. 2504.07459 null Kimi
809 2025-04-10 From Token to Line: Enhancing Code Generation with a Long-Term Perspective Tingwei Lu et.al. 2504.07433 null Kimi
810 2025-04-10 TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models Sher Badshah et.al. 2504.07385 null Kimi
811 2025-04-10 Enhancing Time Series Forecasting via Multi-Level Text Alignment with LLMs Taibiao Zhao et.al. 2504.07360 link Kimi
812 2025-04-10 Revisiting Prompt Optimization with Large Reasoning Models-A Case Study on Event Extraction Saurabh Srivastava et.al. 2504.07357 null Kimi
813 2025-04-09 Modeling Response Consistency in Multi-Agent LLM Systems: A Comparative Analysis of Shared and Separate Context Approaches Tooraj Helmi et.al. 2504.07303 null Kimi
814 2025-04-09 SemEval-2025 Task 5: LLMs4Subjects – LLM-based Automated Subject Tagging for a National Technical Library’s Open-Access Catalog Jennifer D’Souza et.al. 2504.07199 link Kimi
815 2025-04-09 HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation Mingxuan Li et.al. 2504.07174 link Kimi
816 2025-04-09 Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning Nikhil Shivakumar Nayak et.al. 2504.07097 link Kimi
817 2025-04-09 OmniCaptioner: One Captioner to Rule Them All Yiting Lu et.al. 2504.07089 link Kimi
818 2025-04-09 KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs Elan Markowitz et.al. 2504.07087 null Kimi
819 2025-04-09 DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning Atharva Pandey et.al. 2504.07080 null Kimi
820 2025-04-09 SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills Boyuan Zheng et.al. 2504.07079 null Kimi
821 2025-04-09 HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification Bibek Paudel et.al. 2504.07069 null Kimi
822 2025-04-09 Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration Kostas Hatalis et.al. 2504.06943 null Kimi
823 2025-04-09 Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Image Recognition Sergio Romero-Tapiador et.al. 2504.06925 null Kimi
824 2025-04-09 Integrating Cognitive Processing Signals into Language Models: A Review of Advances, Applications and Future Directions Angela Lopez-Cardona et.al. 2504.06843 null Kimi
825 2025-04-09 LVC: A Lightweight Compression Framework for Enhancing VLMs in Long Video Understanding Ziyi Wang et.al. 2504.06835 null Kimi
826 2025-04-09 Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations Zican Dong et.al. 2504.06792 null Kimi
827 2025-04-09 Zero-Shot Image-Based Large Language Model Approach to Road Pavement Monitoring Shuoshuo Xu et.al. 2504.06785 null Kimi
828 2025-04-09 FamilyTool: A Multi-hop Personalized Tool Use Benchmark Yuxin Wang et.al. 2504.06766 link Kimi
829 2025-04-09 EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture Wenfeng Feng et.al. 2504.06738 null Kimi
830 2025-04-09 A Neuro-inspired Interpretation of Unlearning in Large Language Models through Sample-level Unlearning Difficulty Xiaohua Feng et.al. 2504.06658 null Kimi
831 2025-04-09 Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program Minghe Gao et.al. 2504.06606 null Kimi
832 2025-04-09 Automated Business Process Analysis: An LLM-Based Approach to Value Assessment William De Michele et.al. 2504.06600 link Kimi
833 2025-04-09 Right Prediction, Wrong Reasoning: Uncovering LLM Misalignment in RA Disease Diagnosis Umakanta Maharana et.al. 2504.06581 link Kimi
834 2025-04-09 NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables Lanrui Wang et.al. 2504.06560 null Kimi
835 2025-04-09 Lugha-Llama: Adapting Large Language Models for African Languages Happy Buzaaba et.al. 2504.06536 null Kimi
836 2025-04-08 Don’t Let It Hallucinate: Premise Verification via Retrieval-Augmented Logical Reasoning Yuehan Qin et.al. 2504.06438 null Kimi
837 2025-04-08 S’MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning Hanqing Zeng et.al. 2504.06426 null Kimi
838 2025-04-08 Understanding Machine Unlearning Through the Lens of Mode Connectivity Jiali Cheng et.al. 2504.06407 null Kimi
839 2025-04-08 GOLLuM: Gaussian Process Optimized LLMs – Reframing LLM Finetuning through Bayesian Optimization Bojana Ranković et.al. 2504.06265 link Kimi
840 2025-04-09 Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Gleb Rodionov et.al. 2504.06261 null Kimi
841 2025-04-08 FEABench: Evaluating Language Models on Multiphysics Reasoning Ability Nayantara Mudur et.al. 2504.06260 link Kimi
842 2025-04-08 Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation Biao Zhang et.al. 2504.06225 null Kimi
843 2025-04-08 From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models Chejian Xu et.al. 2504.06214 null Kimi
844 2025-04-08 TxGemma: Efficient and Agentic LLMs for Therapeutics Eric Wang et.al. 2504.06196 null Kimi
845 2025-04-08 Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups Rijul Magu et.al. 2504.06160 null Kimi
846 2025-04-08 QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform Movina Moses et.al. 2504.06136 null Kimi
847 2025-04-08 Multi-Sense Embeddings for Language Models and Knowledge Distillation Qitong Wang et.al. 2504.06036 null Kimi
848 2025-04-08 NativQA Framework: Enabling LLMs with Native, Local, and Everyday Knowledge Firoj Alam et.al. 2504.05995 null Kimi
849 2025-04-08 PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario Sriram Mandalika et.al. 2504.05908 null Kimi
850 2025-04-08 HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference Shuzhang Zhong et.al. 2504.05897 link Kimi
851 2025-04-08 Agent Guide: A Simple Agent Behavioral Watermarking Framework Kaibo Huang et.al. 2504.05871 null Kimi
852 2025-04-08 Are Generative AI Agents Effective Personalized Financial Advisors? Takehiro Takayanagi et.al. 2504.05862 link Kimi
853 2025-04-08 How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM Jirong Zha et.al. 2504.05786 null Kimi
854 2025-04-08 DDT: Decoupled Diffusion Transformer Shuai Wang et.al. 2504.05741 null Kimi
855 2025-04-08 Rank-Then-Score: Enhancing Large Language Models for Automated Essay Scoring Yida Cai et.al. 2504.05736 null Kimi
856 2025-04-08 STRIVE: A Think & Improve Approach with Iterative Refinement for Enhancing Question Quality Estimation Aniket Deroy et.al. 2504.05693 null Kimi
857 2025-04-08 Towards Smarter Hiring: Are Zero-Shot and Few-Shot Pre-trained LLMs Ready for HR Spoken Interview Transcript Analysis? Subhankar Maity et.al. 2504.05683 null Kimi
858 2025-04-08 Sugar-Coated Poison: Benign Generation Unlocks LLM Jailbreaking Yu-Hang Wu et.al. 2504.05652 link Kimi
859 2025-04-08 TAGC: Optimizing Gradient Communication in Distributed Transformer Training Igor Polyakov et.al. 2504.05638 link Kimi
860 2025-04-08 FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction Qian-Wen Zhang et.al. 2504.05607 null Kimi
861 2025-04-08 ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs Gejian Zhao et.al. 2504.05605 null Kimi
862 2025-04-08 Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Yi Peng et.al. 2504.05599 null Kimi
863 2025-04-08 DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding Hossein Entezari Zarch et.al. 2504.05598 null Kimi
864 2025-04-08 Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions Oded Ovadia et.al. 2504.05571 null Kimi
865 2025-04-07 Bridging Industrial Expertise and XR with LLM-Powered Conversational Agents Despina Tomkou et.al. 2504.05527 null Kimi
866 2025-04-07 Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling Benjamin Lipkin et.al. 2504.05410 null Kimi
867 2025-04-07 LiveVQA: Live Visual Knowledge Seeking Mingyang Fu et.al. 2504.05288 null Kimi
868 2025-04-07 Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models Adrián Bazaga et.al. 2504.05258 null Kimi
869 2025-04-07 Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling Hengran Zhang et.al. 2504.05216 null Kimi
870 2025-04-07 Post-Training Language Models for Continual Relation Extraction Sefika Efeoglu et.al. 2504.05214 null Kimi
871 2025-04-07 Concise Reasoning via Reinforcement Learning Mehdi Fatemi et.al. 2504.05185 link Kimi
872 2025-04-07 VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks YuYue et.al. 2504.05118 null Kimi
873 2025-04-07 AI for Climate Finance: Agentic Retrieval and Multi-Step Reasoning for Early Warning System Investments Saeid Ario Vaghefi et.al. 2504.05104 null Kimi
874 2025-04-07 The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning Tianshi Zheng et.al. 2504.05081 null Kimi
875 2025-04-07 Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models Jiawei Lian et.al. 2504.05050 null Kimi
876 2025-04-07 Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning Sugyeong Eo et.al. 2504.05047 null Kimi
877 2025-04-07 Following the Whispers of Values: Unraveling Neural Mechanisms Behind Value-Oriented Behaviors in LLMs Ling Hu et.al. 2504.04994 null Kimi
878 2025-04-07 Towards Visual Text Grounding of Multimodal Large Language Model Ming Li et.al. 2504.04974 null Kimi
879 2025-04-07 M-Prometheus: A Suite of Open Multilingual LLM Judges José Pombal et.al. 2504.04953 null Kimi
880 2025-04-07 A Llama walks into the ‘Bar’: Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam Rean Fernandes et.al. 2504.04945 null Kimi
881 2025-04-07 Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration Ran Xu et.al. 2504.04915 link Kimi
882 2025-04-07 Leveraging Large Language Models for Cost-Effective, Multilingual Depression Detection and Severity Assessment Longdi Xian et.al. 2504.04891 null Kimi
883 2025-04-07 Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos Zhi Zuo et.al. 2504.04837 null Kimi
884 2025-04-07 Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models Ruikang Liu et.al. 2504.04823 link Kimi
885 2025-04-07 Can LLMs Interpret and Leverage Structured Linguistic Representations? A Case Study with AMRs Ankush Raut et.al. 2504.04745 null Kimi
886 2025-04-07 TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context Shubham Kumar Nigam et.al. 2504.04737 null Kimi
887 2025-04-07 Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use Anna Goldie et.al. 2504.04736 null Kimi
888 2025-04-07 Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models Yubo Li et.al. 2504.04717 link Kimi
889 2025-04-07 Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts Yifei Yu et.al. 2504.04713 null Kimi
890 2025-04-07 LagKV: Lag-Relative Information of the KV Cache Tells Which Tokens Are Important Manlai Liang et.al. 2504.04704 link Kimi
891 2025-04-07 R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation Martin Weyssow et.al. 2504.04699 link Kimi
892 2025-04-07 LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts Yimu Wang et.al. 2504.04653 null Kimi
893 2025-04-06 Splits! A Flexible Dataset for Evaluating a Model’s Demographic Social Inference Eylon Caplan et.al. 2504.04640 link Kimi
894 2025-04-06 SECQUE: A Benchmark for Evaluating Real-World Financial Analysis Capabilities Noga Ben Yoash et.al. 2504.04596 null Kimi
895 2025-04-06 The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models? Weichen Zhang et.al. 2504.04540 null Kimi
896 2025-04-06 An Empirical Comparison of Text Summarization: A Multi-Dimensional Evaluation of Large Language Models Anantharaman Janakiraman et.al. 2504.04534 null Kimi
897 2025-04-03 Concept Lancet: Image Editing with Compositional Representation Transplant Jinqi Luo et.al. 2504.02828 null Kimi
898 2025-04-03 On Vanishing Variance in Transformer Length Generalization Ruining Li et.al. 2504.02827 null Kimi
899 2025-04-03 Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing Xiangyu Zhao et.al. 2504.02826 link Kimi
900 2025-04-03 Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models Mateusz Pach et.al. 2504.02821 link Kimi
901 2025-04-03 GMR-Conv: An Efficient Rotation and Reflection Equivariant Convolution Kernel Using Gaussian Mixture Rings Yuexi Du et.al. 2504.02819 link Kimi
902 2025-04-03 Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization Kangle Deng et.al. 2504.02817 null Kimi
903 2025-04-03 Generative Evaluation of Complex Reasoning in Large Language Models Haowei Lin et.al. 2504.02810 link Kimi
904 2025-04-03 MegaMath: Pushing the Limits of Open Math Corpora Fan Zhou et.al. 2504.02807 link Kimi
905 2025-04-03 A Survey of Large Language Models in Mental Health Disorder Detection on Social Media Zhuohan Ge et.al. 2504.02800 null Kimi
906 2025-04-03 Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence Anita Rau et.al. 2504.02799 null Kimi
907 2025-04-03 Spline-based Transformers Prashanth Chandran et.al. 2504.02797 null Kimi
908 2025-04-03 A Framework for Situating Innovations, Opportunities, and Challenges in Advancing Vertical Systems with Large AI Models Gaurav Verma et.al. 2504.02793 null Kimi
909 2025-04-03 Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets Chuning Zhu et.al. 2504.02792 null Kimi
910 2025-04-03 A Framework for Robust Cognitive Evaluation of LLMs Karin de Langis et.al. 2504.02789 null Kimi
911 2025-04-03 GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation Zhiyuan Yan et.al. 2504.02782 link Kimi
912 2025-04-03 From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks Joshua Holstein et.al. 2504.02780 null Kimi
913 2025-04-03 Multi-Head Adaptive Graph Convolution Network for Sparse Point Cloud-Based Human Activity Recognition Vincent Gbouna Zakka et.al. 2504.02778 link Kimi
914 2025-04-03 MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs Jaap Jumelet et.al. 2504.02768 null Kimi
915 2025-04-03 How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices? Andres Algaba et.al. 2504.02767 link Kimi
916 2025-04-03 Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model Shengjun Zhang et.al. 2504.02764 null Kimi
917 2025-04-03 CanonNet: Canonical Ordering and Curvature Learning for Point Cloud Analysis Benjy Friedmann et.al. 2504.02763 null Kimi
918 2025-04-03 RBR4DNN: Requirements-based Testing of Neural Networks Nusrat Jahan Mozumder et.al. 2504.02737 link Kimi
919 2025-04-03 Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study Aryan Agrawal et.al. 2504.02733 link Kimi
920 2025-04-03 Why do LLMs attend to the first token? Federico Barbero et.al. 2504.02732 null Kimi
921 2025-04-03 HQViT: Hybrid Quantum Vision Transformer for Image Classification Hui Zhang et.al. 2504.02730 null Kimi
922 2025-04-03 ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization Kehua Feng et.al. 2504.02725 null Kimi
923 2025-04-03 Autonomous Human-Robot Interaction via Operator Imitation Sammy Christen et.al. 2504.02724 null Kimi
924 2025-04-03 The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context Nikhil Verma et.al. 2504.02708 null Kimi
925 2025-04-03 Responsible Development of Offensive AI Ryan Marinelli et.al. 2504.02701 link Kimi
926 2025-04-03 Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation Xingguang Zhang et.al. 2504.02697 link Kimi
927 2025-04-03 Affordable AI Assistants with Knowledge Graph of Thoughts Maciej Besta et.al. 2504.02670 null Kimi
928 2025-04-03 Inference-Time Scaling for Generalist Reward Modeling Zijun Liu et.al. 2504.02495 null Kimi
929 2025-04-03 Cognitive Memory in Large Language Models Lianlei Shan et.al. 2504.02441 null Kimi
930 2025-04-03 Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation Chuanqi Cheng et.al. 2504.02438 null Kimi
931 2025-04-03 AnesBench: Multi-Dimensional Evaluation of LLM Reasoning in Anesthesiology Xiang Feng et.al. 2504.02404 link Kimi
932 2025-04-03 CoTAL: Human-in-the-Loop Prompt Engineering, Chain-of-Thought Reasoning, and Active Learning for Generalizable Formative Assessment Scoring Clayton Cohn et.al. 2504.02323 null Kimi
933 2025-04-03 MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism Ruidong Zhu et.al. 2504.02263 null Kimi
934 2025-04-03 LLMs as Deceptive Agents: How Role-Based Prompting Induces Semantic Ambiguity in Puzzle Tasks Seunghyun Yoo et.al. 2504.02254 null Kimi
935 2025-04-03 FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant Attention Huangliang Dai et.al. 2504.02211 null Kimi
936 2025-04-03 More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment Yifan Wang et.al. 2504.02193 null Kimi
937 2025-04-02 A Survey of Scaling in Large Language Model Reasoning Zihan Chen et.al. 2504.02181 null Kimi
938 2025-04-02 OmniCellTOSG: The First Cell Text-Omic Signaling Graphs Dataset for Joint LLM and GNN Modeling Heming Zhang et.al. 2504.02148 link Kimi
939 2025-04-02 On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software Ali Nouri et.al. 2504.02141 null Kimi
940 2025-04-02 Achieving Unanimous Consensus in Decision Making Using Multi-Agents Apurba Pokharel et.al. 2504.02128 null Kimi
941 2025-04-02 Exploring LLM Reasoning Through Controlled Prompt Variations Giannis Chatziveroglou et.al. 2504.02111 link Kimi
942 2025-04-02 The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data Massimiliano Luca et.al. 2504.01951 null Kimi
943 2025-04-02 OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Wasi Uddin Ahmad et.al. 2504.01943 null Kimi
944 2025-04-02 Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length? Celine Lee et.al. 2504.01935 link Kimi
945 2025-04-02 A thorough benchmark of automatic text classification: From traditional approaches to large language models Washington Cunha et.al. 2504.01930 link Kimi
946 2025-04-03 Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation Baban Gain et.al. 2504.01919 null Kimi
947 2025-04-02 FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs Mothilal Asokan et.al. 2504.01916 null Kimi
948 2025-04-02 Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning Yinggan Xu et.al. 2504.01911 null Kimi
949 2025-04-02 STAR-1: Safer Alignment of Reasoning LLMs with 1K Data Zijun Wang et.al. 2504.01903 null Kimi
950 2025-04-02 TransientTables: Evaluating LLMs’ Reasoning on Temporally Evolving Semi-structured Tables Abhilash Shankarampeta et.al. 2504.01879 null Kimi
951 2025-04-02 Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models Zhiwei Yu et.al. 2504.01857 null Kimi
952 2025-04-02 InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation Bowen Cao et.al. 2504.01707 null Kimi
953 2025-04-02 ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs Yi-Long Lu et.al. 2504.01698 link Kimi
954 2025-04-02 Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish Cedric Lothritz et.al. 2504.01667 null Kimi
955 2025-04-02 Enabling Systematic Generalization in Abstract Spatial Reasoning through Meta-Learning for Compositionality Philipp Mondorf et.al. 2504.01445 link Kimi
956 2025-04-02 FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations Athena Wen et.al. 2504.01420 link Kimi
957 2025-04-02 An Illusion of Progress? Assessing the Current State of Web Agents Tianci Xue et.al. 2504.01382 link Kimi
958 2025-04-02 Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design Mohan Zhang et.al. 2504.01337 null Kimi
959 2025-04-02 Slow-Fast Architecture for Video Multi-Modal Large Language Models Min Shi et.al. 2504.01328 link Kimi
960 2025-04-02 On Data Synthesis and Post-training for Visual Abstract Reasoning Ke Zhu et.al. 2504.01324 null Kimi
961 2025-04-02 Adaptive Rectification Sampling for Test-Time Compute Scaling Zhendong Tan et.al. 2504.01317 link Kimi
962 2025-04-02 ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning Bairu Hou et.al. 2504.01296 link Kimi
963 2025-04-02 Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding Sakhinana Sagar Srinivas et.al. 2504.01281 null Kimi
964 2025-04-01 Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models Rafael Giebisch et.al. 2504.01248 null Kimi
965 2025-04-01 Detecting PTSD in Clinical Interviews: A Comparative Analysis of NLP Methods and Large Language Models Feng Chen et.al. 2504.01216 null Kimi
966 2025-04-01 $μ$ KE: Matryoshka Unstructured Knowledge Editing of Large Language Models Zian Su et.al. 2504.01196 null Kimi
967 2025-04-01 When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning Nishad Singhi et.al. 2504.01005 null Kimi
968 2025-04-01 Token embeddings violate the manifold hypothesis Michael Robinson et.al. 2504.01002 null Kimi
969 2025-04-01 MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Siyuan Li et.al. 2504.00999 null Kimi
970 2025-04-01 MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs Juncheng Wu et.al. 2504.00993 link Kimi
971 2025-04-01 SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching Yuxuan Zhu et.al. 2504.00970 null Kimi
972 2025-04-01 Multi-Token Attention Olga Golovneva et.al. 2504.00927 null Kimi
973 2025-04-01 Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Saaket Agashe et.al. 2504.00906 link Kimi
974 2025-03-31 Easi3R: Estimating Disentangled Motion from DUSt3R Without Training Xingyu Chen et.al. 2503.24391 link Kimi
975 2025-03-31 RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy Zhonghan Zhao et.al. 2503.24388 null Kimi
976 2025-03-31 Consistent Subject Generation via Contrastive Instantiated Concepts Lee Hsin-Ying et.al. 2503.24387 null Kimi
977 2025-03-31 Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Shengqiong Wu et.al. 2503.24379 null Kimi
978 2025-03-31 ACPBench Hard: Unrestrained Reasoning about Action, Change, and Planning Harsha Kokel et.al. 2503.24378 null Kimi
979 2025-03-31 Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models Rui Wang et.al. 2503.24377 link Kimi
980 2025-03-31 Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Yi Chen et.al. 2503.24376 link Kimi
981 2025-03-31 ERUPT: Efficient Rendering with Unposed Patch Transformer Maxim V. Shugaev et.al. 2503.24374 null Kimi
982 2025-03-31 Effectively Controlling Reasoning Models through Thinking Intervention Tong Wu et.al. 2503.24370 null Kimi
983 2025-03-31 Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation Xiaoran Zhang et.al. 2503.24368 null Kimi
984 2025-03-31 Query and Conquer: Execution-Guided SQL Generation Łukasz Borchmann et.al. 2503.24364 null Kimi
985 2025-03-31 SQuat: Subspace-orthogonal KV Cache Quantization Hao Wang et.al. 2503.24358 null Kimi
986 2025-03-31 ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion Rana Muhammad Shahroz Khan et.al. 2503.24354 null Kimi
987 2025-03-31 Can Test-Time Scaling Improve World Foundation Model? Wenyan Cong et.al. 2503.24320 link Kimi
988 2025-03-31 BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models Alok Abhishek et.al. 2503.24310 null Kimi
989 2025-03-31 A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG Arshia Kermani et.al. 2503.24307 null Kimi
990 2025-03-31 Order Matters: On Parameter-Efficient Image-to-Video Probing for Recognizing Nearly Symmetric Actions Thinesh Thiyakesan Ponbagavathi et.al. 2503.24298 null Kimi
991 2025-03-31 Is analogy enough to draw novel adjective-noun inferences? Hayley Ross et.al. 2503.24293 link Kimi
992 2025-03-31 Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Jingcheng Hu et.al. 2503.24290 null Kimi
993 2025-03-31 Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning Jiacheng Lin et.al. 2503.24289 link Kimi
994 2025-03-31 Style Quantization for Data-Efficient GAN Training Jian Wang et.al. 2503.24282 null Kimi
995 2025-03-31 Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality Sewoong Lee et.al. 2503.24277 link Kimi
996 2025-03-31 FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics Yixuan Li et.al. 2503.24267 null Kimi
997 2025-03-31 Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation Dun Yuan et.al. 2503.24245 null Kimi
998 2025-03-31 Spatio-temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications in Ridesharing Run Yang et.al. 2503.24237 null Kimi
999 2025-03-31 What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models Qiyuan Zhang et.al. 2503.24235 link Kimi
1000 2025-03-31 PAARS: Persona Aligned Agentic Retail Shoppers Saab Mansour et.al. 2503.24228 null Kimi
1001 2025-03-31 MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing Karim Radouane et.al. 2503.24219 link Kimi
1002 2025-03-31 All You Need is Sally-Anne: ToM in AI Strongly Supported After Surpassing Tests for 3-Year-Olds Nitay Alon et.al. 2503.24215 null Kimi
1003 2025-03-31 Synthetic News Generation for Fake News Classification Abdul Sittar et.al. 2503.24206 null Kimi
1004 2025-03-31 TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers’ Guidance Jingxian Xu et.al. 2503.24198 null Kimi
1005 2025-03-31 Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms Shuoming Zhang et.al. 2503.24191 null Kimi
1006 2025-03-31 Grounding Agent Reasoning in Image Schemas: A Neurosymbolic Approach to Embodied Cognition François Olivier et.al. 2503.24110 null Kimi
1007 2025-03-31 Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data Fatemeh Mohammadi et.al. 2503.24062 null Kimi
1008 2025-03-31 AirCache: Activating Inter-modal Relevancy KV Cache Compression for Efficient Large Vision-Language Model Inference Kai Huang et.al. 2503.23956 null Kimi
1009 2025-03-31 Model Hemorrhage and the Robustness Limits of Large Language Models Ziyang Ma et.al. 2503.23924 null Kimi
1010 2025-03-31 OrchMLLM: Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training Yijie Zheng et.al. 2503.23830 null Kimi
1011 2025-03-31 Expanding RL with Verifiable Rewards Across Diverse Domains Yi Su et.al. 2503.23829 null Kimi
1012 2025-03-31 Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute Yingwei Ma et.al. 2503.23803 link Kimi
1013 2025-03-31 Adaptive Layer-skipping in Pre-trained LLMs Xuan Luo et.al. 2503.23798 null Kimi
1014 2025-03-31 WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization Ine Gevers et.al. 2503.23779 null Kimi
1015 2025-03-31 Short-video Propagation Influence Rating: A New Real-world Dataset and A New Large Graph Model Dizhan Xue et.al. 2503.23746 link Kimi
1016 2025-03-31 LANID: LLM-assisted New Intent Discovery Lu Fan et.al. 2503.23740 link Kimi
1017 2025-03-31 AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization Yiyang Du et.al. 2503.23733 link Kimi
1018 2025-03-30 Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models Haochen Liu et.al. 2503.23523 link Kimi
1019 2025-03-30 If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs Siqi Fan et.al. 2503.23514 null Kimi
1020 2025-03-30 RARE: Retrieval-Augmented Reasoning Modeling Zhengren Wang et.al. 2503.23513 link Kimi
1021 2025-03-30 Benchmarking Systematic Relational Reasoning with Large Language and Reasoning Models Irtaza Khalid et.al. 2503.23487 null Kimi
1022 2025-03-30 Order Independence With Finetuning Katrina Brown et.al. 2503.23483 null Kimi
1023 2025-03-27 Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model Abdelrahman Shaker et.al. 2503.21782 link Kimi
1024 2025-03-27 X $^{2}$ -Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction Weihao Yu et.al. 2503.21779 null Kimi
1025 2025-03-27 Video-R1: Reinforcing Video Reasoning in MLLMs Kaituo Feng et.al. 2503.21776 link Kimi
1026 2025-03-27 StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion Ziyu Guo et.al. 2503.21775 null Kimi
1027 2025-03-27 MemInsight: Autonomous Memory Augmentation for LLM Agents Rana Salama et.al. 2503.21760 null Kimi
1028 2025-03-27 Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck Adrian Bulat et.al. 2503.21757 null Kimi
1029 2025-03-27 LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis Shitian Zhao et.al. 2503.21749 null Kimi
1030 2025-03-27 GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics Arsham Gholamzadeh Khoee et.al. 2503.21735 null Kimi
1031 2025-03-27 Effective Skill Unlearning through Intervention and Abstention Yongce Li et.al. 2503.21730 link Kimi
1032 2025-03-27 ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation Zhicheng Lee et.al. 2503.21729 null Kimi
1033 2025-03-27 OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation Mallika Garg et.al. 2503.21723 null Kimi
1034 2025-03-27 Collab: Controlled Decoding using Mixture of Agents for LLM Alignment Souradip Chakraborty et.al. 2503.21720 null Kimi
1035 2025-03-27 Outlier dimensions favor frequent tokens in language model Iuri Macocco et.al. 2503.21718 null Kimi
1036 2025-03-27 CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers? Jiefu Ou et.al. 2503.21717 link Kimi
1037 2025-03-27 Elementwise Layer Normalization Felix Stollenwerk et.al. 2503.21708 link Kimi
1038 2025-03-27 MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX Liuyue Xie et.al. 2503.21699 null Kimi
1039 2025-03-27 Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks Wenqi Zhang et.al. 2503.21696 link Kimi
1040 2025-03-27 AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation Jiahe Qian et.al. 2503.21695 null Kimi
1041 2025-03-27 Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Zhiyuan Ma et.al. 2503.21694 link Kimi
1042 2025-03-27 LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning Hui Wang et.al. 2503.21683 null Kimi
1043 2025-03-27 JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community Yunze Xiao et.al. 2503.21679 null Kimi
1044 2025-03-27 How do language models learn facts? Dynamics, curricula and hallucinations Nicolas Zucchet et.al. 2503.21676 null Kimi
1045 2025-03-27 COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing Rajvee Sheth et.al. 2503.21670 null Kimi
1046 2025-03-27 Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI Danaja Rutar et.al. 2503.21668 null Kimi
1047 2025-03-27 UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Zhengxi Lu et.al. 2503.21620 link Kimi
1048 2025-03-27 A Measure Based Generalizable Approach to Understandability Vikas Kushwaha et.al. 2503.21615 null Kimi
1049 2025-03-27 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond Xiaoye Qu et.al. 2503.21614 link Kimi
1050 2025-03-27 Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach Javier Coronado-Blázquez et.al. 2503.21613 null Kimi
1051 2025-03-27 GenEdit: Compounding Operators and Continuous Improvement to Tackle Text-to-SQL in the Enterprise Karime Maamari et.al. 2503.21602 null Kimi
1052 2025-03-27 Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing Johan Wahréus et.al. 2503.21598 null Kimi
1053 2025-03-27 debug-gym: A Text-Based Environment for Interactive Debugging Xingdi Yuan et.al. 2503.21557 null Kimi
1054 2025-03-27 SWI: Speaking with Intent in Large Language Models Yuwei Yin et.al. 2503.21544 link Kimi
1055 2025-03-27 Keyword-Oriented Multimodal Modeling for Euphemism Identification Yuxue Hu et.al. 2503.21504 link Kimi
1056 2025-03-27 Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection Ryan Marinelli et.al. 2503.21464 link Kimi
1057 2025-03-27 An evaluation of LLMs and Google Translate for translation of selected Indian languages via sentiment and semantic analyses Rohitash Chandra et.al. 2503.21393 null Kimi
1058 2025-03-27 Controlling Large Language Model with Latent Actions Chengxing Jia et.al. 2503.21383 link Kimi
1059 2025-03-27 Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models Haoxiang Sun et.al. 2503.21380 link Kimi
1060 2025-03-27 ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback Taewon Yun et.al. 2503.21332 null Kimi
1061 2025-03-27 InternVL-X: Advancing and Accelerating InternVL Series with Efficient Visual Token Compression Dongchen Lu et.al. 2503.21307 link Kimi
1062 2025-03-27 ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition Yujie Liu et.al. 2503.21248 null Kimi
1063 2025-03-27 Bias-Aware Agent: Enhancing Fairness in AI-Driven Knowledge Retrieval Karanbir Singh et.al. 2503.21237 link Kimi
1064 2025-03-27 LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models Hengyuan Zhao et.al. 2503.21227 null Kimi
1065 2025-03-27 ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging Haoming Xu et.al. 2503.21088 link Kimi
1066 2025-03-27 EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues Yuhan Liu et.al. 2503.21080 null Kimi
1067 2025-03-27 Rerouting Connection: Hybrid Computer Vision Analysis Reveals Visual Similarity Between Indus and Tibetan-Yi Corridor Writing Systems Ooha Lakkadi Reddy et.al. 2503.21074 link Kimi
1068 2025-03-26 Can Large Language Models Predict Associations Among Human Attitudes? Ana Ma et.al. 2503.21011 null Kimi
1069 2025-03-26 VinaBench: Benchmark for Faithful and Consistent Visual Narratives Silin Gao et.al. 2503.20871 null Kimi
1070 2025-03-26 Understanding R1-Zero-Like Training: A Critical Perspective Zichen Liu et.al. 2503.20783 link Kimi
1071 2025-03-27 Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning Huajie Tan et.al. 2503.20752 null Kimi
1072 2025-03-26 Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework Soham Sane et.al. 2503.20750 null Kimi
1073 2025-03-27 Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs Yuxuan Lu et.al. 2503.20749 null Kimi
1074 2025-03-26 Vision as LoRA Han Wang et.al. 2503.20680 link Kimi
1075 2025-03-26 TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews Huimin Xu et.al. 2503.20666 null Kimi
1076 2025-03-26 Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions Alessandro Maisto et.al. 2503.20623 null Kimi
1077 2025-03-26 Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation Yunkai Liang et.al. 2503.20552 link Kimi
1078 2025-03-26 Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence Yijiong Yu et.al. 2503.20533 link Kimi
1079 2025-03-26 StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs Zhicheng Guo et.al. 2503.20527 link Kimi
1080 2025-03-26 From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment Yucheng Suo et.al. 2503.20472 null Kimi
1081 2025-03-26 MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation Rongyu Zhang et.al. 2503.20384 null Kimi
1082 2025-03-26 VideoGEM: Training-free Action Grounding in Videos Felix Vogel et.al. 2503.20348 null Kimi
1083 2025-03-26 Iterative Prompting with Persuasion Skills in Jailbreaking Large Language Models Shih-Wen Ke et.al. 2503.20320 null Kimi
1084 2025-03-26 QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions Siyin Wang et.al. 2503.20290 null Kimi
1085 2025-03-26 sudo rm -rf agentic_security Sejin Lee et.al. 2503.20279 link Kimi
1086 2025-03-26 ViLBench: A Suite for Vision-Language Process Reward Modeling Haoqin Tu et.al. 2503.20271 null Kimi
1087 2025-03-26 Qwen2.5-Omni Technical Report Jin Xu et.al. 2503.20215 null Kimi
1088 2025-03-26 SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain Nan Gao et.al. 2503.20202 null Kimi
1089 2025-03-26 Open Deep Search: Democratizing Search with Open-source Reasoning Agents Salaheddin Alzubi et.al. 2503.20201 link Kimi
1090 2025-03-25 Can Multi-modal (reasoning) LLMs work as deepfake detectors? Simiao Ren et.al. 2503.20084 null Kimi
1091 2025-03-25 Cross-Tokenizer Distillation via Approximate Likelihood Matching Benjamin Minixhofer et.al. 2503.20083 link Kimi
1092 2025-03-25 OmniNova:A General Multimodal Agent Framework Pengfei Du et.al. 2503.20028 null Kimi
1093 2025-03-25 ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback Bohan Zhai et.al. 2503.19988 link Kimi
1094 2025-03-25 LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation Han Chen et.al. 2503.19950 link Kimi
1095 2025-03-25 CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning Hao Yu et.al. 2503.19900 link Kimi
1096 2025-03-25 Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Xiaoyu Tian et.al. 2503.19855 null Kimi
1097 2025-03-25 FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs Carlos Plou et.al. 2503.19850 null Kimi
1098 2025-03-25 A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950 Zhao Fang et.al. 2503.19844 null Kimi
1099 2025-03-25 PAVE: Patching and Adapting Video Large Language Models Zhuoming Liu et.al. 2503.19794 link Kimi
1100 2025-03-25 Gemma 3 Technical Report Gemma Team et.al. 2503.19786 null Kimi
1101 2025-03-25 AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Itay Nakash et.al. 2503.19693 link Kimi
1102 2025-03-25 1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training Han Zhao et.al. 2503.19633 null Kimi
1103 2025-03-25 Innate Reasoning is Not Enough: In-Context Learning Enhances Reasoning Large Language Models with Less Overthinking Yuyao Ge et.al. 2503.19602 null Kimi
1104 2025-03-25 Scaling Laws of Synthetic Data for Language Models Zeyu Qin et.al. 2503.19551 null Kimi
1105 2025-03-25 FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models Dahyun Jung et.al. 2503.19540 link Kimi
1106 2025-03-25 ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning Mingyang Chen et.al. 2503.19470 null Kimi
1107 2025-03-25 DeCAP: Context-Adaptive Prompt Generation for Debiasing Zero-shot Question Answering in Large Language Models Suyoung Bae et.al. 2503.19426 null Kimi
1108 2025-03-25 Process or Result? Manipulated Ending Tokens Can Mislead Reasoning LLMs to Ignore the Correct Reasoning Steps Yu Cui et.al. 2503.19326 null Kimi
1109 2025-03-25 Long-Context Autoregressive Video Modeling with Next-Frame Prediction Yuchao Gu et.al. 2503.19325 link Kimi
1110 2025-03-25 Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications Ben Rahman et.al. 2503.19276 null Kimi
1111 2025-03-25 MARS: Memory-Enhanced Agents with Reflective Self-improvement Xuechen Liang et.al. 2503.19271 null Kimi
1112 2025-03-25 Linguistic Blind Spots of Large Language Models Jiali Cheng et.al. 2503.19260 null Kimi
1113 2025-03-25 SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings Farhana Keya et.al. 2503.19257 null Kimi
1114 2025-03-24 A Survey of Large Language Model Agents for Question Answering Murong Yue et.al. 2503.19213 null Kimi
1115 2025-03-24 Overtrained Language Models Are Harder to Fine-Tune Jacob Mitchell Springer et.al. 2503.19206 null Kimi
1116 2025-03-24 Language Model Uncertainty Quantification with Attention Chain Yinghao Li et.al. 2503.19168 link Kimi
1117 2025-03-24 LLM-Based Insight Extraction for Contact Center Analytics and Cost-Efficient Deployment Varsha Embar et.al. 2503.19090 null Kimi
1118 2025-03-24 Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization Zhanda Zhu et.al. 2503.19050 link Kimi
1119 2025-03-24 LookAhead Tuning: Safer Language Models via Partial Answer Previews Kangwei Liu et.al. 2503.19041 link Kimi
1120 2025-03-24 Exploring Training and Inference Scaling Laws in Generative Retrieval Hongru Cai et.al. 2503.18941 link Kimi
1121 2025-03-24 xKV: Cross-Layer SVD for KV-Cache Compression Chi-Chih Chang et.al. 2503.18893 link Kimi
1122 2025-03-24 SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild Weihao Zeng et.al. 2503.18892 null Kimi
1123 2025-03-24 AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration Zhexuan Wang et.al. 2503.18891 link Kimi
1124 2025-03-24 I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Andrey Galichin et.al. 2503.18878 link Kimi
1125 2025-03-24 EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments Sara Fish et.al. 2503.18825 null Kimi
1126 2025-03-24 REALM: A Dataset of Real-World LLM Use Cases Jingwen Cheng et.al. 2503.18792 null Kimi
1127 2025-03-24 BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache Dayou Du et.al. 2503.18773 link Kimi
1128 2025-03-24 AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning Alan Dao et.al. 2503.18769 null Kimi
1129 2025-03-24 Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models Yazhou Zhang et.al. 2503.18681 null Kimi
1130 2025-03-24 Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures Abdoul Majid O. Thiombiano et.al. 2503.18565 null Kimi
1131 2025-03-24 Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models Nariman Naderi et.al. 2503.18562 null Kimi
1132 2025-03-24 Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models Bin Li et.al. 2503.18556 null Kimi
1133 2025-03-24 SciClaims: An End-to-End Generative System for Biomedical Claim Analysis Raúl Ortega et.al. 2503.18526 null Kimi
1134 2025-03-24 Verbal Process Supervision Elicits Better Coding Agents Hao-Yuan Chen et.al. 2503.18494 null Kimi
1135 2025-03-24 Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding Xiangrui Liu et.al. 2503.18478 null Kimi
1136 2025-03-24 A Simple yet Effective Layout Token in Large Language Models for Document Understanding Zhaoqing Zhu et.al. 2503.18434 null Kimi
1137 2025-03-24 Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning Junsong Li et.al. 2503.18432 null Kimi
1138 2025-03-24 Breaking the Encoder Barrier for Seamless Video-Language Understanding Handong Li et.al. 2503.18422 null Kimi
1139 2025-03-24 J&H: Evaluating the Robustness of Large Language Models Under Knowledge-Injection Attacks in Legal Domain Yiran Hu et.al. 2503.18360 link Kimi
1140 2025-03-24 Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions Dong Jing et.al. 2503.18320 null Kimi
1141 2025-03-24 Jenga: Effective Memory Management for Serving LLM with Heterogeneity Chen Zhang et.al. 2503.18292 null Kimi
1142 2025-03-24 Sun-Shine: A Large Language Model for Tibetan Culture Cheng Huang et.al. 2503.18288 link Kimi
1143 2025-03-24 TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model Cheng Yang et.al. 2503.18278 null Kimi
1144 2025-03-24 Bridging Emotions and Architecture: Sentiment Analysis in Modern Distributed Systems Mahak Shah et.al. 2503.18260 null Kimi
1145 2025-03-23 ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices Aneesh Vathul et.al. 2503.18242 null Kimi
1146 2025-03-23 Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering Zixin Chen et.al. 2503.18172 null Kimi
1147 2025-03-23 LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space Zhangyu Wang et.al. 2503.18142 null Kimi
1148 2025-03-23 AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs Diwei Wang et.al. 2503.18141 null Kimi
1149 2025-03-23 GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks Varvara Krechetova et.al. 2503.18129 link Kimi
1150 2025-03-20 Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation Yuqing Wang et.al. 2503.16430 null Kimi
1151 2025-03-20 XAttention: Block Sparse Attention with Antidiagonal Scoring Ruyi Xu et.al. 2503.16428 link Kimi
1152 2025-03-20 DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding Keyan Chen et.al. 2503.16426 link Kimi
1153 2025-03-20 Tokenize Image as a Set Zigang Geng et.al. 2503.16425 link Kimi
1154 2025-03-20 1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering Yuheng Yuan et.al. 2503.16422 null Kimi
1155 2025-03-20 Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Yang Sui et.al. 2503.16419 link Kimi
1156 2025-03-20 Survey on Evaluation of LLM-based Agents Asaf Yehudai et.al. 2503.16416 null Kimi
1157 2025-03-20 M3: 3D-Spatial MultiModal Memory Xueyan Zou et.al. 2503.16413 link Kimi
1158 2025-03-20 RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints Yiran Qin et.al. 2503.16408 null Kimi
1159 2025-03-20 The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination Yifan Sun et.al. 2503.16402 link Kimi
1160 2025-03-20 SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation Chun-Han Yao et.al. 2503.16396 null Kimi
1161 2025-03-20 Do Visual Imaginations Improve Vision-and-Language Navigation Agents? Akhil Perincherry et.al. 2503.16394 null Kimi
1162 2025-03-20 Attentional Triple-Encoder Network in Spatiospectral Domains for Medical Image Segmentation Kristin Qi et.al. 2503.16389 null Kimi
1163 2025-03-20 Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation Yijia Luo et.al. 2503.16385 link Kimi
1164 2025-03-20 LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images Leyang Wang et.al. 2503.16376 null Kimi
1165 2025-03-20 NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes Han-Hung Lee et.al. 2503.16375 link Kimi
1166 2025-03-20 JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Muyao Li et.al. 2503.16365 null Kimi
1167 2025-03-20 Neural Networks: According to the Principles of Grassmann Algebra Z. Zarezadeh et.al. 2503.16364 null Kimi
1168 2025-03-20 CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners Yunzhi Yao et.al. 2503.16356 link Kimi
1169 2025-03-20 Enhancing Software Quality Assurance with an Adaptive Differential Evolution based Quantum Variational Autoencoder-Transformer Model Seshu Babu Barma et.al. 2503.16335 null Kimi
1170 2025-03-20 LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates Ying Shen et.al. 2503.16334 null Kimi
1171 2025-03-20 OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence Long Yuan et.al. 2503.16326 null Kimi
1172 2025-03-20 Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 Peiran Gu et.al. 2503.16304 null Kimi
1173 2025-03-20 Unleashing Vecset Diffusion Model for Fast Shape Generation Zeqiang Lai et.al. 2503.16302 link Kimi
1174 2025-03-20 PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification Sharon Peled et.al. 2503.16284 link Kimi
1175 2025-03-20 Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data Zijian Li et.al. 2503.16260 null Kimi
1176 2025-03-20 Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Keda Tao et.al. 2503.16257 null Kimi
1177 2025-03-20 M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation Markus Karmann et.al. 2503.16254 null Kimi
1178 2025-03-20 Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning Zhaowei Liu et.al. 2503.16252 link Kimi
1179 2025-03-20 AI Agents in Cryptoland: Practical Attacks and No Silver Bullet Atharv Singh Patlan et.al. 2503.16248 null Kimi
1180 2025-03-20 Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t Quy-Anh Dang et.al. 2503.16219 link Kimi
1181 2025-03-20 Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation Andrea Maracani et.al. 2503.16184 null Kimi
1182 2025-03-20 SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs Shibo Jie et.al. 2503.16163 null Kimi
1183 2025-03-20 Tuning LLMs by RAG Principles: Towards LLM-native Memory Jiale Wei et.al. 2503.16071 link Kimi
1184 2025-03-20 PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval Qiang Zou et.al. 2503.16064 link Kimi
1185 2025-03-20 Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts Yike Yuan et.al. 2503.16057 null Kimi
1186 2025-03-20 Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond Yaoyao Yu et.al. 2503.16040 null Kimi
1187 2025-03-20 Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models Zhihang Liu et.al. 2503.16036 link Kimi
1188 2025-03-20 The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement Ruihan Yang et.al. 2503.16024 null Kimi
1189 2025-03-20 Autonomous AI imitators increase diversity in homogeneous information ecosystems Emil Bakkensen Johansen et.al. 2503.16021 null Kimi
1190 2025-03-20 GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions Xiaomeng Chu et.al. 2503.16013 null Kimi
1191 2025-03-20 Adaptive Group Policy Optimization: Towards Stable Training and Token-Efficient Reasoning Chen Li et.al. 2503.15952 null Kimi
1192 2025-03-20 Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment Gaole Dai et.al. 2503.15937 null Kimi
1193 2025-03-20 SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models Fahao Chen et.al. 2503.15921 null Kimi
1194 2025-03-20 DeepPsy-Agent: A Stage-Aware and Deep-Thinking Emotional Support Agent System Kai Chen et.al. 2503.15876 null Kimi
1195 2025-03-20 MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations Kyungho Bae et.al. 2503.15871 null Kimi
1196 2025-03-20 Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey Xiaoou Liu et.al. 2503.15850 null Kimi
1197 2025-03-20 Entropy-based Exploration Conduction for Multi-step Reasoning Jinghan Zhang et.al. 2503.15848 null Kimi
1198 2025-03-20 Grammar and Gameplay-aligned RL for Game Description Generation with LLMs Tsunehiko Tanaka et.al. 2503.15783 null Kimi
1199 2025-03-19 UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction Shravan Nayak et.al. 2503.15661 null Kimi
1200 2025-03-19 LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning Federico Cocchi et.al. 2503.15621 link Kimi
1201 2025-03-19 Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification ZhengLin Lai et.al. 2503.15469 link Kimi
1202 2025-03-19 SemEval-2025 Task 1: AdMIRe – Advancing Multimodal Idiomaticity Representation Thomas Pickard et.al. 2503.15358 null Kimi
1203 2025-03-19 MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration David Wan et.al. 2503.15272 null Kimi
1204 2025-03-19 Do Chains-of-Thoughts of Large Language Models Suffer from Hallucinations, Cognitive Biases, or Phobias in Bayesian Reasoning? Roberto Araya et.al. 2503.15268 null Kimi
1205 2025-03-19 Efficient allocation of image recognition and LLM tasks on multi-GPU system Marcin Lawenda et.al. 2503.15252 null Kimi
1206 2025-03-19 Automated Non-Functional Requirements Generation in Software Engineering with Large Language Models: A Comparative Study Jomar Thomas Almonte et.al. 2503.15248 null Kimi
1207 2025-03-19 BigO(Bench) – Can LLMs Generate Code with Controlled Time and Space Complexity? Pierre Chambon et.al. 2503.15242 link Kimi
1208 2025-03-19 Exploring Large Language Models for Word Games:Who is the Spy? Chentian Wei et.al. 2503.15235 link Kimi
1209 2025-03-19 CoE: Chain-of-Explanation via Automatic Visual Concept Circuit Description and Polysemanticity Quantification Wenlong Yu et.al. 2503.15234 link Kimi
1210 2025-03-19 A Review on Large Language Models for Visual Analytics Navya Sonal Agarwal et.al. 2503.15176 null Kimi
1211 2025-03-19 Machine Unlearning in Hyperbolic vs. Euclidean Multimodal Contrastive Learning: Adapting Alignment Calibration to MERU Àlex Pujol Vidal et.al. 2503.15166 null Kimi
1212 2025-03-19 VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making Mohamed Salim Aissi et.al. 2503.15108 null Kimi
1213 2025-03-19 Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings Zonghao Ying et.al. 2503.15092 link Kimi
1214 2025-03-19 Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices Ziyao Wang et.al. 2503.14932 null Kimi
1215 2025-03-19 MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models Jiazheng Li et.al. 2503.14917 null Kimi
1216 2025-03-19 Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations Shuo Li et.al. 2503.14895 null Kimi
1217 2025-03-19 MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer Honglin Lin et.al. 2503.14891 link Kimi
1218 2025-03-19 Communication-Efficient Distributed On-Device LLM Inference Over Wireless Networks Kai Zhang et.al. 2503.14882 null Kimi
1219 2025-03-19 Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers Bo Chen et.al. 2503.14881 null Kimi
1220 2025-03-19 LogLLaMA: Transformer-based log anomaly detection with LLaMA Zhuoyi Yang et.al. 2503.14849 null Kimi
1221 2025-03-18 RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving Wenqi Jiang et.al. 2503.14649 null Kimi
1222 2025-03-18 Dynamic Accumulated Attention Map for Interpreting Evolution of Decision-Making in Vision Transformer Yi Liao et.al. 2503.14640 null Kimi
1223 2025-03-18 Assessing Large Language Models for Automated Feedback Generation in Learning Programming Problem Solving Priscylla Silva et.al. 2503.14630 link Kimi
1224 2025-03-18 Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives Sara Sarto et.al. 2503.14604 null Kimi
1225 2025-03-19 State Space Model Meets Transformer: A New Paradigm for 3D Object Detection Chuxin Wang et.al. 2503.14493 null Kimi
1226 2025-03-18 DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers Minglei Shi et.al. 2503.14487 null Kimi
1227 2025-03-18 Gricean Norms as a Basis for Effective Collaboration Fardin Saad et.al. 2503.14484 link Kimi
1228 2025-03-18 LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers Nikhil Abhyankar et.al. 2503.14434 link Kimi
1229 2025-03-18 PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play Wei Fang et.al. 2503.14432 null Kimi
1230 2025-03-18 VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation Shoubin Yu et.al. 2503.14350 null Kimi
1231 2025-03-18 DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies Wei Song et.al. 2503.14324 link Kimi
1232 2025-03-18 DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal Vaibhav Aggarwal et.al. 2503.14269 link Kimi
1233 2025-03-18 Speculative Decoding for Verilog: Speed and Quality, All in One Changran Xu et.al. 2503.14153 null Kimi
1234 2025-03-18 Inference-Time Intervention in Large Language Models for Reliable Requirement Verification Paul Darm et.al. 2503.14130 null Kimi
1235 2025-03-18 Growing a Twig to Accelerate Large Vision-Language Models Zhenwei Shao et.al. 2503.14075 null Kimi
1236 2025-03-18 Fast Autoregressive Video Generation with Diagonal Decoding Yang Ye et.al. 2503.14070 null Kimi
1237 2025-03-18 Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks Mykyta Syromiatnikov et.al. 2503.13988 link Kimi
1238 2025-03-18 Improving LLM Video Understanding with 16 Frames Per Second Yixuan Li et.al. 2503.13956 null Kimi
1239 2025-03-18 ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models Alexey Karev et.al. 2503.13923 null Kimi
1240 2025-03-18 Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models Mingming Peng et.al. 2503.13813 null Kimi
1241 2025-03-18 LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation Yang Zhou et.al. 2503.13794 null Kimi
1242 2025-03-17 Mitigating KV Cache Competition to Enhance User Experience in LLM Inference Haiying Shen et.al. 2503.13773 null Kimi
1243 2025-03-17 Do Large Language Models Understand Performance Optimization? Bowen Cui et.al. 2503.13772 null Kimi
1244 2025-03-17 MetaScale: Test-Time Scaling with Evolving Meta-Thoughts Qin Liu et.al. 2503.13447 null Kimi
1245 2025-03-17 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Ye Liu et.al. 2503.13444 link Kimi
1246 2025-03-17 xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference Maximilian Beck et.al. 2503.13427 link Kimi
1247 2025-03-17 MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research James Burgess et.al. 2503.13399 link Kimi
1248 2025-03-17 Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning Mengyao Lyu et.al. 2503.13383 null Kimi
1249 2025-03-17 TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM Ye Wang et.al. 2503.13377 link Kimi
1250 2025-03-17 Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning Hai-Long Sun et.al. 2503.13360 null Kimi
1251 2025-03-17 Computation Mechanism Behind LLM Position Generalization Chi Han et.al. 2503.13305 null Kimi
1252 2025-03-17 A Survey on Transformer Context Extension: Approaches and Evaluation Yijun Liu et.al. 2503.13299 null Kimi
1253 2025-03-17 $φ$ -Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation Fangzhi Xu et.al. 2503.13288 link Kimi
1254 2025-03-17 Knowledge-Aware Iterative Retrieval for Multi-Agent Systems Seyoung Song et.al. 2503.13275 null Kimi
1255 2025-03-17 Can Language Models Follow Multiple Turns of Entangled Instructions? Chi Han et.al. 2503.13222 link Kimi
1256 2025-03-17 Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach Sinan Fan et.al. 2503.13208 null Kimi
1257 2025-03-17 MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways Zhen Chen et.al. 2503.13205 null Kimi
1258 2025-03-17 Are LLMs (Really) Ideological? An IRT-based Analysis and Alignment Tool for Perceived Socio-Economic Bias in LLMs Jasmin Wachter et.al. 2503.13149 null Kimi
1259 2025-03-17 Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding Weiyu Guo et.al. 2503.13139 null Kimi
1260 2025-03-17 Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference Hao Yin et.al. 2503.13108 link Kimi
1261 2025-03-17 ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models Hao Yin et.al. 2503.13107 link Kimi
1262 2025-03-17 A Multi-Stage Framework with Taxonomy-Guided Reasoning for Occupation Classification Using Large Language Models Palakorn Achananuparp et.al. 2503.12989 null Kimi
1263 2025-03-17 ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM Wenqiang Wang et.al. 2503.12988 null Kimi
1264 2025-03-17 R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization Jingyi Zhang et.al. 2503.12937 link Kimi
1265 2025-03-17 HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models Xinyan Jiang et.al. 2503.12908 null Kimi
1266 2025-03-17 VITED: Video Temporal Evidence Distillation Yujie Lu et.al. 2503.12855 null Kimi
1267 2025-03-17 ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing Aditi Tiwari et.al. 2503.12852 null Kimi
1268 2025-03-17 Grounded Chain-of-Thought for Multimodal Large Language Models Qiong Wu et.al. 2503.12799 link Kimi
1269 2025-03-17 DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding Xinyu Ma et.al. 2503.12797 link Kimi
1270 2025-03-17 Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering Kenneth J. K. Ong et.al. 2503.12722 null Kimi
1271 2025-03-17 Can Reasoning Models Reason about Hardware? An Agentic HLS Perspective Luca Collini et.al. 2503.12721 null Kimi
1272 2025-03-16 Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility Jacob Chmura et.al. 2503.12667 null Kimi
1273 2025-03-16 VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures Yoo Yeon Sung et.al. 2503.12651 null Kimi
1274 2025-03-16 MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network Vrushank Ahire et.al. 2503.12623 link Kimi
1275 2025-03-16 MoECollab: Democratizing LLM Development Through Collaborative Mixture of Experts Harshit et.al. 2503.12592 null Kimi
1276 2025-03-16 AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding Xiao Wang et.al. 2503.12559 link Kimi
1277 2025-03-14 TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing Stefan Lionar et.al. 2503.11629 link Kimi
1278 2025-03-14 ASMA-Tune: Unlocking LLMs’ Assembly Code Comprehension via Structural-Semantic Instruction Tuning Xinyi Wang et.al. 2503.11617 link Kimi
1279 2025-03-14 Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space Zhiliang Chen et.al. 2503.11586 link Kimi
1280 2025-03-14 Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers Weiming Ren et.al. 2503.11579 null Kimi
1281 2025-03-14 Implicit Bias-Like Patterns in Reasoning Models Messi H. J. Lee et.al. 2503.11572 null Kimi
1282 2025-03-14 Similarity-Aware Token Pruning: Your VLM but Faster Ahmadreza Jeddi et.al. 2503.11549 link Kimi
1283 2025-03-14 HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models Ziqin Zhou et.al. 2503.11513 null Kimi
1284 2025-03-14 V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning Zixu Cheng et.al. 2503.11495 null Kimi
1285 2025-03-14 Integrating LLMs in Gamified Systems Carlos J. Costa et.al. 2503.11458 null Kimi
1286 2025-03-14 Cerebrum (AIOS SDK): A Platform for Agent Development, Deployment, Distribution, and Discovery Balaji Rama et.al. 2503.11444 link Kimi
1287 2025-03-14 Text Compression for Efficient Language Generation David Gu et.al. 2503.11426 null Kimi
1288 2025-03-14 Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages Jiyeong Kim et.al. 2503.11384 null Kimi
1289 2025-03-14 Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches Panggih Kusuma Ningrum et.al. 2503.11376 null Kimi
1290 2025-03-14 AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation Fengyu Li et.al. 2503.11346 link Kimi
1291 2025-03-14 Rule-Guided Feedback: Enhancing Reasoning by Enforcing Rule Adherence in Large Language Models Aissatou Diallo et.al. 2503.11336 null Kimi
1292 2025-03-14 Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking Ziyi Wang et.al. 2503.11324 null Kimi
1293 2025-03-14 MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens Jeong Hun Yeo et.al. 2503.11315 link Kimi
1294 2025-03-14 Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering Xinyu Tang et.al. 2503.11314 link Kimi
1295 2025-03-14 BriLLM: Brain-inspired Large Language Model Hai Zhao et.al. 2503.11299 null Kimi
1296 2025-03-14 Line of Duty: Evaluating LLM Self-Knowledge via Consistency in Feasibility Boundaries Sahil Kale et.al. 2503.11256 link Kimi
1297 2025-03-14 Reasoning-Grounded Natural Language Explanations for Language Models Vojtech Cahlik et.al. 2503.11248 link Kimi
1298 2025-03-14 Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty? Giacomo Camposampiero et.al. 2503.11207 link Kimi
1299 2025-03-14 LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs Leqi Shen et.al. 2503.11205 null Kimi
1300 2025-03-14 Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering Gang Li et.al. 2503.11197 link Kimi
1301 2025-03-14 FastVID: Dynamic Density Pruning for Fast Video Large Language Models Leqi Shen et.al. 2503.11187 link Kimi
1302 2025-03-14 Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity Chi Xu et.al. 2503.11164 null Kimi
1303 2025-03-14 Don’t Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models Shaotian Yan et.al. 2503.11154 null Kimi
1304 2025-03-14 MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling Rachel S. Y. Teo et.al. 2503.11144 link Kimi
1305 2025-03-14 X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression Guihong Li et.al. 2503.11132 null Kimi
1306 2025-03-14 Direction-Aware Diagonal Autoregressive Image Generation Yijia Xu et.al. 2503.11129 null Kimi
1307 2025-03-13 GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Rongyao Fang et.al. 2503.10639 link Kimi
1308 2025-03-13 Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers? Subhajit Maity et.al. 2503.10632 null Kimi
1309 2025-03-13 SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems Ziyu Guo et.al. 2503.10627 null Kimi
1310 2025-03-13 Transformers without Normalization Jiachen Zhu et.al. 2503.10622 null Kimi
1311 2025-03-13 Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search Andy Zhou et.al. 2503.10619 null Kimi
1312 2025-03-13 Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models Andy Zhou et.al. 2503.10617 null Kimi
1313 2025-03-13 TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention Jinhao Duan et.al. 2503.10602 link Kimi
1314 2025-03-13 Long Context Tuning for Video Generation Yuwei Guo et.al. 2503.10589 null Kimi
1315 2025-03-13 Autoregressive Image Generation with Randomized Parallel Decoding Haopeng Li et.al. 2503.10568 link Kimi
1316 2025-03-13 AudioX: Diffusion Transformer for Anything-to-Audio Generation Zeyue Tian et.al. 2503.10522 null Kimi
1317 2025-03-13 TokenCarve: Information-Preserving Visual Token Compression in Multimodal Large Language Models Xudong Tan et.al. 2503.10501 link Kimi
1318 2025-03-13 MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation Weihao Xuan et.al. 2503.10497 null Kimi
1319 2025-03-13 Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents Hanxu Hu et.al. 2503.10494 link Kimi
1320 2025-03-13 LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions Gaurav Kumar Gupta et.al. 2503.10486 null Kimi
1321 2025-03-13 DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation Wenhao Hu et.al. 2503.10452 null Kimi
1322 2025-03-13 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Wanhua Li et.al. 2503.10437 link Kimi
1323 2025-03-13 BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models Can Zheng et.al. 2503.10432 null Kimi
1324 2025-03-13 Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning Jonathan Shaki et.al. 2503.10408 null Kimi
1325 2025-03-13 SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading Qiaoling Chen et.al. 2503.10377 null Kimi
1326 2025-03-13 G-Boost: Boosting Private SLMs with General LLMs Yijiang Fan et.al. 2503.10367 null Kimi
1327 2025-03-13 KV-Distill: Nearly Lossless Learnable Context Compression for LLMs Vivek Chari et.al. 2503.10337 null Kimi
1328 2025-03-13 Collaborative Speculative Inference for Efficient LLM Inference Serving Luyao Gao et.al. 2503.10325 null Kimi
1329 2025-03-13 VisualPRM: An Effective Process Reward Model for Multimodal Reasoning Weiyun Wang et.al. 2503.10291 null Kimi
1330 2025-03-13 Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout Shilong Wang et.al. 2503.10217 null Kimi
1331 2025-03-13 LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents Boyu Chen et.al. 2503.10200 null Kimi
1332 2025-03-13 Robustness Tokens: Towards Adversarial Robustness of Transformers Brian Pulfer et.al. 2503.10191 link Kimi
1333 2025-03-13 Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding Shunqi Mao et.al. 2503.10183 null Kimi
1334 2025-03-13 “Well, Keep Thinking”: Enhancing LLM Reasoning with Adaptive Injection Decoding Hyunbin Jin et.al. 2503.10167 null Kimi
1335 2025-03-13 ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning Pengfei Luo et.al. 2503.10166 link Kimi
1336 2025-03-13 Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding Jinze Li et.al. 2503.10135 null Kimi
1337 2025-03-11 QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension Yongdong Luo et.al. 2503.08689 link Kimi
1338 2025-03-11 CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving Changxing Liu et.al. 2503.08683 link Kimi
1339 2025-03-11 Chain-of-Thought Reasoning In The Wild Is Not Always Faithful Iván Arcuschin et.al. 2503.08679 link Kimi
1340 2025-03-11 REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder Yitian Zhang et.al. 2503.08665 null Kimi
1341 2025-03-11 MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention Yuhan Wang et.al. 2503.08664 link Kimi
1342 2025-03-11 Exploring the Word Sense Disambiguation Capabilities of Large Language Models Pierpaolo Basile et.al. 2503.08662 null Kimi
1343 2025-03-11 Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention Emily Xiao et.al. 2503.08640 link Kimi
1344 2025-03-11 HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder Yingqi Tang et.al. 2503.08612 link Kimi
1345 2025-03-11 Vision Transformer for Intracranial Hemorrhage Classification in CT Scans Using an Entropy-Aware Fuzzy Integral Strategy for Adaptive Scan-Level Decision Fusion Mehdi Hosseini Chagahi et.al. 2503.08609 null Kimi
1346 2025-03-11 Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling Subin Kim et.al. 2503.08605 null Kimi
1347 2025-03-11 RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding Xichen Tan et.al. 2503.08576 null Kimi
1348 2025-03-11 DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process Minjun Zhu et.al. 2503.08569 null Kimi
1349 2025-03-11 MoE-Loco: Mixture of Experts for Multitask Locomotion Runhan Huang et.al. 2503.08564 null Kimi
1350 2025-03-11 Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs Wanyong Feng et.al. 2503.08551 null Kimi
1351 2025-03-11 Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation Xian Gao et.al. 2503.08549 null Kimi
1352 2025-03-11 DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering Sher Badshah et.al. 2503.08542 null Kimi
1353 2025-03-11 Mellow: a small audio language model for reasoning Soham Deshmukh et.al. 2503.08540 link Kimi
1354 2025-03-11 Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation Andres M Bran et.al. 2503.08537 link Kimi
1355 2025-03-11 ChromaFormer: A Scalable and Accurate Transformer Architecture for Land Cover Classification Mingshi Li et.al. 2503.08534 null Kimi
1356 2025-03-11 Visual Attention Graph Kai-Fu Yang et.al. 2503.08531 null Kimi
1357 2025-03-11 Position-Aware Depth Decay Decoding ( $D^3$ ): Boosting Large Language Model Inference Efficiency Siqi Fan et.al. 2503.08524 null Kimi
1358 2025-03-11 Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models Han Cao et.al. 2503.08495 null Kimi
1359 2025-03-11 Accelerating MoE Model Inference with Expert Sharding Oana Balmau et.al. 2503.08467 null Kimi
1360 2025-03-11 FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework Jianian Zhu et.al. 2503.08461 null Kimi
1361 2025-03-11 Controlling Latent Diffusion Using Latent CLIP Jason Becker et.al. 2503.08455 link Kimi
1362 2025-03-11 TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference Systems Feiyang Wu et.al. 2503.08415 link Kimi
1363 2025-03-11 Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information Elizaveta Kuznetsova et.al. 2503.08404 null Kimi
1364 2025-03-11 Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens Qingsong Xie et.al. 2503.08377 null Kimi
1365 2025-03-11 Robust Latent Matters: Boosting Image Generation with Sampling Error Kai Qiu et.al. 2503.08354 link Kimi
1366 2025-03-11 Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs Chongjun Tu et.al. 2503.08342 null Kimi
1367 2025-03-10 Securing External Deeper-than-black-box GPAI Evaluations Alejandro Tlaie et.al. 2503.07496 null Kimi
1368 2025-03-10 V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation Guiwei Zhang et.al. 2503.07493 link Kimi
1369 2025-03-10 Destination Calculus: A Linear λ-Calculus for Purely Functional Memory Writes Thomas Bagrel et.al. 2503.07489 link Kimi
1370 2025-03-10 LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? Bangyan Li et.al. 2503.07487 null Kimi
1371 2025-03-10 Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction Zongzheng Zhang et.al. 2503.07485 link Kimi
1372 2025-03-10 VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models Jiacheng Ruan et.al. 2503.07478 link Kimi
1373 2025-03-10 Petri Net Modeling of Root Hair Response to Phosphate Starvation in Arabidopsis Thaliana Amber H. B. Fijn et.al. 2503.07477 null Kimi
1374 2025-03-10 MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Xiangru Tang et.al. 2503.07459 link Kimi
1375 2025-03-10 Open-Set Gait Recognition from Sparse mmWave Radar Point Clouds Riccardo Mazzieri et.al. 2503.07435 link Kimi
1376 2025-03-10 DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks Feiran You et.al. 2503.07433 link Kimi
1377 2025-03-10 CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving Ziliang Xiong et.al. 2503.07425 null Kimi
1378 2025-03-10 Inorganic Catalyst Efficiency Prediction Based on EAPCR Model: A Deep Learning Solution for Multi-Source Heterogeneous Data Zhangdi Liu et.al. 2503.07424 null Kimi
1379 2025-03-10 AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion Mingzhen Sun et.al. 2503.07418 null Kimi
1380 2025-03-07 Task-oriented Uncertainty Collaborative Learning for Label-Efficient Brain Tumor Segmentation Zhenxuan Zhang et.al. 2503.05682 link Kimi
1381 2025-03-07 The latent variable proximal point algorithm for variational problems with inequality constraints Jørgen S. Dokken et.al. 2503.05672 link Kimi
1382 2025-03-07 Kinodynamic Model Predictive Control for Energy Efficient Locomotion of Legged Robots with Parallel Elasticity Yulun Zhuang et.al. 2503.05666 null Kimi
1383 2025-03-07 A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval Yu Zhang et.al. 2503.05659 link Kimi
1384 2025-03-07 Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning Justin Chih-Yao Chen et.al. 2503.05641 null Kimi
1385 2025-03-07 Exploring FMCW Radars and Feature Maps for Activity Recognition: A Benchmark Study Ali Samimi Fard et.al. 2503.05629 null Kimi
1386 2025-03-07 FMT:A Multimodal Pneumonia Detection Model Based on Stacking MOE Framework Jingyu Xu et.al. 2503.05626 null Kimi
1387 2025-03-07 A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models Dong Shu et.al. 2503.05613 null Kimi
1388 2025-03-07 D2GV: Deformable 2D Gaussian Splatting for Video Representation in 400FPS Mufan Liu et.al. 2503.05600 link Kimi
1389 2025-03-07 R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Huatong Song et.al. 2503.05592 null Kimi
1390 2025-03-06 L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling Zhuo Chen et.al. 2503.04725 link Kimi
1391 2025-03-07 Shifting Long-Context LLMs Research from Input to Output Yuhao Wu et.al. 2503.04723 null Kimi
1392 2025-03-06 Enough Coin Flips Can Make LLMs Act Bayesian Ritwik Gupta et.al. 2503.04722 null Kimi
1393 2025-03-06 L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning Pranjal Aggarwal et.al. 2503.04697 null Kimi
1394 2025-03-06 UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets Wenyu Wang et.al. 2503.04693 null Kimi
1395 2025-03-06 The Influence of Prior Discourse on Conversational Agent-Driven Decision-Making Stephen Pilli et.al. 2503.04692 null Kimi
1396 2025-03-06 Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases Pengcheng Qiu et.al. 2503.04691 null Kimi
1397 2025-03-07 DIMSUM: Discourse in Mathematical Reasoning as a Supervision Module Krish Sharma et.al. 2503.04685 null Kimi
1398 2025-03-06 Matrix Factorization for Inferring Associations and Missing Links Ryan Barron et.al. 2503.04680 null Kimi
1399 2025-03-06 LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue Sangyeop Kim et.al. 2503.04675 null Kimi
1400 2025-03-05 PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning Ryozo Masukawa et.al. 2503.03747 null Kimi
1401 2025-03-05 Process-based Self-Rewarding Language Models Shimao Zhang et.al. 2503.03746 null Kimi
1402 2025-03-05 Rethinking Deep Clustering Paradigms: Self-Supervision Is All You Need Amal Shaheena et.al. 2503.03733 null Kimi
1403 2025-03-05 Towards Understanding Distilled Reasoning Models: A Representational Approach David D. Baek et.al. 2503.03730 null Kimi
1404 2025-03-05 When Radiation Meets Linux: Analyzing Soft Errors in Linux on COTS SoCs under Proton Irradiation Saad Memon et.al. 2503.03722 null Kimi
1405 2025-03-05 Improving LLM Safety Alignment with Dual-Objective Optimization Xuandong Zhao et.al. 2503.03710 link Kimi
1406 2025-03-05 Rethinking Video Tokenization: A Conditioned Diffusion-based Approach Nianzu Yang et.al. 2503.03708 link Kimi
1407 2025-03-05 A Practical Memory Injection Attack against LLM Agents Shen Dong et.al. 2503.03704 null Kimi
1408 2025-03-05 ILLC: Iterative Layer-by-Layer Compression for Enhancing Structural Faithfulness in SpArX Ungsik Kim et.al. 2503.03693 null Kimi
1409 2025-03-05 DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance Zhao Yang et.al. 2503.03689 link Kimi
1410 2025-03-04 Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation Han Xue et.al. 2503.02881 link Kimi
1411 2025-03-04 Language Models can Self-Improve at State-Value Estimation for Better Search Ethan Mendes et.al. 2503.02878 link Kimi
1412 2025-03-04 Weak-to-Strong Generalization Even in Random Feature Networks, Provably Marko Medvedev et.al. 2503.02877 null Kimi
1413 2025-03-04 SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models Dmitry Nechaev et.al. 2503.02876 link Kimi
1414 2025-03-04 The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models Ke Ji et.al. 2503.02875 null Kimi
1415 2025-03-04 Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework Ziang Zhou et.al. 2503.02863 null Kimi
1416 2025-03-04 PileUp Mitigation at the HL-LHC Using Attention for Event-Wide Context Luke Vaughan et.al. 2503.02860 null Kimi
1417 2025-03-04 Unsupervised Attributed Dynamic Network Embedding with Stability Guarantees Emma Ceccherini et.al. 2503.02859 null Kimi
1418 2025-03-04 Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs’ Decoding Layers Zicong He et.al. 2503.02851 link Kimi
1419 2025-03-04 Multimodal Deep Learning for Subtype Classification in Breast Cancer Using Histopathological Images and Gene Expression Data Amin Honarmandi Shandiz et.al. 2503.02849 link Kimi
1420 2025-02-28 LLM Post-Training: A Deep Dive into Reasoning Large Language Models Komal Kumar et.al. 2502.21321 link Kimi
1421 2025-02-28 Doping dependence of 2-spinon excitations in the doped 1D cuprate Ba $2$CuO${3+δ}$ Jiarui Li et.al. 2502.21316 null Kimi
1422 2025-02-28 Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos Zhiyu Tan et.al. 2502.21314 null Kimi
1423 2025-02-28 FANformer: Improving Large Language Models Through Effective Periodicity Modeling Yihong Dong et.al. 2502.21309 null Kimi
1424 2025-02-28 Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With Faithfulness Based on Causal Theory of Mind Dingyi Zhang et.al. 2502.21297 null Kimi
1425 2025-02-28 Bilevel Optimized Implicit Neural Representation for Scan-Specific Accelerated MRI Reconstruction Hongze Yu et.al. 2502.21292 null Kimi
1426 2025-02-28 Contextualizing biological perturbation experiments through language Menghua Wu et.al. 2502.21290 link Kimi
1427 2025-02-28 Boosting Prediction with Data Missing Not at Random Yuan Bian et.al. 2502.21276 null Kimi
1428 2025-02-28 Adaptive Keyframe Sampling for Long Video Understanding Xi Tang et.al. 2502.21271 null Kimi
1429 2025-02-28 Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks Andrea Montanari et.al. 2502.21269 null Kimi
1430 2025-02-27 R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts Zhongyang Li et.al. 2502.20395 link Kimi
1431 2025-02-27 LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding Ang Cao et.al. 2502.20389 null Kimi
1432 2025-02-27 InsTaG: Learning Personalized 3D Talking Head from Few-Second Video Jiahe Li et.al. 2502.20387 link Kimi
1433 2025-02-27 ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting Dexter Ong et.al. 2502.20386 null Kimi
1434 2025-02-27 rSPDE: tools for statistical modeling using fractional SPDEs David Bolin et.al. 2502.20385 null Kimi
1435 2025-02-27 PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation Albert Gong et.al. 2502.20377 link Kimi
1436 2025-02-27 Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization Ryan C. Barron et.al. 2502.20364 link Kimi
1437 2025-02-27 Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs Kuan Lok Zhou et.al. 2502.20356 null Kimi
1438 2025-02-27 Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners Daniele Paliotta et.al. 2502.20339 null Kimi
1439 2025-02-27 KeBaB: $k$ -mer based breaking for finding super-maximal exact matches Nathaniel K. Brown et.al. 2502.20338 null Kimi
1440 2025-02-26 Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models Lucy Xiaoyang Shi et.al. 2502.19417 null Kimi
1441 2025-02-26 Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation Shiven Sinha et.al. 2502.19414 link Kimi
1442 2025-02-26 The Mighty ToRR: A Benchmark for Table Reasoning and Robustness Shir Ashury-Tahan et.al. 2502.19412 link Kimi
1443 2025-02-26 Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs Dayu Yang et.al. 2502.19411 link Kimi
1444 2025-02-26 ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models Danae Sánchez Villegas et.al. 2502.19409 null Kimi
1445 2025-02-26 Learning Code-Edit Embedding to Model Student Debugging Behavior Hasnain Heickal et.al. 2502.19407 null Kimi
1446 2025-02-26 Single-shot and two-shot decoding with generalized bicycle codes Hsiang-Ku Lin et.al. 2502.19406 null Kimi
1447 2025-02-26 General Reasoning Requires Learning to Reason from the Get-go Seungwook Han et.al. 2502.19402 null Kimi
1448 2025-02-26 TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Max Ku et.al. 2502.19400 null Kimi
1449 2025-02-26 The End of Easy Phenomenology for CMB Experiments: A Case Study in the Dark Sector Cynthia Trendafilova et.al. 2502.19383 null Kimi
1450 2025-02-25 K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs Ziheng Ouyang et.al. 2502.18461 null Kimi
1451 2025-02-25 DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers Xueguang Ma et.al. 2502.18460 link Kimi
1452 2025-02-25 GHOST 2.0: generative high-fidelity one shot transfer of heads Alexander Groshev et.al. 2502.18417 null Kimi
1453 2025-02-25 Comparative Analysis of MDL-VAE vs. Standard VAE on 202 Years of Gynecological Data Paula Santos et.al. 2502.18412 null Kimi
1454 2025-02-25 The FFT Strikes Back: An Efficient Alternative to Self-Attention Jacob Fein-Ashley et.al. 2502.18394 link Kimi
1455 2025-02-25 ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Yifan Pu et.al. 2502.18364 null Kimi
1456 2025-02-25 Graph Inference with Effective Resistance Queries Huck Bennett et.al. 2502.18350 null Kimi
1457 2025-02-25 Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology Romy Beauté et.al. 2502.18318 null Kimi
1458 2025-02-25 RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction Jianhao Yan et.al. 2502.18308 null Kimi
1459 2025-02-25 DeepCircuitX: A Comprehensive Repository-Level Dataset for RTL Code Understanding, Generation, and PPA Analysis Zeju Li et.al. 2502.18297 null Kimi
1460 2025-02-24 S4S: Solving for a Diffusion Model Solver Eric Frankel et.al. 2502.17423 null Kimi
1461 2025-02-24 MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs Jiarui Zhang et.al. 2502.17422 link Kimi
1462 2025-02-24 LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification Penghui Yang et.al. 2502.17421 link Kimi
1463 2025-02-24 Reasoning with Latent Thoughts: On the Power of Looped Transformers Nikunj Saunshi et.al. 2502.17416 null Kimi
1464 2025-02-24 X-Dancer: Expressive Music to Human Dance Video Generation Zeyuan Chen et.al. 2502.17414 null Kimi
1465 2025-02-24 Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning Guijin Son et.al. 2502.17407 link Kimi
1466 2025-02-24 Advances in multiparameter quantum sensing and metrology Luca Pezzè et.al. 2502.17396 null Kimi
1467 2025-02-24 The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE Andrei Chernov et.al. 2502.17391 null Kimi
1468 2025-02-24 A Concise Lyapunov Analysis of Nesterov’s Accelerated Gradient Method Jun Liu et.al. 2502.17373 null Kimi
1469 2025-02-24 KV-Edit: Training-Free Image Editing for Precise Background Preservation Tianrui Zhu et.al. 2502.17363 link Kimi
1470 2025-02-21 Sparks of cognitive flexibility: self-guided context inference for flexible stimulus-response mapping by attentional routing Rowan Sommers et.al. 2502.15634 null Kimi
1471 2025-02-21 LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models Hugo Pitorro et.al. 2502.15612 null Kimi
1472 2025-02-21 Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning Wenhao Zhu et.al. 2502.15592 link Kimi
1473 2025-02-21 LightThinker: Thinking Step-by-Step Compression Jintian Zhang et.al. 2502.15589 null Kimi
1474 2025-02-21 Adaptive Expansion for Hypergraph Learning Tianyi Ma et.al. 2502.15564 null Kimi
1475 2025-02-21 Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach Sai Krishna Reddy Mareddy et.al. 2502.15545 null Kimi
1476 2025-02-21 Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors Milad Sefidgaran et.al. 2502.15540 link Kimi
1477 2025-02-21 Towards Swift Serverless LLM Cold Starts with ParaServe Chiheng Lou et.al. 2502.15524 null Kimi
1478 2025-02-21 Solving Inverse Problems with Deep Linear Neural Networks: Global Convergence Guarantees for Gradient Descent with Weight Decay Hannah Laus et.al. 2502.15522 null Kimi
1479 2025-02-21 Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection Yue Sun et.al. 2502.15516 null Kimi
1480 2025-02-20 LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention Shang Yang et.al. 2502.14866 link Kimi
1481 2025-02-20 CLIPPER: Compression enables long-context synthetic data generation Chau Minh Pham et.al. 2502.14854 link Kimi
1482 2025-02-20 Revealing and Mitigating Over-Attention in Knowledge Editing Pinzheng Wang et.al. 2502.14838 link Kimi
1483 2025-02-20 Towards Economical Inference: Enabling DeepSeek’s Multi-Head Latent Attention in Any Transformer-based LLMs Tao Ji et.al. 2502.14837 link Kimi
1484 2025-02-20 Improving the Diffusability of Autoencoders Ivan Skorokhodov et.al. 2502.14831 null Kimi
1485 2025-02-20 Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps Martin Tutek et.al. 2502.14829 link Kimi
1486 2025-02-20 Turning on the Light: Polymorphism-Induced Photoluminescence in Cysteine Crystals Debarshi Banerjee et.al. 2502.14826 null Kimi
1487 2025-02-20 Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models Vlad Sobal et.al. 2502.14819 null Kimi
1488 2025-02-20 RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation Henrique Piñeiro Monteagudo et.al. 2502.14792 null Kimi
1489 2025-02-20 Ray-Tracing for Conditionally Activated Neural Networks Claudio Gallicchio et.al. 2502.14788 null Kimi
1490 2025-02-20 LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning Yansheng Mao et.al. 2502.14644 null Kimi
1491 2025-02-20 PEARL: Towards Permutation-Resilient LLMs Liang Chen et.al. 2502.14628 link Kimi
1492 2025-02-20 PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models Yu Meng et.al. 2502.14504 null Kimi
1493 2025-02-20 Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression Haoyu Wang et.al. 2502.14477 null Kimi
1494 2025-02-20 Early-Exit and Instant Confidence Translation Quality Estimation Vilém Zouhar et.al. 2502.14429 link Kimi
1495 2025-02-19 MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads Weihao Liu et.al. 2502.13963 link Kimi
1496 2025-02-19 A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models Hao Huang et.al. 2502.13942 null Kimi
1497 2025-02-19 Qwen2.5-VL Technical Report Shuai Bai et.al. 2502.13923 null Kimi
1498 2025-02-19 LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization Guanzheng Chen et.al. 2502.13922 link Kimi
1499 2025-02-19 A measurement-based approach to analyze the power consumption of the softwarized 5G core Arturo Bellin et.al. 2502.13879 null Kimi
1500 2025-02-19 SPEX: Scaling Feature Interaction Explanations for LLMs Justin Singh Kang et.al. 2502.13870 link Kimi
1501 2025-02-19 Enhancing LLM-Based Recommendations Through Personalized Reasoning Jiahao Liu et.al. 2502.13845 link Kimi
1502 2025-02-19 SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning Renxi Wang et.al. 2502.13753 link Kimi
1503 2025-02-19 MoM: Linear Sequence Modeling with Mixture-of-Memories Jusen Du et.al. 2502.13685 link Kimi
1504 2025-02-19 PeerQA: A Scientific Question Answering Dataset from Peer Reviews Tim Baumgärtner et.al. 2502.13668 link Kimi
1505 2025-02-18 Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning Jingyang Lin et.al. 2502.13127 null Kimi
1506 2025-02-18 Eager Updates For Overlapped Communication and Computation in DiLoCo Satyen Kale et.al. 2502.12996 null Kimi
1507 2025-02-18 Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing Xiaoju Ye et.al. 2502.12962 null Kimi
1508 2025-02-18 Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models Gyeongman Kim et.al. 2502.12947 null Kimi
1509 2025-02-18 S $^2$ R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning Ruotian Ma et.al. 2502.12853 link Kimi
1510 2025-02-18 A $^2$ ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization Junhui He et.al. 2502.12665 null Kimi
1511 2025-02-18 MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation Sihyun Yu et.al. 2502.12632 null Kimi
1512 2025-02-18 Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions Leonardo Ranaldi et.al. 2502.12616 null Kimi
1513 2025-02-18 LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data Cehao Yang et.al. 2502.12583 link Kimi
1514 2025-02-18 HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading Cheng Luo et.al. 2502.12574 link Kimi
1515 2025-02-17 Small Models Struggle to Learn from Strong Reasoners Yuetai Li et.al. 2502.12143 null Kimi
1516 2025-02-17 SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs Yige Xu et.al. 2502.12134 null Kimi
1517 2025-02-17 APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs Yuxiang Huang et.al. 2502.12085 link Kimi
1518 2025-02-17 AdaSplash: Adaptive Sparse Flash Attention Nuno Gonçalves et.al. 2502.12082 link Kimi
1519 2025-02-17 TokenSkip: Controllable Chain-of-Thought Compression in LLMs Heming Xia et.al. 2502.12067 link Kimi
1520 2025-02-17 SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities Fengqing Jiang et.al. 2502.12025 null Kimi
1521 2025-02-17 Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving Xin Xu et.al. 2502.12022 null Kimi
1522 2025-02-17 Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL Hanbing Liu et.al. 2502.11656 link Kimi
1523 2025-02-17 SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking Zijian Wu et.al. 2502.11534 null Kimi
1524 2025-02-17 AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification Xiaoyu Tan et.al. 2502.11520 null Kimi
1525 2025-02-14 Are Large Language Models the future crowd workers of Linguistics? Iris Ferrazzo et.al. 2502.10266 null Kimi
1526 2025-02-14 LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing Kuan Li et.al. 2502.09977 null Kimi
1527 2025-02-14 MIR-Bench: Benchmarking LLM’s Long-Context Intelligence via Many-Shot In-Context Inductive Reasoning Kai Yan et.al. 2502.09933 null Kimi
1528 2025-02-14 INF^2: High-Throughput Generative Inference of Large Language Models using Near-Storage Processing Hongsun Jang et.al. 2502.09921 null Kimi
1529 2025-02-13 ATM-Net: Adaptive Termination and Multi-Precision Neural Networks for Energy-Harvested Edge Intelligence Neeraj Solanki et.al. 2502.09822 null Kimi
1530 2025-02-13 NestQuant: Nested Lattice Quantization for Matrix Products and LLMs Semyon Savkin et.al. 2502.09720 null Kimi
1531 2025-02-13 MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Dongzhi Jiang et.al. 2502.09621 null Kimi
1532 2025-02-13 CoT-Valve: Length-Compressible Chain-of-Thought Tuning Xinyin Ma et.al. 2502.09601 link Kimi
1533 2025-02-13 Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs Siyan Zhao et.al. 2502.09597 link Kimi
1534 2025-02-13 SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Daniel Fleischer et.al. 2502.09390 link Kimi
1535 2025-02-13 Generalizability through Explainability: Countering Overfitting with Counterfactual Examples Flavio Giorgi et.al. 2502.09193 null Kimi
1536 2025-02-13 Bridging the Gap Between LLMs and Human Intentions: Progresses and Challenges in Instruction Understanding, Intention Reasoning, and Reliable Generation Zongyu Chang et.al. 2502.09101 null Kimi
1537 2025-02-13 Unleashing the Power of Large Language Model for Denoising Recommendation Shuyao Wang et.al. 2502.09058 null Kimi
1538 2025-02-13 Diversity Enhances an LLM’s Performance in RAG and Long-context Task Zhchao Wang et.al. 2502.09017 null Kimi
1539 2025-02-13 RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models Quan Wei et.al. 2502.09003 null Kimi
1540 2025-02-13 Task Generalization With AutoRegressive Compositional Structure: Can Learning From $\d$ Tasks Generalize to $\d^{T}$ Tasks? Amirhesam Abedsoltan et.al. 2502.08991 null Kimi
1541 2025-02-12 Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning Qifan Yu et.al. 2502.08482 null Kimi
1542 2025-02-12 The MoE-Empowered Edge LLMs Deployment: Architecture, Challenges, and Opportunities Ning Li et.al. 2502.08381 null Kimi
1543 2025-02-12 Inference-time sparse attention with asymmetric indexing Pierre-Emmanuel Mazaré et.al. 2502.08246 null Kimi
1544 2025-02-12 Learning Human Skill Generators at Key-Step Levels Yilu Wu et.al. 2502.08234 null Kimi
1545 2025-02-12 Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Lingfei Qian et.al. 2502.08127 link Kimi
1546 2025-02-12 GCoT: Chain-of-Thought Prompt Learning for Graphs Xingtong Yu et.al. 2502.08092 null Kimi
1547 2025-02-12 Mixture of Decoupled Message Passing Experts with Entropy Constraint for General Node Classification Xuanze Chen et.al. 2502.08083 null Kimi
1548 2025-02-11 Training Sparse Mixture Of Experts Text Embedding Models Zach Nussbaum et.al. 2502.07972 link Kimi
1549 2025-02-11 HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment Youhe Jiang et.al. 2502.07903 null Kimi
1550 2025-02-11 TransMLA: Multi-head Latent Attention Is All You Need Fanxu Meng et.al. 2502.07864 link Kimi
1551 2025-02-11 Magic 1-For-1: Generating One Minute Video Clips within One Minute Hongwei Yi et.al. 2502.07701 link Kimi
1552 2025-02-11 LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid Weigao Sun et.al. 2502.07563 link Kimi
1553 2025-02-11 Early Stopping Against Label Noise Without Validation Data Suqin Yuan et.al. 2502.07551 link Kimi
1554 2025-02-11 Instance-dependent Early Stopping Suqin Yuan et.al. 2502.07547 link Kimi
1555 2025-02-11 Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More Xialie Zhuang et.al. 2502.07490 link Kimi
1556 2025-02-11 LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! Dacheng Li et.al. 2502.07374 link Kimi
1557 2025-02-11 LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation Zican Dong et.al. 2502.07365 null Kimi
1558 2025-02-11 BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Xu Huang et.al. 2502.07346 link Kimi
1559 2025-02-11 CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Junlong Li et.al. 2502.07316 link Kimi
1560 2025-02-11 OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms Lumen AI et.al. 2502.07312 link Kimi
1561 2025-02-10 On the Emergence of Thinking in LLMs I: Searching for the Right Intuition Guanghao Ye et.al. 2502.06773 link Kimi
1562 2025-02-10 ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates Ling Yang et.al. 2502.06772 link Kimi
1563 2025-02-10 Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs Ryan Synk et.al. 2502.06766 link Kimi
1564 2025-02-10 History-Guided Video Diffusion Kiwhan Song et.al. 2502.06764 null Kimi
1565 2025-02-10 Rationalization Models for Text-to-SQL Gaetano Rossiello et.al. 2502.06759 null Kimi
1566 2025-02-10 MoETuner: Optimized Mixture of Expert Serving with Balanced Expert Placement and Token Routing Seokjin Go et.al. 2502.06643 null Kimi
1567 2025-02-10 Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text Approaches Adithya Pratapa et.al. 2502.06617 link Kimi
1568 2025-02-10 Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation Chengwen Qi et.al. 2502.06563 link Kimi
1569 2025-02-10 CoS: Chain-of-Shot Prompting for Long Video Understanding Jian Hu et.al. 2502.06428 null Kimi
1570 2025-02-10 Expect the Unexpected: FailSafe Long Context QA for Finance Kiran Kamble et.al. 2502.06329 null Kimi
1571 2025-02-07 Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray Yunhang Shen et.al. 2502.05177 link Kimi
1572 2025-02-07 VideoRoPE: What Makes for Good Video Rotary Position Embedding? Xilin Wei et.al. 2502.05173 link Kimi
1573 2025-02-07 Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient Jan Ludziejewski et.al. 2502.05172 null Kimi
1574 2025-02-07 NoLiMa: Long-Context Evaluation Beyond Literal Matching Ali Modarressi et.al. 2502.05167 link Kimi
1575 2025-02-07 Data-Parallel Neural Network Training via Nonlinearly Preconditioned Trust-Region Method Samuel A. Cruz Alegría et.al. 2502.05133 null Kimi
1576 2025-02-07 Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures Tushar Pandey et.al. 2502.05078 link Kimi
1577 2025-02-07 S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency Yuting Zeng et.al. 2502.04790 null Kimi
1578 2025-02-07 Early Stopping for Regression Trees Ratmir Miftachov et.al. 2502.04709 null Kimi
1579 2025-02-07 ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning Yuwei Yin et.al. 2502.04689 link Kimi
1580 2025-02-07 Unveiling the Mechanisms of Explicit CoT Training: How Chain-of-Thought Enhances Reasoning Generalization Xinhao Yao et.al. 2502.04667 link Kimi
1581 2025-02-06 Exploring operation parallelism vs. ion movement in ion-trapped QCCD architectures Anabel Ovide et.al. 2502.04181 null Kimi
1582 2025-02-06 HD-EPIC: A Highly-Detailed Egocentric Video Dataset Toby Perrett et.al. 2502.04144 null Kimi
1583 2025-02-06 AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference Qingyue Yang et.al. 2502.04077 link Kimi
1584 2025-02-06 RWKV-UI: UI Understanding with Enhanced Perception and Reasoning Jiaxi Yang et.al. 2502.03971 null Kimi
1585 2025-02-06 InfinitePOD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers Chenchen Shou et.al. 2502.03885 null Kimi
1586 2025-02-06 Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning Peizhuang Cong et.al. 2502.03884 null Kimi
1587 2025-02-06 Identify Critical KV Cache in LLM Inference from an Output Perturbation Perspective Yuan Feng et.al. 2502.03805 link Kimi
1588 2025-02-05 (GG) MoE vs. MLP on Tabular Data Andrei Chernov et.al. 2502.03608 null Kimi
1589 2025-02-05 HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference Zeyu Zhang et.al. 2502.03589 null Kimi
1590 2025-02-05 Demystifying Long Chain-of-Thought Reasoning in LLMs Edward Yeo et.al. 2502.03373 link Kimi
1591 2025-02-05 ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model Qiguang Chen et.al. 2502.03325 null Kimi
1592 2025-02-05 Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning DiJia Su et.al. 2502.03275 null Kimi
1593 2025-02-05 MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding Pengyi Li et.al. 2502.03183 null Kimi
1594 2025-02-05 Structured Token Retention and Computational Memory Paths in Large Language Models Jonathan Delena et.al. 2502.03102 null Kimi
1595 2025-02-05 IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates Aissatou Diallo et.al. 2502.03080 null Kimi
1596 2025-02-05 Scaling Laws for Upcycling Mixture-of-Experts Language Models Seng Pei Liew et.al. 2502.03009 null Kimi
1597 2025-02-05 LLM-KT: Aligning Large Language Models with Knowledge Tracing using a Plug-and-Play Instruction Ziwei Wang et.al. 2502.02945 null Kimi
1598 2025-02-05 Early Stopping in Contextual Bandits and Inferences Zihan Cui et.al. 2502.02793 null Kimi
1599 2025-02-04 Twilight: Adaptive Attention Sparsity with Hierarchical Top- $p$ Pruning Chaofan Lin et.al. 2502.02770 null Kimi
1600 2025-02-04 Hecate: Unlocking Efficient Sparse Model Training via Fully Sharded Sparse Data Parallelism Yuhao Qing et.al. 2502.02581 null Kimi
1601 2025-02-04 Brief analysis of DeepSeek R1 and it’s implications for Generative AI Sarah Mercer et.al. 2502.02523 null Kimi
1602 2025-02-04 EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization Yize Wu et.al. 2502.02493 null Kimi
1603 2025-02-04 Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers Alireza Amiri et.al. 2502.02393 null Kimi
1604 2025-02-04 STAIR: Improving Safety Alignment with Introspective Reasoning Yichi Zhang et.al. 2502.02384 link Kimi
1605 2025-02-04 Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs Sagnik Mukherjee et.al. 2502.02362 null Kimi
1606 2025-02-04 VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation Siyu Xu et.al. 2502.02175 null Kimi
1607 2025-02-04 M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference Nikhil Bhendawade et.al. 2502.02040 null Kimi
1608 2025-02-04 Wavelet-based Positional Representation for Long Context Yui Oka et.al. 2502.02004 null Kimi
1609 2025-02-04 MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving Shiju Zhao et.al. 2502.01960 null Kimi
1610 2025-01-31 Scalable-Softmax Is Superior for Attention Ken M. Nakanishi et.al. 2501.19399 null Kimi
1611 2025-01-31 Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models Alina Shutova et.al. 2501.19392 link Kimi
1612 2025-01-31 Efficient Reasoning with Hidden Thinking Xuan Shen et.al. 2501.19201 link Kimi
1613 2025-01-31 Rethinking Early Stopping: Refine, Then Calibrate Eugène Berta et.al. 2501.19195 link Kimi
1614 2025-01-31 A theoretical framework for overfitting in energy-based modeling Giovanni Catania et.al. 2501.19158 null Kimi
1615 2025-01-31 $\infty$ -Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation Saul Santos et.al. 2501.19098 link Kimi
1616 2025-01-30 Rope to Nope and Back Again: A New Hybrid Attention Strategy Bowen Yang et.al. 2501.18795 null Kimi
1617 2025-01-30 Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning Maya Kruse et.al. 2501.18724 null Kimi
1618 2025-01-30 Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models Yi Ding et.al. 2501.18533 null Kimi
1619 2025-01-30 State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence Thea Aviss et.al. 2501.18356 null Kimi
1620 2025-01-30 Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge Swarnadeep Saha et.al. 2501.18099 null Kimi
1621 2025-01-29 Physics-Grounded Differentiable Simulation for Soft Growing Robots Lucas Chen et.al. 2501.17963 link Kimi
1622 2025-01-29 Free Agent in Agent-Based Mixture-of-Experts Generative AI Framework Jung-Hua Liu et.al. 2501.17903 null Kimi
1623 2025-01-29 Formally Verified Binary-level Pointer Analysis Freek Verbeek et.al. 2501.17766 null Kimi
1624 2025-01-29 CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs Amey Hengle et.al. 2501.17581 null Kimi
1625 2025-01-29 Heuristic-Informed Mixture of Experts for Link Prediction in Multilayer Networks Lucio La Cava et.al. 2501.17557 null Kimi
1626 2025-01-29 DINT Transformer Yueyang Cang et.al. 2501.17486 null Kimi
1627 2025-01-28 TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network Yumingzhi Pan et.al. 2501.16784 null Kimi
1628 2025-01-28 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow Yueen Ma et.al. 2501.16698 null Kimi
1629 2025-01-28 MCTS-SQL: An Effective Framework for Text-to-SQL with Monte Carlo Tree Search Shuozhi Yuan et.al. 2501.16607 null Kimi
1630 2025-01-27 Searching for GEMS: Discovery and Characterization of Two Brown Dwarfs Around M Dwarfs Alexander Larsen et.al. 2501.16554 null Kimi
1631 2025-01-27 MoEVD: Enhancing Vulnerability Detection by Mixture-of-Experts (MoE) Xu Yang et.al. 2501.16454 null Kimi
1632 2025-01-27 The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model Kaito Takanami et.al. 2501.16226 null Kimi
1633 2025-01-27 Provence: efficient and robust context pruning for retrieval-augmented generation Nadezhda Chirkova et.al. 2501.16214 null Kimi
1634 2025-01-27 Options-Aware Dense Retrieval for Multiple-Choice query Answering Manish Singh et.al. 2501.16111 null Kimi
1635 2025-01-27 Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference Yinghan Li et.al. 2501.16103 null Kimi
1636 2025-01-27 Understanding Long Videos via LLM-Powered Entity Relation Graphs Meng Chu et.al. 2501.15953 null Kimi
1637 2025-01-27 Memorization and Regularization in Generative Diffusion Models Ricardo Baptista et.al. 2501.15785 link Kimi
1638 2025-01-27 Renewable Energy Prediction: A Comparative Study of Deep Learning Models for Complex Dataset Analysis Haibo Wang et.al. 2501.15731 null Kimi
1639 2025-01-26 A Benchmarking Platform for DDR4 Memory Performance in Data-Center-Class FPGAs Andrea Galimberti et.al. 2501.15582 null Kimi
1640 2025-01-26 Qwen2.5-1M Technical Report An Yang et.al. 2501.15383 null Kimi
1641 2025-01-25 ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning Shangqian Gao et.al. 2501.15316 null Kimi
1642 2025-01-24 Mean-field limit from general mixtures of experts to quantum neural networks Anderson Melchor Hernandez et.al. 2501.14660 null Kimi
1643 2025-01-24 Experimentally Evaluating the Resource Efficiency of Big Data Autoscaling Jonathan Will et.al. 2501.14456 link Kimi
1644 2025-01-24 Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains Xu Chu et.al. 2501.14431 null Kimi
1645 2025-01-24 GraphBC: Improving LLMs for Better Graph Data Processing Xu Chu et.al. 2501.14427 null Kimi
1646 2025-01-24 Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation Shengzhe Zhang et.al. 2501.14269 link Kimi
1647 2025-01-24 Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading Minrui Xu et.al. 2501.14205 null Kimi
1648 2025-01-23 Can We Generate Images with CoT? Let’s Verify and Reinforce Image Generation Step by Step Ziyu Guo et.al. 2501.13926 link Kimi
1649 2025-01-23 The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities Chan-Jan Hsu et.al. 2501.13921 link Kimi
1650 2025-01-23 PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection Peiyuan Zhang et.al. 2501.13898 link Kimi
1651 2025-01-23 Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Zhenghao Lin et.al. 2501.13629 null Kimi
1652 2025-01-23 Coarse-to-Fine Process Reward Modeling for Enhanced Mathematical Reasoning Yulan Hu et.al. 2501.13622 null Kimi
1653 2025-01-23 Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge Haomiao Xiong et.al. 2501.13468 link Kimi
1654 2025-01-23 Contrast: A Hybrid Architecture of Transformers and State Space Models for Low-Level Vision Aman Urumbekov et.al. 2501.13353 null Kimi
1655 2025-01-23 Qrazor: Reliable and effortless 4-bit llm quantization by significant data razoring Dongyoung Lee et.al. 2501.13331 null Kimi
1656 2025-01-22 Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment Melissa Kazemi Rad et.al. 2501.13080 null Kimi
1657 2025-01-22 Autonomy-of-Experts Models Ang Lv et.al. 2501.13074 null Kimi
1658 2025-01-22 Ehrenfeucht-Haussler Rank and Chain of Thought Pablo Barceló et.al. 2501.12997 null Kimi
1659 2025-01-22 LLM4WM: Adapting LLM for Wireless Multi-Tasking Xuanyu Liu et.al. 2501.12983 null Kimi
1660 2025-01-22 Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference Weizhi Fei et.al. 2501.12959 null Kimi
1661 2025-01-22 Late Breaking Result: FPGA-Based Emulation and Fault Injection for CNN Inference Accelerators Filip Masar et.al. 2501.12818 link Kimi
1662 2025-01-22 NExtLong: Toward Effective Long-Context Training without Long Documents Chaochen Gao et.al. 2501.12766 link Kimi
1663 2025-01-22 BLR-MoE: Boosted Language-Routing Mixture of Experts for Domain-Robust Multilingual E2E ASR Guodong Ma et.al. 2501.12602 null Kimi
1664 2025-01-22 Kimi k1.5: Scaling Reinforcement Learning with LLMs Kimi Team et.al. 2501.12599 null Kimi
1665 2025-01-21 Slot-BERT: Self-supervised Object Discovery in Surgical Video Guiqiu Liao et.al. 2501.12477 null Kimi
1666 2025-01-21 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null Kimi
1667 2025-01-21 Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL Yeounoh Chung et.al. 2501.12372 link Kimi
1668 2025-01-21 Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models Samira Abnar et.al. 2501.12370 null Kimi
1669 2025-01-21 CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning Yuanheng Fang et.al. 2501.12226 null Kimi
1670 2025-01-21 Muon-specific two-Higgs-doublet model for $(g-2)_μ$ anomaly, $W$ -boson mass-shift, and Zee model I. A. Yafi et.al. 2501.12181 null Kimi
1671 2025-01-21 Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Zihan Qiu et.al. 2501.11873 null Kimi
1672 2025-01-20 Characterization of GPU TEE Overheads in Distributed Data Parallel ML Training Jonghytun Lee et.al. 2501.11771 null Kimi
1673 2025-01-20 Early Stopping Bayesian Optimization for Controller Tuning David Stenger et.al. 2501.11532 link Kimi
1674 2025-01-20 CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation Zheng Chong et.al. 2501.11325 link Kimi
1675 2025-01-20 RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems? Haotian Xu et.al. 2501.11284 null Kimi
1676 2025-01-17 AraXL: A Physically Scalable, Ultra-Wide RISC-V Vector Processor Design for Fast and Efficient Computation on Long Vectors Navaneeth Kunhi Purayil et.al. 2501.10301 null Kimi
1677 2025-01-17 ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario Lucen Zhong et.al. 2501.10132 link Kimi
1678 2025-01-17 Multi-Dimensional Vector ISA Extension for Mobile In-Cache Computing Alireza Khadem et.al. 2501.09902 link Kimi
1679 2025-01-16 Coded Deep Learning: Framework and Algorithm En-hui Yang et.al. 2501.09849 null Kimi
1680 2025-01-15 LeMo: Enabling LEss Token Involvement for MOre Context Fine-tuning Tuowei Wang et.al. 2501.09767 null Kimi
1681 2025-01-16 AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Junjie He et.al. 2501.09503 link Kimi
1682 2025-01-16 PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks Huiyou Zhan et.al. 2501.09367 null Kimi
1683 2025-01-15 Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation Jiaxin Guo et.al. 2501.08523 null Kimi
1684 2025-01-14 Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models Yifu Qiu et.al. 2501.08248 null Kimi
1685 2025-01-14 PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving Ahmet Caner Yüzügüler et.al. 2501.08192 null Kimi
1686 2025-01-13 A Survey of Early Exit Deep Neural Networks in NLP Divya Jyoti Bajpai et.al. 2501.07670 null Kimi
1687 2025-01-14 Monotone Curve Estimation via Convex Duality Tongseok Lim et.al. 2501.06975 null Kimi
1688 2025-01-12 MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference Wenxuan Zeng et.al. 2501.06807 null Kimi
1689 2025-01-12 Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management Liu Qianli et.al. 2501.06709 null Kimi
1690 2025-01-11 SafeSplit: A Novel Defense Against Client-Side Backdoor Attacks in Split Learning Phillip Rieger et.al. 2501.06650 null Kimi
1691 2025-01-11 Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks Amr Almorsi et.al. 2501.06625 null Kimi
1692 2025-01-11 Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping Muru Zhang et.al. 2501.06589 link Kimi
1693 2025-01-11 Tensor Product Attention Is All You Need Yifan Zhang et.al. 2501.06425 link Kimi
1694 2025-01-10 Scale-up Unlearnable Examples Learning with High-Performance Computing Yanfan Zhu et.al. 2501.06080 link Kimi
1695 2025-01-09 Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters Ziyue Luo et.al. 2501.05563 null Kimi
1696 2025-01-09 LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation Xi Ye et.al. 2501.05414 null Kimi
1697 2025-01-09 Euclid: Detecting Solar System objects in Euclid images and classifying them using Kohonen self-organising maps A. A. Nucita et.al. 2501.05023 null Kimi
1698 2025-01-09 SyNPar: Synthetic Null Data Parallelism for High-Power False Discovery Rate Control in High-Dimensional Variable Selection Changhu Wang et.al. 2501.05012 null Kimi
1699 2025-01-09 TreeKV: Smooth Key-Value Cache Compression with Tree Structures Ziwei He et.al. 2501.04987 null Kimi
1700 2025-01-08 Collaborative Inference Acceleration with Non-Penetrative Tensor Partitioning Zhibang Liu et.al. 2501.04489 null Kimi
1701 2025-01-06 The Power of Negative Zero: Datatype Customization for Quantized Large Language Models Yuzong Chen et.al. 2501.04052 link Kimi
1702 2025-01-07 CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering Jialiang Chen et.al. 2501.03447 null Kimi
1703 2025-01-05 PTEENet: Post-Trained Early-Exit Neural Networks Augmentation for Inference Cost Optimization Assaf Lahiany et.al. 2501.02508 null Kimi
1704 2025-01-07 ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Chaojie Mao et.al. 2501.02487 null Kimi
1705 2025-01-04 AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference Zhuomin He et.al. 2501.02336 link Kimi
1706 2025-01-04 The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit Huixue Zhou et.al. 2501.02173 null Kimi
1707 2025-01-03 Efficient LLM Inference with Activation Checkpointing and Hybrid Caching Sanghyeon Lee et.al. 2501.01792 null Kimi
1708 2025-01-03 Data Parallel Visualization and Rendering on the RAMSES Supercomputer with ANARI Stefan Zellmann et.al. 2501.01628 null Kimi
1709 2025-01-02 TreeLUT: An Efficient Alternative to Deep Neural Networks for Inference Acceleration Using Gradient Boosted Decision Trees Alireza Khataei et.al. 2501.01511 link Kimi
1710 2025-01-02 FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving Zihao Ye et.al. 2501.01005 link Kimi
1711 2025-01-01 Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding Jiajun Zhu et.al. 2501.00712 link Kimi
1712 2025-01-01 Adjoint sharding for very long context training of state space models Xingzi Xu et.al. 2501.00692 null Kimi
1713 2024-12-31 Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing Peihao Wang et.al. 2501.00658 link Kimi
1714 2024-12-31 A Study on Context Length and Efficient Transformers for Biomedical Image Analysis Sarah M. Hooper et.al. 2501.00619 null Kimi
1715 2024-12-31 VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling Xinhao Li et.al. 2501.00574 link Kimi
1716 2024-12-30 CaseSumm: A Large-Scale Dataset for Long-Context Summarization from U.S. Supreme Court Opinions Mourad Heddaya et.al. 2501.00097 null Kimi
1717 2024-12-30 Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism Tim Tsz-Kit Lau et.al. 2412.21124 null Kimi
1718 2024-12-30 Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA Qingyun Jin et.al. 2412.20677 null Kimi
1719 2024-12-29 ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding Xiao Wang et.al. 2412.20504 link Kimi
1720 2024-12-29 TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication Zongwu Wang et.al. 2412.20501 link Kimi
1721 2024-12-29 NeutronTP: Load-Balanced Distributed Full-Graph GNN Training with Tensor Parallelism Xin Ai et.al. 2412.20379 null Kimi
1722 2024-12-28 LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System Hyucksung Kwon et.al. 2412.20166 null Kimi
1723 2024-12-28 ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming Jiedong Zhuang et.al. 2412.20105 null Kimi
1724 2024-12-27 Goal-oriented Communications based on Recursive Early Exit Neural Networks Jary Pomponi et.al. 2412.19587 null Kimi
1725 2024-12-27 StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture Miaomiao Dai et.al. 2412.19535 null Kimi
1726 2025-01-02 A Survey on Large Language Model Acceleration based on KV Cache Management Haoyang Li et.al. 2412.19442 link Kimi
1727 2024-12-26 Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones Mehrnaz Mofakhami et.al. 2412.19325 null Kimi
1728 2024-12-26 Multi-matrix Factorization Attention Jingcheng Hu et.al. 2412.19255 null Kimi
1729 2024-12-26 Repository Structure-Aware Training Makes SLMs Better Issue Resolver Zexiong Ma et.al. 2412.19031 null Kimi
1730 2024-12-25 Long-Range Tasks Using Short-Context LLMs: Incremental Reasoning With Structured Memories Dulhan Jayalath et.al. 2412.18914 null Kimi
1731 2024-12-25 Bootstrap Your Own Context Length Liang Wang et.al. 2412.18860 null Kimi
1732 2024-12-25 DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search Lei Yang et.al. 2412.18811 link Kimi
1733 2024-12-24 Efficient Long Context Language Model Retrieval with Compression Minju Seo et.al. 2412.18232 null Kimi
1734 2024-12-24 Adapter Merging with Centroid Prototype Mapping for Scalable Class-Incremental Learning Takuma Fukuda et.al. 2412.18219 link Kimi
1735 2024-12-24 KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management Rongxin Cheng et.al. 2412.18169 null Kimi
1736 2024-12-24 Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering Francois Chaubard et.al. 2412.18052 link Kimi
1737 2024-12-23 Theoretical Constraints on the Expressive Power of $\mathsf{RoPE}$ -based Tensor Attention Transformers Xiaoyu Li et.al. 2412.18040 null Kimi
1738 2024-12-23 Deliberation in Latent Space via Differentiable Cache Augmentation Luyang Liu et.al. 2412.17747 null Kimi
1739 2024-12-24 YuLan-Mini: An Open Data-efficient Language Model Yiwen Hu et.al. 2412.17743 link Kimi
1740 2024-12-23 Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework Aswini Kumar Patra et.al. 2412.17587 null Kimi
1741 2024-12-23 Optimal Convergence Rates for Neural Operators Mike Nguyen et.al. 2412.17518 null Kimi
1742 2024-12-23 A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression Chenlong Deng et.al. 2412.17483 null Kimi
1743 2024-12-23 MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models Beibei Yu et.al. 2412.17339 null Kimi
1744 2024-12-22 Revisiting In-Context Learning with Long Context Language Models Jinheon Baek et.al. 2412.16926 null Kimi
1745 2024-12-20 A survey on FPGA-based accelerator for ML models Feng Yan et.al. 2412.15666 null Kimi
1746 2024-12-20 Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks Brian J Chan et.al. 2412.15605 link Kimi
1747 2024-12-19 Systematic Evaluation of Long-Context LLMs on Financial Concepts Lavanya Gupta et.al. 2412.15386 null Kimi
1748 2024-12-19 LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Yushi Bai et.al. 2412.15204 link Kimi
1749 2024-12-19 Minimizing speculation overhead in a parallel recognizer for regular texts Angelo Borsotti et.al. 2412.14975 null Kimi
1750 2024-12-19 DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs Xiabin Zhou et.al. 2412.14838 null Kimi
1751 2024-12-19 Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models Wenhan Liu et.al. 2412.14574 link Kimi
1752 2024-12-19 HashAttention: Semantic Sparsity for Faster Inference Aditya Desai et.al. 2412.14468 null Kimi
1753 2024-12-18 Scaling Deep Learning Training with MPMD Pipeline Parallelism Anxhelo Xhebraj et.al. 2412.14374 null Kimi
1754 2024-12-18 ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals Utkarsh Saxena et.al. 2412.14363 link Kimi
1755 2024-12-18 State Space Models are Strong Text Rerankers Zhichao Xu et.al. 2412.14354 null Kimi
1756 2024-12-19 Online MDP with Transition Prototypes: A Robust Adaptive Approach Shuo Sun et.al. 2412.14075 null Kimi
1757 2024-12-19 Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Benjamin Warner et.al. 2412.13663 link Kimi
1758 2024-12-18 SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation Jialong Wu et.al. 2412.13649 link Kimi
1759 2024-12-18 LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning Yansheng Mao et.al. 2412.13626 null Kimi
1760 2024-12-18 Attention-aware convolutional neural networks for identification of magnetic islands in the tearing mode on EAST tokamak Feifei Long et.al. 2412.13498 null Kimi
1761 2024-12-18 Deploying Foundation Model Powered Agent Services: A Survey Wenchao Xu et.al. 2412.13437 null Kimi
1762 2024-12-17 COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism Jianing He et.al. 2412.13236 link Kimi
1763 2024-12-17 GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models Mukai Li et.al. 2412.12735 link Kimi
1764 2024-12-17 More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression Jiebin Zhang et.al. 2412.12706 null Kimi
1765 2024-12-17 LLMs are Also Effective Embedding Models: An In-depth Overview Chongyang Tao et.al. 2412.12591 null Kimi
1766 2024-12-17 PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization Yun Luo et.al. 2412.12588 link Kimi
1767 2024-12-17 ITP: Instance-Aware Test Pruning for Out-of-Distribution Detection Haonan Xu et.al. 2412.12566 link Kimi
1768 2024-12-17 A System for Microserving of LLMs Hongyi Jin et.al. 2412.12488 null Kimi
1769 2024-12-17 Boosting Long-Context Information Seeking via Query-Guided Activation Refilling Hongjin Qian et.al. 2412.12486 link Kimi
1770 2024-12-17 Core Context Aware Attention for Long Context Language Modeling Yaofo Chen et.al. 2412.12465 null Kimi
1771 2024-12-17 SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Guoxuan Chen et.al. 2412.12094 link Kimi
1772 2024-12-16 SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval Yueqian Lin et.al. 2412.12009 link Kimi
1773 2024-12-16 EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents Mengna Zhu et.al. 2412.11814 null Kimi
1774 2024-12-16 CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation Hongxuan Zhang et.al. 2412.11741 null Kimi
1775 2024-12-16 Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning Xingchi Chen et.al. 2412.11685 null Kimi
1776 2024-12-16 On the SDP Relaxation of Direct Torque Finite Control Set Model Predictive Control Luca M. Hartmann et.al. 2412.11666 null Kimi
1777 2024-12-16 FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation Dannong Wang et.al. 2412.11378 link Kimi
1778 2024-12-15 Timing of Seven Isolated Pulsars in the Globular Cluster Terzan 1 Justine Singleton et.al. 2412.11271 null Kimi
1779 2024-12-15 Wasserstein Bounds for generative diffusion models with Gaussian tail targets Xixian Wang et.al. 2412.11251 null Kimi
1780 2024-12-15 ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction Yi Feng et.al. 2412.11210 link Kimi
1781 2024-12-13 SCBench: A KV Cache-Centric Analysis of Long-Context Methods Yucheng Li et.al. 2412.10319 null Kimi
1782 2024-12-13 Lost in the Middle, and In-Between: Enhancing Language Models’ Ability to Reason Over Long Contexts in Multi-Hop QA George Arthur Baker et.al. 2412.10079 link Kimi
1783 2024-12-13 Benchmarking Table Comprehension In The Wild Yikang Pan et.al. 2412.09884 null Kimi
1784 2024-12-13 V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding Junqi Ge et.al. 2412.09616 link Kimi
1785 2024-12-12 InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions Pan Zhang et.al. 2412.09596 link Kimi
1786 2024-12-12 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Tiehan Fan et.al. 2412.09283 null Kimi
1787 2024-12-12 ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty Meizhi Zhong et.al. 2412.09036 null Kimi
1788 2024-12-12 RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios Ruiwen Zhou et.al. 2412.08972 link Kimi
1789 2024-12-12 Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries Junhyuck Kim et.al. 2412.08890 link Kimi
1790 2024-12-11 TURBOATTENTION: Efficient Attention Approximation For High Throughputs LLMs Hao Kang et.al. 2412.08585 null Kimi
1791 2024-12-11 EMS: Adaptive Evict-then-Merge Strategy for Head-wise KV Cache Compression Based on Global-Local Importance Yingxin Li et.al. 2412.08521 null Kimi
1792 2024-12-10 From Slow Bidirectional to Fast Causal Video Generators Tianwei Yin et.al. 2412.07772 null Kimi
1793 2024-12-10 ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Jinyi Hu et.al. 2412.07720 link Kimi
1794 2024-12-09 FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization Boyang Zhang et.al. 2412.06865 null Kimi
1795 2024-12-09 Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models Wei Suo et.al. 2412.06458 null Kimi
1796 2024-12-08 BiDM: Pushing the Limit of Quantization for Diffusion Models Xingyu Zheng et.al. 2412.05926 link Kimi
1797 2024-12-08 XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference Weizhuo Li et.al. 2412.05896 null Kimi
1798 2024-12-07 Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression Michael R. Metel et.al. 2412.05693 null Kimi
1799 2024-12-11 Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference Qingyuan Li et.al. 2412.04964 null Kimi
1800 2024-12-06 GUIDE: A Global Unified Inference Engine for Deploying Large Language Models in Heterogeneous Environments Yanyu Chen et.al. 2412.04788 null Kimi
1801 2024-12-05 Cross-Self KV Cache Pruning for Efficient Vision-Language Inference Xiaohuan Pei et.al. 2412.04652 link Kimi
1802 2024-12-05 votess: A multi-target, GPU-capable, parallel Voronoi tessellator C. Byrohl et.al. 2412.04514 link Kimi
1803 2024-12-05 p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay Jun Zhang et.al. 2412.04449 link Kimi
1804 2024-12-07 PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation Ao Wang et.al. 2412.03409 link Kimi
1805 2024-12-04 ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression Guangda Liu et.al. 2412.03213 null Kimi
1806 2024-12-04 Unifying KV Cache Compression for Large Language Models with LeanKV Yanqi Zhang et.al. 2412.03131 null Kimi
1807 2024-12-04 Lightweight Multiplane Images Network for Real-Time Stereoscopic Conversion from Planar Video Shanding Diao et.al. 2412.03102 null Kimi
1808 2024-12-03 Resource-Adaptive Successive Doubling for Hyperparameter Optimization with Large Datasets on High-Performance Computing Systems Marcel Aach et.al. 2412.02729 link Kimi
1809 2024-12-03 Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity Da Ma et.al. 2412.02252 null Kimi
1810 2024-12-02 RandAR: Decoder-only Autoregressive Visual Generation in Random Orders Ziqi Pang et.al. 2412.01827 null Kimi
1811 2024-12-05 Yi-Lightning Technical Report 01. AI et.al. 2412.01253 null Kimi
1812 2024-12-02 INTELLECT-1 Technical Report Sami Jaghouar et.al. 2412.01152 link Kimi
1813 2024-12-03 Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification Wenxuan Huang et.al. 2412.00876 link Kimi
1814 2024-12-01 MERLIN: Multi-stagE query performance prediction for dynamic paRallel oLap pIpeliNe Kaixin Zhang et.al. 2412.00749 null Kimi
1815 2024-11-29 DeMo: Decoupled Momentum Optimization Bowen Peng et.al. 2411.19870 link Kimi
1816 2024-11-27 FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving Ao Shen et.al. 2411.18424 null Kimi
1817 2024-11-28 MiniKV: Pushing the Limits of LLM Inference via 2-Bit Layer-Discriminative KV Cache Akshat Sharma et.al. 2411.18077 null Kimi
1818 2024-11-27 Addressing Architectural Obstacles for Overlay with Stream Network Abstraction Chengyue Wang et.al. 2411.17966 null Kimi
1819 2024-11-26 Attamba: Attending To Multi-Token States Yash Akhauri et.al. 2411.17685 link Kimi
1820 2024-11-26 Toward High-Performance LLM Serving: A Simulation-Based Approach for Identifying Optimal Parallelism Yi-Chien Lin et.al. 2411.17651 link Kimi
1821 2024-11-26 Efficient LLM Inference with I/O-Aware Partial KV Cache Recomputation Chaoyi Jiang et.al. 2411.17089 null Kimi
1822 2024-11-25 Lion Cub: Minimizing Communication Overhead in Distributed Lion Satoki Ishikawa et.al. 2411.16462 null Kimi
1823 2024-11-24 Hiding Communication Cost in Distributed LLM Training via Micro-batch Co-execution Haiquan Wang et.al. 2411.15871 null Kimi
1824 2024-11-27 A Method for Building Large Language Models with Predefined KV Cache Capacity Zhonghua Yi et.al. 2411.15785 null Kimi
1825 2024-11-22 DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models Keda Tao et.al. 2411.15024 link Kimi
1826 2024-11-21 Functional Array Programming in an Extended Pi-Calculus Hans Hüttel et.al. 2411.14579 null Kimi
1827 2024-11-22 Quantization without Tears Minghao Fu et.al. 2411.13918 null Kimi
1828 2024-11-19 Faster Multi-GPU Training with PPLL: A Pipeline Parallelism Framework Leveraging Local Learning Xiuyuan Guo et.al. 2411.12780 null Kimi
1829 2024-11-18 Parsing Millions of DNS Records per Second Jeroen Koekkoek et.al. 2411.12035 link Kimi
1830 2024-11-17 SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Jintao Zhang et.al. 2411.10958 link Kimi
1831 2024-11-16 Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model Ting Liu et.al. 2411.10803 link Kimi
1832 2024-11-15 SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers Joseph Liu et.al. 2411.10510 link Kimi
1833 2024-11-14 Squeezed Attention: Accelerating Long Context Length LLM Inference Coleman Hooper et.al. 2411.09688 link Kimi
1834 2024-11-15 Communication Compression for Tensor Parallel LLM Inference Jan Hansen-Palmus et.al. 2411.09510 null Kimi
1835 2024-11-12 Towards Low-bit Communication for Tensor Parallel LLM Inference Harry Dong et.al. 2411.07942 null Kimi
1836 2024-11-11 Anchor Attention, Small Cache: Code Generation with Large Language Models Xiangyu Zhang et.al. 2411.06680 link Kimi
1837 2024-11-10 Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator Kazuki Fujii et.al. 2411.06465 null Kimi
1838 2024-11-08 Balancing Pipeline Parallelism with Vocabulary Parallelism Man Tsung Yeung et.al. 2411.05288 link Kimi
1839 2024-11-07 BitNet a4.8: 4-bit Activations for 1-bit LLMs Hongyu Wang et.al. 2411.04965 null Kimi
1840 2024-11-06 Stepping Forward on the Last Mile Chen Feng et.al. 2411.04036 null Kimi
1841 2024-11-05 TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection Wei Wu et.al. 2411.02886 null Kimi
1842 2024-11-05 DroidSpeak: Enhancing Cross-LLM Communication Yuhan Liu et.al. 2411.02820 null Kimi
1843 2024-11-04 “Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization Eldar Kurtic et.al. 2411.02355 null Kimi
1844 2024-11-04 Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism Fan Wu et.al. 2411.02086 null Kimi
1845 2024-11-04 xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism Jiarui Fang et.al. 2411.01738 link Kimi
1846 2024-11-02 NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference Xuanlin Jiang et.al. 2411.01142 null Kimi
1847 2024-11-01 MoNTA: Accelerating Mixture-of-Experts Training with Network-Traffc-Aware Parallel Optimization Jingming Guo et.al. 2411.00662 link Kimi
1848 2024-11-01 Constrained Diffusion Implicit Models Vivek Jayaram et.al. 2411.00359 null Kimi
1849 2024-11-05 SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile Ruisi Zhang et.al. 2411.00284 null Kimi
1850 2024-10-31 Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2 Weijie Ke et.al. 2410.23776 null Kimi
1851 2024-10-31 ALISE: Accelerating Large Language Model Serving with Speculative Scheduling Youpeng Zhao et.al. 2410.23537 null Kimi
1852 2024-10-29 VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration Dezhan Tu et.al. 2410.23317 null Kimi
1853 2024-10-30 BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference Junqi Zhao et.al. 2410.23079 link Kimi
1854 2024-10-29 The Impact of Inference Acceleration Strategies on Bias of LLMs Elisabeth Kirsten et.al. 2410.22118 link Kimi
1855 2024-10-29 How Does Critical Batch Size Scale in Pre-training? Hanlin Zhang et.al. 2410.21676 link Kimi
1856 2024-10-28 ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference Hanshi Sun et.al. 2410.21465 link Kimi
1857 2024-10-28 Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments Yuzhe Yang et.al. 2410.21340 null Kimi
1858 2024-10-28 Beyond Autoregression: Fast LLMs via Self-Distillation Through Time Justin Deschenaux et.al. 2410.21035 link Kimi
1859 2024-10-26 DQRM: Deep Quantized Recommendation Models Yang Zhou et.al. 2410.20046 link Kimi
1860 2024-10-25 RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction Tanqiu Jiang et.al. 2410.19937 null Kimi
1861 2024-10-25 BitPipe: Bidirectional Interleaved Pipeline Parallelism for Accelerating Large Models Training Houming Wu et.al. 2410.19367 link Kimi
1862 2024-10-28 Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning Yu Fu et.al. 2410.19258 link Kimi
1863 2024-10-24 KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing Yifei Yang et.al. 2410.18517 link Kimi
1864 2024-10-24 The Nature of Mathematical Modeling and Probabilistic Optimization Engineering in Generative AI Fulu Li et.al. 2410.18441 null Kimi
1865 2024-10-25 Fast Inference for Augmented Large Language Models Rana Shahout et.al. 2410.18248 null Kimi
1866 2024-10-23 Value Residual Learning For Alleviating Attention Concentration In Transformers Zhanchao Zhou et.al. 2410.17897 link Kimi
1867 2024-10-23 Markov Chain of Thought for Efficient Mathematical Reasoning Wen Yang et.al. 2410.17635 null Kimi
1868 2024-10-22 PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction Long Xing et.al. 2410.17247 link Kimi
1869 2024-10-21 MagicPIG: LSH Sampling for Efficient LLM Generation Zhuoming Chen et.al. 2410.16179 link Kimi
1870 2024-10-21 Residual vector quantization for KV cache compression in large language model Ankur Kumar et.al. 2410.15704 link Kimi
1871 2024-10-20 SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training Jinda Jia et.al. 2410.15526 link Kimi
1872 2024-10-20 EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models Junhao Hu et.al. 2410.15332 null Kimi
1873 2024-10-20 Lossless KV Cache Compression to 2% Zhen Yang et.al. 2410.15252 null Kimi
1874 2024-10-19 Pipeline Gradient-based Model Training on Analog In-memory Accelerators Zhaoxian Wu et.al. 2410.15155 link Kimi
1875 2024-10-18 A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference You Wu et.al. 2410.14442 link Kimi
1876 2024-10-23 TiMePReSt: Time and Memory Efficient Pipeline Parallel DNN Training with Removed Staleness Ankita Dutta et.al. 2410.14312 null Kimi
1877 2024-10-17 SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction Xuan Zhang et.al. 2410.13846 link Kimi
1878 2024-10-17 AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations Qian Tao et.al. 2410.13212 null Kimi
1879 2024-10-19 In-context KV-Cache Eviction for LLMs via Attention-Gate Zihao Zeng et.al. 2410.12876 null Kimi
1880 2024-10-16 FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction Akriti Jain et.al. 2410.12513 null Kimi
1881 2024-10-16 COMET: Towards Partical W4A4KV4 LLMs Serving Lian Liu et.al. 2410.12168 null Kimi
1882 2024-10-15 From promise to practice: realizing high-performance decentralized training Zesen Wang et.al. 2410.11998 null Kimi
1883 2024-10-15 QSpec: Speculative Decoding with Complementary Quantization Schemes Juntao Zhao et.al. 2410.11305 null Kimi
1884 2024-10-14 DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads Guangxuan Xiao et.al. 2410.10819 link Kimi
1885 2024-10-14 When Attention Sink Emerges in Language Models: An Empirical View Xiangming Gu et.al. 2410.10781 link Kimi
1886 2024-10-14 Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling Wenze Liu et.al. 2410.10511 link Kimi
1887 2024-10-15 EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations Zhangchi Feng et.al. 2410.10315 link Kimi
1888 2024-10-11 ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression Yefei He et.al. 2410.08584 null Kimi
1889 2024-10-10 KV Prediction for Improved Time to First Token Maxwell Horton et.al. 2410.08391 link Kimi
1890 2024-10-10 TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text Songshuo Lu et.al. 2410.07590 link Kimi
1891 2024-10-09 SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration Heming Xia et.al. 2410.06916 link Kimi
1892 2024-10-07 PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs Mengzhao Chen et.al. 2410.05265 link Kimi
1893 2024-10-07 Presto! Distilling Steps and Layers for Accelerating Music Generation Zachary Novack et.al. 2410.05167 null Kimi
1894 2024-10-07 TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention Lijie Yang et.al. 2410.05076 link Kimi
1895 2024-10-07 Fast State Restoration in LLM Serving with HCache Shiwei Gao et.al. 2410.05004 null Kimi
1896 2024-10-06 Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective Jinhao Li et.al. 2410.04466 null Kimi
1897 2024-10-04 SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation Aurick Qiao et.al. 2410.03960 null Kimi
1898 2024-10-04 LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy Rongzhi Zhang et.al. 2410.03111 null Kimi
1899 2024-10-04 UNComp: Uncertainty-Aware Long-Context Compressor for Efficient Large Language Model Inference Jing Xiong et.al. 2410.03090 null Kimi
1900 2024-10-09 LEGO: QEC Decoding System Architecture for Dynamic Circuits Yue Wu et.al. 2410.03073 null Kimi
1901 2024-10-04 Compute Or Load KV Cache? Why Not Both? Shuowei Jin et.al. 2410.03065 null Kimi
1902 2024-10-03 EinDecomp: Decomposition of Declaratively-Specified Machine Learning and Numerical Computations for Parallel Execution Daniel Bourgeois et.al. 2410.02682 null Kimi
1903 2024-10-03 SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration Jintao Zhang et.al. 2410.02367 link Kimi
1904 2024-10-02 Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads Yuxiang Huang et.al. 2410.01805 link Kimi
1905 2024-10-02 InfiniPot: Infinite Context Processing on Memory-Constrained LLMs Minsoo Kim et.al. 2410.01518 null Kimi
1906 2024-10-02 A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts Suyu Ge et.al. 2410.01485 null Kimi
1907 2024-10-01 Developing a BLAS library for the AMD AI Engine Tristan Laan et.al. 2410.00825 null Kimi
1908 2024-10-01 TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices Zonghang Li et.al. 2410.00531 link Kimi
1909 2024-10-01 LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management Yi Xiong et.al. 2410.00428 null Kimi
1910 2024-09-30 KV-Compress: Paged KV-Cache Compression with Variable Compression Rates per Attention Head Isaac Rehg et.al. 2410.00161 link Kimi
1911 2024-09-30 The Early Bird Catches the Leak: Unveiling Timing Side Channels in LLM Serving Systems Linke Song et.al. 2409.20002 null Kimi
1912 2024-09-27 Toward Greener Matrix Operations by Lossless Compressed Formats Francesco Tosoni et.al. 2409.18620 link Kimi
1913 2024-09-26 Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores Shaobo Ma et.al. 2409.17870 null Kimi
1914 2024-09-25 Search for Efficient Large Language Models Xuan Shen et.al. 2409.17372 link Kimi
1915 2024-09-25 Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations Amey Agrawal et.al. 2409.17264 null Kimi
1916 2024-09-25 AlignedKV: Reducing Memory Access of KV-Cache with Precision-Aligned Quantization Yifan Tan et.al. 2409.16546 link Kimi
1917 2024-09-25 A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence Xin Yuan et.al. 2409.16537 null Kimi
1918 2024-09-23 CSPS: A Communication-Efficient Sequence-Parallelism based Serving System for Transformer based Models with Long Prompts Zeyu Zhang et.al. 2409.15104 null Kimi
1919 2024-09-23 Inference-Friendly Models With MixAttention Shashank Rajput et.al. 2409.15012 null Kimi
1920 2024-09-23 Mutation-Based Deep Learning Framework Testing Method in JavaScript Environment Yinglong Zou et.al. 2409.14968 null Kimi
1921 2024-09-16 Do Large Language Models Need a Content Delivery Network? Yihua Cheng et.al. 2409.13761 link Kimi
1922 2024-09-20 Time Distributed Deep Learning models for Purely Exogenous Forecasting. Application to Water Table Depth Prediction using Weather Image Time Series Matteo Salis et.al. 2409.13284 null Kimi
1923 2024-09-23 CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs Junlin Lv et.al. 2409.12490 link Kimi
1924 2024-09-04 ISO: Overlap of Computation and Communication within Seqenence For LLM Inference Bin Xiao et.al. 2409.11155 null Kimi
1925 2024-09-17 KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models Bo Lv et.al. 2409.11057 null Kimi
1926 2024-09-21 CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios Luning Wang et.al. 2409.10593 link Kimi
1927 2024-09-14 A Dynamic Weighting Strategy to Mitigate Worker Node Failure in Distributed Deep Learning Yuesheng Xu et.al. 2409.09242 null Kimi
1928 2024-09-11 Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU Zhenyu Ning et.al. 2409.09086 null Kimi
1929 2024-09-13 SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity Qitian Wu et.al. 2409.09007 link Kimi
1930 2024-09-11 Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering Weixi Weng et.al. 2409.07331 null Kimi
1931 2024-09-11 FreeRide: Harvesting Bubbles in Pipeline Parallelism Jiashu Zhang et.al. 2409.06941 null Kimi
1932 2024-09-09 DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects Xu Zhang et.al. 2409.05404 null Kimi
1933 2024-09-08 InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference Xiurui Pan et.al. 2409.04992 null Kimi
1934 2024-09-04 Accelerating Large Language Model Training with Hybrid GPU-based Compression Lang Xu et.al. 2409.02423 null Kimi
1935 2024-09-03 Contemporary Model Compression on Large Language Models Inference Dong Liu et.al. 2409.01990 link Kimi
1936 2024-09-03 On-chain Validation of Tracking Data Messages (TDM) Using Distributed Deep Learning on a Proof of Stake (PoS) Blockchain Yasir Latif et.al. 2409.01614 null Kimi
1937 2024-09-02 LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUs Mo Sun et.al. 2409.00918 null Kimi
1938 2024-08-26 Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition Axel Klawonn et.al. 2408.14442 null Kimi
1939 2024-08-23 Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI Mikhail Khalilov et.al. 2408.13356 null Kimi
1940 2024-08-22 LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation Shihao Chen et.al. 2408.12354 null Kimi
1941 2024-08-23 MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding Jian Chen et.al. 2408.11049 link Kimi
1942 2024-08-20 Security Assessment of Hierarchical Federated Deep Learning D Alqattan et.al. 2408.10752 link Kimi
1943 2024-08-20 Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning Bei Ouyang et.al. 2408.10746 null Kimi
1944 2024-08-21 LongVILA: Scaling Long-Context Visual Language Models for Long Videos Fuzhao Xue et.al. 2408.10188 link Kimi
1945 2024-08-17 RepControlNet: ControlNet Reparameterization Zhaoli Deng et.al. 2408.09240 null Kimi
1946 2024-08-17 Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version) Mingkuan Xu et.al. 2408.09055 null Kimi
1947 2024-08-23 ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models Chao Zeng et.al. 2408.08554 link Kimi
1948 2024-08-16 Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models Jerry Huang et.al. 2408.08470 null Kimi
1949 2024-08-15 Asteroid: Resource-Efficient Hybrid Pipeline Parallelism for Collaborative DNN Training on Heterogeneous Edge Devices Shengyuan Ye et.al. 2408.08015 null Kimi
1950 2024-08-17 Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference Rohan Baskar Prabhakar et.al. 2408.07802 null Kimi
1951 2024-08-18 Post-Training Sparse Attention with Double Sparsity Shuo Yang et.al. 2408.07092 link Kimi
1952 2024-08-12 LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration Zhiwen Mo et.al. 2408.06003 null Kimi
1953 2024-08-10 Eigen Attention: Attention in Low-Rank Space for KV Cache Compression Utkarsh Saxena et.al. 2408.05646 link Kimi
1954 2024-08-05 SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving Andreas Kosmas Kakolyris et.al. 2408.05235 null Kimi
1955 2024-08-08 Partial Experts Checkpoint: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training Weilin Cai et.al. 2408.04307 null Kimi
1956 2024-08-07 Zero-Delay QKV Compression for Mitigating KV Cache and Network Bottlenecks in LLM Inference Zeyu Zhang et.al. 2408.04107 null Kimi
1957 2024-08-08 NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time Yilong Chen et.al. 2408.03675 link Kimi
1958 2024-08-04 Cross-layer Attention Sharing for Large Language Models Yongyu Mu et.al. 2408.01890 null Kimi
1959 2024-08-01 Intermittent Semi-working Mask: A New Masking Paradigm for LLMs Mingcong Lu et.al. 2408.00539 null Kimi
1960 2024-08-13 Finch: Prompt-guided Key-Value Cache Compression Giulio Corallo et.al. 2408.00167 null Kimi
1961 2024-07-31 EdgeLLM: A Highly Efficient CPU-FPGA Heterogeneous Edge Accelerator for Large Language Models Mingqiang Huang et.al. 2407.21325 null Kimi
1962 2024-07-30 Palu: Compressing KV-Cache with Low-Rank Projection Chi-Chih Chang et.al. 2407.21118 link Kimi
1963 2024-07-30 ThinK: Thinner Key Cache by Query-Driven Pruning Yuhui Xu et.al. 2407.21018 null Kimi
1964 2024-07-31 A2SF: Accumulative Attention Scoring with Forgetting Factor for Token Pruning in Transformer Decoder Hyun-rae Jo et.al. 2407.20485 null Kimi
1965 2024-07-25 An Efficient Inference Framework for Early-exit Large Language Models Ruijie Miao et.al. 2407.20272 null Kimi
1966 2024-07-29 When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention Lianghong Guo et.al. 2407.20042 link Kimi
1967 2024-07-29 Inference acceleration for large language models using “stairs” assisted greedy generation Domas Grigaliūnas et.al. 2407.19947 null Kimi
1968 2024-07-29 Rina: Enhancing Ring-AllReduce with In-network Aggregation in Distributed Model Training Zixuan Chen et.al. 2407.19721 null Kimi
1969 2024-07-25 Efficient Inference of Vision Instruction-Following Models with Elastic Cache Zuyan Liu et.al. 2407.18121 link Kimi
1970 2024-07-28 Keep the Cost Down: A Review on Methods to Optimize LLM’ s KV-Cache Consumption Luohe Shi et.al. 2407.18003 null Kimi
1971 2024-07-25 Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads Xihui Lin et.al. 2407.17678 null Kimi
1972 2024-07-23 A deeper look at depth pruning of LLMs Shoaib Ahmed Siddiqui et.al. 2407.16286 link Kimi
1973 2024-07-22 RazorAttention: Efficient KV Cache Compression Through Retrieval Heads Hanlin Tang et.al. 2407.15891 null Kimi
1974 2024-07-22 AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description Junyu Xie et.al. 2407.15850 link Kimi
1975 2024-07-22 LLMmap: Fingerprinting For Large Language Models Dario Pasquini et.al. 2407.15847 link Kimi
1976 2024-07-22 CarFormer: Self-Driving with Learned Object-Centric Representations Shadi Hamdan et.al. 2407.15843 null Kimi
1977 2024-07-22 SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models Mingze Xu et.al. 2407.15841 link Kimi
1978 2024-07-22 MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity Yangzhou Liu et.al. 2407.15838 link Kimi
1979 2024-07-22 dMel: Speech Tokenization made Simple He Bai et.al. 2407.15835 null Kimi
1980 2024-07-22 Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight Ziyuan Huang et.al. 2407.15819 null Kimi
1981 2024-07-23 A simple and fast C++ thread pool implementation capable of running task graphs Dmytro Puyda et.al. 2407.15805 link Kimi
1982 2024-07-22 Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation Guanyu Hu et.al. 2407.15798 null Kimi
1983 2024-07-22 Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach Rian Dolphin et.al. 2407.15788 null Kimi
1984 2024-07-22 Parallel Split Learning with Global Sampling Mohammad Kohankhaki et.al. 2407.15738 link Kimi
1985 2024-07-22 vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving Jiale Xu et.al. 2407.15309 link Kimi
1986 2024-07-19 Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference Joyjit Kundu et.al. 2407.14645 null Kimi
1987 2024-07-19 Internal Consistency and Self-Feedback in Large Language Models: A Survey Xun Liang et.al. 2407.14507 link Kimi
1988 2024-07-19 On Pre-training of Multimodal Language Models Customized for Chart Understanding Wan-Cyuan Fan et.al. 2407.14506 null Kimi
1989 2024-07-19 PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding Chenshu Hou et.al. 2407.14491 null Kimi
1990 2024-07-19 Evaluating the Reliability of Self-Explanations in Large Language Models Korbinian Randl et.al. 2407.14487 link Kimi
1991 2024-07-19 Contrastive Learning with Counterfactual Explanations for Radiology Report Generation Mingjie Li et.al. 2407.14474 null Kimi
1992 2024-07-19 Check-Eval: A Checklist-based Approach for Evaluating Text Quality Jayr Pereira et.al. 2407.14467 null Kimi
1993 2024-07-19 AttentNet: Fully Convolutional 3D Attention for Lung Nodule Detection Majedaldein Almahasneh et.al. 2407.14464 null Kimi
1994 2024-07-19 PolyFormer: Scalable Node-wise Filters via Polynomial Graph Transformer Jiahong Ma et.al. 2407.14459 link Kimi
1995 2024-07-19 Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier Zachary Wojtowicz et.al. 2407.14452 null Kimi
1996 2024-07-19 From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards Nicole Sultanum et.al. 2407.14451 null Kimi
1997 2024-07-19 LoAS: Fully Temporal-Parallel Datatflow for Dual-Sparse Spiking Neural Networks Ruokai Yin et.al. 2407.14073 link Kimi
1998 2024-07-19 LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Qichen Fu et.al. 2407.14057 null Kimi
1999 2024-07-18 SegPoint: Segment Any Point Cloud via Large Language Model Shuting He et.al. 2407.13761 null Kimi
2000 2024-07-18 Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models Zhuo Chen et.al. 2407.13757 null Kimi
2001 2024-07-18 CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications Mirza Masfiqur Rahman et.al. 2407.13742 null Kimi
2002 2024-07-18 Baba Is AI: Break the Rules to Beat the Benchmark Nathan Cloos et.al. 2407.13729 null Kimi
2003 2024-07-18 Compressing Structured Tensor Algebra Mahdi Ghorbani et.al. 2407.13726 null Kimi
2004 2024-07-18 CoDefeater: Using LLMs To Find Defeaters in Assurance Cases Usman Gohar et.al. 2407.13717 link Kimi
2005 2024-07-18 Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning Ans Munir et.al. 2407.13715 link Kimi
2006 2024-07-18 Understanding Reference Policies in Direct Preference Optimization Yixin Liu et.al. 2407.13709 link Kimi
2007 2024-07-18 ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection Janek Herrlein et.al. 2407.13702 link Kimi
2008 2024-07-18 Cross-Task Attack: A Self-Supervision Generative Framework Based on Attention Shift Qingyuan Zeng et.al. 2407.13700 null Kimi
2009 2024-07-17 Analysis of Crab X-ray Polarization using Deeper IXPE Observations Josephine Wong et.al. 2407.12779 null Kimi
2010 2024-07-17 The BRST quantisation of chiral BMS-like field theories José Figueroa-O’Farrill et.al. 2407.12778 null Kimi
2011 2024-07-17 Jigsaw Game: Federated Clustering Jinxuan Xu et.al. 2407.12764 null Kimi
2012 2024-07-17 LookupViT: Compressing visual information to a limited number of tokens Rajat Koner et.al. 2407.12753 null Kimi
2013 2024-07-17 CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference Mohammad Erfan Sadeghi et.al. 2407.12736 null Kimi
2014 2024-07-17 EchoSight: Advancing Visual-Language Models with Wiki Knowledge Yibin Yan et.al. 2407.12735 null Kimi
2015 2024-07-17 FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible Pruning in Uncertain Scenarios Zekai Chen et.al. 2407.12729 null Kimi
2016 2024-07-17 Exploring the interplay of individual traits and interaction dynamics in preschool social networks Gülşah Akçakır et.al. 2407.12728 null Kimi
2017 2024-07-17 NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model Zhongqun Zhang et.al. 2407.12727 null Kimi
2018 2024-07-17 Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models? Ben Yao et.al. 2407.12725 null Kimi
2019 2024-07-16 GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression Daniel Goldstein et.al. 2407.12077 link Kimi
2020 2024-07-16 Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale Aymen Alsaadi et.al. 2407.11967 null Kimi
2021 2024-07-16 UrbanWorld: An Urban World Model for 3D City Generation Yu Shang et.al. 2407.11965 link Kimi
2022 2024-07-16 NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? Mo Li et.al. 2407.11963 link Kimi
2023 2024-07-17 Hierarchical Separable Video Transformer for Snapshot Compressive Imaging Ping Wang et.al. 2407.11946 link Kimi
2024 2024-07-16 Min-max theory and existence of H-spheres with arbitrary codimensions Rui Gao et.al. 2407.11945 null Kimi
2025 2024-07-16 Beyond Spatial Explanations: Explainable Face Recognition in the Frequency Domain Marco Huber et.al. 2407.11941 null Kimi
2026 2024-07-16 Generalized Difference-in-Differences Yiqing Xu et.al. 2407.11937 null Kimi
2027 2024-07-16 Learning Multi-view Anomaly Detection Haoyang He et.al. 2407.11935 null Kimi
2028 2024-07-16 Code Documentation and Analysis to Secure Software Development Paul Attie et.al. 2407.11934 null Kimi
2029 2024-07-16 What’s Wrong? Refining Meeting Summaries with LLM Feedback Frederic Kirstein et.al. 2407.11919 null Kimi
2030 2024-07-16 PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation Branden Butler et.al. 2407.11798 null Kimi
2031 2024-07-21 Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference Yuan Feng et.al. 2407.11550 link Kimi
2032 2024-07-15 VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation Bocheng Zou et.al. 2407.10972 link Kimi
2033 2024-07-15 Q-Sparse: All Large Language Models can be Fully Sparsely-Activated Hongyu Wang et.al. 2407.10969 null Kimi
2034 2024-07-15 Induction of non-Fermi liquids by critical cavity photons at the onset of superradiance Ipsita Mandal et.al. 2407.10963 null Kimi
2035 2024-07-15 Fast Matrix Multiplications for Lookup Table-Quantized LLMs Han Guo et.al. 2407.10960 link Kimi
2036 2024-07-15 InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models Nirat Saini et.al. 2407.10958 null Kimi
2037 2024-07-15 MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models Chengguang Gan et.al. 2407.10953 null Kimi
2038 2024-07-15 The infamous 95 GeV $\rm b \bar b$ excess at LEP: Two b or not two b? Patrick Janot et.al. 2407.10948 null Kimi
2039 2024-07-15 Can Textual Semantics Mitigate Sounding Object Segmentation Preference? Yaoting Wang et.al. 2407.10947 link Kimi
2040 2024-07-15 GRUtopia: Dream General Robots in a City at Scale Hanqing Wang et.al. 2407.10943 link Kimi
2041 2024-07-15 IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation Yuanhao Zhai et.al. 2407.10937 link Kimi
2042 2024-07-12 FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 Georgios Makridis et.al. 2407.09467 null Kimi
2043 2024-07-12 Human-like Episodic Memory for Infinite Context LLMs Zafeirios Fountas et.al. 2407.09450 link Kimi
2044 2024-07-12 ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts Amelia F. Hardy et.al. 2407.09447 link Kimi
2045 2024-07-12 MUSCLE: A Model Update Strategy for Compatible LLM Evolution Jessica Echterhoff et.al. 2407.09435 null Kimi
2046 2024-07-12 Open (Clinical) LLMs are Sensitive to Instruction Phrasings Alberto Mario Ceballos Arroyo et.al. 2407.09429 link Kimi
2047 2024-07-12 TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models Hang Zou et.al. 2407.09424 null Kimi
2048 2024-07-12 Mitigating Entity-Level Hallucination in Large Language Models Weihang Su et.al. 2407.09417 link Kimi
2049 2024-07-12 SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers Shraman Pramanick et.al. 2407.09413 link Kimi
2050 2024-07-12 Thunderbolt: Causal Concurrent Consensus and Execution Junchao Chen et.al. 2407.09409 null Kimi
2051 2024-07-12 PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents Saber Zerhoudi et.al. 2407.09394 link Kimi
2052 2024-07-11 MAVIS: Mathematical Visual Instruction Tuning Renrui Zhang et.al. 2407.08739 link Kimi
2053 2024-07-11 Real-Time Anomaly Detection and Reactive Planning with Large Language Models Rohan Sinha et.al. 2407.08735 null Kimi
2054 2024-07-11 Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist Zihao Zhou et.al. 2407.08733 null Kimi
2055 2024-07-11 Planar decomposition of the HOMFLY polynomial for bipartite knots and links A. Anokhina et.al. 2407.08724 null Kimi
2056 2024-07-11 A Taxonomy for Data Contamination in Large Language Models Medha Palavalli et.al. 2407.08716 null Kimi
2057 2024-07-11 GTA: A Benchmark for General Tool Agents Jize Wang et.al. 2407.08713 link Kimi
2058 2024-07-11 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing et.al. 2407.08701 null Kimi
2059 2024-07-11 Flex-TPU: A Flexible TPU with Runtime Reconfigurable Dataflow Architecture Mohammed Elbtity et.al. 2407.08700 null Kimi
2060 2024-07-11 Mitigating Catastrophic Forgetting in Language Transfer via Model Merging Anton Alexandrov et.al. 2407.08699 null Kimi
2061 2024-07-11 Patterns of link reciprocity in directed, signed networks Anna Gallo et.al. 2407.08697 null Kimi
2062 2024-07-10 Training on the Test Task Confounds Evaluation and Emergence Ricardo Dominguez-Olmedo et.al. 2407.07890 link Kimi
2063 2024-07-10 Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization Junkang Wu et.al. 2407.07880 link Kimi
2064 2024-07-10 Bound States in Continuum via Singular Transfer Matrices Ovidiu-Zeno Lipan et.al. 2407.07879 null Kimi
2065 2024-07-10 FACTS About Building Retrieval Augmented Generation-based Chatbots Rama Akkiraju et.al. 2407.07858 null Kimi
2066 2024-07-10 OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training Sami Jaghouar et.al. 2407.07852 link Kimi
2067 2024-07-10 Harnessing Integrated CPU-GPU System Memory for HPC: a first look into Grace Hopper Gabin Schieffer et.al. 2407.07850 null Kimi
2068 2024-07-10 Natural Language Mechanisms via Self-Resolution with Foundation Models Nicolas Della Penna et.al. 2407.07845 null Kimi
2069 2024-07-10 Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification Mei Qiu et.al. 2407.07842 null Kimi
2070 2024-07-10 Transformer Alignment in Large Language Models Murdock Aubry et.al. 2407.07810 null Kimi
2071 2024-07-10 Attribute or Abstain: Large Language Models as Long Document Assistants Jan Buchmann et.al. 2407.07799 link Kimi
2072 2024-07-09 AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning Jiaxi Cui et.al. 2407.07094 link Kimi
2073 2024-07-09 FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation Liqun Ma et.al. 2407.07093 link Kimi
2074 2024-07-09 Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic Ruochen Jin et.al. 2407.07089 link Kimi
2075 2024-07-09 Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models Logan Cross et.al. 2407.07086 link Kimi
2076 2024-07-09 Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities Shaltiel Shmidman et.al. 2407.07080 null Kimi
2077 2024-07-09 ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction Shaozhe Hao et.al. 2407.07077 link Kimi
2078 2024-07-09 Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps Yung-Sung Chuang et.al. 2407.07071 link Kimi
2079 2024-07-09 Prompting Techniques for Secure Code Generation: A Systematic Investigation Catherine Tony et.al. 2407.07064 null Kimi
2080 2024-07-09 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Weize Chen et.al. 2407.07061 link Kimi
2081 2024-07-09 CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement Wang Wei et.al. 2407.07056 null Kimi
2082 2024-07-08 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Orr Zohar et.al. 2407.06189 link Kimi
2083 2024-07-08 CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation Xinying Guo et.al. 2407.06188 null Kimi
2084 2024-07-08 Left-Linear Rewriting in Adhesive Categories Paolo Baldan et.al. 2407.06181 null Kimi
2085 2024-07-08 The Tug-of-War Between Deepfake Generation and Detection Hannah Lee et.al. 2407.06174 null Kimi
2086 2024-07-08 On Speeding Up Language Model Evaluation Jin Peng Zhou et.al. 2407.06172 null Kimi
2087 2024-07-08 Inevitable Endgame of Comet Tsuchinshan-ATLAS (C/2023 A3) Zdenek Sekanina et.al. 2407.06166 null Kimi
2088 2024-07-08 What’s Wrong with Your Code Generated by Large Language Models? An Extensive Study Shihan Dou et.al. 2407.06153 null Kimi
2089 2024-07-08 WIBACong: An Argument-centric Framework for Understanding US Congressional Hearings Arman Irani et.al. 2407.06149 null Kimi
2090 2024-07-08 Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks Lukas Netz et.al. 2407.06146 null Kimi
2091 2024-07-08 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Ethan Chern et.al. 2407.06135 link Kimi
2092 2024-07-05 LaRa: Efficient Large-Baseline Radiance Fields Anpei Chen et.al. 2407.04699 null Kimi
2093 2024-07-05 Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs Rudolf Laine et.al. 2407.04694 link Kimi
2094 2024-07-05 ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu et.al. 2407.04693 link Kimi
2095 2024-07-05 Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge Yuanze Lin et.al. 2407.04681 null Kimi
2096 2024-07-05 Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition Ye Bai et.al. 2407.04675 null Kimi
2097 2024-07-05 Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement Yongji Wu et.al. 2407.04656 null Kimi
2098 2024-07-05 Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework Reza Averly et.al. 2407.04629 null Kimi
2099 2024-07-05 On scalable oversight with weak LLMs judging strong LLMs Zachary Kenton et.al. 2407.04622 null Kimi
2100 2024-07-08 OneRestore: A Universal Restoration Framework for Composite Degradation Yu Guo et.al. 2407.04621 link Kimi
2101 2024-07-05 Learning to (Learn at Test Time): RNNs with Expressive Hidden States Yu Sun et.al. 2407.04620 link Kimi
2102 2024-07-03 Universal Length Generalization with Turing Programs Kaiying Hou et.al. 2407.03310 null Kimi
2103 2024-07-03 Eyes on the Game: Deciphering Implicit Human Signals to Infer Human Proficiency, Trust, and Intent Nikhil Hulle et.al. 2407.03298 null Kimi
2104 2024-07-03 Large Language Models for JSON Schema Discovery Michael J. Mior et.al. 2407.03286 null Kimi
2105 2024-07-03 LLM Internal States Reveal Hallucination Risk Faced With a Query Ziwei Ji et.al. 2407.03282 link Kimi
2106 2024-07-03 Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks Mintae Kim et.al. 2407.03280 null Kimi
2107 2024-07-03 Nesterov’s Accelerated Jacobi-Type Methods for Large-scale Symmetric Positive Semidefinite Linear Systems Ling Liang et.al. 2407.03272 null Kimi
2108 2024-07-03 STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data Kheir Eddine Daouadi et.al. 2407.03253 null Kimi
2109 2024-07-03 ACTRESS: Active Retraining for Semi-supervised Visual Grounding Weitai Kang et.al. 2407.03251 null Kimi
2110 2024-07-04 When big data actually are low-rank, or entrywise approximation of certain function-generated matrices Stanislav Budzinskiy et.al. 2407.03250 link Kimi
2111 2024-07-03 Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning Jiaqi Wang et.al. 2407.03247 link Kimi
2112 2024-07-02 MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang et.al. 2407.02490 link Kimi
2113 2024-07-02 Neurocache: Efficient Vector Retrieval for Long-range Language Modeling Ali Safaya et.al. 2407.02486 link Kimi
2114 2024-07-02 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Yue Yu et.al. 2407.02485 null Kimi
2115 2024-07-02 Characterizing the Interpretability of Attention Maps in Digital Pathology Tomé Albuquerque et.al. 2407.02484 null Kimi
2116 2024-07-02 MMedAgent: Learning to Use Medical Tools with Multi-modal Agent Binxu Li et.al. 2407.02483 link Kimi
2117 2024-07-02 Understanding Alignment in Multimodal LLMs: A Comprehensive Study Elmira Amirloo et.al. 2407.02477 null Kimi
2118 2024-07-02 Open Scene Graphs for Open World Object-Goal Navigation Joel Loo et.al. 2407.02473 null Kimi
2119 2024-07-02 Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I Harrie Oosterhuis et.al. 2407.02464 null Kimi
2120 2024-07-02 Decentralized Intelligence Network (DIN) Abraham Nash et.al. 2407.02461 null Kimi
2121 2024-07-02 Revisión de Métodos de Planificación de Camino de Cobertura para Entornos Agrícolas Ismael Ait et.al. 2407.02449 null Kimi

Early Stopping

ID Publish Date Title Authors PDF Code Kimi
1 2024-12-12 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Tiehan Fan et.al. 2412.09283 null Kimi
2 2024-12-11 GradStop: Exploring Training Dynamics in Unsupervised Outlier Detection through Gradient Cohesion Yuang Zhang et.al. 2412.08501 link Kimi
3 2024-12-11 Collaborative Inference for Large Models with Task Offloading and Early Exiting Zuan Xie et.al. 2412.08284 null Kimi
4 2024-12-11 Diff-GO $^\text{n}$ : Enhancing Diffusion Models for Goal-Oriented Communications Suchinthaka Wanninayaka et.al. 2412.06980 null Kimi
5 2024-12-06 Sparse autoencoders reveal selective remapping of visual concepts during adaptation Hyesu Lim et.al. 2412.05276 link Kimi
6 2024-12-06 BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits Wazib Ansar et.al. 2412.05225 null Kimi
7 2024-12-05 A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs Wangbo Zhao et.al. 2412.03324 link Kimi
8 2024-12-03 Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control Sebastian Hirt et.al. 2412.02423 null Kimi
9 2024-12-02 Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization Weiqiao Shan et.al. 2412.01455 null Kimi
10 2024-12-02 EdgeOAR: Real-time Online Action Recognition On Edge Devices Wei Luo et.al. 2412.01267 null Kimi
11 2024-12-02 Reliable and scalable variable importance estimation via warm-start and early stopping Zexuan Sun et.al. 2412.01120 link Kimi
12 2024-11-28 Deep Neural Network-Based Prediction of B-Cell Epitopes for SARS-CoV and SARS-CoV-2: Enhancing Vaccine Design through Machine Learning Xinyu Shi et.al. 2412.00109 null Kimi
13 2024-11-26 Selfish Evolution: Making Discoveries in Extreme Label Noise with the Help of Overfitting Dynamics Nima Sedaghat et.al. 2412.00077 null Kimi
14 2024-11-28 DIESEL – Dynamic Inference-Guidance via Evasion of Semantic Embeddings in LLMs Ben Ganon et.al. 2411.19038 null Kimi
15 2024-11-27 One-Step Early Stopping Strategy using Neural Tangent Kernel Theory and Rademacher Complexity Daniel Martin Xavier et.al. 2411.18806 null Kimi
16 2024-11-27 HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression Lei Liu et.al. 2411.18473 null Kimi
17 2024-11-26 Instance-Aware Graph Prompt Learning Jiazheng Li et.al. 2411.17676 null Kimi
18 2024-11-22 Instance-Aware Generalized Referring Expression Segmentation E-Ro Nguyen et.al. 2411.15087 null Kimi
19 2024-11-19 Deep Learning-Driven Heat Map Analysis for Evaluating thickness of Wounded Skin Layers Devakumar GR et.al. 2411.12678 null Kimi
20 2024-11-15 Exploiting Negative Curvature in Conjunction with Adaptive Sampling: Theoretical Results and a Practical Algorithm Albert S. Berahas et.al. 2411.10378 null Kimi
21 2024-11-13 Voxeland: Probabilistic Instance-Aware Semantic Mapping with Evidence-based Uncertainty Quantification Jose-Luis Matez-Bandera et.al. 2411.08727 link Kimi
22 2024-11-11 The Unreasonable Effectiveness of Monte Carlo Simulations in A/B Testing Márton Trencséni et.al. 2411.06701 link Kimi
23 2024-11-07 Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale Flavio Di Palo et.al. 2411.05045 null Kimi
24 2024-11-07 LoFi: Scalable Local Image Reconstruction with Implicit Neural Representation AmirEhsan Khorashadizadeh et.al. 2411.04995 link Kimi
25 2024-11-05 SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents Dawei Li et.al. 2411.03284 link Kimi
26 2024-11-06 Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression: A Distribution-Free Analysis Yingzhen Yang et.al. 2411.02904 null Kimi
27 2024-11-05 Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery Bowei Du et.al. 2411.02861 null Kimi
28 2024-11-05 CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration Hongpeng Jin et.al. 2411.02829 null Kimi
29 2024-11-06 Energy-Aware Dynamic Neural Inference Marcello Bullo et.al. 2411.02471 null Kimi
30 2024-11-04 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution Yang Yue et.al. 2411.02359 link Kimi
31 2024-11-02 Bi-Level Graph Structure Learning for Next POI Recommendation Liang Wang et.al. 2411.01169 null Kimi
32 2024-10-30 Accelerated AI Inference via Dynamic Execution Methods Haim Barad et.al. 2411.00853 null Kimi
33 2024-11-01 Preventing Model Collapse in Deep Canonical Correlation Analysis by Noise Regularization Junlin He et.al. 2411.00383 null Kimi
34 2024-10-29 Power side-channel leakage localization through adversarial training of deep neural networks Jimmy Gammell et.al. 2410.22425 link Kimi
35 2024-10-27 Branch-and-bound algorithm for efficient reliability analysis of general coherent systems Ji-Eun Byun et.al. 2410.22363 null Kimi
36 2024-10-28 Agreement Tasks in Fault-Prone Synchronous Networks of Arbitrary Structure Pierre Fraigniaud et.al. 2410.21538 null Kimi
37 2024-10-28 Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA Sangmin Bae et.al. 2410.20672 null Kimi
38 2024-10-27 Sequential Large Language Model-Based Hyper-Parameter Optimization Kanan Mahammadli et.al. 2410.20302 link Kimi
39 2024-10-26 Looking Beyond The Top-1: Transformers Determine Top Tokens In Order Daria Lioubashevski et.al. 2410.20210 link Kimi
40 2024-10-26 Dynamic layer selection in decoder-only transformers Theodore Glavas et.al. 2410.20022 link Kimi
41 2024-10-25 COMSPLIT: A Communication-Aware Split Learning Design for Heterogeneous IoT Platforms Vukan Ninkovic et.al. 2410.19375 null Kimi
42 2024-10-30 Dynamic Vocabulary Pruning in Early-Exit LLMs Jort Vincenti et.al. 2410.18952 link Kimi
43 2024-10-24 AdaEDL: Early Draft Stopping for Speculative Decoding of Large Language Models via an Entropy-based Lower Bound on Token Acceptance Probability Sudhanshu Agrawal et.al. 2410.18351 null Kimi
44 2024-10-23 Inferring stability properties of chaotic systems on autoencoders’ latent spaces Elise Özalp et.al. 2410.18003 link Kimi
45 2024-10-23 Diffusion Priors for Variational Likelihood Estimation and Image Denoising Jun Cheng et.al. 2410.17521 link Kimi
46 2024-10-21 Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification Nishant S. Gaikwad et.al. 2410.15681 null Kimi
47 2024-10-24 BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping Taolin Zhang et.al. 2410.15430 link Kimi
48 2024-10-16 FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction Akriti Jain et.al. 2410.12513 null Kimi
49 2024-10-15 Juggernaut: Efficient Crypto-Agnostic Byzantine Agreement Daniel Collins et.al. 2410.12121 null Kimi
50 2024-10-14 Focused ReAct: Improving ReAct through Reiterate and Early Stop Shuoqiu Li et.al. 2410.10779 null Kimi
51 2024-10-14 big.LITTLE Vision Transformer for Efficient Visual Recognition He Guo et.al. 2410.10267 null Kimi
52 2024-10-12 DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach Daniel Gallo Fernández et.al. 2410.09633 link Kimi
53 2024-10-11 Scaling Gaussian Processes for Learning Curve Prediction via Latent Kronecker Structure Jihao Andreas Lin et.al. 2410.09239 null Kimi
54 2024-10-08 Benchmarking of a new data splitting method on volcanic eruption data Simona Reale et.al. 2410.06306 null Kimi
55 2024-10-08 MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More Wei Huang et.al. 2410.06270 link Kimi
56 2024-10-08 Mini-Batch Kernel $k$ -means Ben Jourdan et.al. 2410.05902 null Kimi
57 2024-10-06 Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach Divya Jyoti Bajpai et.al. 2410.05338 null Kimi
58 2024-10-07 L-C4: Language-Based Video Colorization for Creative and Consistent Color Zheng Chang et.al. 2410.04972 null Kimi
59 2024-10-06 CAPEEN: Image Captioning with Early Exits and Knowledge Distillation Divya Jyoti Bajpai et.al. 2410.04433 link Kimi
60 2024-10-06 DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs Divya Jyoti Bajpai et.al. 2410.04424 link Kimi
61 2024-10-03 Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis Zikun Zhang et.al. 2410.02321 null Kimi
62 2024-10-03 Global dynamical structures from infinitesimal data Benjamin McInroe et.al. 2410.02111 null Kimi
63 2024-10-02 CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL Mohammadreza Pourreza et.al. 2410.01943 null Kimi
64 2024-10-02 Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension Zaiquan Yang et.al. 2410.01544 null Kimi
65 2024-10-01 Timber! Poisoning Decision Trees Stefano Calzavara et.al. 2410.00862 null Kimi
66 2024-09-30 Inference of water waves surface elevation from horizontal velocity components using physics informed neural networks (PINN) Omar Sallam et.al. 2409.19851 null Kimi
67 2024-09-27 Improving Visual Object Tracking through Visual Prompting Shih-Fang Chen et.al. 2409.18901 link Kimi
68 2024-09-24 Reinforcement Leaning for Infinite-Dimensional Systems Wei Zhang et.al. 2409.15737 null Kimi
69 2024-10-03 Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer Prediction Amrit Diggavi Seshadri et.al. 2409.14091 null Kimi
70 2024-09-21 Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer Zheng Liu et.al. 2409.13999 null Kimi
71 2024-09-18 Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments Gang Chen et.al. 2409.11975 link Kimi
72 2024-09-17 UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning Kathakoli Sengupta et.al. 2409.11403 null Kimi
73 2024-09-16 Improving Multi-candidate Speculative Decoding Xiaofan Lu et.al. 2409.10644 link Kimi
74 2024-09-14 Group Sequential Testing of a Treatment Effect Using a Surrogate Marker Layla Parast et.al. 2409.09440 link Kimi
75 2024-09-13 Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection Dixi Yao et.al. 2409.08858 null Kimi
76 2024-09-11 AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge Han Wang et.al. 2409.07394 link Kimi
77 2024-09-11 From optimal score matching to optimal sampling Zehao Dou et.al. 2409.07032 null Kimi
78 2024-09-10 Noisy Early Stopping for Noisy Labels William Toner et.al. 2409.06830 null Kimi
79 2024-09-10 Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds Mu Cai et.al. 2409.06827 link Kimi
80 2024-08-26 Optimizing STAR Aligner for High Throughput Computing in the Cloud Piotr Kica et.al. 2409.05886 null Kimi
81 2024-09-09 Early-exit Convolutional Neural Networks Edanur Demir et.al. 2409.05336 link Kimi
82 2024-09-08 Attention-Based Efficient Breath Sound Removal in Studio Audio Recordings Nidula Elgiriyewithana et.al. 2409.04949 null Kimi
83 2024-09-16 RTop-K: Ultra-Fast Row-Wise Top-K Algorithm and GPU Implementation for Neural Networks Xi Xie et.al. 2409.00822 null Kimi
84 2024-08-30 Dynamic Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling Guangya Wan et.al. 2408.17017 null Kimi
85 2024-08-24 Inferring the shape of a solid inside a draining tank from its liquid level dynamics Gbenga Fabusola et.al. 2408.14503 null Kimi
86 2024-08-26 Re-Mix: Optimizing Data Mixtures for Large Scale Imitation Learning Joey Hejna et.al. 2408.14037 link Kimi
87 2024-08-24 Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning Xinglin Wang et.al. 2408.13457 null Kimi
88 2024-08-24 Face Clustering via Early Stopping and Edge Recall Junjie Liu et.al. 2408.13431 link Kimi
89 2024-08-21 Critique-out-Loud Reward Models Zachary Ankner et.al. 2408.11791 link Kimi
90 2024-08-21 EEG-Defender: Defending against Jailbreak through Early Exit Generation of Large Language Models Chongwen Zhao et.al. 2408.11308 null Kimi
91 2024-08-20 Inferring Underwater Topography with FINN Coşku Can Horuz et.al. 2408.10649 null Kimi
92 2024-08-15 An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation Jun Wang et.al. 2408.08047 null Kimi
93 2024-08-14 Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks Liting Jiang et.al. 2408.07613 null Kimi
94 2024-08-12 HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors Hyungtae Lim et.al. 2408.06328 null Kimi
95 2024-08-12 Transfer learning of state-based potential games for process optimization in decentralized manufacturing systems Steve Yuwono et.al. 2408.05992 null Kimi
96 2024-08-12 A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models Taehong Moon et.al. 2408.05927 link Kimi
97 2024-08-08 Early-Exit meets Model-Distributed Inference at Edge Networks Marco Colocrese et.al. 2408.05247 null Kimi
98 2024-08-09 PriPHiT: Privacy-Preserving Hierarchical Training of Deep Neural Networks Yamin Sepehri et.al. 2408.05092 null Kimi
99 2024-08-09 Early Exit Strategies for Approximate k-NN Search in Dense Retrieval Francesco Busolin et.al. 2408.04981 null Kimi
100 2024-08-07 Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling Zilyu Ye et.al. 2408.03695 link Kimi
101 2024-08-03 Advancing Green AI: Efficient and Accurate Lightweight CNNs for Rice Leaf Disease Identification Khairun Saddami et.al. 2408.01752 null Kimi
102 2024-08-01 Early Stopping Based on Repeated Significance Eric Bax et.al. 2408.00908 null Kimi
103 2024-07-31 Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation Wenyuan Chen et.al. 2408.00112 null Kimi
104 2024-07-30 Accelerating Large Language Model Inference with Self-Supervised Early Exits Florian Valade et.al. 2407.21082 null Kimi
105 2024-07-25 An Efficient Inference Framework for Early-exit Large Language Models Ruijie Miao et.al. 2407.20272 null Kimi
106 2024-07-26 Topology Optimization of Random Memristors for Input-Aware Dynamic SNN Bo Wang et.al. 2407.18625 link Kimi
107 2024-07-25 Superior Scoring Rules for Probabilistic Evaluation of Single-Label Multi-Class Classification Tasks Rouhollah Ahmadian et.al. 2407.17697 null Kimi
108 2024-07-23 Can Large Language Models Automatically Jailbreak GPT-4V? Yuanwei Wu et.al. 2407.16686 null Kimi
109 2024-07-22 WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding Quan Kong et.al. 2407.15350 null Kimi
110 2024-07-19 Joint or Disjoint: Mixing Training Regimes for Early-Exit Models Bartłomiej Krzepkowski et.al. 2407.14320 link Kimi
111 2024-07-19 BERTer: The Efficient One Pradyumna Saligram et.al. 2407.14039 null Kimi
112 2024-07-18 On the consistency of rotation curves and spatially integrated HI flux profiles Tariq Yasin et.al. 2407.13754 null Kimi
113 2024-07-19 Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View Jianan Fan et.al. 2407.12870 link Kimi
114 2024-07-17 Hallucination Index: An Image Quality Metric for Generative Reconstruction Models Matthew Tivnan et.al. 2407.12780 null Kimi
115 2024-07-16 Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning Yanting Miao et.al. 2407.12164 link Kimi
116 2024-07-16 Enhancing Split Computing and Early Exit Applications through Predefined Sparsity Luigi Capogrosso et.al. 2407.11763 link Kimi
117 2024-07-16 Preconditioned Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression Yingzhen Yang et.al. 2407.11353 null Kimi
118 2024-07-10 Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical Adarsh Prasad Behera et.al. 2407.11061 null Kimi
119 2024-07-15 Multilingual Contrastive Decoding via Language-Agnostic Layers Skipping Wenhao Zhu et.al. 2407.10795 link Kimi
120 2024-07-13 Towards understanding epoch-wise double descent in two-layer linear neural networks Amanda Olmin et.al. 2407.09845 null Kimi
121 2024-07-11 Sensor-Aware Classifiers for Energy-Efficient Time Series Applications on IoT Devices Dina Hussein et.al. 2407.08715 null Kimi
122 2024-07-07 Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking You Wu et.al. 2407.05383 null Kimi
123 2024-07-04 Unsupervised speech enhancement with spectral kurtosis and double deep priors Hien Ohnaka et.al. 2407.03887 null Kimi
124 2024-07-02 Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation Efstathia Soufleri et.al. 2407.02713 link Kimi
125 2024-07-02 Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model Cong Cao et.al. 2407.01960 null Kimi
126 2024-07-01 Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach Stef Baas et.al. 2407.01055 null Kimi
127 2024-07-01 SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection Dingkang Liang et.al. 2407.01016 null Kimi
128 2024-06-27 Adaptive Stochastic Weight Averaging Caglar Demir et.al. 2406.19092 link Kimi
129 2024-06-26 An Order Theory Framework of Recurrence Equations for Static Cost Analysis $-$ Dynamic Inference of Non-Linear Inequality Invariants Louis Rustenholz et.al. 2406.18260 null Kimi
130 2024-06-24 SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Neng Wang et.al. 2406.16279 link Kimi
131 2024-06-21 Micro-power spoken keyword spotting on Xylo Audio 2 Hannah Bos et.al. 2406.15112 null Kimi
132 2024-06-21 Early stopping for conjugate gradients in statistical inverse problems Laura Hucker et.al. 2406.15001 null Kimi
133 2024-06-21 Cost-Effective RF Fingerprinting Based on Hybrid CVNN-RF Classifier with Automated Multi-Dimensional Early-Exit Strategy Jiayan Gan et.al. 2406.14869 null Kimi
134 2024-06-20 On Layer-wise Representation Similarity: Application for Multi-Exit Models with a Single Classifier Jiachen Jiang et.al. 2406.14479 null Kimi