Usage instructions: here
Other links:
LLM
ID | Publish Date | Title | Authors | Code | Kimi | |
---|---|---|---|---|---|---|
1 | 2025-05-22 | GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning | Chengqi Duan et.al. | 2505.17022 | null | Kimi |
2 | 2025-05-22 | CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms | Shilin Yan et.al. | 2505.17020 | null | Kimi |
3 | 2025-05-22 | Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO | Chengzhuo Tong et.al. | 2505.17017 | null | Kimi |
4 | 2025-05-22 | Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models | Runsen Xu et.al. | 2505.17015 | null | Kimi |
5 | 2025-05-22 | SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding | Haoning Wu et.al. | 2505.17012 | link | Kimi |
6 | 2025-05-22 | R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning | Huatong Song et.al. | 2505.17005 | null | Kimi |
7 | 2025-05-22 | Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? | Jin Jiang et.al. | 2505.16998 | null | Kimi |
8 | 2025-05-22 | X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs | Rui Ye et.al. | 2505.16997 | null | Kimi |
9 | 2025-05-22 | $\text{R}^2\text{ec}$ : Towards Large Recommender Models with Reasoning | Runyang You et.al. | 2505.16994 | null | Kimi |
10 | 2025-05-22 | Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding | Runpeng Yu et.al. | 2505.16990 | null | Kimi |
11 | 2025-05-22 | T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning | Amartya Chakraborty et.al. | 2505.16986 | null | Kimi |
12 | 2025-05-22 | Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | Adib Bazgir et.al. | 2505.16982 | null | Kimi |
13 | 2025-05-22 | Bottlenecked Transformers: Periodic KV Cache Abstraction for Generalised Reasoning | Adnan Oomerjee et.al. | 2505.16950 | null | Kimi |
14 | 2025-05-22 | MixAT: Combining Continuous and Discrete Adversarial Training for LLMs | Csaba Dékány et.al. | 2505.16947 | null | Kimi |
15 | 2025-05-22 | AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios | Yunjia Qi et.al. | 2505.16944 | null | Kimi |
16 | 2025-05-22 | NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification | NovelSeek Team et.al. | 2505.16938 | null | Kimi |
17 | 2025-05-22 | In-Context Watermarks for Large Language Models | Yepeng Liu et.al. | 2505.16934 | null | Kimi |
18 | 2025-05-22 | Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning | Bosung Kim et.al. | 2505.16928 | null | Kimi |
19 | 2025-05-22 | Don’t “Overthink” Passage Reranking: Is Reasoning Truly Necessary? | Nour Jedidi et.al. | 2505.16886 | null | Kimi |
20 | 2025-05-22 | CASTILLO: Characterizing Response Length Distributions of Large Language Models | Daniel F. Perez-Ramirez et.al. | 2505.16881 | null | Kimi |
21 | 2025-05-22 | LaViDa: A Large Diffusion Language Model for Multimodal Understanding | Shufan Li et.al. | 2505.16839 | null | Kimi |
22 | 2025-05-22 | R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search | Yibo Wang et.al. | 2505.16838 | null | Kimi |
23 | 2025-05-22 | Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning | Fanrui Zhang et.al. | 2505.16836 | null | Kimi |
24 | 2025-05-22 | SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis | Shuang Sun et.al. | 2505.16834 | null | Kimi |
25 | 2025-05-22 | From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization | Haonian Ji et.al. | 2505.16832 | null | Kimi |
26 | 2025-05-22 | Unlearning Isn’t Deletion: Investigating Reversibility of Machine Unlearning in LLMs | Xiaoyu Xu et.al. | 2505.16831 | null | Kimi |
27 | 2025-05-22 | KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning | Wei Sun et.al. | 2505.16826 | null | Kimi |
28 | 2025-05-22 | REPA Works Until It Doesn’t: Early-Stopped, Holistic Alignment Supercharges Diffusion Training | Ziqiao Wang et.al. | 2505.16792 | null | Kimi |
29 | 2025-05-22 | CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models | Zhenzhen Ren et.al. | 2505.16785 | null | Kimi |
30 | 2025-05-22 | Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning | Xinghao Chen et.al. | 2505.16782 | null | Kimi |
31 | 2025-05-22 | R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO | Huanjin Yao et.al. | 2505.16673 | null | Kimi |
32 | 2025-05-22 | Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding | Feilong Tang et.al. | 2505.16652 | null | Kimi |
33 | 2025-05-22 | Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains | Wenhui Tan et.al. | 2505.16552 | null | Kimi |
34 | 2025-05-22 | LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing | Dario Di Palma et.al. | 2505.16491 | null | Kimi |
35 | 2025-05-22 | WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning | Zhepei Wei et.al. | 2505.16421 | null | Kimi |
36 | 2025-05-22 | DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving | Zhenjie Yang et.al. | 2505.16278 | null | Kimi |
37 | 2025-05-22 | LIFEBench: Evaluating Length Instruction Following in Large Language Models | Wei Zhang et.al. | 2505.16234 | null | Kimi |
38 | 2025-05-22 | NQKV: A KV Cache Quantization Scheme Based on Normal Distribution Characteristics | Zhihang Cai et.al. | 2505.16210 | null | Kimi |
39 | 2025-05-22 | QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design | Benjamin Schneider et.al. | 2505.16175 | null | Kimi |
40 | 2025-05-22 | KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization | Mingbo Song et.al. | 2505.16162 | null | Kimi |
41 | 2025-05-22 | Training-Free Reasoning and Reflection in MLLMs | Hongchen Wei et.al. | 2505.16151 | null | Kimi |
42 | 2025-05-22 | Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation | Zhenglin Hua et.al. | 2505.16146 | null | Kimi |
43 | 2025-05-22 | Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning | Gagan Bhatia et.al. | 2505.16088 | null | Kimi |
44 | 2025-05-22 | Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development | Ming Shen et.al. | 2505.16086 | null | Kimi |
45 | 2025-05-21 | Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models | Jingcong Liang et.al. | 2505.16056 | null | Kimi |
46 | 2025-05-21 | Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning | Alex Su et.al. | 2505.15966 | null | Kimi |
47 | 2025-05-21 | Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization | Aliakbar Nafar et.al. | 2505.15918 | null | Kimi |
48 | 2025-05-21 | dKV-Cache: The Cache for Diffusion Language Models | Xinyin Ma et.al. | 2505.15781 | link | Kimi |
49 | 2025-05-21 | Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space | Zhen Zhang et.al. | 2505.15778 | link | Kimi |
50 | 2025-05-21 | Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention | Huanxuan Liao et.al. | 2505.15774 | null | Kimi |
51 | 2025-05-21 | ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy | Gengyang Li et.al. | 2505.15684 | null | Kimi |
52 | 2025-05-21 | A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability | Zishuai Zhang et.al. | 2505.15683 | link | Kimi |
53 | 2025-05-21 | Feature Extraction and Steering for Enhanced Chain-of-Thought Reasoning in Language Models | Zihao Li et.al. | 2505.15634 | null | Kimi |
54 | 2025-05-21 | Learn to Reason Efficiently with Adaptive Length-based Reward Shaping | Wei Liu et.al. | 2505.15612 | link | Kimi |
55 | 2025-05-21 | Multilingual Test-Time Scaling via Initial Thought Transfer | Prasoon Bajpai et.al. | 2505.15508 | null | Kimi |
56 | 2025-05-21 | Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought | Ao Liu et.al. | 2505.15431 | null | Kimi |
57 | 2025-05-21 | FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management | Xiang Liu et.al. | 2505.15347 | null | Kimi |
58 | 2025-05-21 | Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack | Silvia Cappelletti et.al. | 2505.15323 | null | Kimi |
59 | 2025-05-21 | Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization | Joonho Yang et.al. | 2505.15291 | null | Kimi |
60 | 2025-05-21 | LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval | Zhenyu Ning et.al. | 2505.15269 | null | Kimi |
61 | 2025-05-21 | Towards Explainable Temporal Reasoning in Large Language Models: A Structure-Aware Generative Framework | Zihao Jiang et.al. | 2505.15245 | link | Kimi |
62 | 2025-05-21 | Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning | Jinghui Lu et.al. | 2505.15154 | null | Kimi |
63 | 2025-05-21 | BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms | Yunlong Hou et.al. | 2505.15141 | null | Kimi |
64 | 2025-05-21 | The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning | Shivam Agarwal et.al. | 2505.15134 | null | Kimi |
65 | 2025-05-21 | An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents | Bowen Jin et.al. | 2505.15117 | link | Kimi |
66 | 2025-05-21 | RoT: Enhancing Table Reasoning with Iterative Row-Wise Traversals | Xuanliang Zhang et.al. | 2505.15110 | null | Kimi |
67 | 2025-05-21 | Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs | Hao Wang et.al. | 2505.15075 | link | Kimi |
68 | 2025-05-21 | Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision | Eric Hanchen Jiang et.al. | 2505.14999 | null | Kimi |
69 | 2025-05-20 | STree: Speculative Tree Decoding for Hybrid State-Space Models | Yangchao Wu et.al. | 2505.14969 | null | Kimi |
70 | 2025-05-20 | Too Long, Didn’t Model: Decomposing LLM Long-Context Understanding With Novels | Sil Hamilton et.al. | 2505.14925 | null | Kimi |
71 | 2025-05-20 | Scaling Laws for State Dynamics in Large Language Models | Jacob X Li et.al. | 2505.14892 | null | Kimi |
72 | 2025-05-20 | Balanced and Elastic End-to-end Training of Dynamic LLMs | Mohamed Wahib et.al. | 2505.14864 | null | Kimi |
73 | 2025-05-20 | Text Generation Beyond Discrete Token Sampling | Yufan Zhuang et.al. | 2505.14827 | null | Kimi |
74 | 2025-05-21 | Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning | Haolei Xu et.al. | 2505.14684 | null | Kimi |
75 | 2025-05-20 | Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training | Mengru Wang et.al. | 2505.14681 | null | Kimi |
76 | 2025-05-20 | Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning | Jiaer Xia et.al. | 2505.14677 | null | Kimi |
77 | 2025-05-20 | SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment | Wonje Jeung et.al. | 2505.14667 | null | Kimi |
78 | 2025-05-20 | Beyond Words: Multimodal LLM Knows When to Speak | Zikai Liao et.al. | 2505.14654 | null | Kimi |
79 | 2025-05-20 | KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models | Fnu Mohbat et.al. | 2505.14629 | link | Kimi |
80 | 2025-05-20 | Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs | Morgan Lindsay Heisler et.al. | 2505.14620 | null | Kimi |
81 | 2025-05-20 | Can Pruning Improve Reasoning? Revisiting Long-CoT Compression with Capability in Mind for Better Reasoning | Shangziqi Zhao et.al. | 2505.14582 | null | Kimi |
82 | 2025-05-20 | Reasoning Models Better Express Their Confidence | Dongkeun Yoon et.al. | 2505.14489 | link | Kimi |
83 | 2025-05-20 | Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation | Peter Baile Chen et.al. | 2505.14398 | null | Kimi |
84 | 2025-05-20 | Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach | Umberto Cappellazzo et.al. | 2505.14336 | null | Kimi |
85 | 2025-05-20 | Speculative Decoding Reimagined for Multimodal Large Language Models | Luxi Lin et.al. | 2505.14260 | null | Kimi |
86 | 2025-05-20 | FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation | Shaolin Zhu et.al. | 2505.14256 | null | Kimi |
87 | 2025-05-20 | Tokenization Constraints in LLMs: A Study of Symbolic and Arithmetic Reasoning Limits | Xiang Zhang et.al. | 2505.14178 | null | Kimi |
88 | 2025-05-20 | RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning | Qianyue Hao et.al. | 2505.14140 | null | Kimi |
89 | 2025-05-20 | DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models | Yakun Zhu et.al. | 2505.14107 | link | Kimi |
90 | 2025-05-20 | Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models | Wenhui Zhu et.al. | 2505.13973 | null | Kimi |
91 | 2025-05-20 | FlashThink: An Early Exit Method For Efficient Reasoning | Guochao Jiang et.al. | 2505.13949 | null | Kimi |
92 | 2025-05-20 | EEG-to-Text Translation: A Model for Deciphering Human Brain Activity | Saydul Akbar Murad et.al. | 2505.13936 | link | Kimi |
93 | 2025-05-20 | Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning | Jiwon Song et.al. | 2505.13866 | null | Kimi |
94 | 2025-05-20 | EfficientLLM: Efficiency in Large Language Models | Zhengqing Yuan et.al. | 2505.13840 | null | Kimi |
95 | 2025-05-20 | Structured Agent Distillation for Large Language Model | Jun Liu et.al. | 2505.13820 | null | Kimi |
96 | 2025-05-19 | Ice Cream Doesn’t Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference | Jin Du et.al. | 2505.13770 | null | Kimi |
97 | 2025-05-19 | Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers | Andrew Nam et.al. | 2505.13737 | null | Kimi |
98 | 2025-05-19 | RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs | Soumya Rani Samineni et.al. | 2505.13697 | null | Kimi |
99 | 2025-05-19 | Optimizing Anytime Reasoning via Budget Relative Policy Optimization | Penghui Qi et.al. | 2505.13438 | link | Kimi |
100 | 2025-05-19 | CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process | Jinhe Bi et.al. | 2505.13408 | null | Kimi |
101 | 2025-05-19 | Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference | Shuqing Luo et.al. | 2505.13345 | link | Kimi |
102 | 2025-05-19 | Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space | Hengli Li et.al. | 2505.13308 | null | Kimi |
103 | 2025-05-19 | RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning | Qiguang Chen et.al. | 2505.13307 | link | Kimi |
104 | 2025-05-19 | Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability | Jingyi Ren et.al. | 2505.13258 | null | Kimi |
105 | 2025-05-19 | HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding | Siran Liu et.al. | 2505.13254 | null | Kimi |
106 | 2025-05-19 | Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification | Jikai Wang et.al. | 2505.13204 | null | Kimi |
107 | 2025-05-19 | Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities | Lili Zhang et.al. | 2505.13195 | null | Kimi |
108 | 2025-05-19 | ModernGBERT: German-only 1B Encoder Model Trained from Scratch | Anton Ehrmanntraut et.al. | 2505.13136 | null | Kimi |
109 | 2025-05-19 | Benchmarking and Confidence Evaluation of LALMs For Temporal Reasoning | Debarpan Bhattacharya et.al. | 2505.13115 | null | Kimi |
110 | 2025-05-19 | FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference | Guangda Liu et.al. | 2505.13109 | null | Kimi |
111 | 2025-05-19 | Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning | Xiaoyu Yang et.al. | 2505.13081 | null | Kimi |
112 | 2025-05-19 | MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO | Yicheng Xiao et.al. | 2505.13031 | link | Kimi |
113 | 2025-05-19 | Fractured Chain-of-Thought Reasoning | Baohao Liao et.al. | 2505.12992 | null | Kimi |
114 | 2025-05-19 | A3 : an Analytical Low-Rank Approximation Framework for Attention | Jeffrey T. H. Wong et.al. | 2505.12942 | null | Kimi |
115 | 2025-05-19 | Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs | Zhihe Yang et.al. | 2505.12929 | link | Kimi |
116 | 2025-05-19 | The Traitors: Deception and Trust in Multi-Agent Language Model Simulations | Pedro M. P. Curvo et.al. | 2505.12923 | null | Kimi |
117 | 2025-05-19 | LEXam: Benchmarking Legal Reasoning on 340 Law Exams | Yu Fan et.al. | 2505.12864 | null | Kimi |
118 | 2025-05-19 | Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs | Zhuo Yang et.al. | 2505.12833 | null | Kimi |
119 | 2025-05-19 | SynDec: A Synthesize-then-Decode Approach for Arbitrary Textual Style Transfer via Large Language Models | Han Sun et.al. | 2505.12821 | null | Kimi |
120 | 2025-05-19 | Accelerating Adaptive Retrieval Augmented Generation via Instruction-Driven Representation Reduction of Retrieval Overlaps | Jie Ou et.al. | 2505.12731 | null | Kimi |
121 | 2025-05-19 | FLASH: Latent-Aware Semi-Autoregressive Speculative Decoding for Multimodal Tasks | Zihua Wang et.al. | 2505.12728 | null | Kimi |
122 | 2025-05-19 | ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving | Haoyuan Wu et.al. | 2505.12717 | null | Kimi |
123 | 2025-05-19 | Shadow-FT: Tuning Instruct via Base | Taiqiang Wu et.al. | 2505.12716 | link | Kimi |
124 | 2025-05-19 | Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities | Haoyu Zhao et.al. | 2505.12680 | link | Kimi |
125 | 2025-05-19 | HydraInfer: Hybrid Disaggregated Scheduling for Multimodal Large Language Model Serving | Xianzhe Dong et.al. | 2505.12658 | null | Kimi |
126 | 2025-05-19 | Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents | Yunseok Jang et.al. | 2505.12632 | null | Kimi |
127 | 2025-05-19 | Enhancing Latent Computation in Transformers with Latent Tokens | Yuchang Sun et.al. | 2505.12629 | null | Kimi |
128 | 2025-05-18 | A Survey of Attacks on Large Language Models | Wenrui Xu et.al. | 2505.12567 | null | Kimi |
129 | 2025-05-15 | 3D-Fixup: Advancing Photo Editing with 3D Priors | Yen-Chi Cheng et.al. | 2505.10566 | null | Kimi |
130 | 2025-05-15 | End-to-End Vision Tokenizer Tuning | Wenxuan Wang et.al. | 2505.10562 | null | Kimi |
131 | 2025-05-15 | Neural Thermodynamic Laws for Large Language Model Training | Ziming Liu et.al. | 2505.10559 | null | Kimi |
132 | 2025-05-15 | MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning | Ke Wang et.al. | 2505.10557 | link | Kimi |
133 | 2025-05-15 | Beyond ‘Aha!’: Toward Systematic Meta-Abilities Alignment in Large Reasoning Models | Zhiyuan Hu et.al. | 2505.10554 | link | Kimi |
134 | 2025-05-15 | Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data | Yiwen Liu et.al. | 2505.10551 | link | Kimi |
135 | 2025-05-15 | Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning | Milan Ganai et.al. | 2505.10547 | null | Kimi |
136 | 2025-05-15 | Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models | Annie Wong et.al. | 2505.10543 | link | Kimi |
137 | 2025-05-15 | Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis | Pengfei Wang et.al. | 2505.10541 | link | Kimi |
138 | 2025-05-15 | Enhancing Multi-Image Question Answering via Submodular Subset Selection | Aaryan Sharma et.al. | 2505.10533 | null | Kimi |
139 | 2025-05-15 | MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models | Mugilan Ganesan et.al. | 2505.10526 | null | Kimi |
140 | 2025-05-15 | Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation | Xinrui Wang et.al. | 2505.10522 | null | Kimi |
141 | 2025-05-15 | Multi-Token Prediction Needs Registers | Anastasios Gerontopoulos et.al. | 2505.10518 | link | Kimi |
142 | 2025-05-15 | The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks | Benedikt Ebing et.al. | 2505.10507 | null | Kimi |
143 | 2025-05-15 | RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs | Vibha Belavadi et.al. | 2505.10495 | null | Kimi |
144 | 2025-05-15 | Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective | Yutao Mou et.al. | 2505.10494 | link | Kimi |
145 | 2025-05-15 | CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning | Shaohan Wang et.al. | 2505.10493 | null | Kimi |
146 | 2025-05-15 | UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation | Yi Li et.al. | 2505.10483 | null | Kimi |
147 | 2025-05-15 | Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps | Ningyuan Yang et.al. | 2505.10482 | null | Kimi |
148 | 2025-05-15 | Parallel Scaling Law for Language Models | Mouxiang Chen et.al. | 2505.10475 | link | Kimi |
149 | 2025-05-15 | AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge | Ranjan Sapkota et.al. | 2505.10468 | null | Kimi |
150 | 2025-05-15 | Superposition Yields Robust Neural Scaling | Yizhou liu et.al. | 2505.10465 | link | Kimi |
151 | 2025-05-15 | Vision language models have difficulty recognizing virtual objects | Tyler Tran et.al. | 2505.10453 | null | Kimi |
152 | 2025-05-15 | Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models | Zemin Huang et.al. | 2505.10446 | null | Kimi |
153 | 2025-05-15 | Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations? | Pedro Orvalho et.al. | 2505.10443 | null | Kimi |
154 | 2025-05-15 | Hierarchical Document Refinement for Long-context Retrieval-augmented Generation | Jiajie Jin et.al. | 2505.10413 | link | Kimi |
155 | 2025-05-15 | Are LLM-generated plain language summaries truly understandable? A large-scale crowdsourced evaluation | Yue Guo et.al. | 2505.10409 | null | Kimi |
156 | 2025-05-15 | Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding | Jianhao Huang et.al. | 2505.10405 | null | Kimi |
157 | 2025-05-15 | Rethinking Repetition Problems of LLMs in Code Generation | Yihong Dong et.al. | 2505.10402 | link | Kimi |
158 | 2025-05-15 | Evaluating Model Explanations without Ground Truth | Kaivalya Rawal et.al. | 2505.10399 | link | Kimi |
159 | 2025-05-15 | J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning | Chenxi Whitehouse et.al. | 2505.10320 | null | Kimi |
160 | 2025-05-15 | StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation | Daniel A. P. Oliveira et.al. | 2505.10292 | link | Kimi |
161 | 2025-05-15 | The Evolving Landscape of Generative Large Language Models and Traditional Natural Language Processing in Medicine | Rui Yang et.al. | 2505.10261 | null | Kimi |
162 | 2025-05-15 | Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data | Poli Apollinaire Nemkova et.al. | 2505.10260 | link | Kimi |
163 | 2025-05-15 | On the Interplay of Human-AI Alignment,Fairness, and Performance Trade-offs in Medical Imaging | Haozhe Luo et.al. | 2505.10231 | link | Kimi |
164 | 2025-05-15 | ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention | Jintian Shao et.al. | 2505.10222 | null | Kimi |
165 | 2025-05-15 | The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think | Seongyun Lee et.al. | 2505.10185 | null | Kimi |
166 | 2025-05-15 | GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs | Longchao Da et.al. | 2505.10143 | null | Kimi |
167 | 2025-05-15 | From Text to Network: Constructing a Knowledge Graph of Taiwan-Based China Studies Using Generative AI | Hsuan-Lei Shao et.al. | 2505.10093 | null | Kimi |
168 | 2025-05-15 | CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability | Han Peng et.al. | 2505.10063 | null | Kimi |
169 | 2025-05-15 | PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language | Ijazul Haq et.al. | 2505.10055 | null | Kimi |
170 | 2025-05-15 | ServeGen: Workload Characterization and Generation of Large Language Model Serving in Production | Yuxing Xiang et.al. | 2505.09999 | null | Kimi |
171 | 2025-05-15 | Analysing Safety Risks in LLMs Fine-Tuned with Pseudo-Malicious Cyber Security Data | Adel ElZemity et.al. | 2505.09974 | null | Kimi |
172 | 2025-05-15 | Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents | Mrinal Rawat et.al. | 2505.09970 | null | Kimi |
173 | 2025-05-15 | Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph | Deeksha Prahlad et.al. | 2505.09945 | link | Kimi |
174 | 2025-05-15 | Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit Tasks | Ziyuan Zhang et.al. | 2505.09901 | link | Kimi |
175 | 2025-05-14 | Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting | Apollinaire Poli Nemkova et.al. | 2505.09852 | null | Kimi |
176 | 2025-05-14 | Contextual Phenotyping of Pediatric Sepsis Cohort Using Large Language Models | Aditya Nagori et.al. | 2505.09805 | null | Kimi |
177 | 2025-05-14 | Trustless Autonomy: Understanding Motivations, Benefits and Governance Dilemma in Self-Sovereign Decentralized AI Agents | Botao Amber Hu et.al. | 2505.09757 | null | Kimi |
178 | 2025-05-14 | System Prompt Optimization with Meta-Learning | Yumin Choi et.al. | 2505.09666 | null | Kimi |
179 | 2025-05-14 | Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists? | Anthony GX-Chen et.al. | 2505.09614 | null | Kimi |
180 | 2025-05-14 | Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors | Nicolas Dupuis et.al. | 2505.09610 | null | Kimi |
181 | 2025-05-14 | WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models | Abdullah Mushtaq et.al. | 2505.09595 | null | Kimi |
182 | 2025-05-14 | PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning | Zongqian Li et.al. | 2505.09519 | link | Kimi |
183 | 2025-05-14 | CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios | Raghav Garg et.al. | 2505.09436 | link | Kimi |
184 | 2025-05-14 | Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records | Yili He et.al. | 2505.09435 | null | Kimi |
185 | 2025-05-14 | Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits | Subrit Dikshit et.al. | 2505.09407 | null | Kimi |
186 | 2025-05-14 | The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners | Vince Trencsenyi et.al. | 2505.09396 | null | Kimi |
187 | 2025-05-14 | Qwen3 Technical Report | An Yang et.al. | 2505.09388 | link | Kimi |
188 | 2025-05-14 | Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures | Chenggang Zhao et.al. | 2505.09343 | null | Kimi |
189 | 2025-05-14 | Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs | Jingcheng Niu et.al. | 2505.09338 | null | Kimi |
190 | 2025-05-14 | Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging | Hongjin Qian et.al. | 2505.09316 | null | Kimi |
191 | 2025-05-14 | Reproducibility Study of “Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents” | Pedro M. P. Curvo et.al. | 2505.09289 | link | Kimi |
192 | 2025-05-14 | Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt | Bin-Bin Gao et.al. | 2505.09264 | link | Kimi |
193 | 2025-05-14 | ELIS: Efficient LLM Iterative Scheduling System with Response Length Predictor | Seungbeom Choi et.al. | 2505.09142 | null | Kimi |
194 | 2025-05-14 | CEC-Zero: Chinese Error Correction Solution Based on LLM | Sophie Zhang et.al. | 2505.09082 | null | Kimi |
195 | 2025-05-14 | A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias | Brandon Smith et.al. | 2505.09056 | null | Kimi |
196 | 2025-05-13 | Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification | Adarsh Kumar et.al. | 2505.09031 | null | Kimi |
197 | 2025-05-13 | Automated Meta Prompt Engineering for Alignment with the Theory of Mind | Aaron Baughman et.al. | 2505.09024 | null | Kimi |
198 | 2025-05-13 | Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training | Yangyi Chen et.al. | 2505.08971 | link | Kimi |
199 | 2025-05-13 | Toward Cost-Efficient Serving of Mixture-of-Experts with Asynchrony | Shaoyu Wang et.al. | 2505.08944 | null | Kimi |
200 | 2025-05-13 | Performance Gains of LLMs With Humans in a World of LLMs Versus Humans | Lucas McCullum et.al. | 2505.08902 | null | Kimi |
201 | 2025-05-13 | Generative AI for Autonomous Driving: Frontiers and Opportunities | Yuping Wang et.al. | 2505.08854 | link | Kimi |
202 | 2025-05-13 | CodePDE: An Inference Framework for LLM-driven PDE Solver Generation | Shanda Li et.al. | 2505.08783 | link | Kimi |
203 | 2025-05-14 | Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology | Yatai Ji et.al. | 2505.08765 | null | Kimi |
204 | 2025-05-13 | DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models | Xiaoyang Chen et.al. | 2505.08744 | link | Kimi |
205 | 2025-05-13 | Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies | Xiaoliang Luo et.al. | 2505.08739 | link | Kimi |
206 | 2025-05-13 | NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context | Ben Yao et.al. | 2505.08734 | null | Kimi |
207 | 2025-05-13 | PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts | Yang Su et.al. | 2505.08719 | null | Kimi |
208 | 2025-05-13 | LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs | K M Sajjadul Islam et.al. | 2505.08704 | null | Kimi |
209 | 2025-05-13 | TRAIL: Trace Reasoning and Agentic Issue Localization | Darshan Deshpande et.al. | 2505.08638 | null | Kimi |
210 | 2025-05-13 | Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models | Donghoon Kim et.al. | 2505.08622 | null | Kimi |
211 | 2025-05-13 | Automatic Task Detection and Heterogeneous LLM Speculative Decoding | Danying Ge et.al. | 2505.08600 | null | Kimi |
212 | 2025-05-13 | Small but Significant: On the Promise of Small Language Models for Accessible AIED | Yumou Wei et.al. | 2505.08588 | null | Kimi |
213 | 2025-05-13 | The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News | Yuhan Liu et.al. | 2505.08532 | null | Kimi |
214 | 2025-05-13 | LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models | Takumi Shibata et.al. | 2505.08498 | null | Kimi |
215 | 2025-05-13 | RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models | Fujun Zhang et.al. | 2505.08463 | null | Kimi |
216 | 2025-05-13 | Accelerating Chain-of-Thought Reasoning: When Goal-Gradient Importance Meets Dynamic Skipping | Ren Zhuang et.al. | 2505.08392 | null | Kimi |
217 | 2025-05-13 | Benchmarking AI scientists in omics data-driven biological research | Erpai Luo et.al. | 2505.08341 | link | Kimi |
218 | 2025-05-13 | AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale | Yunjie Ji et.al. | 2505.08311 | null | Kimi |
219 | 2025-05-13 | Evaluating the Effectiveness of Black-Box Prompt Optimization as the Scale of LLMs Continues to Grow | Ziyu Zhou et.al. | 2505.08303 | null | Kimi |
220 | 2025-05-13 | Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration | Rishabh Agrawal et.al. | 2505.08261 | null | Kimi |
221 | 2025-05-13 | Evaluating LLM Metrics Through Real-World Capabilities | Justin K Miller et.al. | 2505.08253 | null | Kimi |
222 | 2025-05-13 | Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement | Haoran Ye et.al. | 2505.08245 | link | Kimi |
223 | 2025-05-13 | A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs | Artem Shelmanov et.al. | 2505.08200 | null | Kimi |
224 | 2025-05-13 | Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage | Ruilin Liu et.al. | 2505.08167 | null | Kimi |
225 | 2025-05-13 | Decoding Neighborhood Environments with Large Language Models | Andrew Cart et.al. | 2505.08163 | null | Kimi |
226 | 2025-05-13 | Lost in Transmission: When and Why LLMs Fail to Reason Globally | Tobias Schnabel et.al. | 2505.08140 | null | Kimi |
227 | 2025-05-13 | ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval | Mingxu Tao et.al. | 2505.08130 | null | Kimi |
228 | 2025-05-12 | Are LLMs complicated ethical dilemma analyzers? | Jiashen et.al. | 2505.08106 | link | Kimi |
229 | 2025-05-12 | Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders | Dong Shu et.al. | 2505.08080 | null | Kimi |
230 | 2025-05-12 | FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning | Zhehao Zhang et.al. | 2505.08054 | null | Kimi |
231 | 2025-05-12 | Learning from Peers in Reasoning Models | Tongxu Luo et.al. | 2505.07787 | null | Kimi |
232 | 2025-05-12 | S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models | Muzhi Dai et.al. | 2505.07686 | null | Kimi |
233 | 2025-05-12 | SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models | Hang Wu et.al. | 2505.07680 | null | Kimi |
234 | 2025-05-13 | OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit | Arun S. Maiya et.al. | 2505.07672 | link | Kimi |
235 | 2025-05-12 | Benchmarking Retrieval-Augmented Generation for Chemistry | Xianrui Zhong et.al. | 2505.07671 | null | Kimi |
236 | 2025-05-12 | Concept-Level Explainability for Auditing & Steering LLM Responses | Kenza Amara et.al. | 2505.07610 | link | Kimi |
237 | 2025-05-12 | MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining | Xiaomi LLM-Core Team et.al. | 2505.07608 | link | Kimi |
238 | 2025-05-12 | Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent | Ziyang Huang et.al. | 2505.07596 | null | Kimi |
239 | 2025-05-12 | A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models | Junjie Ye et.al. | 2505.07591 | link | Kimi |
240 | 2025-05-12 | ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution | Xu Huang et.al. | 2505.07512 | null | Kimi |
241 | 2025-05-12 | A Survey on Collaborative Mechanisms Between Large and Small Language Models | Yi Chen et.al. | 2505.07460 | null | Kimi |
242 | 2025-05-12 | How well do LLMs reason over tabular data, really? | Cornelius Wolff et.al. | 2505.07453 | null | Kimi |
243 | 2025-05-12 | Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data | David de-Fitero-Dominguez et.al. | 2505.07372 | null | Kimi |
244 | 2025-05-12 | QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines | Ohjoon Kwon et.al. | 2505.07345 | null | Kimi |
245 | 2025-05-12 | Generative Pre-trained Autoregressive Diffusion Transformer | Yuan Zhang et.al. | 2505.07344 | null | Kimi |
246 | 2025-05-12 | Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study | Baixuan Xu et.al. | 2505.07313 | null | Kimi |
247 | 2025-05-12 | Semantic Retention and Extreme Compression in LLMs: Can We Have Both? | Stanislas Laborde et.al. | 2505.07289 | null | Kimi |
248 | 2025-05-12 | UMoE: Unifying Attention and FFN with Shared Experts | Yuanhang Yang et.al. | 2505.07260 | null | Kimi |
249 | 2025-05-12 | SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models | Peichao Lai et.al. | 2505.07247 | link | Kimi |
250 | 2025-05-12 | Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity | Guang Yan et.al. | 2505.07239 | null | Kimi |
251 | 2025-05-12 | DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation | Jiashuo Sun et.al. | 2505.07233 | link | Kimi |
252 | 2025-05-12 | Measuring General Intelligence with Generated Games | Vivek Verma et.al. | 2505.07215 | link | Kimi |
253 | 2025-05-12 | Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 2030 | Mouxiao Bian et.al. | 2505.07205 | null | Kimi |
254 | 2025-05-12 | PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications | Kuntai Du et.al. | 2505.07203 | null | Kimi |
255 | 2025-05-12 | One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models | Haoran Gu et.al. | 2505.07167 | null | Kimi |
256 | 2025-05-12 | Pre-training vs. Fine-tuning: A Reproducibility Study on Dense Retrieval Knowledge Acquisition | Zheng Yao et.al. | 2505.07166 | link | Kimi |
257 | 2025-05-11 | RefPentester: A Knowledge-Informed Self-Reflective Penetration Testing Framework Based on Large Language Models | Hanzheng Dai et.al. | 2505.07089 | null | Kimi |
258 | 2025-05-11 | Architectural Precedents for General Agents using Large Language Models | Robert E. Wray et.al. | 2505.07087 | null | Kimi |
259 | 2025-05-11 | DialogueReason: Rule-Based RL Sparks Dialogue Reasoning in LLMs | Yubo Shu et.al. | 2505.07049 | null | Kimi |
260 | 2025-05-11 | LLM-Augmented Chemical Synthesis and Design Decision Programs | Haorui Wang et.al. | 2505.07027 | null | Kimi |
261 | 2025-05-08 | DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion | Qitao Zhao et.al. | 2505.05473 | null | Kimi |
262 | 2025-05-08 | Flow-GRPO: Training Flow Matching Models via Online RL | Jie Liu et.al. | 2505.05470 | link | Kimi |
263 | 2025-05-08 | Generating Physically Stable and Buildable LEGO Designs from Text | Ava Pun et.al. | 2505.05469 | link | Kimi |
264 | 2025-05-08 | StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant | Haibo Wang et.al. | 2505.05467 | null | Kimi |
265 | 2025-05-08 | ComPO: Preference Alignment via Comparison Oracles | Peter Chen et.al. | 2505.05465 | null | Kimi |
266 | 2025-05-08 | Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging | Shiqi Chen et.al. | 2505.05464 | link | Kimi |
267 | 2025-05-08 | UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections | Fatima Haouari et.al. | 2505.05459 | null | Kimi |
268 | 2025-05-08 | SITE: towards Spatial Intelligence Thorough Evaluation | Wenqi Wang et.al. | 2505.05456 | null | Kimi |
269 | 2025-05-08 | Conversational Process Model Redesign | Nataliia Klievtsova et.al. | 2505.05453 | null | Kimi |
270 | 2025-05-08 | Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding | Han Xiao et.al. | 2505.05446 | link | Kimi |
271 | 2025-05-08 | clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations | Chalamalasetti Kranti et.al. | 2505.05445 | null | Kimi |
272 | 2025-05-08 | EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation | Biao Yi et.al. | 2505.05440 | null | Kimi |
273 | 2025-05-08 | Empowering Scientific Workflows with Federated Agents | J. Gregory Pauloski et.al. | 2505.05428 | link | Kimi |
274 | 2025-05-08 | Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data | Yudong Wang et.al. | 2505.05427 | null | Kimi |
275 | 2025-05-08 | TransProQA: an LLM-based literary Translation evaluation metric with Professional Question Answering | Ran Zhang et.al. | 2505.05423 | link | Kimi |
276 | 2025-05-08 | TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation | Haokun Lin et.al. | 2505.05422 | link | Kimi |
277 | 2025-05-08 | Reasoning Models Don’t Always Say What They Think | Yanda Chen et.al. | 2505.05410 | null | Kimi |
278 | 2025-05-08 | Crosslingual Reasoning through Test-Time Scaling | Zheng-Xin Yong et.al. | 2505.05408 | link | Kimi |
279 | 2025-05-08 | Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans? | Valeria Pastorino et.al. | 2505.05406 | null | Kimi |
280 | 2025-05-08 | CART-ELC: Oblique Decision Tree Induction via Exhaustive Search | Andrew D. Laack et.al. | 2505.05402 | link | Kimi |
281 | 2025-05-08 | PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model | Zhang Zhang et.al. | 2505.05397 | null | Kimi |
282 | 2025-05-08 | EDmamba: A Simple yet Effective Event Denoising Method with State Space Model | Ciyu Ruan et.al. | 2505.05391 | null | Kimi |
283 | 2025-05-08 | Walrus: An Efficient Decentralized Storage Network | George Danezis et.al. | 2505.05370 | null | Kimi |
284 | 2025-05-08 | High-fidelity Grain Growth Modeling: Leveraging Deep Learning for Fast Computations | Pungponhavoan Tep et.al. | 2505.05354 | null | Kimi |
285 | 2025-05-08 | Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization | Sooyoung Park et.al. | 2505.05343 | link | Kimi |
286 | 2025-05-08 | Progressive Inertial Poser: Progressive Real-Time Kinematic Chain Estimation for 3D Full-Body Pose from Three IMU Sensors | Zunjie Zhu et.al. | 2505.05336 | null | Kimi |
287 | 2025-05-08 | ICon: In-Context Contribution for Automatic Data Selection | Yixin Yang et.al. | 2505.05327 | null | Kimi |
288 | 2025-05-08 | Scalable Chain of Thoughts via Elastic Reasoning | Yuhui Xu et.al. | 2505.05315 | null | Kimi |
289 | 2025-05-08 | T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction | Kun Peng et.al. | 2505.05271 | null | Kimi |
290 | 2025-05-08 | Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks | Yixin Cheng et.al. | 2505.05190 | link | Kimi |
291 | 2025-05-08 | Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models | Wei Peng et.al. | 2505.05189 | null | Kimi |
292 | 2025-05-08 | MARK: Memory Augmented Refinement of Knowledge | Anish Ganguli et.al. | 2505.05177 | null | Kimi |
293 | 2025-05-08 | X-Driver: Explainable Autonomous Driving with Vision-Language Models | Wei Liu et.al. | 2505.05098 | null | Kimi |
294 | 2025-05-08 | Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes | Zhuocheng Gong et.al. | 2505.04993 | null | Kimi |
295 | 2025-05-08 | Chain-of-Thought Tokens are Computer Program Variables | Fangwei Zhu et.al. | 2505.04955 | link | Kimi |
296 | 2025-05-08 | Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models | Yunxin Li et.al. | 2505.04921 | link | Kimi |
297 | 2025-05-08 | An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education | Ramteja Sajja et.al. | 2505.04916 | null | Kimi |
298 | 2025-05-08 | Enigme: Generative Text Puzzles for Evaluating Reasoning in Language Models | John Hawkins et.al. | 2505.04914 | link | Kimi |
299 | 2025-05-08 | SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models | Shun Taguchi et.al. | 2505.04911 | null | Kimi |
300 | 2025-05-08 | ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning | Ziqing Qiao et.al. | 2505.04881 | null | Kimi |
301 | 2025-05-08 | GroverGPT-2: Simulating Grover’s Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization | Min Chen et.al. | 2505.04880 | null | Kimi |
302 | 2025-05-07 | CRAFT: Cultural Russian-Oriented Dataset Adaptation for Focused Text-to-Image Generation | Viacheslav Vasilev et.al. | 2505.04851 | null | Kimi |
303 | 2025-05-07 | Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards | Manveer Singh Tamber et.al. | 2505.04847 | null | Kimi |
304 | 2025-05-07 | Large Language Models are Autonomous Cyber Defenders | Sebastián R. Castro et.al. | 2505.04843 | link | Kimi |
305 | 2025-05-07 | ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling | Xiao Wang et.al. | 2505.04802 | null | Kimi |
306 | 2025-05-07 | The Promise and Limits of LLMs in Constructing Proofs and Hints for Logic Problems in Intelligent Tutoring Systems | Sutapa Dey Tithi et.al. | 2505.04736 | null | Kimi |
307 | 2025-05-07 | SOAEsV2-7B/72B: Full-Pipeline Optimization for State-Owned Enterprise LLMs via Continual Pre-Training, Domain-Progressive SFT and Distillation-Enhanced Speculative Decoding | Jingyang Deng et.al. | 2505.04723 | null | Kimi |
308 | 2025-05-07 | EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning | Zhenghao Xing et.al. | 2505.04623 | link | Kimi |
309 | 2025-05-07 | ZeroSearch: Incentivize the Search Capability of LLMs without Searching | Hao Sun et.al. | 2505.04588 | null | Kimi |
310 | 2025-05-07 | Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review | Josh McGiff et.al. | 2505.04531 | null | Kimi |
311 | 2025-05-07 | Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs | Yehui Tang et.al. | 2505.04519 | null | Kimi |
312 | 2025-05-07 | CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation | Jiahao Li et.al. | 2505.04481 | null | Kimi |
313 | 2025-05-07 | OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models | Xiaoyu Xu et.al. | 2505.04416 | null | Kimi |
314 | 2025-05-07 | YABLoCo: Yet Another Benchmark for Long Context Code Generation | Aidar Valeev et.al. | 2505.04406 | null | Kimi |
315 | 2025-05-07 | The Aloe Family Recipe for Open and Specialized Healthcare LLMs | Dario Garcia-Gasulla et.al. | 2505.04388 | null | Kimi |
316 | 2025-05-07 | Benchmarking LLMs’ Swarm intelligence | Kai Ruan et.al. | 2505.04364 | link | Kimi |
317 | 2025-05-07 | GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance | Sofia Jamil et.al. | 2505.04284 | link | Kimi |
318 | 2025-05-07 | SToLa: Self-Adaptive Touch-Language Framework with Tactile Commonsense Reasoning in Open-Ended Scenarios | Ning Cheng et.al. | 2505.04201 | null | Kimi |
319 | 2025-05-07 | VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction Tuning | Trinh T. L. Vuong et.al. | 2505.04192 | link | Kimi |
320 | 2025-05-07 | S3D: Sketch-Driven 3D Model Generation | Hail Song et.al. | 2505.04185 | link | Kimi |
321 | 2025-05-07 | Large Language Models are often politically extreme, usually ideologically inconsistent, and persuasive even in informational contexts | Nouar Aldahoul et.al. | 2505.04171 | null | Kimi |
322 | 2025-05-07 | Unmasking the Canvas: A Dynamic Benchmark for Image Generation Jailbreaking and LLM Content Safety | Variath Madhupal Gautham Nair et.al. | 2505.04146 | null | Kimi |
323 | 2025-05-07 | Enhancing Granular Sentiment Classification with Chain-of-Thought Prompting in Large Language Models | Vihaan Miriyala et.al. | 2505.04135 | null | Kimi |
324 | 2025-05-07 | LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress? | Teddy Foley et.al. | 2505.04075 | link | Kimi |
325 | 2025-05-07 | Advancing and Benchmarking Personalized Tool Invocation for LLMs | Xu Huang et.al. | 2505.04072 | link | Kimi |
326 | 2025-05-06 | Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving | Shan Yu et.al. | 2505.04021 | null | Kimi |
327 | 2025-05-06 | SLOT: Structuring the Output of Large Language Models | Darren Yow-Bang Wang et.al. | 2505.04016 | null | Kimi |
328 | 2025-05-06 | Can Large Language Models Predict Parallel Code Performance? | Gregory Bolet et.al. | 2505.03988 | null | Kimi |
329 | 2025-05-06 | X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains | Qianchu Liu et.al. | 2505.03981 | null | Kimi |
330 | 2025-05-06 | The Power of Stories: Narrative Priming Shapes How LLM Agents Collaborate and Compete | Gerrit Großmann et.al. | 2505.03961 | link | Kimi |
331 | 2025-05-06 | Frog Soup: Zero-Shot, In-Context, and Sample-Efficient Frogger Agents | Xiang Li et.al. | 2505.03947 | link | Kimi |
332 | 2025-05-06 | MARCO: A Multi-Agent System for Optimizing HPC Code Generation Using Large Language Models | Asif Rahman et.al. | 2505.03906 | null | Kimi |
333 | 2025-05-06 | Novel Extraction of Discriminative Fine-Grained Feature to Improve Retinal Vessel Segmentation | Shuang Zeng et.al. | 2505.03896 | link | Kimi |
334 | 2025-05-06 | VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model | Zuwei Long et.al. | 2505.03739 | link | Kimi |
335 | 2025-05-06 | WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch | Zimu Lu et.al. | 2505.03733 | link | Kimi |
336 | 2025-05-06 | Distribution-Conditional Generation: From Class Distribution to Creative Generation | Fu Feng et.al. | 2505.03667 | null | Kimi |
337 | 2025-05-06 | ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision Assistant | Yifan Xiang et.al. | 2505.03654 | null | Kimi |
338 | 2025-05-06 | A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning | Kolawole E. Ogunsina et.al. | 2505.03553 | null | Kimi |
339 | 2025-05-06 | Faster MoE LLM Inference for Extremely Large Models | Haoqi Yang et.al. | 2505.03531 | null | Kimi |
340 | 2025-05-06 | Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models | Bin Yu et.al. | 2505.03469 | link | Kimi |
341 | 2025-05-06 | The Steganographic Potentials of Language Models | Artem Karpov et.al. | 2505.03439 | null | Kimi |
342 | 2025-05-06 | Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents | Schaun Wheeler et.al. | 2505.03434 | null | Kimi |
343 | 2025-05-06 | MedArabiQ: Benchmarking Large Language Models on Arabic Medical Tasks | Mouath Abu Daoud et.al. | 2505.03427 | link | Kimi |
344 | 2025-05-06 | Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation | Mohammad Shoaib Ansari et.al. | 2505.03406 | null | Kimi |
345 | 2025-05-06 | Absolute Zero: Reinforced Self-play Reasoning with Zero Data | Andrew Zhao et.al. | 2505.03335 | link | Kimi |
346 | 2025-05-06 | AI-Driven Scholarly Peer Review via Persistent Workflow Prompting, Meta-Prompting, and Meta-Reasoning | Evgeny Markhasin et.al. | 2505.03332 | null | Kimi |
347 | 2025-05-06 | Recall with Reasoning: Chain-of-Thought Distillation for Mamba’s Long-Context Memory and Extrapolation | Junyu Ma et.al. | 2505.03320 | null | Kimi |
348 | 2025-05-06 | SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation | Zhaoxi Mu et.al. | 2505.03273 | null | Kimi |
349 | 2025-05-06 | RAVU: Retrieval Augmented Video Understanding with Compositional Reasoning over Graph | Sameer Malik et.al. | 2505.03173 | null | Kimi |
350 | 2025-05-06 | Assessing and Enhancing the Robustness of LLM-based Multi-Agent Systems Through Chaos Engineering | Joshua Owotogbe et.al. | 2505.03096 | null | Kimi |
351 | 2025-05-05 | Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text | Jennifer Healey et.al. | 2505.03053 | null | Kimi |
352 | 2025-05-05 | A Typology of Synthetic Datasets for Dialogue Processing in Clinical Contexts | Steven Bedrick et.al. | 2505.03025 | null | Kimi |
353 | 2025-05-05 | Memorization or Interpolation ? Detecting LLM Memorization through Input Perturbation Analysis | Albérick Euraste Djiré et.al. | 2505.03019 | null | Kimi |
354 | 2025-05-05 | RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale | Daniel Goldstein et.al. | 2505.03005 | link | Kimi |
355 | 2025-05-05 | Generating Narrated Lecture Videos from Slides with Synchronized Highlights | Alexander Holmberg et.al. | 2505.02966 | null | Kimi |
356 | 2025-05-05 | When Your Own Output Becomes Your Training Data: Noise-to-Meaning Loops and a Formal RSI Trigger | Rintaro Ando et.al. | 2505.02888 | link | Kimi |
357 | 2025-05-05 | AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation | Qingqiu Li et.al. | 2505.02830 | null | Kimi |
358 | 2025-05-05 | AutoLibra: Agent Metric Induction from Open-Ended Feedback | Hao Zhu et.al. | 2505.02820 | link | Kimi |
359 | 2025-05-05 | Knowing You Don’t Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing | Diji Yang et.al. | 2505.02811 | link | Kimi |
360 | 2025-05-05 | HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models | Zheng Lin et.al. | 2505.02795 | null | Kimi |
361 | 2025-05-05 | Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models | Matthew Dahl et.al. | 2505.02763 | null | Kimi |
362 | 2025-05-05 | Using Knowledge Graphs to harvest datasets for efficient CLIP model training | Simon Ging et.al. | 2505.02746 | link | Kimi |
363 | 2025-05-05 | FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models | Zhouliang Yu et.al. | 2505.02735 | link | Kimi |
364 | 2025-05-05 | Enhancing LLMs’ Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry | Junu Kim et.al. | 2505.02722 | link | Kimi |
365 | 2025-05-05 | Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play | Yemin Shi et.al. | 2505.02707 | link | Kimi |
366 | 2025-05-05 | Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models | Xiaobao Wu et.al. | 2505.02686 | link | Kimi |
367 | 2025-05-05 | A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law | Qianjun Pan et.al. | 2505.02665 | null | Kimi |
368 | 2025-05-05 | Enhancing Chemical Reaction and Retrosynthesis Prediction with Large Language Model and Dual-task Learning | Xuan Lin et.al. | 2505.02639 | null | Kimi |
369 | 2025-05-05 | LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis | Qingkai Fang et.al. | 2505.02625 | link | Kimi |
370 | 2025-05-05 | EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning | Lingxiao Kong et.al. | 2505.02579 | link | Kimi |
371 | 2025-05-05 | Bielik v3 Small: Technical Report | Krzysztof Ociepa et.al. | 2505.02550 | null | Kimi |
372 | 2025-05-05 | Large Language Model Partitioning for Low-Latency Inference at the Edge | Dimitrios Kafetzis et.al. | 2505.02533 | null | Kimi |
373 | 2025-05-05 | Beyond the model: Key differentiators in large language models and multi-agent services | Muskaan Goyal et.al. | 2505.02489 | null | Kimi |
374 | 2025-05-05 | Incentivizing Inclusive Contributions in Model Sharing Markets | Enpei Zhang et.al. | 2505.02462 | null | Kimi |
375 | 2025-05-05 | Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs | Elisa Forcada Rodríguez et.al. | 2505.02456 | null | Kimi |
376 | 2025-05-05 | Bielik 11B v2 Technical Report | Krzysztof Ociepa et.al. | 2505.02410 | null | Kimi |
377 | 2025-05-05 | Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL | Jiarui Yao et.al. | 2505.02391 | link | Kimi |
378 | 2025-05-05 | RM-R1: Reward Modeling as Reasoning | Xiusi Chen et.al. | 2505.02387 | link | Kimi |
379 | 2025-05-05 | JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings | Tianyu Zong et.al. | 2505.02366 | link | Kimi |
380 | 2025-05-05 | Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques | Sanjay Surendranath Girija et.al. | 2505.02309 | null | Kimi |
381 | 2025-05-05 | Generative Sign-description Prompts with Multi-positive Contrastive Learning for Sign Language Recognition | Siyu Liang et.al. | 2505.02304 | null | Kimi |
382 | 2025-05-04 | Parameter-Efficient Transformer Embeddings | Henry Ndubuaku et.al. | 2505.02266 | link | Kimi |
383 | 2025-05-04 | SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation | Tanguy Herserant et.al. | 2505.02235 | null | Kimi |
384 | 2025-05-04 | Interpretable Emergent Language Using Inter-Agent Transformers | Mannan Bhardwaj et.al. | 2505.02215 | link | Kimi |
385 | 2025-05-04 | Leveraging LLMs to Automate Energy-Aware Refactoring of Parallel Scientific Codes | Matthew T. Dearing et.al. | 2505.02184 | null | Kimi |
386 | 2025-05-04 | Measuring Hong Kong Massive Multi-Task Language Understanding | Chuxue Cao et.al. | 2505.02177 | null | Kimi |
387 | 2025-05-04 | A New HOPE: Domain-agnostic Automatic Evaluation of Text Chunking | Henrik Brådland et.al. | 2505.02171 | null | Kimi |
388 | 2025-05-04 | Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents | Minzheng Wang et.al. | 2505.02156 | link | Kimi |
389 | 2025-05-01 | T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT | Dongzhi Jiang et.al. | 2505.00703 | link | Kimi |
390 | 2025-05-01 | RayZer: A Self-supervised Large View Synthesis Model | Hanwen Jiang et.al. | 2505.00702 | null | Kimi |
391 | 2025-05-01 | Robotic Visual Instruction | Yanbang Li et.al. | 2505.00693 | null | Kimi |
392 | 2025-05-01 | Towards Autonomous Micromobility through Scalable Urban Simulation | Wayne Wu et.al. | 2505.00690 | null | Kimi |
393 | 2025-05-01 | GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution | Aditya Arora et.al. | 2505.00687 | null | Kimi |
394 | 2025-05-01 | Visual Test-time Scaling for GUI Agent Grounding | Tiange Luo et.al. | 2505.00684 | link | Kimi |
395 | 2025-05-01 | MINERVA: Evaluating Complex Video Reasoning | Arsha Nagrani et.al. | 2505.00681 | link | Kimi |
396 | 2025-05-01 | Steering Large Language Models with Register Analysis for Arbitrary Style Transfer | Xinchen Yang et.al. | 2505.00679 | null | Kimi |
397 | 2025-05-01 | Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions | Yiming Du et.al. | 2505.00675 | link | Kimi |
398 | 2025-05-01 | DeepCritic: Deliberate Critique with Large Language Models | Wenkai Yang et.al. | 2505.00662 | link | Kimi |
399 | 2025-05-01 | On the generalization of language models from in-context learning and finetuning: a controlled study | Andrew K. Lampinen et.al. | 2505.00661 | null | Kimi |
400 | 2025-05-01 | Large Language Models Understanding: an Inherent Ambiguity Barrier | Daniel N. Nissani et.al. | 2505.00654 | null | Kimi |
401 | 2025-05-01 | Open-Source LLM-Driven Federated Transformer for Predictive IoV Management | Yazan Otoum et.al. | 2505.00651 | null | Kimi |
402 | 2025-05-01 | OmicsCL: Unsupervised Contrastive Learning for Cancer Subtype Discovery and Survival Stratification | Atahan Karagoz et.al. | 2505.00650 | link | Kimi |
403 | 2025-05-01 | Investigating Task Arithmetic for Zero-Shot Information Retrieval | Marco Braga et.al. | 2505.00649 | link | Kimi |
404 | 2025-05-01 | Deep Learning Assisted Outer Volume Removal for Highly-Accelerated Real-Time Dynamic MRI | Merve Gülle et.al. | 2505.00643 | null | Kimi |
405 | 2025-05-01 | Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook | Muyi Bao et.al. | 2505.00630 | link | Kimi |
406 | 2025-05-01 | The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them) | Zihao Wang et.al. | 2505.00626 | null | Kimi |
407 | 2025-05-01 | FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation | Chaitali Bhattacharyya et.al. | 2505.00624 | null | Kimi |
408 | 2025-05-01 | Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction | Simon Giebenhain et.al. | 2505.00615 | null | Kimi |
409 | 2025-05-01 | Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation | D. Sculley et.al. | 2505.00612 | null | Kimi |
410 | 2025-05-01 | Combining LLMs with Logic-Based Framework to Explain MCTS | Ziyan An et.al. | 2505.00610 | null | Kimi |
411 | 2025-05-01 | Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4 | Phanish Puranam et.al. | 2505.00603 | null | Kimi |
412 | 2025-05-01 | Fast and Low-Cost Genomic Foundation Models via Outlier Removal | Haozheng Luo et.al. | 2505.00598 | link | Kimi |
413 | 2025-05-01 | A Finite-State Controller Based Offline Solver for Deterministic POMDPs | Alex Schutz et.al. | 2505.00596 | link | Kimi |
414 | 2025-05-01 | Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading | Shuo Tong et.al. | 2505.00592 | null | Kimi |
415 | 2025-05-01 | FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension | Jushi Kai et.al. | 2505.00570 | null | Kimi |
416 | 2025-05-01 | Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models | Makoto Sato et.al. | 2505.00557 | null | Kimi |
417 | 2025-05-01 | 100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models | Chong Zhang et.al. | 2505.00551 | null | Kimi |
418 | 2025-05-01 | HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection | Deanna Emery et.al. | 2505.00506 | null | Kimi |
419 | 2025-05-01 | UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces | Alaa Saleh et.al. | 2505.00472 | null | Kimi |
420 | 2025-05-01 | Red Teaming Large Language Models for Healthcare | Vahid Balazadeh et.al. | 2505.00467 | null | Kimi |
421 | 2025-05-01 | Data Therapist: Eliciting Domain Knowledge from Subject Matter Experts Using Large Language Models | Sungbok Shin et.al. | 2505.00455 | null | Kimi |
422 | 2025-05-01 | KoACD: The First Korean Adolescent Dataset for Cognitive Distortion Analysis | JunSeo Kim et.al. | 2505.00367 | null | Kimi |
423 | 2025-05-01 | Enhancing AI-Driven Education: Integrating Cognitive Frameworks, Linguistic Feedback Analysis, and Ethical Considerations for Improved Content Generation | Antoun Yaacoub et.al. | 2505.00339 | null | Kimi |
424 | 2025-05-01 | Mixture of Sparse Attention: Content-Based Learnable Sparse Attention via Expert-Choice Routing | Piotr Piękos et.al. | 2505.00315 | link | Kimi |
425 | 2025-05-01 | Fine-grained spatial-temporal perception for gas leak segmentation | Xinlong Zhao et.al. | 2505.00295 | null | Kimi |
426 | 2025-05-01 | Empowering Agentic Video Analytics Systems with Video Language Models | Yuxuan Yan et.al. | 2505.00254 | null | Kimi |
427 | 2025-04-30 | Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems | Shaokun Zhang et.al. | 2505.00212 | link | Kimi |
428 | 2025-04-30 | Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models | Minh-Hao Van et.al. | 2505.00150 | null | Kimi |
429 | 2025-04-30 | AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models | Yinghui He et.al. | 2505.00147 | null | Kimi |
430 | 2025-04-30 | Between Underthinking and Overthinking: An Empirical Study of Reasoning Length and correctness in LLMs | Jinyan Su et.al. | 2505.00127 | null | Kimi |
431 | 2025-04-30 | Fine-Tuning LLMs for Low-Resource Dialect Translation: The Case of Lebanese | Silvana Yakhni et.al. | 2505.00114 | link | Kimi |
432 | 2025-04-30 | GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling | Siqi Li et.al. | 2505.00063 | null | Kimi |
433 | 2025-04-30 | TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments | Sichang Tu et.al. | 2504.21851 | null | Kimi |
434 | 2025-04-30 | Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization | Anas Anwarul Haq Khan et.al. | 2504.21831 | null | Kimi |
435 | 2025-04-30 | DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition | Z. Z. Ren et.al. | 2504.21801 | null | Kimi |
436 | 2025-04-30 | WebThinker: Empowering Large Reasoning Models with Deep Research Capability | Xiaoxi Li et.al. | 2504.21776 | link | Kimi |
437 | 2025-04-30 | MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness | Junsheng Huang et.al. | 2504.21773 | null | Kimi |
438 | 2025-04-30 | AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization | Haotian Luo et.al. | 2504.21659 | link | Kimi |
439 | 2025-04-30 | Sadeed: Advancing Arabic Diacritization Through Small Language Model | Zeina Aldallal et.al. | 2504.21635 | null | Kimi |
440 | 2025-04-30 | Meeseeks: An Iterative Benchmark Evaluating LLMs Multi-Turn Instruction-Following Ability | Jiaming Wang et.al. | 2504.21625 | null | Kimi |
441 | 2025-04-30 | RDF-Based Structured Quality Assessment Representation of Multilingual LLM Evaluations | Jonas Gwozdz et.al. | 2504.21605 | null | Kimi |
442 | 2025-04-30 | DNB-AI-Project at SemEval-2025 Task 5: An LLM-Ensemble Approach for Automated Subject Indexing | Lisa Kluge et.al. | 2504.21589 | link | Kimi |
443 | 2025-04-30 | Precision Where It Matters: A Novel Spike Aware Mixed-Precision Quantization Strategy for LLaMA-based Language Models | Lucas Maisonnave et.al. | 2504.21553 | null | Kimi |
444 | 2025-04-30 | RWKV-X: A Linear Complexity Hybrid Language Model | Haowen Hou et.al. | 2504.21463 | link | Kimi |
445 | 2025-04-30 | SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding | Chenkai Zhang et.al. | 2504.21435 | link | Kimi |
446 | 2025-04-30 | Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction | Máté Gedeon et.al. | 2504.21372 | null | Kimi |
447 | 2025-04-30 | ShorterBetter: Guiding Reasoning Models to Find Optimal Inference Length for Efficient Reasoning | Jingyang Yi et.al. | 2504.21370 | null | Kimi |
448 | 2025-04-30 | Revisiting Diffusion Autoencoder Training for Image Reconstruction Quality | Pramook Khungurn et.al. | 2504.21368 | null | Kimi |
449 | 2025-04-30 | Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing | Hong Zhang et.al. | 2504.21356 | link | Kimi |
450 | 2025-04-30 | Phi-4-reasoning Technical Report | Marah Abdin et.al. | 2504.21318 | null | Kimi |
451 | 2025-04-30 | BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models | Zhiting Fan et.al. | 2504.21299 | null | Kimi |
452 | 2025-04-30 | Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models | Guanghao Zhou et.al. | 2504.21277 | null | Kimi |
453 | 2025-04-30 | Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA | Xuanzhao Dong et.al. | 2504.21252 | link | Kimi |
454 | 2025-04-30 | Memorization and Knowledge Injection in Gated LLMs | Xu Pan et.al. | 2504.21239 | null | Kimi |
455 | 2025-04-30 | Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math | Haoran Xu et.al. | 2504.21233 | null | Kimi |
456 | 2025-04-29 | CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks | Rui Wang et.al. | 2504.21228 | null | Kimi |
457 | 2025-04-29 | Automatic Legal Writing Evaluation of LLMs | Ramon Pires et.al. | 2504.21202 | link | Kimi |
458 | 2025-04-29 | Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare | Lovedeep Gondara et.al. | 2504.21191 | null | Kimi |
459 | 2025-04-29 | OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification | Shangyu Li et.al. | 2504.20964 | link | Kimi |
460 | 2025-04-29 | Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models | Maryna Vyshnyvetska et.al. | 2504.20951 | null | Kimi |
461 | 2025-04-29 | Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models | Tyler McDonald et.al. | 2504.20946 | null | Kimi |
462 | 2025-04-29 | ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification | Ziqing Fan et.al. | 2504.20930 | link | Kimi |
463 | 2025-04-29 | DYNAMAX: Dynamic computing for Transformers and Mamba based architectures | Miguel Nogales et.al. | 2504.20922 | null | Kimi |
464 | 2025-04-29 | Using LLMs in Generating Design Rationale for Software Architecture Decisions | Xiyu Zhou et.al. | 2504.20781 | null | Kimi |
465 | 2025-04-29 | JTreeformer: Graph-Transformer via Latent-Diffusion Model for Molecular Generation | Ji Shi et.al. | 2504.20770 | null | Kimi |
466 | 2025-04-29 | Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption | Wenxiao Wang et.al. | 2504.20769 | null | Kimi |
467 | 2025-04-29 | Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think | Hasan Abed Al Kader Hammoud et.al. | 2504.20708 | null | Kimi |
468 | 2025-04-29 | Cooking Up Creativity: A Cognitively-Inspired Approach for Enhancing LLM Creativity through Structured Representations | Moran Mizrahi et.al. | 2504.20643 | link | Kimi |
469 | 2025-04-29 | The Hidden Risks of LLM-Generated Web Application Code: A Security-Centric Evaluation of Code Generation Capabilities in Large Language Models | Swaroop Dora et.al. | 2504.20612 | null | Kimi |
470 | 2025-04-29 | Reinforcement Learning for Reasoning in Large Language Models with One Training Example | Yiping Wang et.al. | 2504.20571 | link | Kimi |
471 | 2025-04-29 | UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation | Huimin Lu et.al. | 2504.20500 | link | Kimi |
472 | 2025-04-29 | Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression | Yu Cui et.al. | 2504.20493 | null | Kimi |
473 | 2025-04-29 | A Summary on GUI Agents with Foundation Models Enhanced by Reinforcement Learning | Jiahao Li et.al. | 2504.20464 | null | Kimi |
474 | 2025-04-29 | Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding | Gabe Guo et.al. | 2504.20456 | link | Kimi |
475 | 2025-04-29 | GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection | DiJia Su et.al. | 2504.20437 | null | Kimi |
476 | 2025-04-29 | FiLA-Video: Spatio-Temporal Compression for Fine-Grained Long Video Understanding | Yanan Guo et.al. | 2504.20384 | null | Kimi |
477 | 2025-04-29 | Local Prompt Optimization | Yash Jain et.al. | 2504.20355 | null | Kimi |
478 | 2025-04-29 | MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation | Amaan Izhar et.al. | 2504.20343 | link | Kimi |
479 | 2025-04-28 | Enhancing Systematic Reviews with Large Language Models: Using GPT-4 and Kimi | Dandan Chen Kaptur et.al. | 2504.20276 | null | Kimi |
480 | 2025-04-28 | Can Large Language Models Learn Formal Logic? A Data-Driven Training and Evaluation Framework | Yuan Xia et.al. | 2504.20213 | null | Kimi |
481 | 2025-04-28 | Weaving Context Across Images: Improving Vision-Language Models through Focus-Centric Visual Chains | Juntian Zhang et.al. | 2504.20199 | null | Kimi |
482 | 2025-04-28 | MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools | Nishant Subramani et.al. | 2504.20168 | null | Kimi |
483 | 2025-04-28 | AutoJudge: Judge Decoding Without Manual Annotation | Roman Garipov et.al. | 2504.20039 | null | Kimi |
484 | 2025-04-28 | Towards Automated Scoping of AI for Social Good Projects | Jacob Emmerson et.al. | 2504.20010 | null | Kimi |
485 | 2025-04-28 | TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons | Emre Can Acikgoz et.al. | 2504.19982 | null | Kimi |
486 | 2025-04-28 | Accelerating Mixture-of-Experts Training with Adaptive Expert Replication | Athinagoras Skiadopoulos et.al. | 2504.19925 | null | Kimi |
487 | 2025-04-28 | Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI | Hugo Georgenthum et.al. | 2504.19918 | null | Kimi |
488 | 2025-04-28 | Can AI Agents Design and Implement Drug Discovery Pipelines? | Khachik Smbatyan et.al. | 2504.19912 | null | Kimi |
489 | 2025-04-28 | GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets | Mingqian He et.al. | 2504.19898 | null | Kimi |
490 | 2025-04-28 | semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage | Ke Hong et.al. | 2504.19867 | null | Kimi |
491 | 2025-04-28 | Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance | Takuya Tamura et.al. | 2504.19811 | null | Kimi |
492 | 2025-04-28 | Moral Reasoning Across Languages: The Critical Role of Low-Resource Languages in LLMs | Huichi Zhou et.al. | 2504.19759 | null | Kimi |
493 | 2025-04-28 | Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented Generation | Carlo Merola et.al. | 2504.19754 | link | Kimi |
494 | 2025-04-28 | LLM-Assisted Automated Deductive Coding of Dialogue Data: Leveraging Dialogue-Specific Characteristics to Enhance Contextual Understanding | Ying Na et.al. | 2504.19734 | null | Kimi |
495 | 2025-04-28 | Taming the Titans: A Survey of Efficient LLM Inference Serving | Ranran Zhen et.al. | 2504.19720 | link | Kimi |
496 | 2025-04-28 | From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review | Mohamed Amine Ferrag et.al. | 2504.19678 | null | Kimi |
497 | 2025-04-28 | Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMs | Osma Suominen et.al. | 2504.19675 | link | Kimi |
498 | 2025-04-28 | VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning | Run Luo et.al. | 2504.19627 | null | Kimi |
499 | 2025-04-28 | m-KAILIN: Knowledge-Driven Agentic Scientific Corpus Distillation Framework for Biomedical Large Language Models Training | Meng Xiao et.al. | 2504.19565 | null | Kimi |
500 | 2025-04-28 | DEEMO: De-identity Multimodal Emotion Recognition and Reasoning | Deng Li et.al. | 2504.19549 | null | Kimi |
501 | 2025-04-28 | Bullet: Boosting GPU Utilization for LLM Serving via Dynamic Spatial-Temporal Orchestration | Zejia Lin et.al. | 2504.19516 | null | Kimi |
502 | 2025-04-28 | Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding | Yan Wang et.al. | 2504.19500 | null | Kimi |
503 | 2025-04-28 | Improving Reasoning Performance in Large Language Models via Representation Engineering | Bertram Højer et.al. | 2504.19483 | null | Kimi |
504 | 2025-04-28 | BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text | Jiageng Wu et.al. | 2504.19467 | link | Kimi |
505 | 2025-04-28 | Towards Long Context Hallucination Detection | Siyi Liu et.al. | 2504.19457 | null | Kimi |
506 | 2025-04-28 | Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory | Prateek Chhikara et.al. | 2504.19413 | null | Kimi |
507 | 2025-04-28 | ICL CIPHERS: Quantifying “Learning’’ in In-Context Learning via Substitution Ciphers | Zhouxiang Fang et.al. | 2504.19395 | null | Kimi |
508 | 2025-04-27 | LLMs for Engineering: Teaching Models to Design High Powered Rockets | Toby Simonds et.al. | 2504.19394 | null | Kimi |
509 | 2025-04-27 | Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing | James O’ Neill et.al. | 2504.19333 | null | Kimi |
510 | 2025-04-27 | Platonic Grounding for Efficient Multimodal Language Models | Moulik Choraria et.al. | 2504.19327 | null | Kimi |
511 | 2025-04-27 | BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese | Peilin Zhou et.al. | 2504.19314 | link | Kimi |
512 | 2025-04-27 | AndroidGen: Building an Android Language Agent under Data Scarcity | Hanyu Lai et.al. | 2504.19298 | null | Kimi |
513 | 2025-04-24 | Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models | Xu Ma et.al. | 2504.17789 | null | Kimi |
514 | 2025-04-24 | The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs | Piotr Nawrot et.al. | 2504.17768 | null | Kimi |
515 | 2025-04-24 | Step1X-Edit: A Practical Framework for General Image Editing | Shiyu Liu et.al. | 2504.17761 | link | Kimi |
516 | 2025-04-24 | Conversational Assistants to support Heart Failure Patients: comparing a Neurosymbolic Architecture with ChatGPT | Anuja Tayal et.al. | 2504.17753 | null | Kimi |
517 | 2025-04-24 | CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos | Shucheng Gong et.al. | 2504.17728 | link | Kimi |
518 | 2025-04-24 | Multilingual Performance Biases of Large Language Models in Education | Vansh Gupta et.al. | 2504.17720 | null | Kimi |
519 | 2025-04-24 | Early Detection of Multidrug Resistance Using Multivariate Time Series Analysis and Interpretable Patient-Similarity Representations | Óscar Escudero-Arnanz et.al. | 2504.17717 | null | Kimi |
520 | 2025-04-24 | Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields | Zhuo He et.al. | 2504.17712 | null | Kimi |
521 | 2025-04-24 | Plasma State Monitoring and Disruption Characterization using Multimodal VAEs | Yoeri Poels et.al. | 2504.17710 | null | Kimi |
522 | 2025-04-24 | Safety in Large Reasoning Models: A Survey | Cheng Wang et.al. | 2504.17704 | null | Kimi |
523 | 2025-04-24 | Federated Learning: A Survey on Privacy-Preserving Collaborative Intelligence | Edward Collins et.al. | 2504.17703 | null | Kimi |
524 | 2025-04-24 | Hierarchical and Multimodal Data for Daily Activity Understanding | Ghazal Kaviani et.al. | 2504.17696 | link | Kimi |
525 | 2025-04-24 | BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring | Asier Bikandi et.al. | 2504.17693 | null | Kimi |
526 | 2025-04-24 | Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks | Haru-Tada Sato et.al. | 2504.17685 | null | Kimi |
527 | 2025-04-24 | INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models | Jarne Thys et.al. | 2504.17677 | null | Kimi |
528 | 2025-04-24 | Energy Considerations of Large Language Model Inference and Efficiency Optimizations | Jared Fernandez et.al. | 2504.17674 | null | Kimi |
529 | 2025-04-24 | Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation | Ying Zhu et.al. | 2504.17672 | null | Kimi |
530 | 2025-04-24 | Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction | Yuanchang Ye et.al. | 2504.17671 | null | Kimi |
531 | 2025-04-24 | DiMeR: Disentangled Mesh Reconstruction Model | Lutao Jiang et.al. | 2504.17670 | null | Kimi |
532 | 2025-04-24 | Towards a HIPAA Compliant Agentic AI System in Healthcare | Subash Neupane et.al. | 2504.17669 | null | Kimi |
533 | 2025-04-24 | Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics | Zena Al-Khalili et.al. | 2504.17665 | null | Kimi |
534 | 2025-04-24 | Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction | Farhad Pourkamali-Anaraki et.al. | 2504.17655 | null | Kimi |
535 | 2025-04-24 | DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training | Xiaoyu Tian et.al. | 2504.17565 | null | Kimi |
536 | 2025-04-24 | HalluLens: LLM Hallucination Benchmark | Yejin Bang et.al. | 2504.17550 | null | Kimi |
537 | 2025-04-24 | A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task | Jiaqi Deng et.al. | 2504.17547 | null | Kimi |
538 | 2025-04-24 | Auditing the Ethical Logic of Generative AI Models | W. Russell Neuman et.al. | 2504.17544 | null | Kimi |
539 | 2025-04-24 | Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation | Xin Yi et.al. | 2504.17480 | null | Kimi |
540 | 2025-04-24 | FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding | De-An Huang et.al. | 2504.17447 | link | Kimi |
541 | 2025-04-24 | Assessing the Capability of Large Language Models for Domain-Specific Ontology Generation | Anna Sofia Lippolis et.al. | 2504.17402 | null | Kimi |
542 | 2025-04-24 | LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams | Yongxuan Wu et.al. | 2504.17366 | link | Kimi |
543 | 2025-04-24 | TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation | Ling You et.al. | 2504.17365 | null | Kimi |
544 | 2025-04-24 | FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation | Yulia Otmakhova et.al. | 2504.17311 | null | Kimi |
545 | 2025-04-24 | JurisCTC: Enhancing Legal Judgment Prediction via Cross-Domain Transfer and Contrastive Learning | Zhaolu Kang et.al. | 2504.17264 | null | Kimi |
546 | 2025-04-24 | MCAF: Efficient Agent-based Video Understanding Framework through Multimodal Coarse-to-Fine Attention Focusing | Shiwen Cao et.al. | 2504.17213 | null | Kimi |
547 | 2025-04-24 | A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation | Yangxinyu Xie et.al. | 2504.17200 | null | Kimi |
548 | 2025-04-24 | Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning | Minju Seo et.al. | 2504.17192 | link | Kimi |
549 | 2025-04-23 | MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation | Chanhee Park et.al. | 2504.17137 | null | Kimi |
550 | 2025-04-23 | Steering the CensorShip: Uncovering Representation Vectors for LLM “Thought” Control | Hannah Cyberey et.al. | 2504.17130 | link | Kimi |
551 | 2025-04-23 | The Rise of Small Language Models in Healthcare: A Comprehensive Survey | Muskan Garg et.al. | 2504.17119 | null | Kimi |
552 | 2025-04-23 | Leveraging LLMs as Meta-Judges: A Multi-Agent Framework for Evaluating LLM Judgments | Yuran Li et.al. | 2504.17087 | null | Kimi |
553 | 2025-04-23 | DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs | Zhenhailong Wang et.al. | 2504.17040 | null | Kimi |
554 | 2025-04-23 | (Im)possibility of Automated Hallucination Detection in Large Language Models | Amin Karbasi et.al. | 2504.17004 | null | Kimi |
555 | 2025-04-23 | Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text | Shifali Agrahari et.al. | 2504.16913 | null | Kimi |
556 | 2025-04-23 | Do Large Language Models know who did what to whom? | Joseph M. Denning et.al. | 2504.16884 | null | Kimi |
557 | 2025-04-23 | Monte Carlo Planning with Large Language Model for Text-Based Game Agents | Zijing Shi et.al. | 2504.16855 | null | Kimi |
558 | 2025-04-23 | GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning | Luu Quy Tung et.al. | 2504.16832 | null | Kimi |
559 | 2025-04-23 | Process Reward Models That Think | Muhammad Khalifa et.al. | 2504.16828 | link | Kimi |
560 | 2025-04-23 | Random Long-Context Access for Mamba via Hardware-aligned Hierarchical Sparse Attention | Xiang Hu et.al. | 2504.16795 | null | Kimi |
561 | 2025-04-23 | Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation | Lakshita Agarwal et.al. | 2504.16788 | null | Kimi |
562 | 2025-04-23 | MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores | Fengwei Zhou et.al. | 2504.16786 | null | Kimi |
563 | 2025-04-23 | How Effective are Generative Large Language Models in Performing Requirements Classification? | Waad Alhoshan et.al. | 2504.16768 | null | Kimi |
564 | 2025-04-23 | Lightweight Latent Verifiers for Efficient Meta-Generation Strategies | Bartosz Piotrowski et.al. | 2504.16760 | null | Kimi |
565 | 2025-04-23 | HEMA : A Hippocampus-Inspired Extended Memory Architecture for Long-Context AI Conversations | Kwangseob Ahn et.al. | 2504.16754 | null | Kimi |
566 | 2025-04-23 | IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery | Aniketh Garikaparthi et.al. | 2504.16728 | null | Kimi |
567 | 2025-04-23 | Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories | Mareike Lisker et.al. | 2504.16604 | null | Kimi |
568 | 2025-04-23 | Comparing Large Language Models and Traditional Machine Translation Tools for Translating Medical Consultation Summaries: A Pilot Study | Andy Li et.al. | 2504.16601 | null | Kimi |
569 | 2025-04-23 | PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression | Lizhe Chen et.al. | 2504.16574 | null | Kimi |
570 | 2025-04-23 | Amplified Vulnerabilities: Structured Jailbreak Attacks on LLM-based Multi-Agent Debate | Senmao Qi et.al. | 2504.16489 | null | Kimi |
571 | 2025-04-23 | Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark | Hanlei Zhang et.al. | 2504.16427 | link | Kimi |
572 | 2025-04-23 | Evaluating Multi-Hop Reasoning in Large Language Models: A Chemistry-Centric Case Study | Mohammad Khodadad et.al. | 2504.16414 | null | Kimi |
573 | 2025-04-23 | ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs | Fahmida Liza Piya et.al. | 2504.16394 | link | Kimi |
574 | 2025-04-23 | SplitReason: Learning To Offload Reasoning | Yash Akhauri et.al. | 2504.16379 | null | Kimi |
575 | 2025-04-23 | Text-to-TrajVis: Enabling Trajectory Data Visualizations from Natural Language Questions | Tian Bai et.al. | 2504.16358 | null | Kimi |
576 | 2025-04-23 | DP2FL: Dual Prompt Personalized Federated Learning in Foundation Models | Ying Chang et.al. | 2504.16357 | null | Kimi |
577 | 2025-04-22 | The Paradox of Poetic Intent in Back-Translation: Evaluating the Quality of Large Language Models in Chinese Translation | Li Weigang et.al. | 2504.16286 | null | Kimi |
578 | 2025-04-22 | FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking | Jabez Magomere et.al. | 2504.16188 | null | Kimi |
579 | 2025-04-22 | MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention | Yucheng Li et.al. | 2504.16083 | null | Kimi |
580 | 2025-04-22 | MR. Video: “MapReduce” is the Principle for Long Video Understanding | Ziqi Pang et.al. | 2504.16082 | null | Kimi |
581 | 2025-04-22 | LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities | Thomas Schmied et.al. | 2504.16078 | null | Kimi |
582 | 2025-04-22 | LongMamba: Enhancing Mamba’s Long Context Capabilities via Training-Free Receptive Field Enlargement | Zhifan Ye et.al. | 2504.16053 | link | Kimi |
583 | 2025-04-22 | Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3 | Ahmed R. Sadik et.al. | 2504.16027 | null | Kimi |
584 | 2025-04-23 | CAPO: Cost-Aware Prompt Optimization | Tom Zehle et.al. | 2504.16005 | link | Kimi |
585 | 2025-04-22 | FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity | Fanny Jourdan et.al. | 2504.15941 | link | Kimi |
586 | 2025-04-22 | Impact of Noise on LLM-Models Performance in Abstraction and Reasoning Corpus (ARC) Tasks with Model Temperature Considerations | Nikhil Khandalkar et.al. | 2504.15903 | null | Kimi |
587 | 2025-04-22 | SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning | Cheng Wen et.al. | 2504.15900 | null | Kimi |
588 | 2025-04-22 | Dynamic Early Exit in Reasoning Models | Chenxu Yang et.al. | 2504.15895 | null | Kimi |
589 | 2025-04-22 | What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns | Michael A. Hedderich et.al. | 2504.15815 | null | Kimi |
590 | 2025-04-22 | A closer look at how large language models trust humans: patterns and biases | Valeria Lerman et.al. | 2504.15801 | null | Kimi |
591 | 2025-04-22 | Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach | Ruizhe Li et.al. | 2504.15784 | null | Kimi |
592 | 2025-04-22 | TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving | Daocheng Fu et.al. | 2504.15780 | null | Kimi |
593 | 2025-04-22 | DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models | Jie Zhu et.al. | 2504.15716 | null | Kimi |
594 | 2025-04-22 | Cost-Effective Text Clustering with Large Language Models | Hongtao Wang et.al. | 2504.15640 | null | Kimi |
595 | 2025-04-22 | DR.FIX: Automatically Fixing Data Races at Industry Scale | Farnaz Behrang et.al. | 2504.15637 | null | Kimi |
596 | 2025-04-22 | Exploiting Contextual Knowledge in LLMs through V-usable Information based Layer Enhancement | Xiaowei Yuan et.al. | 2504.15630 | null | Kimi |
597 | 2025-04-22 | A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models | Gengxian Cao et.al. | 2504.15552 | null | Kimi |
598 | 2025-04-22 | llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length | Issa Sugiura et.al. | 2504.15544 | null | Kimi |
599 | 2025-04-22 | Compass-V2 Technical Report | Sophia Maria et.al. | 2504.15527 | null | Kimi |
600 | 2025-04-21 | CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting | Atin Pothiraj et.al. | 2504.15485 | null | Kimi |
601 | 2025-04-21 | Speculative Sampling via Exponential Races | Szymon Kobus et.al. | 2504.15475 | null | Kimi |
602 | 2025-04-21 | Trillion 7B Technical Report | Sungjun Han et.al. | 2504.15431 | null | Kimi |
603 | 2025-04-21 | LLM-Assisted Translation of Legacy FORTRAN Codes to C++: A Cross-Platform Study | Nishath Rajiv Ranasinghe et.al. | 2504.15424 | null | Kimi |
604 | 2025-04-21 | IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs | David Ma et.al. | 2504.15415 | link | Kimi |
605 | 2025-04-21 | Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection | Myrthe Reuver et.al. | 2504.15392 | null | Kimi |
606 | 2025-04-21 | Towards Understanding Camera Motions in Any Video | Zhiqiu Lin et.al. | 2504.15376 | null | Kimi |
607 | 2025-04-21 | KeDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments | Junyoung Park et.al. | 2504.15364 | null | Kimi |
608 | 2025-04-21 | Exploring Compositional Generalization (in ReCOGS_pos) by Transformers using Restricted Access Sequence Processing (RASP) | William Bruns et.al. | 2504.15349 | null | Kimi |
609 | 2025-04-21 | Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs | Chun-Hsiao Yeh et.al. | 2504.15280 | link | Kimi |
610 | 2025-04-21 | FlowReasoner: Reinforcing Query-Level Meta-Agents | Hongcheng Gao et.al. | 2504.15257 | link | Kimi |
611 | 2025-04-21 | Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges | Nandan Thakur et.al. | 2504.15205 | null | Kimi |
612 | 2025-04-21 | The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks | Joan C. Timoneda et.al. | 2504.15160 | null | Kimi |
613 | 2025-04-21 | EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models | Ziwen Xu et.al. | 2504.15133 | link | Kimi |
614 | 2025-04-21 | Kuwain 1.5B: An Arabic SLM via Language Injection | Khalil Hennara et.al. | 2504.15120 | null | Kimi |
615 | 2025-04-21 | A triple-branch network for latent fingerprint enhancement guided by orientation fields and minutiae | Yurun Wang et.al. | 2504.15105 | null | Kimi |
616 | 2025-04-21 | Rethinking the Potential of Multimodality in Collaborative Problem Solving Diagnosis with Large Language Models | K. Wong et.al. | 2504.15093 | null | Kimi |
617 | 2025-04-21 | DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation | Weijie He et.al. | 2504.15032 | null | Kimi |
618 | 2025-04-21 | Efficient Pretraining Length Scaling | Bohong Wu et.al. | 2504.14992 | null | Kimi |
619 | 2025-04-21 | Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues | Rui Ribeiro et.al. | 2504.14963 | null | Kimi |
620 | 2025-04-21 | MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core | Dennis Liu et.al. | 2504.14960 | null | Kimi |
621 | 2025-04-21 | EducationQ: Evaluating LLMs’ Teaching Capabilities Through Multi-Agent Dialogue Framework | Yao Shi et.al. | 2504.14928 | null | Kimi |
622 | 2025-04-21 | CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs | Yingming Zheng et.al. | 2504.14905 | link | Kimi |
623 | 2025-04-21 | Latent Bayesian Optimization via Autoregressive Normalizing Flows | Seunghun Lee et.al. | 2504.14889 | null | Kimi |
624 | 2025-04-21 | Natural Fingerprints of Large Language Models | Teppei Suzuki et.al. | 2504.14871 | null | Kimi |
625 | 2025-04-21 | OTC: Optimal Tool Calls via Reinforcement Learning | Hongru Wang et.al. | 2504.14870 | null | Kimi |
626 | 2025-04-21 | ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages | Zhoujie Qian et.al. | 2504.14825 | null | Kimi |
627 | 2025-04-21 | On Self-improving Token Embeddings | Mario M. Kubek et.al. | 2504.14808 | null | Kimi |
628 | 2025-04-21 | Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends | Jiaxin GUO et.al. | 2504.14804 | null | Kimi |
629 | 2025-04-21 | gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling | Tianyu Guo et.al. | 2504.14775 | link | Kimi |
630 | 2025-04-21 | PLANET: A Collection of Benchmarks for Evaluating LLMs’ Planning Capabilities | Haoming Li et.al. | 2504.14773 | null | Kimi |
631 | 2025-04-20 | Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions | Luyang Fang et.al. | 2504.14772 | null | Kimi |
632 | 2025-04-20 | SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs | Minh V. T. Pham et.al. | 2504.14757 | null | Kimi |
633 | 2025-04-20 | PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines | Reya Vir et.al. | 2504.14738 | null | Kimi |
634 | 2025-04-20 | AI with Emotions: Exploring Emotional Expressions in Large Language Models | Shin-nosuke Ishikawa et.al. | 2504.14706 | null | Kimi |
635 | 2025-04-20 | Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark | Enxin Song et.al. | 2504.14693 | link | Kimi |
636 | 2025-04-20 | FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models | Mehrnoush Shamsfard et.al. | 2504.14690 | null | Kimi |
637 | 2025-04-20 | Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens | Kaihang Pan et.al. | 2504.14666 | null | Kimi |
638 | 2025-04-20 | A Case Study Exploring the Current Landscape of Synthetic Medical Record Generation with Commercial LLMs | Yihan Lin et.al. | 2504.14657 | null | Kimi |
639 | 2025-04-17 | PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding | Jang Hyun Cho et.al. | 2504.13180 | link | Kimi |
640 | 2025-04-17 | Single-Shot Shape and Reflectance with Spatial Polarization Multiplexing | Tomoki Ichikawa et.al. | 2504.13177 | null | Kimi |
641 | 2025-04-17 | It’s All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization | Ali Behrouz et.al. | 2504.13173 | null | Kimi |
642 | 2025-04-17 | Sleep-time Compute: Beyond Inference Scaling at Test-time | Kevin Lin et.al. | 2504.13171 | link | Kimi |
643 | 2025-04-17 | Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling | Tsung-Han Wu et.al. | 2504.13169 | link | Kimi |
644 | 2025-04-17 | CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training | Shizhe Diao et.al. | 2504.13161 | null | Kimi |
645 | 2025-04-17 | MIB: A Mechanistic Interpretability Benchmark | Aaron Mueller et.al. | 2504.13151 | link | Kimi |
646 | 2025-04-17 | Readable Twins of Unreadable Models | Krzysztof Pancerz et.al. | 2504.13150 | link | Kimi |
647 | 2025-04-17 | Antidistillation Sampling | Yash Savani et.al. | 2504.13146 | null | Kimi |
648 | 2025-04-17 | Exploring Expert Failures Improves LLM Agent Tuning | Li-Cheng Lan et.al. | 2504.13145 | null | Kimi |
649 | 2025-04-17 | $\texttt{Complex-Edit}$ : CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark | Siwei Yang et.al. | 2504.13143 | null | Kimi |
650 | 2025-04-17 | Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo | João Loula et.al. | 2504.13139 | null | Kimi |
651 | 2025-04-17 | Energy-Based Reward Models for Robust Language Model Alignment | Anamika Lochab et.al. | 2504.13134 | link | Kimi |
652 | 2025-04-17 | Science-T2I: Addressing Scientific Illusions in Image Synthesis | Jialuo Li et.al. | 2504.13129 | null | Kimi |
653 | 2025-04-17 | LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard | Varun Rao et.al. | 2504.13125 | null | Kimi |
654 | 2025-04-17 | Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training | Xinsong Zhang et.al. | 2504.13123 | null | Kimi |
655 | 2025-04-17 | VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models | Haojian Huang et.al. | 2504.13122 | link | Kimi |
656 | 2025-04-17 | Probing and Inducing Combinational Creativity in Vision-Language Models | Yongqian Peng et.al. | 2504.13120 | null | Kimi |
657 | 2025-04-17 | EventVAD: Training-Free Event-Aware Video Anomaly Detection | Yihua Shao et.al. | 2504.13092 | null | Kimi |
658 | 2025-04-17 | Retrieval-Augmented Generation with Conflicting Evidence | Han Wang et.al. | 2504.13079 | link | Kimi |
659 | 2025-04-17 | Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off | Riza Velioglu et.al. | 2504.13078 | link | Kimi |
660 | 2025-04-17 | SkyReels-V2: Infinite-length Film Generative Model | Guibin Chen et.al. | 2504.13074 | link | Kimi |
661 | 2025-04-17 | Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models | Sudesh Ramesh Bhagat et.al. | 2504.13068 | null | Kimi |
662 | 2025-04-17 | RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins | Yao Mu et.al. | 2504.13059 | null | Kimi |
663 | 2025-04-17 | Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation | Yichao Feng et.al. | 2504.13054 | null | Kimi |
664 | 2025-04-17 | How Large Language Models Are Changing MOOC Essay Answers: A Comparison of Pre- and Post-LLM Responses | Leo Leppänen et.al. | 2504.13038 | null | Kimi |
665 | 2025-04-17 | Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Assessment and Beyond | Yundi Zhang et.al. | 2504.13037 | null | Kimi |
666 | 2025-04-17 | InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning | Zheng Wang et.al. | 2504.13032 | null | Kimi |
667 | 2025-04-17 | ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images | Sangwook Kim et.al. | 2504.13023 | null | Kimi |
668 | 2025-04-17 | Pose and Facial Expression Transfer by using StyleGAN | Petr Jahoda et.al. | 2504.13021 | null | Kimi |
669 | 2025-04-17 | SHA256 at SemEval-2025 Task 4: Selective Amnesia – Constrained Unlearning for Large Language Models via Knowledge Isolation | Saransh Agrawal et.al. | 2504.12996 | link | Kimi |
670 | 2025-04-17 | Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback | Nearchos Potamitis et.al. | 2504.12951 | null | Kimi |
671 | 2025-04-17 | Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models | Zhouhao Sun et.al. | 2504.12898 | null | Kimi |
672 | 2025-04-17 | EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting | Guanrou Yang et.al. | 2504.12867 | null | Kimi |
673 | 2025-04-17 | Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks | Amey Hengle et.al. | 2504.12845 | link | Kimi |
674 | 2025-04-17 | Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Yicheng Pan et.al. | 2504.12773 | link | Kimi |
675 | 2025-04-17 | Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge | Yongrui Chen et.al. | 2504.12734 | null | Kimi |
676 | 2025-04-17 | Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations | Yiyou Sun et.al. | 2504.12691 | link | Kimi |
677 | 2025-04-17 | Data-efficient LLM Fine-tuning for Code Generation | Weijie Lv et.al. | 2504.12687 | link | Kimi |
678 | 2025-04-17 | Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation | Linda He et.al. | 2504.12637 | null | Kimi |
679 | 2025-04-17 | Identifying and Mitigating the Influence of the Prior Distribution in Large Language Models | Liyi Zhang et.al. | 2504.12585 | link | Kimi |
680 | 2025-04-17 | MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation | Haris Riaz et.al. | 2504.12563 | null | Kimi |
681 | 2025-04-17 | ZeroSumEval: Scaling LLM Evaluation with Inter-Model Competition | Haidar Khan et.al. | 2504.12562 | link | Kimi |
682 | 2025-04-17 | Memorization: A Close Look at Books | Iris Ma et.al. | 2504.12549 | null | Kimi |
683 | 2025-04-16 | MOM: Memory-Efficient Offloaded Mini-Sequence Inference for Long Context Language Models | Junyang Zhang et.al. | 2504.12526 | null | Kimi |
684 | 2025-04-16 | Memorization vs. Reasoning: Updating LLMs with New Knowledge | Aochong Oliver Li et.al. | 2504.12523 | null | Kimi |
685 | 2025-04-16 | Towards Conversational AI for Human-Machine Collaborative MLOps | George Fatouros et.al. | 2504.12477 | null | Kimi |
686 | 2025-04-16 | Integrating Structural and Semantic Signals in Text-Attributed Graphs with BiGTex | Azadeh Beiranvand et.al. | 2504.12474 | link | Kimi |
687 | 2025-04-16 | Dense Backpropagation Improves Training for Sparse Mixture-of-Experts | Ashwinee Panda et.al. | 2504.12463 | link | Kimi |
688 | 2025-04-16 | Activated LoRA: Fine-tuned LLMs for Intrinsics | Kristjan Greenewald et.al. | 2504.12397 | null | Kimi |
689 | 2025-04-16 | BitNet b1.58 2B4T Technical Report | Shuming Ma et.al. | 2504.12285 | null | Kimi |
690 | 2025-04-16 | How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions | Aditya Prakash et.al. | 2504.12284 | null | Kimi |
691 | 2025-04-16 | FLIP Reasoning Challenge | Andreas Plesner et.al. | 2504.12256 | link | Kimi |
692 | 2025-04-16 | What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure | Céline Budding et.al. | 2504.12187 | null | Kimi |
693 | 2025-04-16 | SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data | Suyoung Bae et.al. | 2504.12185 | null | Kimi |
694 | 2025-04-16 | Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models - | Laura Fieback et.al. | 2504.12137 | null | Kimi |
695 | 2025-04-16 | Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework | Jack Preuveneers et.al. | 2504.12090 | null | Kimi |
696 | 2025-04-16 | Purposefully Induced Psychosis (PIP): Embracing Hallucination as Imagination in Large Language Models | Kris Pilcher et.al. | 2504.12012 | null | Kimi |
697 | 2025-04-16 | Generative Recommendation with Continuous-Token Diffusion | Haohao Qu et.al. | 2504.12007 | null | Kimi |
698 | 2025-04-16 | Language Models as Quasi-Crystalline Thought: Structure, Constraint, and Emergence in Generative Systems | Jose Manuel Guevara-Vela et.al. | 2504.11986 | null | Kimi |
699 | 2025-04-16 | ADAT: Time-Series-Aware Adaptive Transformer Architecture for Sign Language Translation | Nada Shahin et.al. | 2504.11942 | null | Kimi |
700 | 2025-04-16 | Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading | Qianjin Yu et.al. | 2504.11919 | null | Kimi |
701 | 2025-04-16 | Evaluating the Goal-Directedness of Large Language Models | Tom Everitt et.al. | 2504.11844 | link | Kimi |
702 | 2025-04-16 | FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations | Yue Zhao et.al. | 2504.11837 | null | Kimi |
703 | 2025-04-16 | Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation | Julia Kreutzer et.al. | 2504.11829 | null | Kimi |
704 | 2025-04-16 | Cost-Efficient LLM Serving in the Cloud: VM Selection with KV Cache Offloading | Kihyun Kim et.al. | 2504.11816 | link | Kimi |
705 | 2025-04-16 | Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification | Yue Li et.al. | 2504.11793 | null | Kimi |
706 | 2025-04-16 | Enhancing Web Agents with Explicit Rollback Mechanisms | Zhisong Zhang et.al. | 2504.11788 | null | Kimi |
707 | 2025-04-16 | Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs | Hyungwoo Lee et.al. | 2504.11765 | null | Kimi |
708 | 2025-04-16 | Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures | Prabhu Vellaisamy et.al. | 2504.11750 | null | Kimi |
709 | 2025-04-16 | Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics | Yiran He et.al. | 2504.11686 | null | Kimi |
710 | 2025-04-16 | Steering Prosocial AI Agents: Computational Basis of LLM’s Decision Making in Social Simulation | Ji Ma et.al. | 2504.11671 | null | Kimi |
711 | 2025-04-15 | GraphicBench: A Planning Benchmark for Graphic Design with Language Agents | Dayeon Ki et.al. | 2504.11571 | null | Kimi |
712 | 2025-04-15 | ReTool: Reinforcement Learning for Strategic Tool Use in LLMs | Jiazhan Feng et.al. | 2504.11536 | link | Kimi |
713 | 2025-04-15 | HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation | Haokun Liu et.al. | 2504.11524 | null | Kimi |
714 | 2025-04-15 | TextArena | Leon Guertler et.al. | 2504.11442 | link | Kimi |
715 | 2025-04-15 | A Dual-Space Framework for General Knowledge Distillation of Large Language Models | Xue Zhang et.al. | 2504.11426 | null | Kimi |
716 | 2025-04-15 | A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce | Wei Xiong et.al. | 2504.11343 | link | Kimi |
717 | 2025-04-15 | Transformer-Based Model for Cold Start Mitigation in FaaS Architecture | Alexandre Savi Fayam Mbala Mouen et.al. | 2504.11338 | null | Kimi |
718 | 2025-04-15 | Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints | Ruicheng Ao et.al. | 2504.11320 | link | Kimi |
719 | 2025-04-15 | Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs | Chang Yang et.al. | 2504.11239 | link | Kimi |
720 | 2025-04-15 | Video Summarization with Large Language Models | Min Jung Lee et.al. | 2504.11199 | null | Kimi |
721 | 2025-04-15 | Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items | Minjie Zou et.al. | 2504.11186 | null | Kimi |
722 | 2025-04-15 | DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis | Efthymios Georgiou et.al. | 2504.11082 | null | Kimi |
723 | 2025-04-15 | Dynamic Compressing Prompts for Efficient Inference of Large Language Models | Jinwu Hu et.al. | 2504.11004 | null | Kimi |
724 | 2025-04-15 | Efficient Reasoning Models: A Survey | Sicheng Feng et.al. | 2504.10903 | link | Kimi |
725 | 2025-04-15 | ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search | Yize Zhang et.al. | 2504.10893 | null | Kimi |
726 | 2025-04-15 | Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content | Yilang Peng et.al. | 2504.10878 | null | Kimi |
727 | 2025-04-15 | Moving Beyond Next-Token Prediction: Transformers are Context-Sensitive Language Generators | Phill Kyu Rhee et.al. | 2504.10845 | null | Kimi |
728 | 2025-04-15 | LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation | Hengyu Shi et.al. | 2504.10829 | null | Kimi |
729 | 2025-04-15 | CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives | Ayoung Lee et.al. | 2504.10823 | null | Kimi |
730 | 2025-04-14 | How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients | Ming Li et.al. | 2504.10766 | link | Kimi |
731 | 2025-04-14 | ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models | Amirhosein Chahe et.al. | 2504.10757 | link | Kimi |
732 | 2025-04-14 | CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD Map Updates | Ankit Kumar Shaw et.al. | 2504.10738 | null | Kimi |
733 | 2025-04-14 | HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving | Avinash Kumar et.al. | 2504.10724 | null | Kimi |
734 | 2025-04-14 | Weight-of-Thought Reasoning: Exploring Neural Network Weights for Enhanced LLM Reasoning | Saif Punjwani et.al. | 2504.10646 | link | Kimi |
735 | 2025-04-14 | Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models | Thilo Hagendorff et.al. | 2504.10615 | null | Kimi |
736 | 2025-04-15 | GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents | Xiaobo Xia et.al. | 2504.10458 | null | Kimi |
737 | 2025-04-14 | RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users | Suyu Ye et.al. | 2504.10445 | link | Kimi |
738 | 2025-04-14 | Multimodal Long Video Modeling Based on Temporal Dynamic Context | Haoran Hao et.al. | 2504.10443 | link | Kimi |
739 | 2025-04-14 | LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models | Minqian Liu et.al. | 2504.10430 | null | Kimi |
740 | 2025-04-14 | LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models | Parshin Shojaee et.al. | 2504.10415 | link | Kimi |
741 | 2025-04-14 | Performance of Large Language Models in Supporting Medical Diagnosis and Treatment | Diogo Sousa et.al. | 2504.10405 | null | Kimi |
742 | 2025-04-14 | Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families | Shahriar Noroozizadeh et.al. | 2504.10340 | null | Kimi |
743 | 2025-04-14 | Heimdall: test-time scaling on the generative verification | Wenlei Shi et.al. | 2504.10337 | null | Kimi |
744 | 2025-04-14 | AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference | Yangshen Deng et.al. | 2504.10326 | null | Kimi |
745 | 2025-04-14 | Deep Reasoning Translation via Reinforcement Learning | Jiaan Wang et.al. | 2504.10187 | link | Kimi |
746 | 2025-04-14 | HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection | Mohamed A. Abdallah et.al. | 2504.10168 | null | Kimi |
747 | 2025-04-14 | Breaking the Data Barrier – Building GUI Agents Through Task Generalization | Junlei Zhang et.al. | 2504.10127 | link | Kimi |
748 | 2025-04-14 | CameraBench: Benchmarking Visual Reasoning in MLLMs via Photography | I-Sheng Fang et.al. | 2504.10090 | null | Kimi |
749 | 2025-04-14 | RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability | Yichi Zhang et.al. | 2504.10081 | null | Kimi |
750 | 2025-04-14 | Mavors: Multi-granularity Video Representation for Multimodal Large Language Model | Yang Shi et.al. | 2504.10068 | null | Kimi |
751 | 2025-04-14 | Hallucination Detection in LLMs via Topological Divergence on Attention Graphs | Alexandra Bazarova et.al. | 2504.10063 | null | Kimi |
752 | 2025-04-14 | DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify | Zhengxuan Zhang et.al. | 2504.10036 | null | Kimi |
753 | 2025-04-14 | The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination | Hao Yin et.al. | 2504.10020 | null | Kimi |
754 | 2025-04-14 | Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models? | Yanbo Wang et.al. | 2504.10000 | null | Kimi |
755 | 2025-04-14 | KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference | Yuxuan Tian et.al. | 2504.09936 | null | Kimi |
756 | 2025-04-14 | FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding | Zheng Liu et.al. | 2504.09925 | link | Kimi |
757 | 2025-04-14 | Reasoning Models Can Be Effective Without Thinking | Wenjie Ma et.al. | 2504.09858 | null | Kimi |
758 | 2025-04-14 | A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science | Jie Feng et.al. | 2504.09848 | null | Kimi |
759 | 2025-04-14 | OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training | Juntao Zhao et.al. | 2504.09844 | null | Kimi |
760 | 2025-04-14 | Training Small Reasoning LLMs with Cognitive Preference Alignment | Wenrui Cai et.al. | 2504.09802 | null | Kimi |
761 | 2025-04-14 | VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents | Ryota Tanaka et.al. | 2504.09795 | null | Kimi |
762 | 2025-04-14 | Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning | Jingtian Wu et.al. | 2504.09781 | null | Kimi |
763 | 2025-04-14 | Understanding and Optimizing Multi-Stage AI Inference Pipelines | Abhimanyu Rajeshkumar Bambhaniya et.al. | 2504.09775 | null | Kimi |
764 | 2025-04-14 | Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning | Can Jin et.al. | 2504.09772 | link | Kimi |
765 | 2025-04-13 | Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability | Haotian Wang et.al. | 2504.09639 | null | Kimi |
766 | 2025-04-13 | Metropolis-Hastings Captioning Game: Knowledge Fusion of Vision Language Models via Decentralized Bayesian Inference | Yuta Matsui et.al. | 2504.09620 | null | Kimi |
767 | 2025-04-10 | Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments | Lorenz Linhardt et.al. | 2504.07965 | null | Kimi |
768 | 2025-04-10 | PixelFlow: Pixel-Space Generative Models with Flow | Shoufa Chen et.al. | 2504.07963 | link | Kimi |
769 | 2025-04-10 | GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Lang Lin et.al. | 2504.07962 | null | Kimi |
770 | 2025-04-10 | Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction | Zeren Jiang et.al. | 2504.07961 | link | Kimi |
771 | 2025-04-10 | CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy | Dongyoung Kim et.al. | 2504.07959 | null | Kimi |
772 | 2025-04-10 | MM-IFEngine: Towards Multimodal Instruction Following | Shengyuan Ding et.al. | 2504.07957 | link | Kimi |
773 | 2025-04-10 | VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning | Yukun Qi et.al. | 2504.07956 | null | Kimi |
774 | 2025-04-10 | Perception-R1: Pioneering Perception Policy with Reinforcement Learning | En Yu et.al. | 2504.07954 | link | Kimi |
775 | 2025-04-10 | Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models | Mustafa Shukor et.al. | 2504.07951 | null | Kimi |
776 | 2025-04-10 | InteractAvatar: Modeling Hand-Face Interaction in Photorealistic Avatars with Deformable Gaussians | Kefan Chen et.al. | 2504.07949 | null | Kimi |
777 | 2025-04-10 | GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces | Hao Yu et.al. | 2504.07945 | null | Kimi |
778 | 2025-04-10 | HoloPart: Generative 3D Part Amodal Segmentation | Yunhan Yang et.al. | 2504.07943 | null | Kimi |
779 | 2025-04-10 | SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement | Xiyao Wang et.al. | 2504.07934 | link | Kimi |
780 | 2025-04-10 | The Urban Impact of AI: Modeling Feedback Loops in Next-Venue Recommendation | Giovanni Mauro et.al. | 2504.07911 | link | Kimi |
781 | 2025-04-10 | The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound | Blake VanBerlo et.al. | 2504.07904 | null | Kimi |
782 | 2025-04-10 | Redefining Machine Translation on Social Network Services with Large Language Models | Hongcheng Guo et.al. | 2504.07901 | link | Kimi |
783 | 2025-04-10 | How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective | Qi Liu et.al. | 2504.07898 | link | Kimi |
784 | 2025-04-10 | Fast Adaptation with Behavioral Foundation Models | Harshit Sikchi et.al. | 2504.07896 | null | Kimi |
785 | 2025-04-10 | SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning | Rui Pan et.al. | 2504.07891 | link | Kimi |
786 | 2025-04-10 | Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge | Riccardo Cantini et.al. | 2504.07887 | link | Kimi |
787 | 2025-04-10 | Token Level Routing Inference System for Edge Devices | Jianshu She et.al. | 2504.07878 | null | Kimi |
788 | 2025-04-10 | Dual Engines of Thoughts: A Depth-Breadth Integration Framework for Open-Ended Analysis | Fei-Hsuan Yu et.al. | 2504.07872 | null | Kimi |
789 | 2025-04-10 | SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos | Joshua Li et.al. | 2504.07867 | null | Kimi |
790 | 2025-04-10 | Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs | Yichun Yin et.al. | 2504.07866 | null | Kimi |
791 | 2025-04-10 | 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization | Mengyang Li et.al. | 2504.07856 | null | Kimi |
792 | 2025-04-10 | The KL3M Data Project: Copyright-Clean Training Resources for Large Language Models | Michael J Bommarito II et.al. | 2504.07854 | link | Kimi |
793 | 2025-04-10 | V2V3D: View-to-View Denoised 3D Reconstruction for Light-Field Microscopy | Jiayin Zhao et.al. | 2504.07853 | null | Kimi |
794 | 2025-04-10 | Anytime Single-Step MAPF Planning with Anytime PIBT | Nayesha Gandotra et.al. | 2504.07841 | null | Kimi |
795 | 2025-04-10 | Understanding Learner-LLM Chatbot Interactions and the Impact of Prompting Guidelines | Cansu Koyuturk et.al. | 2504.07840 | null | Kimi |
796 | 2025-04-10 | Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems | Simon Lermen et.al. | 2504.07831 | null | Kimi |
797 | 2025-04-10 | MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations | Genglin Liu et.al. | 2504.07830 | link | Kimi |
798 | 2025-04-10 | Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models | Hongcheng Guo et.al. | 2504.07807 | link | Kimi |
799 | 2025-04-10 | On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data | Alfredo Garrachón Ruiz et.al. | 2504.07646 | null | Kimi |
800 | 2025-04-10 | ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models | Joel Barmettler et.al. | 2504.07624 | null | Kimi |
801 | 2025-04-10 | VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model | Haozhan Shen et.al. | 2504.07615 | link | Kimi |
802 | 2025-04-10 | Boosting Universal LLM Reward Design through the Heuristic Reward Observation Space Evolution | Zen Kit Heng et.al. | 2504.07596 | null | Kimi |
803 | 2025-04-10 | AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation | Tuhin Chakrabarty et.al. | 2504.07532 | link | Kimi |
804 | 2025-04-10 | Supervised Optimism Correction: Be Confident When LLMs Are Sure | Junjie Zhang et.al. | 2504.07527 | null | Kimi |
805 | 2025-04-10 | VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding | Henghao Zhao et.al. | 2504.07519 | null | Kimi |
806 | 2025-04-10 | GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable | Jianqiao Wangni et.al. | 2504.07513 | null | Kimi |
807 | 2025-04-10 | Kimi-VL Technical Report | Kimi Team et.al. | 2504.07491 | link | Kimi |
808 | 2025-04-10 | Beyond LLMs: A Linguistic Approach to Causal Graph Generation from Narrative Texts | Zehan Li et.al. | 2504.07459 | null | Kimi |
809 | 2025-04-10 | From Token to Line: Enhancing Code Generation with a Long-Term Perspective | Tingwei Lu et.al. | 2504.07433 | null | Kimi |
810 | 2025-04-10 | TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models | Sher Badshah et.al. | 2504.07385 | null | Kimi |
811 | 2025-04-10 | Enhancing Time Series Forecasting via Multi-Level Text Alignment with LLMs | Taibiao Zhao et.al. | 2504.07360 | link | Kimi |
812 | 2025-04-10 | Revisiting Prompt Optimization with Large Reasoning Models-A Case Study on Event Extraction | Saurabh Srivastava et.al. | 2504.07357 | null | Kimi |
813 | 2025-04-09 | Modeling Response Consistency in Multi-Agent LLM Systems: A Comparative Analysis of Shared and Separate Context Approaches | Tooraj Helmi et.al. | 2504.07303 | null | Kimi |
814 | 2025-04-09 | SemEval-2025 Task 5: LLMs4Subjects – LLM-based Automated Subject Tagging for a National Technical Library’s Open-Access Catalog | Jennifer D’Souza et.al. | 2504.07199 | link | Kimi |
815 | 2025-04-09 | HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation | Mingxuan Li et.al. | 2504.07174 | link | Kimi |
816 | 2025-04-09 | Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning | Nikhil Shivakumar Nayak et.al. | 2504.07097 | link | Kimi |
817 | 2025-04-09 | OmniCaptioner: One Captioner to Rule Them All | Yiting Lu et.al. | 2504.07089 | link | Kimi |
818 | 2025-04-09 | KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs | Elan Markowitz et.al. | 2504.07087 | null | Kimi |
819 | 2025-04-09 | DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning | Atharva Pandey et.al. | 2504.07080 | null | Kimi |
820 | 2025-04-09 | SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills | Boyuan Zheng et.al. | 2504.07079 | null | Kimi |
821 | 2025-04-09 | HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification | Bibek Paudel et.al. | 2504.07069 | null | Kimi |
822 | 2025-04-09 | Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration | Kostas Hatalis et.al. | 2504.06943 | null | Kimi |
823 | 2025-04-09 | Are Vision-Language Models Ready for Dietary Assessment? Exploring the Next Frontier in AI-Powered Food Image Recognition | Sergio Romero-Tapiador et.al. | 2504.06925 | null | Kimi |
824 | 2025-04-09 | Integrating Cognitive Processing Signals into Language Models: A Review of Advances, Applications and Future Directions | Angela Lopez-Cardona et.al. | 2504.06843 | null | Kimi |
825 | 2025-04-09 | LVC: A Lightweight Compression Framework for Enhancing VLMs in Long Video Understanding | Ziyi Wang et.al. | 2504.06835 | null | Kimi |
826 | 2025-04-09 | Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations | Zican Dong et.al. | 2504.06792 | null | Kimi |
827 | 2025-04-09 | Zero-Shot Image-Based Large Language Model Approach to Road Pavement Monitoring | Shuoshuo Xu et.al. | 2504.06785 | null | Kimi |
828 | 2025-04-09 | FamilyTool: A Multi-hop Personalized Tool Use Benchmark | Yuxin Wang et.al. | 2504.06766 | link | Kimi |
829 | 2025-04-09 | EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture | Wenfeng Feng et.al. | 2504.06738 | null | Kimi |
830 | 2025-04-09 | A Neuro-inspired Interpretation of Unlearning in Large Language Models through Sample-level Unlearning Difficulty | Xiaohua Feng et.al. | 2504.06658 | null | Kimi |
831 | 2025-04-09 | Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program | Minghe Gao et.al. | 2504.06606 | null | Kimi |
832 | 2025-04-09 | Automated Business Process Analysis: An LLM-Based Approach to Value Assessment | William De Michele et.al. | 2504.06600 | link | Kimi |
833 | 2025-04-09 | Right Prediction, Wrong Reasoning: Uncovering LLM Misalignment in RA Disease Diagnosis | Umakanta Maharana et.al. | 2504.06581 | link | Kimi |
834 | 2025-04-09 | NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables | Lanrui Wang et.al. | 2504.06560 | null | Kimi |
835 | 2025-04-09 | Lugha-Llama: Adapting Large Language Models for African Languages | Happy Buzaaba et.al. | 2504.06536 | null | Kimi |
836 | 2025-04-08 | Don’t Let It Hallucinate: Premise Verification via Retrieval-Augmented Logical Reasoning | Yuehan Qin et.al. | 2504.06438 | null | Kimi |
837 | 2025-04-08 | S’MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning | Hanqing Zeng et.al. | 2504.06426 | null | Kimi |
838 | 2025-04-08 | Understanding Machine Unlearning Through the Lens of Mode Connectivity | Jiali Cheng et.al. | 2504.06407 | null | Kimi |
839 | 2025-04-08 | GOLLuM: Gaussian Process Optimized LLMs – Reframing LLM Finetuning through Bayesian Optimization | Bojana Ranković et.al. | 2504.06265 | link | Kimi |
840 | 2025-04-09 | Hogwild! Inference: Parallel LLM Generation via Concurrent Attention | Gleb Rodionov et.al. | 2504.06261 | null | Kimi |
841 | 2025-04-08 | FEABench: Evaluating Language Models on Multiphysics Reasoning Ability | Nayantara Mudur et.al. | 2504.06260 | link | Kimi |
842 | 2025-04-08 | Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation | Biao Zhang et.al. | 2504.06225 | null | Kimi |
843 | 2025-04-08 | From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models | Chejian Xu et.al. | 2504.06214 | null | Kimi |
844 | 2025-04-08 | TxGemma: Efficient and Agentic LLMs for Therapeutics | Eric Wang et.al. | 2504.06196 | null | Kimi |
845 | 2025-04-08 | Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups | Rijul Magu et.al. | 2504.06160 | null | Kimi |
846 | 2025-04-08 | QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform | Movina Moses et.al. | 2504.06136 | null | Kimi |
847 | 2025-04-08 | Multi-Sense Embeddings for Language Models and Knowledge Distillation | Qitong Wang et.al. | 2504.06036 | null | Kimi |
848 | 2025-04-08 | NativQA Framework: Enabling LLMs with Native, Local, and Everyday Knowledge | Firoj Alam et.al. | 2504.05995 | null | Kimi |
849 | 2025-04-08 | PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario | Sriram Mandalika et.al. | 2504.05908 | null | Kimi |
850 | 2025-04-08 | HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference | Shuzhang Zhong et.al. | 2504.05897 | link | Kimi |
851 | 2025-04-08 | Agent Guide: A Simple Agent Behavioral Watermarking Framework | Kaibo Huang et.al. | 2504.05871 | null | Kimi |
852 | 2025-04-08 | Are Generative AI Agents Effective Personalized Financial Advisors? | Takehiro Takayanagi et.al. | 2504.05862 | link | Kimi |
853 | 2025-04-08 | How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM | Jirong Zha et.al. | 2504.05786 | null | Kimi |
854 | 2025-04-08 | DDT: Decoupled Diffusion Transformer | Shuai Wang et.al. | 2504.05741 | null | Kimi |
855 | 2025-04-08 | Rank-Then-Score: Enhancing Large Language Models for Automated Essay Scoring | Yida Cai et.al. | 2504.05736 | null | Kimi |
856 | 2025-04-08 | STRIVE: A Think & Improve Approach with Iterative Refinement for Enhancing Question Quality Estimation | Aniket Deroy et.al. | 2504.05693 | null | Kimi |
857 | 2025-04-08 | Towards Smarter Hiring: Are Zero-Shot and Few-Shot Pre-trained LLMs Ready for HR Spoken Interview Transcript Analysis? | Subhankar Maity et.al. | 2504.05683 | null | Kimi |
858 | 2025-04-08 | Sugar-Coated Poison: Benign Generation Unlocks LLM Jailbreaking | Yu-Hang Wu et.al. | 2504.05652 | link | Kimi |
859 | 2025-04-08 | TAGC: Optimizing Gradient Communication in Distributed Transformer Training | Igor Polyakov et.al. | 2504.05638 | link | Kimi |
860 | 2025-04-08 | FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction | Qian-Wen Zhang et.al. | 2504.05607 | null | Kimi |
861 | 2025-04-08 | ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs | Gejian Zhao et.al. | 2504.05605 | null | Kimi |
862 | 2025-04-08 | Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought | Yi Peng et.al. | 2504.05599 | null | Kimi |
863 | 2025-04-08 | DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding | Hossein Entezari Zarch et.al. | 2504.05598 | null | Kimi |
864 | 2025-04-08 | Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions | Oded Ovadia et.al. | 2504.05571 | null | Kimi |
865 | 2025-04-07 | Bridging Industrial Expertise and XR with LLM-Powered Conversational Agents | Despina Tomkou et.al. | 2504.05527 | null | Kimi |
866 | 2025-04-07 | Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling | Benjamin Lipkin et.al. | 2504.05410 | null | Kimi |
867 | 2025-04-07 | LiveVQA: Live Visual Knowledge Seeking | Mingyang Fu et.al. | 2504.05288 | null | Kimi |
868 | 2025-04-07 | Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models | Adrián Bazaga et.al. | 2504.05258 | null | Kimi |
869 | 2025-04-07 | Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling | Hengran Zhang et.al. | 2504.05216 | null | Kimi |
870 | 2025-04-07 | Post-Training Language Models for Continual Relation Extraction | Sefika Efeoglu et.al. | 2504.05214 | null | Kimi |
871 | 2025-04-07 | Concise Reasoning via Reinforcement Learning | Mehdi Fatemi et.al. | 2504.05185 | link | Kimi |
872 | 2025-04-07 | VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks | YuYue et.al. | 2504.05118 | null | Kimi |
873 | 2025-04-07 | AI for Climate Finance: Agentic Retrieval and Multi-Step Reasoning for Early Warning System Investments | Saeid Ario Vaghefi et.al. | 2504.05104 | null | Kimi |
874 | 2025-04-07 | The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning | Tianshi Zheng et.al. | 2504.05081 | null | Kimi |
875 | 2025-04-07 | Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models | Jiawei Lian et.al. | 2504.05050 | null | Kimi |
876 | 2025-04-07 | Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning | Sugyeong Eo et.al. | 2504.05047 | null | Kimi |
877 | 2025-04-07 | Following the Whispers of Values: Unraveling Neural Mechanisms Behind Value-Oriented Behaviors in LLMs | Ling Hu et.al. | 2504.04994 | null | Kimi |
878 | 2025-04-07 | Towards Visual Text Grounding of Multimodal Large Language Model | Ming Li et.al. | 2504.04974 | null | Kimi |
879 | 2025-04-07 | M-Prometheus: A Suite of Open Multilingual LLM Judges | José Pombal et.al. | 2504.04953 | null | Kimi |
880 | 2025-04-07 | A Llama walks into the ‘Bar’: Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam | Rean Fernandes et.al. | 2504.04945 | null | Kimi |
881 | 2025-04-07 | Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration | Ran Xu et.al. | 2504.04915 | link | Kimi |
882 | 2025-04-07 | Leveraging Large Language Models for Cost-Effective, Multilingual Depression Detection and Severity Assessment | Longdi Xian et.al. | 2504.04891 | null | Kimi |
883 | 2025-04-07 | Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos | Zhi Zuo et.al. | 2504.04837 | null | Kimi |
884 | 2025-04-07 | Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models | Ruikang Liu et.al. | 2504.04823 | link | Kimi |
885 | 2025-04-07 | Can LLMs Interpret and Leverage Structured Linguistic Representations? A Case Study with AMRs | Ankush Raut et.al. | 2504.04745 | null | Kimi |
886 | 2025-04-07 | TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context | Shubham Kumar Nigam et.al. | 2504.04737 | null | Kimi |
887 | 2025-04-07 | Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use | Anna Goldie et.al. | 2504.04736 | null | Kimi |
888 | 2025-04-07 | Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models | Yubo Li et.al. | 2504.04717 | link | Kimi |
889 | 2025-04-07 | Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts | Yifei Yu et.al. | 2504.04713 | null | Kimi |
890 | 2025-04-07 | LagKV: Lag-Relative Information of the KV Cache Tells Which Tokens Are Important | Manlai Liang et.al. | 2504.04704 | link | Kimi |
891 | 2025-04-07 | R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation | Martin Weyssow et.al. | 2504.04699 | link | Kimi |
892 | 2025-04-07 | LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts | Yimu Wang et.al. | 2504.04653 | null | Kimi |
893 | 2025-04-06 | Splits! A Flexible Dataset for Evaluating a Model’s Demographic Social Inference | Eylon Caplan et.al. | 2504.04640 | link | Kimi |
894 | 2025-04-06 | SECQUE: A Benchmark for Evaluating Real-World Financial Analysis Capabilities | Noga Ben Yoash et.al. | 2504.04596 | null | Kimi |
895 | 2025-04-06 | The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models? | Weichen Zhang et.al. | 2504.04540 | null | Kimi |
896 | 2025-04-06 | An Empirical Comparison of Text Summarization: A Multi-Dimensional Evaluation of Large Language Models | Anantharaman Janakiraman et.al. | 2504.04534 | null | Kimi |
897 | 2025-04-03 | Concept Lancet: Image Editing with Compositional Representation Transplant | Jinqi Luo et.al. | 2504.02828 | null | Kimi |
898 | 2025-04-03 | On Vanishing Variance in Transformer Length Generalization | Ruining Li et.al. | 2504.02827 | null | Kimi |
899 | 2025-04-03 | Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing | Xiangyu Zhao et.al. | 2504.02826 | link | Kimi |
900 | 2025-04-03 | Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models | Mateusz Pach et.al. | 2504.02821 | link | Kimi |
901 | 2025-04-03 | GMR-Conv: An Efficient Rotation and Reflection Equivariant Convolution Kernel Using Gaussian Mixture Rings | Yuexi Du et.al. | 2504.02819 | link | Kimi |
902 | 2025-04-03 | Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization | Kangle Deng et.al. | 2504.02817 | null | Kimi |
903 | 2025-04-03 | Generative Evaluation of Complex Reasoning in Large Language Models | Haowei Lin et.al. | 2504.02810 | link | Kimi |
904 | 2025-04-03 | MegaMath: Pushing the Limits of Open Math Corpora | Fan Zhou et.al. | 2504.02807 | link | Kimi |
905 | 2025-04-03 | A Survey of Large Language Models in Mental Health Disorder Detection on Social Media | Zhuohan Ge et.al. | 2504.02800 | null | Kimi |
906 | 2025-04-03 | Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence | Anita Rau et.al. | 2504.02799 | null | Kimi |
907 | 2025-04-03 | Spline-based Transformers | Prashanth Chandran et.al. | 2504.02797 | null | Kimi |
908 | 2025-04-03 | A Framework for Situating Innovations, Opportunities, and Challenges in Advancing Vertical Systems with Large AI Models | Gaurav Verma et.al. | 2504.02793 | null | Kimi |
909 | 2025-04-03 | Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets | Chuning Zhu et.al. | 2504.02792 | null | Kimi |
910 | 2025-04-03 | A Framework for Robust Cognitive Evaluation of LLMs | Karin de Langis et.al. | 2504.02789 | null | Kimi |
911 | 2025-04-03 | GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation | Zhiyuan Yan et.al. | 2504.02782 | link | Kimi |
912 | 2025-04-03 | From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks | Joshua Holstein et.al. | 2504.02780 | null | Kimi |
913 | 2025-04-03 | Multi-Head Adaptive Graph Convolution Network for Sparse Point Cloud-Based Human Activity Recognition | Vincent Gbouna Zakka et.al. | 2504.02778 | link | Kimi |
914 | 2025-04-03 | MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs | Jaap Jumelet et.al. | 2504.02768 | null | Kimi |
915 | 2025-04-03 | How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices? | Andres Algaba et.al. | 2504.02767 | link | Kimi |
916 | 2025-04-03 | Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model | Shengjun Zhang et.al. | 2504.02764 | null | Kimi |
917 | 2025-04-03 | CanonNet: Canonical Ordering and Curvature Learning for Point Cloud Analysis | Benjy Friedmann et.al. | 2504.02763 | null | Kimi |
918 | 2025-04-03 | RBR4DNN: Requirements-based Testing of Neural Networks | Nusrat Jahan Mozumder et.al. | 2504.02737 | link | Kimi |
919 | 2025-04-03 | Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study | Aryan Agrawal et.al. | 2504.02733 | link | Kimi |
920 | 2025-04-03 | Why do LLMs attend to the first token? | Federico Barbero et.al. | 2504.02732 | null | Kimi |
921 | 2025-04-03 | HQViT: Hybrid Quantum Vision Transformer for Image Classification | Hui Zhang et.al. | 2504.02730 | null | Kimi |
922 | 2025-04-03 | ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization | Kehua Feng et.al. | 2504.02725 | null | Kimi |
923 | 2025-04-03 | Autonomous Human-Robot Interaction via Operator Imitation | Sammy Christen et.al. | 2504.02724 | null | Kimi |
924 | 2025-04-03 | The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context | Nikhil Verma et.al. | 2504.02708 | null | Kimi |
925 | 2025-04-03 | Responsible Development of Offensive AI | Ryan Marinelli et.al. | 2504.02701 | link | Kimi |
926 | 2025-04-03 | Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation | Xingguang Zhang et.al. | 2504.02697 | link | Kimi |
927 | 2025-04-03 | Affordable AI Assistants with Knowledge Graph of Thoughts | Maciej Besta et.al. | 2504.02670 | null | Kimi |
928 | 2025-04-03 | Inference-Time Scaling for Generalist Reward Modeling | Zijun Liu et.al. | 2504.02495 | null | Kimi |
929 | 2025-04-03 | Cognitive Memory in Large Language Models | Lianlei Shan et.al. | 2504.02441 | null | Kimi |
930 | 2025-04-03 | Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation | Chuanqi Cheng et.al. | 2504.02438 | null | Kimi |
931 | 2025-04-03 | AnesBench: Multi-Dimensional Evaluation of LLM Reasoning in Anesthesiology | Xiang Feng et.al. | 2504.02404 | link | Kimi |
932 | 2025-04-03 | CoTAL: Human-in-the-Loop Prompt Engineering, Chain-of-Thought Reasoning, and Active Learning for Generalizable Formative Assessment Scoring | Clayton Cohn et.al. | 2504.02323 | null | Kimi |
933 | 2025-04-03 | MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism | Ruidong Zhu et.al. | 2504.02263 | null | Kimi |
934 | 2025-04-03 | LLMs as Deceptive Agents: How Role-Based Prompting Induces Semantic Ambiguity in Puzzle Tasks | Seunghyun Yoo et.al. | 2504.02254 | null | Kimi |
935 | 2025-04-03 | FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant Attention | Huangliang Dai et.al. | 2504.02211 | null | Kimi |
936 | 2025-04-03 | More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment | Yifan Wang et.al. | 2504.02193 | null | Kimi |
937 | 2025-04-02 | A Survey of Scaling in Large Language Model Reasoning | Zihan Chen et.al. | 2504.02181 | null | Kimi |
938 | 2025-04-02 | OmniCellTOSG: The First Cell Text-Omic Signaling Graphs Dataset for Joint LLM and GNN Modeling | Heming Zhang et.al. | 2504.02148 | link | Kimi |
939 | 2025-04-02 | On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software | Ali Nouri et.al. | 2504.02141 | null | Kimi |
940 | 2025-04-02 | Achieving Unanimous Consensus in Decision Making Using Multi-Agents | Apurba Pokharel et.al. | 2504.02128 | null | Kimi |
941 | 2025-04-02 | Exploring LLM Reasoning Through Controlled Prompt Variations | Giannis Chatziveroglou et.al. | 2504.02111 | link | Kimi |
942 | 2025-04-02 | The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data | Massimiliano Luca et.al. | 2504.01951 | null | Kimi |
943 | 2025-04-02 | OpenCodeReasoning: Advancing Data Distillation for Competitive Coding | Wasi Uddin Ahmad et.al. | 2504.01943 | null | Kimi |
944 | 2025-04-02 | Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length? | Celine Lee et.al. | 2504.01935 | link | Kimi |
945 | 2025-04-02 | A thorough benchmark of automatic text classification: From traditional approaches to large language models | Washington Cunha et.al. | 2504.01930 | link | Kimi |
946 | 2025-04-03 | Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation | Baban Gain et.al. | 2504.01919 | null | Kimi |
947 | 2025-04-02 | FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs | Mothilal Asokan et.al. | 2504.01916 | null | Kimi |
948 | 2025-04-02 | Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning | Yinggan Xu et.al. | 2504.01911 | null | Kimi |
949 | 2025-04-02 | STAR-1: Safer Alignment of Reasoning LLMs with 1K Data | Zijun Wang et.al. | 2504.01903 | null | Kimi |
950 | 2025-04-02 | TransientTables: Evaluating LLMs’ Reasoning on Temporally Evolving Semi-structured Tables | Abhilash Shankarampeta et.al. | 2504.01879 | null | Kimi |
951 | 2025-04-02 | Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models | Zhiwei Yu et.al. | 2504.01857 | null | Kimi |
952 | 2025-04-02 | InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation | Bowen Cao et.al. | 2504.01707 | null | Kimi |
953 | 2025-04-02 | ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs | Yi-Long Lu et.al. | 2504.01698 | link | Kimi |
954 | 2025-04-02 | Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish | Cedric Lothritz et.al. | 2504.01667 | null | Kimi |
955 | 2025-04-02 | Enabling Systematic Generalization in Abstract Spatial Reasoning through Meta-Learning for Compositionality | Philipp Mondorf et.al. | 2504.01445 | link | Kimi |
956 | 2025-04-02 | FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations | Athena Wen et.al. | 2504.01420 | link | Kimi |
957 | 2025-04-02 | An Illusion of Progress? Assessing the Current State of Web Agents | Tianci Xue et.al. | 2504.01382 | link | Kimi |
958 | 2025-04-02 | Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design | Mohan Zhang et.al. | 2504.01337 | null | Kimi |
959 | 2025-04-02 | Slow-Fast Architecture for Video Multi-Modal Large Language Models | Min Shi et.al. | 2504.01328 | link | Kimi |
960 | 2025-04-02 | On Data Synthesis and Post-training for Visual Abstract Reasoning | Ke Zhu et.al. | 2504.01324 | null | Kimi |
961 | 2025-04-02 | Adaptive Rectification Sampling for Test-Time Compute Scaling | Zhendong Tan et.al. | 2504.01317 | link | Kimi |
962 | 2025-04-02 | ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning | Bairu Hou et.al. | 2504.01296 | link | Kimi |
963 | 2025-04-02 | Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding | Sakhinana Sagar Srinivas et.al. | 2504.01281 | null | Kimi |
964 | 2025-04-01 | Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models | Rafael Giebisch et.al. | 2504.01248 | null | Kimi |
965 | 2025-04-01 | Detecting PTSD in Clinical Interviews: A Comparative Analysis of NLP Methods and Large Language Models | Feng Chen et.al. | 2504.01216 | null | Kimi |
966 | 2025-04-01 | $μ$ KE: Matryoshka Unstructured Knowledge Editing of Large Language Models | Zian Su et.al. | 2504.01196 | null | Kimi |
967 | 2025-04-01 | When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning | Nishad Singhi et.al. | 2504.01005 | null | Kimi |
968 | 2025-04-01 | Token embeddings violate the manifold hypothesis | Michael Robinson et.al. | 2504.01002 | null | Kimi |
969 | 2025-04-01 | MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization | Siyuan Li et.al. | 2504.00999 | null | Kimi |
970 | 2025-04-01 | MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs | Juncheng Wu et.al. | 2504.00993 | link | Kimi |
971 | 2025-04-01 | SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching | Yuxuan Zhu et.al. | 2504.00970 | null | Kimi |
972 | 2025-04-01 | Multi-Token Attention | Olga Golovneva et.al. | 2504.00927 | null | Kimi |
973 | 2025-04-01 | Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents | Saaket Agashe et.al. | 2504.00906 | link | Kimi |
974 | 2025-03-31 | Easi3R: Estimating Disentangled Motion from DUSt3R Without Training | Xingyu Chen et.al. | 2503.24391 | link | Kimi |
975 | 2025-03-31 | RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy | Zhonghan Zhao et.al. | 2503.24388 | null | Kimi |
976 | 2025-03-31 | Consistent Subject Generation via Contrastive Instantiated Concepts | Lee Hsin-Ying et.al. | 2503.24387 | null | Kimi |
977 | 2025-03-31 | Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation | Shengqiong Wu et.al. | 2503.24379 | null | Kimi |
978 | 2025-03-31 | ACPBench Hard: Unrestrained Reasoning about Action, Change, and Planning | Harsha Kokel et.al. | 2503.24378 | null | Kimi |
979 | 2025-03-31 | Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models | Rui Wang et.al. | 2503.24377 | link | Kimi |
980 | 2025-03-31 | Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 | Yi Chen et.al. | 2503.24376 | link | Kimi |
981 | 2025-03-31 | ERUPT: Efficient Rendering with Unposed Patch Transformer | Maxim V. Shugaev et.al. | 2503.24374 | null | Kimi |
982 | 2025-03-31 | Effectively Controlling Reasoning Models through Thinking Intervention | Tong Wu et.al. | 2503.24370 | null | Kimi |
983 | 2025-03-31 | Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation | Xiaoran Zhang et.al. | 2503.24368 | null | Kimi |
984 | 2025-03-31 | Query and Conquer: Execution-Guided SQL Generation | Łukasz Borchmann et.al. | 2503.24364 | null | Kimi |
985 | 2025-03-31 | SQuat: Subspace-orthogonal KV Cache Quantization | Hao Wang et.al. | 2503.24358 | null | Kimi |
986 | 2025-03-31 | ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion | Rana Muhammad Shahroz Khan et.al. | 2503.24354 | null | Kimi |
987 | 2025-03-31 | Can Test-Time Scaling Improve World Foundation Model? | Wenyan Cong et.al. | 2503.24320 | link | Kimi |
988 | 2025-03-31 | BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models | Alok Abhishek et.al. | 2503.24310 | null | Kimi |
989 | 2025-03-31 | A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG | Arshia Kermani et.al. | 2503.24307 | null | Kimi |
990 | 2025-03-31 | Order Matters: On Parameter-Efficient Image-to-Video Probing for Recognizing Nearly Symmetric Actions | Thinesh Thiyakesan Ponbagavathi et.al. | 2503.24298 | null | Kimi |
991 | 2025-03-31 | Is analogy enough to draw novel adjective-noun inferences? | Hayley Ross et.al. | 2503.24293 | link | Kimi |
992 | 2025-03-31 | Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model | Jingcheng Hu et.al. | 2503.24290 | null | Kimi |
993 | 2025-03-31 | Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning | Jiacheng Lin et.al. | 2503.24289 | link | Kimi |
994 | 2025-03-31 | Style Quantization for Data-Efficient GAN Training | Jian Wang et.al. | 2503.24282 | null | Kimi |
995 | 2025-03-31 | Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality | Sewoong Lee et.al. | 2503.24277 | link | Kimi |
996 | 2025-03-31 | FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics | Yixuan Li et.al. | 2503.24267 | null | Kimi |
997 | 2025-03-31 | Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation | Dun Yuan et.al. | 2503.24245 | null | Kimi |
998 | 2025-03-31 | Spatio-temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications in Ridesharing | Run Yang et.al. | 2503.24237 | null | Kimi |
999 | 2025-03-31 | What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models | Qiyuan Zhang et.al. | 2503.24235 | link | Kimi |
1000 | 2025-03-31 | PAARS: Persona Aligned Agentic Retail Shoppers | Saab Mansour et.al. | 2503.24228 | null | Kimi |
1001 | 2025-03-31 | MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing | Karim Radouane et.al. | 2503.24219 | link | Kimi |
1002 | 2025-03-31 | All You Need is Sally-Anne: ToM in AI Strongly Supported After Surpassing Tests for 3-Year-Olds | Nitay Alon et.al. | 2503.24215 | null | Kimi |
1003 | 2025-03-31 | Synthetic News Generation for Fake News Classification | Abdul Sittar et.al. | 2503.24206 | null | Kimi |
1004 | 2025-03-31 | TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers’ Guidance | Jingxian Xu et.al. | 2503.24198 | null | Kimi |
1005 | 2025-03-31 | Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms | Shuoming Zhang et.al. | 2503.24191 | null | Kimi |
1006 | 2025-03-31 | Grounding Agent Reasoning in Image Schemas: A Neurosymbolic Approach to Embodied Cognition | François Olivier et.al. | 2503.24110 | null | Kimi |
1007 | 2025-03-31 | Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data | Fatemeh Mohammadi et.al. | 2503.24062 | null | Kimi |
1008 | 2025-03-31 | AirCache: Activating Inter-modal Relevancy KV Cache Compression for Efficient Large Vision-Language Model Inference | Kai Huang et.al. | 2503.23956 | null | Kimi |
1009 | 2025-03-31 | Model Hemorrhage and the Robustness Limits of Large Language Models | Ziyang Ma et.al. | 2503.23924 | null | Kimi |
1010 | 2025-03-31 | OrchMLLM: Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training | Yijie Zheng et.al. | 2503.23830 | null | Kimi |
1011 | 2025-03-31 | Expanding RL with Verifiable Rewards Across Diverse Domains | Yi Su et.al. | 2503.23829 | null | Kimi |
1012 | 2025-03-31 | Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute | Yingwei Ma et.al. | 2503.23803 | link | Kimi |
1013 | 2025-03-31 | Adaptive Layer-skipping in Pre-trained LLMs | Xuan Luo et.al. | 2503.23798 | null | Kimi |
1014 | 2025-03-31 | WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization | Ine Gevers et.al. | 2503.23779 | null | Kimi |
1015 | 2025-03-31 | Short-video Propagation Influence Rating: A New Real-world Dataset and A New Large Graph Model | Dizhan Xue et.al. | 2503.23746 | link | Kimi |
1016 | 2025-03-31 | LANID: LLM-assisted New Intent Discovery | Lu Fan et.al. | 2503.23740 | link | Kimi |
1017 | 2025-03-31 | AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization | Yiyang Du et.al. | 2503.23733 | link | Kimi |
1018 | 2025-03-30 | Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models | Haochen Liu et.al. | 2503.23523 | link | Kimi |
1019 | 2025-03-30 | If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs | Siqi Fan et.al. | 2503.23514 | null | Kimi |
1020 | 2025-03-30 | RARE: Retrieval-Augmented Reasoning Modeling | Zhengren Wang et.al. | 2503.23513 | link | Kimi |
1021 | 2025-03-30 | Benchmarking Systematic Relational Reasoning with Large Language and Reasoning Models | Irtaza Khalid et.al. | 2503.23487 | null | Kimi |
1022 | 2025-03-30 | Order Independence With Finetuning | Katrina Brown et.al. | 2503.23483 | null | Kimi |
1023 | 2025-03-27 | Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model | Abdelrahman Shaker et.al. | 2503.21782 | link | Kimi |
1024 | 2025-03-27 | X $^{2}$ -Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction | Weihao Yu et.al. | 2503.21779 | null | Kimi |
1025 | 2025-03-27 | Video-R1: Reinforcing Video Reasoning in MLLMs | Kaituo Feng et.al. | 2503.21776 | link | Kimi |
1026 | 2025-03-27 | StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion | Ziyu Guo et.al. | 2503.21775 | null | Kimi |
1027 | 2025-03-27 | MemInsight: Autonomous Memory Augmentation for LLM Agents | Rana Salama et.al. | 2503.21760 | null | Kimi |
1028 | 2025-03-27 | Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck | Adrian Bulat et.al. | 2503.21757 | null | Kimi |
1029 | 2025-03-27 | LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis | Shitian Zhao et.al. | 2503.21749 | null | Kimi |
1030 | 2025-03-27 | GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics | Arsham Gholamzadeh Khoee et.al. | 2503.21735 | null | Kimi |
1031 | 2025-03-27 | Effective Skill Unlearning through Intervention and Abstention | Yongce Li et.al. | 2503.21730 | link | Kimi |
1032 | 2025-03-27 | ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation | Zhicheng Lee et.al. | 2503.21729 | null | Kimi |
1033 | 2025-03-27 | OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation | Mallika Garg et.al. | 2503.21723 | null | Kimi |
1034 | 2025-03-27 | Collab: Controlled Decoding using Mixture of Agents for LLM Alignment | Souradip Chakraborty et.al. | 2503.21720 | null | Kimi |
1035 | 2025-03-27 | Outlier dimensions favor frequent tokens in language model | Iuri Macocco et.al. | 2503.21718 | null | Kimi |
1036 | 2025-03-27 | CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers? | Jiefu Ou et.al. | 2503.21717 | link | Kimi |
1037 | 2025-03-27 | Elementwise Layer Normalization | Felix Stollenwerk et.al. | 2503.21708 | link | Kimi |
1038 | 2025-03-27 | MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX | Liuyue Xie et.al. | 2503.21699 | null | Kimi |
1039 | 2025-03-27 | Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks | Wenqi Zhang et.al. | 2503.21696 | link | Kimi |
1040 | 2025-03-27 | AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation | Jiahe Qian et.al. | 2503.21695 | null | Kimi |
1041 | 2025-03-27 | Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data | Zhiyuan Ma et.al. | 2503.21694 | link | Kimi |
1042 | 2025-03-27 | LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning | Hui Wang et.al. | 2503.21683 | null | Kimi |
1043 | 2025-03-27 | JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community | Yunze Xiao et.al. | 2503.21679 | null | Kimi |
1044 | 2025-03-27 | How do language models learn facts? Dynamics, curricula and hallucinations | Nicolas Zucchet et.al. | 2503.21676 | null | Kimi |
1045 | 2025-03-27 | COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing | Rajvee Sheth et.al. | 2503.21670 | null | Kimi |
1046 | 2025-03-27 | Cognitive Science-Inspired Evaluation of Core Capabilities for Object Understanding in AI | Danaja Rutar et.al. | 2503.21668 | null | Kimi |
1047 | 2025-03-27 | UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning | Zhengxi Lu et.al. | 2503.21620 | link | Kimi |
1048 | 2025-03-27 | A Measure Based Generalizable Approach to Understandability | Vikas Kushwaha et.al. | 2503.21615 | null | Kimi |
1049 | 2025-03-27 | A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond | Xiaoye Qu et.al. | 2503.21614 | link | Kimi |
1050 | 2025-03-27 | Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach | Javier Coronado-Blázquez et.al. | 2503.21613 | null | Kimi |
1051 | 2025-03-27 | GenEdit: Compounding Operators and Continuous Improvement to Tackle Text-to-SQL in the Enterprise | Karime Maamari et.al. | 2503.21602 | null | Kimi |
1052 | 2025-03-27 | Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing | Johan Wahréus et.al. | 2503.21598 | null | Kimi |
1053 | 2025-03-27 | debug-gym: A Text-Based Environment for Interactive Debugging | Xingdi Yuan et.al. | 2503.21557 | null | Kimi |
1054 | 2025-03-27 | SWI: Speaking with Intent in Large Language Models | Yuwei Yin et.al. | 2503.21544 | link | Kimi |
1055 | 2025-03-27 | Keyword-Oriented Multimodal Modeling for Euphemism Identification | Yuxue Hu et.al. | 2503.21504 | link | Kimi |
1056 | 2025-03-27 | Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection | Ryan Marinelli et.al. | 2503.21464 | link | Kimi |
1057 | 2025-03-27 | An evaluation of LLMs and Google Translate for translation of selected Indian languages via sentiment and semantic analyses | Rohitash Chandra et.al. | 2503.21393 | null | Kimi |
1058 | 2025-03-27 | Controlling Large Language Model with Latent Actions | Chengxing Jia et.al. | 2503.21383 | link | Kimi |
1059 | 2025-03-27 | Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models | Haoxiang Sun et.al. | 2503.21380 | link | Kimi |
1060 | 2025-03-27 | ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback | Taewon Yun et.al. | 2503.21332 | null | Kimi |
1061 | 2025-03-27 | InternVL-X: Advancing and Accelerating InternVL Series with Efficient Visual Token Compression | Dongchen Lu et.al. | 2503.21307 | link | Kimi |
1062 | 2025-03-27 | ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition | Yujie Liu et.al. | 2503.21248 | null | Kimi |
1063 | 2025-03-27 | Bias-Aware Agent: Enhancing Fairness in AI-Driven Knowledge Retrieval | Karanbir Singh et.al. | 2503.21237 | link | Kimi |
1064 | 2025-03-27 | LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models | Hengyuan Zhao et.al. | 2503.21227 | null | Kimi |
1065 | 2025-03-27 | ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging | Haoming Xu et.al. | 2503.21088 | link | Kimi |
1066 | 2025-03-27 | EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues | Yuhan Liu et.al. | 2503.21080 | null | Kimi |
1067 | 2025-03-27 | Rerouting Connection: Hybrid Computer Vision Analysis Reveals Visual Similarity Between Indus and Tibetan-Yi Corridor Writing Systems | Ooha Lakkadi Reddy et.al. | 2503.21074 | link | Kimi |
1068 | 2025-03-26 | Can Large Language Models Predict Associations Among Human Attitudes? | Ana Ma et.al. | 2503.21011 | null | Kimi |
1069 | 2025-03-26 | VinaBench: Benchmark for Faithful and Consistent Visual Narratives | Silin Gao et.al. | 2503.20871 | null | Kimi |
1070 | 2025-03-26 | Understanding R1-Zero-Like Training: A Critical Perspective | Zichen Liu et.al. | 2503.20783 | link | Kimi |
1071 | 2025-03-27 | Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning | Huajie Tan et.al. | 2503.20752 | null | Kimi |
1072 | 2025-03-26 | Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework | Soham Sane et.al. | 2503.20750 | null | Kimi |
1073 | 2025-03-27 | Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs | Yuxuan Lu et.al. | 2503.20749 | null | Kimi |
1074 | 2025-03-26 | Vision as LoRA | Han Wang et.al. | 2503.20680 | link | Kimi |
1075 | 2025-03-26 | TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews | Huimin Xu et.al. | 2503.20666 | null | Kimi |
1076 | 2025-03-26 | Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions | Alessandro Maisto et.al. | 2503.20623 | null | Kimi |
1077 | 2025-03-26 | Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation | Yunkai Liang et.al. | 2503.20552 | link | Kimi |
1078 | 2025-03-26 | Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence | Yijiong Yu et.al. | 2503.20533 | link | Kimi |
1079 | 2025-03-26 | StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs | Zhicheng Guo et.al. | 2503.20527 | link | Kimi |
1080 | 2025-03-26 | From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment | Yucheng Suo et.al. | 2503.20472 | null | Kimi |
1081 | 2025-03-26 | MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation | Rongyu Zhang et.al. | 2503.20384 | null | Kimi |
1082 | 2025-03-26 | VideoGEM: Training-free Action Grounding in Videos | Felix Vogel et.al. | 2503.20348 | null | Kimi |
1083 | 2025-03-26 | Iterative Prompting with Persuasion Skills in Jailbreaking Large Language Models | Shih-Wen Ke et.al. | 2503.20320 | null | Kimi |
1084 | 2025-03-26 | QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions | Siyin Wang et.al. | 2503.20290 | null | Kimi |
1085 | 2025-03-26 | sudo rm -rf agentic_security | Sejin Lee et.al. | 2503.20279 | link | Kimi |
1086 | 2025-03-26 | ViLBench: A Suite for Vision-Language Process Reward Modeling | Haoqin Tu et.al. | 2503.20271 | null | Kimi |
1087 | 2025-03-26 | Qwen2.5-Omni Technical Report | Jin Xu et.al. | 2503.20215 | null | Kimi |
1088 | 2025-03-26 | SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain | Nan Gao et.al. | 2503.20202 | null | Kimi |
1089 | 2025-03-26 | Open Deep Search: Democratizing Search with Open-source Reasoning Agents | Salaheddin Alzubi et.al. | 2503.20201 | link | Kimi |
1090 | 2025-03-25 | Can Multi-modal (reasoning) LLMs work as deepfake detectors? | Simiao Ren et.al. | 2503.20084 | null | Kimi |
1091 | 2025-03-25 | Cross-Tokenizer Distillation via Approximate Likelihood Matching | Benjamin Minixhofer et.al. | 2503.20083 | link | Kimi |
1092 | 2025-03-25 | OmniNova:A General Multimodal Agent Framework | Pengfei Du et.al. | 2503.20028 | null | Kimi |
1093 | 2025-03-25 | ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback | Bohan Zhai et.al. | 2503.19988 | link | Kimi |
1094 | 2025-03-25 | LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Han Chen et.al. | 2503.19950 | link | Kimi |
1095 | 2025-03-25 | CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Hao Yu et.al. | 2503.19900 | link | Kimi |
1096 | 2025-03-25 | Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking | Xiaoyu Tian et.al. | 2503.19855 | null | Kimi |
1097 | 2025-03-25 | FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs | Carlos Plou et.al. | 2503.19850 | null | Kimi |
1098 | 2025-03-25 | A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950 | Zhao Fang et.al. | 2503.19844 | null | Kimi |
1099 | 2025-03-25 | PAVE: Patching and Adapting Video Large Language Models | Zhuoming Liu et.al. | 2503.19794 | link | Kimi |
1100 | 2025-03-25 | Gemma 3 Technical Report | Gemma Team et.al. | 2503.19786 | null | Kimi |
1101 | 2025-03-25 | AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation | Itay Nakash et.al. | 2503.19693 | link | Kimi |
1102 | 2025-03-25 | 1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training | Han Zhao et.al. | 2503.19633 | null | Kimi |
1103 | 2025-03-25 | Innate Reasoning is Not Enough: In-Context Learning Enhances Reasoning Large Language Models with Less Overthinking | Yuyao Ge et.al. | 2503.19602 | null | Kimi |
1104 | 2025-03-25 | Scaling Laws of Synthetic Data for Language Models | Zeyu Qin et.al. | 2503.19551 | null | Kimi |
1105 | 2025-03-25 | FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models | Dahyun Jung et.al. | 2503.19540 | link | Kimi |
1106 | 2025-03-25 | ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning | Mingyang Chen et.al. | 2503.19470 | null | Kimi |
1107 | 2025-03-25 | DeCAP: Context-Adaptive Prompt Generation for Debiasing Zero-shot Question Answering in Large Language Models | Suyoung Bae et.al. | 2503.19426 | null | Kimi |
1108 | 2025-03-25 | Process or Result? Manipulated Ending Tokens Can Mislead Reasoning LLMs to Ignore the Correct Reasoning Steps | Yu Cui et.al. | 2503.19326 | null | Kimi |
1109 | 2025-03-25 | Long-Context Autoregressive Video Modeling with Next-Frame Prediction | Yuchao Gu et.al. | 2503.19325 | link | Kimi |
1110 | 2025-03-25 | Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications | Ben Rahman et.al. | 2503.19276 | null | Kimi |
1111 | 2025-03-25 | MARS: Memory-Enhanced Agents with Reflective Self-improvement | Xuechen Liang et.al. | 2503.19271 | null | Kimi |
1112 | 2025-03-25 | Linguistic Blind Spots of Large Language Models | Jiali Cheng et.al. | 2503.19260 | null | Kimi |
1113 | 2025-03-25 | SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings | Farhana Keya et.al. | 2503.19257 | null | Kimi |
1114 | 2025-03-24 | A Survey of Large Language Model Agents for Question Answering | Murong Yue et.al. | 2503.19213 | null | Kimi |
1115 | 2025-03-24 | Overtrained Language Models Are Harder to Fine-Tune | Jacob Mitchell Springer et.al. | 2503.19206 | null | Kimi |
1116 | 2025-03-24 | Language Model Uncertainty Quantification with Attention Chain | Yinghao Li et.al. | 2503.19168 | link | Kimi |
1117 | 2025-03-24 | LLM-Based Insight Extraction for Contact Center Analytics and Cost-Efficient Deployment | Varsha Embar et.al. | 2503.19090 | null | Kimi |
1118 | 2025-03-24 | Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization | Zhanda Zhu et.al. | 2503.19050 | link | Kimi |
1119 | 2025-03-24 | LookAhead Tuning: Safer Language Models via Partial Answer Previews | Kangwei Liu et.al. | 2503.19041 | link | Kimi |
1120 | 2025-03-24 | Exploring Training and Inference Scaling Laws in Generative Retrieval | Hongru Cai et.al. | 2503.18941 | link | Kimi |
1121 | 2025-03-24 | xKV: Cross-Layer SVD for KV-Cache Compression | Chi-Chih Chang et.al. | 2503.18893 | link | Kimi |
1122 | 2025-03-24 | SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild | Weihao Zeng et.al. | 2503.18892 | null | Kimi |
1123 | 2025-03-24 | AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration | Zhexuan Wang et.al. | 2503.18891 | link | Kimi |
1124 | 2025-03-24 | I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders | Andrey Galichin et.al. | 2503.18878 | link | Kimi |
1125 | 2025-03-24 | EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments | Sara Fish et.al. | 2503.18825 | null | Kimi |
1126 | 2025-03-24 | REALM: A Dataset of Real-World LLM Use Cases | Jingwen Cheng et.al. | 2503.18792 | null | Kimi |
1127 | 2025-03-24 | BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache | Dayou Du et.al. | 2503.18773 | link | Kimi |
1128 | 2025-03-24 | AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning | Alan Dao et.al. | 2503.18769 | null | Kimi |
1129 | 2025-03-24 | Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models | Yazhou Zhang et.al. | 2503.18681 | null | Kimi |
1130 | 2025-03-24 | Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures | Abdoul Majid O. Thiombiano et.al. | 2503.18565 | null | Kimi |
1131 | 2025-03-24 | Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models | Nariman Naderi et.al. | 2503.18562 | null | Kimi |
1132 | 2025-03-24 | Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models | Bin Li et.al. | 2503.18556 | null | Kimi |
1133 | 2025-03-24 | SciClaims: An End-to-End Generative System for Biomedical Claim Analysis | Raúl Ortega et.al. | 2503.18526 | null | Kimi |
1134 | 2025-03-24 | Verbal Process Supervision Elicits Better Coding Agents | Hao-Yuan Chen et.al. | 2503.18494 | null | Kimi |
1135 | 2025-03-24 | Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding | Xiangrui Liu et.al. | 2503.18478 | null | Kimi |
1136 | 2025-03-24 | A Simple yet Effective Layout Token in Large Language Models for Document Understanding | Zhaoqing Zhu et.al. | 2503.18434 | null | Kimi |
1137 | 2025-03-24 | Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning | Junsong Li et.al. | 2503.18432 | null | Kimi |
1138 | 2025-03-24 | Breaking the Encoder Barrier for Seamless Video-Language Understanding | Handong Li et.al. | 2503.18422 | null | Kimi |
1139 | 2025-03-24 | J&H: Evaluating the Robustness of Large Language Models Under Knowledge-Injection Attacks in Legal Domain | Yiran Hu et.al. | 2503.18360 | link | Kimi |
1140 | 2025-03-24 | Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions | Dong Jing et.al. | 2503.18320 | null | Kimi |
1141 | 2025-03-24 | Jenga: Effective Memory Management for Serving LLM with Heterogeneity | Chen Zhang et.al. | 2503.18292 | null | Kimi |
1142 | 2025-03-24 | Sun-Shine: A Large Language Model for Tibetan Culture | Cheng Huang et.al. | 2503.18288 | link | Kimi |
1143 | 2025-03-24 | TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model | Cheng Yang et.al. | 2503.18278 | null | Kimi |
1144 | 2025-03-24 | Bridging Emotions and Architecture: Sentiment Analysis in Modern Distributed Systems | Mahak Shah et.al. | 2503.18260 | null | Kimi |
1145 | 2025-03-23 | ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices | Aneesh Vathul et.al. | 2503.18242 | null | Kimi |
1146 | 2025-03-23 | Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering | Zixin Chen et.al. | 2503.18172 | null | Kimi |
1147 | 2025-03-23 | LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space | Zhangyu Wang et.al. | 2503.18142 | null | Kimi |
1148 | 2025-03-23 | AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs | Diwei Wang et.al. | 2503.18141 | null | Kimi |
1149 | 2025-03-23 | GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks | Varvara Krechetova et.al. | 2503.18129 | link | Kimi |
1150 | 2025-03-20 | Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation | Yuqing Wang et.al. | 2503.16430 | null | Kimi |
1151 | 2025-03-20 | XAttention: Block Sparse Attention with Antidiagonal Scoring | Ruyi Xu et.al. | 2503.16428 | link | Kimi |
1152 | 2025-03-20 | DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding | Keyan Chen et.al. | 2503.16426 | link | Kimi |
1153 | 2025-03-20 | Tokenize Image as a Set | Zigang Geng et.al. | 2503.16425 | link | Kimi |
1154 | 2025-03-20 | 1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering | Yuheng Yuan et.al. | 2503.16422 | null | Kimi |
1155 | 2025-03-20 | Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models | Yang Sui et.al. | 2503.16419 | link | Kimi |
1156 | 2025-03-20 | Survey on Evaluation of LLM-based Agents | Asaf Yehudai et.al. | 2503.16416 | null | Kimi |
1157 | 2025-03-20 | M3: 3D-Spatial MultiModal Memory | Xueyan Zou et.al. | 2503.16413 | link | Kimi |
1158 | 2025-03-20 | RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints | Yiran Qin et.al. | 2503.16408 | null | Kimi |
1159 | 2025-03-20 | The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination | Yifan Sun et.al. | 2503.16402 | link | Kimi |
1160 | 2025-03-20 | SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation | Chun-Han Yao et.al. | 2503.16396 | null | Kimi |
1161 | 2025-03-20 | Do Visual Imaginations Improve Vision-and-Language Navigation Agents? | Akhil Perincherry et.al. | 2503.16394 | null | Kimi |
1162 | 2025-03-20 | Attentional Triple-Encoder Network in Spatiospectral Domains for Medical Image Segmentation | Kristin Qi et.al. | 2503.16389 | null | Kimi |
1163 | 2025-03-20 | Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation | Yijia Luo et.al. | 2503.16385 | link | Kimi |
1164 | 2025-03-20 | LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images | Leyang Wang et.al. | 2503.16376 | null | Kimi |
1165 | 2025-03-20 | NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes | Han-Hung Lee et.al. | 2503.16375 | link | Kimi |
1166 | 2025-03-20 | JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse | Muyao Li et.al. | 2503.16365 | null | Kimi |
1167 | 2025-03-20 | Neural Networks: According to the Principles of Grassmann Algebra | Z. Zarezadeh et.al. | 2503.16364 | null | Kimi |
1168 | 2025-03-20 | CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners | Yunzhi Yao et.al. | 2503.16356 | link | Kimi |
1169 | 2025-03-20 | Enhancing Software Quality Assurance with an Adaptive Differential Evolution based Quantum Variational Autoencoder-Transformer Model | Seshu Babu Barma et.al. | 2503.16335 | null | Kimi |
1170 | 2025-03-20 | LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates | Ying Shen et.al. | 2503.16334 | null | Kimi |
1171 | 2025-03-20 | OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence | Long Yuan et.al. | 2503.16326 | null | Kimi |
1172 | 2025-03-20 | Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 | Peiran Gu et.al. | 2503.16304 | null | Kimi |
1173 | 2025-03-20 | Unleashing Vecset Diffusion Model for Fast Shape Generation | Zeqiang Lai et.al. | 2503.16302 | link | Kimi |
1174 | 2025-03-20 | PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification | Sharon Peled et.al. | 2503.16284 | link | Kimi |
1175 | 2025-03-20 | Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data | Zijian Li et.al. | 2503.16260 | null | Kimi |
1176 | 2025-03-20 | Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models | Keda Tao et.al. | 2503.16257 | null | Kimi |
1177 | 2025-03-20 | M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation | Markus Karmann et.al. | 2503.16254 | null | Kimi |
1178 | 2025-03-20 | Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Zhaowei Liu et.al. | 2503.16252 | link | Kimi |
1179 | 2025-03-20 | AI Agents in Cryptoland: Practical Attacks and No Silver Bullet | Atharv Singh Patlan et.al. | 2503.16248 | null | Kimi |
1180 | 2025-03-20 | Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t | Quy-Anh Dang et.al. | 2503.16219 | link | Kimi |
1181 | 2025-03-20 | Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation | Andrea Maracani et.al. | 2503.16184 | null | Kimi |
1182 | 2025-03-20 | SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs | Shibo Jie et.al. | 2503.16163 | null | Kimi |
1183 | 2025-03-20 | Tuning LLMs by RAG Principles: Towards LLM-native Memory | Jiale Wei et.al. | 2503.16071 | link | Kimi |
1184 | 2025-03-20 | PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval | Qiang Zou et.al. | 2503.16064 | link | Kimi |
1185 | 2025-03-20 | Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts | Yike Yuan et.al. | 2503.16057 | null | Kimi |
1186 | 2025-03-20 | Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond | Yaoyao Yu et.al. | 2503.16040 | null | Kimi |
1187 | 2025-03-20 | Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models | Zhihang Liu et.al. | 2503.16036 | link | Kimi |
1188 | 2025-03-20 | The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement | Ruihan Yang et.al. | 2503.16024 | null | Kimi |
1189 | 2025-03-20 | Autonomous AI imitators increase diversity in homogeneous information ecosystems | Emil Bakkensen Johansen et.al. | 2503.16021 | null | Kimi |
1190 | 2025-03-20 | GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions | Xiaomeng Chu et.al. | 2503.16013 | null | Kimi |
1191 | 2025-03-20 | Adaptive Group Policy Optimization: Towards Stable Training and Token-Efficient Reasoning | Chen Li et.al. | 2503.15952 | null | Kimi |
1192 | 2025-03-20 | Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment | Gaole Dai et.al. | 2503.15937 | null | Kimi |
1193 | 2025-03-20 | SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models | Fahao Chen et.al. | 2503.15921 | null | Kimi |
1194 | 2025-03-20 | DeepPsy-Agent: A Stage-Aware and Deep-Thinking Emotional Support Agent System | Kai Chen et.al. | 2503.15876 | null | Kimi |
1195 | 2025-03-20 | MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations | Kyungho Bae et.al. | 2503.15871 | null | Kimi |
1196 | 2025-03-20 | Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey | Xiaoou Liu et.al. | 2503.15850 | null | Kimi |
1197 | 2025-03-20 | Entropy-based Exploration Conduction for Multi-step Reasoning | Jinghan Zhang et.al. | 2503.15848 | null | Kimi |
1198 | 2025-03-20 | Grammar and Gameplay-aligned RL for Game Description Generation with LLMs | Tsunehiko Tanaka et.al. | 2503.15783 | null | Kimi |
1199 | 2025-03-19 | UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction | Shravan Nayak et.al. | 2503.15661 | null | Kimi |
1200 | 2025-03-19 | LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning | Federico Cocchi et.al. | 2503.15621 | link | Kimi |
1201 | 2025-03-19 | Dynamic Bi-Elman Attention Networks (DBEAN): Dual-Directional Context-Aware Representation Learning for Enhanced Text Classification | ZhengLin Lai et.al. | 2503.15469 | link | Kimi |
1202 | 2025-03-19 | SemEval-2025 Task 1: AdMIRe – Advancing Multimodal Idiomaticity Representation | Thomas Pickard et.al. | 2503.15358 | null | Kimi |
1203 | 2025-03-19 | MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration | David Wan et.al. | 2503.15272 | null | Kimi |
1204 | 2025-03-19 | Do Chains-of-Thoughts of Large Language Models Suffer from Hallucinations, Cognitive Biases, or Phobias in Bayesian Reasoning? | Roberto Araya et.al. | 2503.15268 | null | Kimi |
1205 | 2025-03-19 | Efficient allocation of image recognition and LLM tasks on multi-GPU system | Marcin Lawenda et.al. | 2503.15252 | null | Kimi |
1206 | 2025-03-19 | Automated Non-Functional Requirements Generation in Software Engineering with Large Language Models: A Comparative Study | Jomar Thomas Almonte et.al. | 2503.15248 | null | Kimi |
1207 | 2025-03-19 | BigO(Bench) – Can LLMs Generate Code with Controlled Time and Space Complexity? | Pierre Chambon et.al. | 2503.15242 | link | Kimi |
1208 | 2025-03-19 | Exploring Large Language Models for Word Games:Who is the Spy? | Chentian Wei et.al. | 2503.15235 | link | Kimi |
1209 | 2025-03-19 | CoE: Chain-of-Explanation via Automatic Visual Concept Circuit Description and Polysemanticity Quantification | Wenlong Yu et.al. | 2503.15234 | link | Kimi |
1210 | 2025-03-19 | A Review on Large Language Models for Visual Analytics | Navya Sonal Agarwal et.al. | 2503.15176 | null | Kimi |
1211 | 2025-03-19 | Machine Unlearning in Hyperbolic vs. Euclidean Multimodal Contrastive Learning: Adapting Alignment Calibration to MERU | Àlex Pujol Vidal et.al. | 2503.15166 | null | Kimi |
1212 | 2025-03-19 | VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making | Mohamed Salim Aissi et.al. | 2503.15108 | null | Kimi |
1213 | 2025-03-19 | Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings | Zonghao Ying et.al. | 2503.15092 | link | Kimi |
1214 | 2025-03-19 | Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices | Ziyao Wang et.al. | 2503.14932 | null | Kimi |
1215 | 2025-03-19 | MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models | Jiazheng Li et.al. | 2503.14917 | null | Kimi |
1216 | 2025-03-19 | Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations | Shuo Li et.al. | 2503.14895 | null | Kimi |
1217 | 2025-03-19 | MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer | Honglin Lin et.al. | 2503.14891 | link | Kimi |
1218 | 2025-03-19 | Communication-Efficient Distributed On-Device LLM Inference Over Wireless Networks | Kai Zhang et.al. | 2503.14882 | null | Kimi |
1219 | 2025-03-19 | Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers | Bo Chen et.al. | 2503.14881 | null | Kimi |
1220 | 2025-03-19 | LogLLaMA: Transformer-based log anomaly detection with LLaMA | Zhuoyi Yang et.al. | 2503.14849 | null | Kimi |
1221 | 2025-03-18 | RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving | Wenqi Jiang et.al. | 2503.14649 | null | Kimi |
1222 | 2025-03-18 | Dynamic Accumulated Attention Map for Interpreting Evolution of Decision-Making in Vision Transformer | Yi Liao et.al. | 2503.14640 | null | Kimi |
1223 | 2025-03-18 | Assessing Large Language Models for Automated Feedback Generation in Learning Programming Problem Solving | Priscylla Silva et.al. | 2503.14630 | link | Kimi |
1224 | 2025-03-18 | Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives | Sara Sarto et.al. | 2503.14604 | null | Kimi |
1225 | 2025-03-19 | State Space Model Meets Transformer: A New Paradigm for 3D Object Detection | Chuxin Wang et.al. | 2503.14493 | null | Kimi |
1226 | 2025-03-18 | DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers | Minglei Shi et.al. | 2503.14487 | null | Kimi |
1227 | 2025-03-18 | Gricean Norms as a Basis for Effective Collaboration | Fardin Saad et.al. | 2503.14484 | link | Kimi |
1228 | 2025-03-18 | LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers | Nikhil Abhyankar et.al. | 2503.14434 | link | Kimi |
1229 | 2025-03-18 | PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play | Wei Fang et.al. | 2503.14432 | null | Kimi |
1230 | 2025-03-18 | VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation | Shoubin Yu et.al. | 2503.14350 | null | Kimi |
1231 | 2025-03-18 | DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies | Wei Song et.al. | 2503.14324 | link | Kimi |
1232 | 2025-03-18 | DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal | Vaibhav Aggarwal et.al. | 2503.14269 | link | Kimi |
1233 | 2025-03-18 | Speculative Decoding for Verilog: Speed and Quality, All in One | Changran Xu et.al. | 2503.14153 | null | Kimi |
1234 | 2025-03-18 | Inference-Time Intervention in Large Language Models for Reliable Requirement Verification | Paul Darm et.al. | 2503.14130 | null | Kimi |
1235 | 2025-03-18 | Growing a Twig to Accelerate Large Vision-Language Models | Zhenwei Shao et.al. | 2503.14075 | null | Kimi |
1236 | 2025-03-18 | Fast Autoregressive Video Generation with Diagonal Decoding | Yang Ye et.al. | 2503.14070 | null | Kimi |
1237 | 2025-03-18 | Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks | Mykyta Syromiatnikov et.al. | 2503.13988 | link | Kimi |
1238 | 2025-03-18 | Improving LLM Video Understanding with 16 Frames Per Second | Yixuan Li et.al. | 2503.13956 | null | Kimi |
1239 | 2025-03-18 | ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models | Alexey Karev et.al. | 2503.13923 | null | Kimi |
1240 | 2025-03-18 | Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models | Mingming Peng et.al. | 2503.13813 | null | Kimi |
1241 | 2025-03-18 | LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation | Yang Zhou et.al. | 2503.13794 | null | Kimi |
1242 | 2025-03-17 | Mitigating KV Cache Competition to Enhance User Experience in LLM Inference | Haiying Shen et.al. | 2503.13773 | null | Kimi |
1243 | 2025-03-17 | Do Large Language Models Understand Performance Optimization? | Bowen Cui et.al. | 2503.13772 | null | Kimi |
1244 | 2025-03-17 | MetaScale: Test-Time Scaling with Evolving Meta-Thoughts | Qin Liu et.al. | 2503.13447 | null | Kimi |
1245 | 2025-03-17 | VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning | Ye Liu et.al. | 2503.13444 | link | Kimi |
1246 | 2025-03-17 | xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference | Maximilian Beck et.al. | 2503.13427 | link | Kimi |
1247 | 2025-03-17 | MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research | James Burgess et.al. | 2503.13399 | link | Kimi |
1248 | 2025-03-17 | Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning | Mengyao Lyu et.al. | 2503.13383 | null | Kimi |
1249 | 2025-03-17 | TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM | Ye Wang et.al. | 2503.13377 | link | Kimi |
1250 | 2025-03-17 | Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning | Hai-Long Sun et.al. | 2503.13360 | null | Kimi |
1251 | 2025-03-17 | Computation Mechanism Behind LLM Position Generalization | Chi Han et.al. | 2503.13305 | null | Kimi |
1252 | 2025-03-17 | A Survey on Transformer Context Extension: Approaches and Evaluation | Yijun Liu et.al. | 2503.13299 | null | Kimi |
1253 | 2025-03-17 | $φ$ -Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation | Fangzhi Xu et.al. | 2503.13288 | link | Kimi |
1254 | 2025-03-17 | Knowledge-Aware Iterative Retrieval for Multi-Agent Systems | Seyoung Song et.al. | 2503.13275 | null | Kimi |
1255 | 2025-03-17 | Can Language Models Follow Multiple Turns of Entangled Instructions? | Chi Han et.al. | 2503.13222 | link | Kimi |
1256 | 2025-03-17 | Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach | Sinan Fan et.al. | 2503.13208 | null | Kimi |
1257 | 2025-03-17 | MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways | Zhen Chen et.al. | 2503.13205 | null | Kimi |
1258 | 2025-03-17 | Are LLMs (Really) Ideological? An IRT-based Analysis and Alignment Tool for Perceived Socio-Economic Bias in LLMs | Jasmin Wachter et.al. | 2503.13149 | null | Kimi |
1259 | 2025-03-17 | Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding | Weiyu Guo et.al. | 2503.13139 | null | Kimi |
1260 | 2025-03-17 | Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference | Hao Yin et.al. | 2503.13108 | link | Kimi |
1261 | 2025-03-17 | ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models | Hao Yin et.al. | 2503.13107 | link | Kimi |
1262 | 2025-03-17 | A Multi-Stage Framework with Taxonomy-Guided Reasoning for Occupation Classification Using Large Language Models | Palakorn Achananuparp et.al. | 2503.12989 | null | Kimi |
1263 | 2025-03-17 | ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM | Wenqiang Wang et.al. | 2503.12988 | null | Kimi |
1264 | 2025-03-17 | R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization | Jingyi Zhang et.al. | 2503.12937 | link | Kimi |
1265 | 2025-03-17 | HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models | Xinyan Jiang et.al. | 2503.12908 | null | Kimi |
1266 | 2025-03-17 | VITED: Video Temporal Evidence Distillation | Yujie Lu et.al. | 2503.12855 | null | Kimi |
1267 | 2025-03-17 | ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing | Aditi Tiwari et.al. | 2503.12852 | null | Kimi |
1268 | 2025-03-17 | Grounded Chain-of-Thought for Multimodal Large Language Models | Qiong Wu et.al. | 2503.12799 | link | Kimi |
1269 | 2025-03-17 | DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding | Xinyu Ma et.al. | 2503.12797 | link | Kimi |
1270 | 2025-03-17 | Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering | Kenneth J. K. Ong et.al. | 2503.12722 | null | Kimi |
1271 | 2025-03-17 | Can Reasoning Models Reason about Hardware? An Agentic HLS Perspective | Luca Collini et.al. | 2503.12721 | null | Kimi |
1272 | 2025-03-16 | Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility | Jacob Chmura et.al. | 2503.12667 | null | Kimi |
1273 | 2025-03-16 | VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures | Yoo Yeon Sung et.al. | 2503.12651 | null | Kimi |
1274 | 2025-03-16 | MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network | Vrushank Ahire et.al. | 2503.12623 | link | Kimi |
1275 | 2025-03-16 | MoECollab: Democratizing LLM Development Through Collaborative Mixture of Experts | Harshit et.al. | 2503.12592 | null | Kimi |
1276 | 2025-03-16 | AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding | Xiao Wang et.al. | 2503.12559 | link | Kimi |
1277 | 2025-03-14 | TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing | Stefan Lionar et.al. | 2503.11629 | link | Kimi |
1278 | 2025-03-14 | ASMA-Tune: Unlocking LLMs’ Assembly Code Comprehension via Structural-Semantic Instruction Tuning | Xinyi Wang et.al. | 2503.11617 | link | Kimi |
1279 | 2025-03-14 | Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space | Zhiliang Chen et.al. | 2503.11586 | link | Kimi |
1280 | 2025-03-14 | Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers | Weiming Ren et.al. | 2503.11579 | null | Kimi |
1281 | 2025-03-14 | Implicit Bias-Like Patterns in Reasoning Models | Messi H. J. Lee et.al. | 2503.11572 | null | Kimi |
1282 | 2025-03-14 | Similarity-Aware Token Pruning: Your VLM but Faster | Ahmadreza Jeddi et.al. | 2503.11549 | link | Kimi |
1283 | 2025-03-14 | HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models | Ziqin Zhou et.al. | 2503.11513 | null | Kimi |
1284 | 2025-03-14 | V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning | Zixu Cheng et.al. | 2503.11495 | null | Kimi |
1285 | 2025-03-14 | Integrating LLMs in Gamified Systems | Carlos J. Costa et.al. | 2503.11458 | null | Kimi |
1286 | 2025-03-14 | Cerebrum (AIOS SDK): A Platform for Agent Development, Deployment, Distribution, and Discovery | Balaji Rama et.al. | 2503.11444 | link | Kimi |
1287 | 2025-03-14 | Text Compression for Efficient Language Generation | David Gu et.al. | 2503.11426 | null | Kimi |
1288 | 2025-03-14 | Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages | Jiyeong Kim et.al. | 2503.11384 | null | Kimi |
1289 | 2025-03-14 | Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches | Panggih Kusuma Ningrum et.al. | 2503.11376 | null | Kimi |
1290 | 2025-03-14 | AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation | Fengyu Li et.al. | 2503.11346 | link | Kimi |
1291 | 2025-03-14 | Rule-Guided Feedback: Enhancing Reasoning by Enforcing Rule Adherence in Large Language Models | Aissatou Diallo et.al. | 2503.11336 | null | Kimi |
1292 | 2025-03-14 | Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking | Ziyi Wang et.al. | 2503.11324 | null | Kimi |
1293 | 2025-03-14 | MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Jeong Hun Yeo et.al. | 2503.11315 | link | Kimi |
1294 | 2025-03-14 | Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering | Xinyu Tang et.al. | 2503.11314 | link | Kimi |
1295 | 2025-03-14 | BriLLM: Brain-inspired Large Language Model | Hai Zhao et.al. | 2503.11299 | null | Kimi |
1296 | 2025-03-14 | Line of Duty: Evaluating LLM Self-Knowledge via Consistency in Feasibility Boundaries | Sahil Kale et.al. | 2503.11256 | link | Kimi |
1297 | 2025-03-14 | Reasoning-Grounded Natural Language Explanations for Language Models | Vojtech Cahlik et.al. | 2503.11248 | link | Kimi |
1298 | 2025-03-14 | Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty? | Giacomo Camposampiero et.al. | 2503.11207 | link | Kimi |
1299 | 2025-03-14 | LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs | Leqi Shen et.al. | 2503.11205 | null | Kimi |
1300 | 2025-03-14 | Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering | Gang Li et.al. | 2503.11197 | link | Kimi |
1301 | 2025-03-14 | FastVID: Dynamic Density Pruning for Fast Video Large Language Models | Leqi Shen et.al. | 2503.11187 | link | Kimi |
1302 | 2025-03-14 | Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity | Chi Xu et.al. | 2503.11164 | null | Kimi |
1303 | 2025-03-14 | Don’t Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models | Shaotian Yan et.al. | 2503.11154 | null | Kimi |
1304 | 2025-03-14 | MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling | Rachel S. Y. Teo et.al. | 2503.11144 | link | Kimi |
1305 | 2025-03-14 | X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression | Guihong Li et.al. | 2503.11132 | null | Kimi |
1306 | 2025-03-14 | Direction-Aware Diagonal Autoregressive Image Generation | Yijia Xu et.al. | 2503.11129 | null | Kimi |
1307 | 2025-03-13 | GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing | Rongyao Fang et.al. | 2503.10639 | link | Kimi |
1308 | 2025-03-13 | Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers? | Subhajit Maity et.al. | 2503.10632 | null | Kimi |
1309 | 2025-03-13 | SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems | Ziyu Guo et.al. | 2503.10627 | null | Kimi |
1310 | 2025-03-13 | Transformers without Normalization | Jiachen Zhu et.al. | 2503.10622 | null | Kimi |
1311 | 2025-03-13 | Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search | Andy Zhou et.al. | 2503.10619 | null | Kimi |
1312 | 2025-03-13 | Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models | Andy Zhou et.al. | 2503.10617 | null | Kimi |
1313 | 2025-03-13 | TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention | Jinhao Duan et.al. | 2503.10602 | link | Kimi |
1314 | 2025-03-13 | Long Context Tuning for Video Generation | Yuwei Guo et.al. | 2503.10589 | null | Kimi |
1315 | 2025-03-13 | Autoregressive Image Generation with Randomized Parallel Decoding | Haopeng Li et.al. | 2503.10568 | link | Kimi |
1316 | 2025-03-13 | AudioX: Diffusion Transformer for Anything-to-Audio Generation | Zeyue Tian et.al. | 2503.10522 | null | Kimi |
1317 | 2025-03-13 | TokenCarve: Information-Preserving Visual Token Compression in Multimodal Large Language Models | Xudong Tan et.al. | 2503.10501 | link | Kimi |
1318 | 2025-03-13 | MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation | Weihao Xuan et.al. | 2503.10497 | null | Kimi |
1319 | 2025-03-13 | Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents | Hanxu Hu et.al. | 2503.10494 | link | Kimi |
1320 | 2025-03-13 | LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions | Gaurav Kumar Gupta et.al. | 2503.10486 | null | Kimi |
1321 | 2025-03-13 | DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation | Wenhao Hu et.al. | 2503.10452 | null | Kimi |
1322 | 2025-03-13 | 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models | Wanhua Li et.al. | 2503.10437 | link | Kimi |
1323 | 2025-03-13 | BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models | Can Zheng et.al. | 2503.10432 | null | Kimi |
1324 | 2025-03-13 | Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning | Jonathan Shaki et.al. | 2503.10408 | null | Kimi |
1325 | 2025-03-13 | SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading | Qiaoling Chen et.al. | 2503.10377 | null | Kimi |
1326 | 2025-03-13 | G-Boost: Boosting Private SLMs with General LLMs | Yijiang Fan et.al. | 2503.10367 | null | Kimi |
1327 | 2025-03-13 | KV-Distill: Nearly Lossless Learnable Context Compression for LLMs | Vivek Chari et.al. | 2503.10337 | null | Kimi |
1328 | 2025-03-13 | Collaborative Speculative Inference for Efficient LLM Inference Serving | Luyao Gao et.al. | 2503.10325 | null | Kimi |
1329 | 2025-03-13 | VisualPRM: An Effective Process Reward Model for Multimodal Reasoning | Weiyun Wang et.al. | 2503.10291 | null | Kimi |
1330 | 2025-03-13 | Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout | Shilong Wang et.al. | 2503.10217 | null | Kimi |
1331 | 2025-03-13 | LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents | Boyu Chen et.al. | 2503.10200 | null | Kimi |
1332 | 2025-03-13 | Robustness Tokens: Towards Adversarial Robustness of Transformers | Brian Pulfer et.al. | 2503.10191 | link | Kimi |
1333 | 2025-03-13 | Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding | Shunqi Mao et.al. | 2503.10183 | null | Kimi |
1334 | 2025-03-13 | “Well, Keep Thinking”: Enhancing LLM Reasoning with Adaptive Injection Decoding | Hyunbin Jin et.al. | 2503.10167 | null | Kimi |
1335 | 2025-03-13 | ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning | Pengfei Luo et.al. | 2503.10166 | link | Kimi |
1336 | 2025-03-13 | Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding | Jinze Li et.al. | 2503.10135 | null | Kimi |
1337 | 2025-03-11 | QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension | Yongdong Luo et.al. | 2503.08689 | link | Kimi |
1338 | 2025-03-11 | CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving | Changxing Liu et.al. | 2503.08683 | link | Kimi |
1339 | 2025-03-11 | Chain-of-Thought Reasoning In The Wild Is Not Always Faithful | Iván Arcuschin et.al. | 2503.08679 | link | Kimi |
1340 | 2025-03-11 | REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder | Yitian Zhang et.al. | 2503.08665 | null | Kimi |
1341 | 2025-03-11 | MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention | Yuhan Wang et.al. | 2503.08664 | link | Kimi |
1342 | 2025-03-11 | Exploring the Word Sense Disambiguation Capabilities of Large Language Models | Pierpaolo Basile et.al. | 2503.08662 | null | Kimi |
1343 | 2025-03-11 | Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention | Emily Xiao et.al. | 2503.08640 | link | Kimi |
1344 | 2025-03-11 | HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder | Yingqi Tang et.al. | 2503.08612 | link | Kimi |
1345 | 2025-03-11 | Vision Transformer for Intracranial Hemorrhage Classification in CT Scans Using an Entropy-Aware Fuzzy Integral Strategy for Adaptive Scan-Level Decision Fusion | Mehdi Hosseini Chagahi et.al. | 2503.08609 | null | Kimi |
1346 | 2025-03-11 | Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling | Subin Kim et.al. | 2503.08605 | null | Kimi |
1347 | 2025-03-11 | RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding | Xichen Tan et.al. | 2503.08576 | null | Kimi |
1348 | 2025-03-11 | DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process | Minjun Zhu et.al. | 2503.08569 | null | Kimi |
1349 | 2025-03-11 | MoE-Loco: Mixture of Experts for Multitask Locomotion | Runhan Huang et.al. | 2503.08564 | null | Kimi |
1350 | 2025-03-11 | Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs | Wanyong Feng et.al. | 2503.08551 | null | Kimi |
1351 | 2025-03-11 | Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation | Xian Gao et.al. | 2503.08549 | null | Kimi |
1352 | 2025-03-11 | DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering | Sher Badshah et.al. | 2503.08542 | null | Kimi |
1353 | 2025-03-11 | Mellow: a small audio language model for reasoning | Soham Deshmukh et.al. | 2503.08540 | link | Kimi |
1354 | 2025-03-11 | Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation | Andres M Bran et.al. | 2503.08537 | link | Kimi |
1355 | 2025-03-11 | ChromaFormer: A Scalable and Accurate Transformer Architecture for Land Cover Classification | Mingshi Li et.al. | 2503.08534 | null | Kimi |
1356 | 2025-03-11 | Visual Attention Graph | Kai-Fu Yang et.al. | 2503.08531 | null | Kimi |
1357 | 2025-03-11 | Position-Aware Depth Decay Decoding ( $D^3$ ): Boosting Large Language Model Inference Efficiency | Siqi Fan et.al. | 2503.08524 | null | Kimi |
1358 | 2025-03-11 | Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models | Han Cao et.al. | 2503.08495 | null | Kimi |
1359 | 2025-03-11 | Accelerating MoE Model Inference with Expert Sharding | Oana Balmau et.al. | 2503.08467 | null | Kimi |
1360 | 2025-03-11 | FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework | Jianian Zhu et.al. | 2503.08461 | null | Kimi |
1361 | 2025-03-11 | Controlling Latent Diffusion Using Latent CLIP | Jason Becker et.al. | 2503.08455 | link | Kimi |
1362 | 2025-03-11 | TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference Systems | Feiyang Wu et.al. | 2503.08415 | link | Kimi |
1363 | 2025-03-11 | Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information | Elizaveta Kuznetsova et.al. | 2503.08404 | null | Kimi |
1364 | 2025-03-11 | Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens | Qingsong Xie et.al. | 2503.08377 | null | Kimi |
1365 | 2025-03-11 | Robust Latent Matters: Boosting Image Generation with Sampling Error | Kai Qiu et.al. | 2503.08354 | link | Kimi |
1366 | 2025-03-11 | Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs | Chongjun Tu et.al. | 2503.08342 | null | Kimi |
1367 | 2025-03-10 | Securing External Deeper-than-black-box GPAI Evaluations | Alejandro Tlaie et.al. | 2503.07496 | null | Kimi |
1368 | 2025-03-10 | V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation | Guiwei Zhang et.al. | 2503.07493 | link | Kimi |
1369 | 2025-03-10 | Destination Calculus: A Linear λ-Calculus for Purely Functional Memory Writes | Thomas Bagrel et.al. | 2503.07489 | link | Kimi |
1370 | 2025-03-10 | LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? | Bangyan Li et.al. | 2503.07487 | null | Kimi |
1371 | 2025-03-10 | Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction | Zongzheng Zhang et.al. | 2503.07485 | link | Kimi |
1372 | 2025-03-10 | VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models | Jiacheng Ruan et.al. | 2503.07478 | link | Kimi |
1373 | 2025-03-10 | Petri Net Modeling of Root Hair Response to Phosphate Starvation in Arabidopsis Thaliana | Amber H. B. Fijn et.al. | 2503.07477 | null | Kimi |
1374 | 2025-03-10 | MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning | Xiangru Tang et.al. | 2503.07459 | link | Kimi |
1375 | 2025-03-10 | Open-Set Gait Recognition from Sparse mmWave Radar Point Clouds | Riccardo Mazzieri et.al. | 2503.07435 | link | Kimi |
1376 | 2025-03-10 | DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks | Feiran You et.al. | 2503.07433 | link | Kimi |
1377 | 2025-03-10 | CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving | Ziliang Xiong et.al. | 2503.07425 | null | Kimi |
1378 | 2025-03-10 | Inorganic Catalyst Efficiency Prediction Based on EAPCR Model: A Deep Learning Solution for Multi-Source Heterogeneous Data | Zhangdi Liu et.al. | 2503.07424 | null | Kimi |
1379 | 2025-03-10 | AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion | Mingzhen Sun et.al. | 2503.07418 | null | Kimi |
1380 | 2025-03-07 | Task-oriented Uncertainty Collaborative Learning for Label-Efficient Brain Tumor Segmentation | Zhenxuan Zhang et.al. | 2503.05682 | link | Kimi |
1381 | 2025-03-07 | The latent variable proximal point algorithm for variational problems with inequality constraints | Jørgen S. Dokken et.al. | 2503.05672 | link | Kimi |
1382 | 2025-03-07 | Kinodynamic Model Predictive Control for Energy Efficient Locomotion of Legged Robots with Parallel Elasticity | Yulun Zhuang et.al. | 2503.05666 | null | Kimi |
1383 | 2025-03-07 | A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval | Yu Zhang et.al. | 2503.05659 | link | Kimi |
1384 | 2025-03-07 | Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning | Justin Chih-Yao Chen et.al. | 2503.05641 | null | Kimi |
1385 | 2025-03-07 | Exploring FMCW Radars and Feature Maps for Activity Recognition: A Benchmark Study | Ali Samimi Fard et.al. | 2503.05629 | null | Kimi |
1386 | 2025-03-07 | FMT:A Multimodal Pneumonia Detection Model Based on Stacking MOE Framework | Jingyu Xu et.al. | 2503.05626 | null | Kimi |
1387 | 2025-03-07 | A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models | Dong Shu et.al. | 2503.05613 | null | Kimi |
1388 | 2025-03-07 | D2GV: Deformable 2D Gaussian Splatting for Video Representation in 400FPS | Mufan Liu et.al. | 2503.05600 | link | Kimi |
1389 | 2025-03-07 | R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning | Huatong Song et.al. | 2503.05592 | null | Kimi |
1390 | 2025-03-06 | L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling | Zhuo Chen et.al. | 2503.04725 | link | Kimi |
1391 | 2025-03-07 | Shifting Long-Context LLMs Research from Input to Output | Yuhao Wu et.al. | 2503.04723 | null | Kimi |
1392 | 2025-03-06 | Enough Coin Flips Can Make LLMs Act Bayesian | Ritwik Gupta et.al. | 2503.04722 | null | Kimi |
1393 | 2025-03-06 | L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning | Pranjal Aggarwal et.al. | 2503.04697 | null | Kimi |
1394 | 2025-03-06 | UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets | Wenyu Wang et.al. | 2503.04693 | null | Kimi |
1395 | 2025-03-06 | The Influence of Prior Discourse on Conversational Agent-Driven Decision-Making | Stephen Pilli et.al. | 2503.04692 | null | Kimi |
1396 | 2025-03-06 | Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases | Pengcheng Qiu et.al. | 2503.04691 | null | Kimi |
1397 | 2025-03-07 | DIMSUM: Discourse in Mathematical Reasoning as a Supervision Module | Krish Sharma et.al. | 2503.04685 | null | Kimi |
1398 | 2025-03-06 | Matrix Factorization for Inferring Associations and Missing Links | Ryan Barron et.al. | 2503.04680 | null | Kimi |
1399 | 2025-03-06 | LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue | Sangyeop Kim et.al. | 2503.04675 | null | Kimi |
1400 | 2025-03-05 | PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning | Ryozo Masukawa et.al. | 2503.03747 | null | Kimi |
1401 | 2025-03-05 | Process-based Self-Rewarding Language Models | Shimao Zhang et.al. | 2503.03746 | null | Kimi |
1402 | 2025-03-05 | Rethinking Deep Clustering Paradigms: Self-Supervision Is All You Need | Amal Shaheena et.al. | 2503.03733 | null | Kimi |
1403 | 2025-03-05 | Towards Understanding Distilled Reasoning Models: A Representational Approach | David D. Baek et.al. | 2503.03730 | null | Kimi |
1404 | 2025-03-05 | When Radiation Meets Linux: Analyzing Soft Errors in Linux on COTS SoCs under Proton Irradiation | Saad Memon et.al. | 2503.03722 | null | Kimi |
1405 | 2025-03-05 | Improving LLM Safety Alignment with Dual-Objective Optimization | Xuandong Zhao et.al. | 2503.03710 | link | Kimi |
1406 | 2025-03-05 | Rethinking Video Tokenization: A Conditioned Diffusion-based Approach | Nianzu Yang et.al. | 2503.03708 | link | Kimi |
1407 | 2025-03-05 | A Practical Memory Injection Attack against LLM Agents | Shen Dong et.al. | 2503.03704 | null | Kimi |
1408 | 2025-03-05 | ILLC: Iterative Layer-by-Layer Compression for Enhancing Structural Faithfulness in SpArX | Ungsik Kim et.al. | 2503.03693 | null | Kimi |
1409 | 2025-03-05 | DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance | Zhao Yang et.al. | 2503.03689 | link | Kimi |
1410 | 2025-03-04 | Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation | Han Xue et.al. | 2503.02881 | link | Kimi |
1411 | 2025-03-04 | Language Models can Self-Improve at State-Value Estimation for Better Search | Ethan Mendes et.al. | 2503.02878 | link | Kimi |
1412 | 2025-03-04 | Weak-to-Strong Generalization Even in Random Feature Networks, Provably | Marko Medvedev et.al. | 2503.02877 | null | Kimi |
1413 | 2025-03-04 | SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models | Dmitry Nechaev et.al. | 2503.02876 | link | Kimi |
1414 | 2025-03-04 | The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models | Ke Ji et.al. | 2503.02875 | null | Kimi |
1415 | 2025-03-04 | Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework | Ziang Zhou et.al. | 2503.02863 | null | Kimi |
1416 | 2025-03-04 | PileUp Mitigation at the HL-LHC Using Attention for Event-Wide Context | Luke Vaughan et.al. | 2503.02860 | null | Kimi |
1417 | 2025-03-04 | Unsupervised Attributed Dynamic Network Embedding with Stability Guarantees | Emma Ceccherini et.al. | 2503.02859 | null | Kimi |
1418 | 2025-03-04 | Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs’ Decoding Layers | Zicong He et.al. | 2503.02851 | link | Kimi |
1419 | 2025-03-04 | Multimodal Deep Learning for Subtype Classification in Breast Cancer Using Histopathological Images and Gene Expression Data | Amin Honarmandi Shandiz et.al. | 2503.02849 | link | Kimi |
1420 | 2025-02-28 | LLM Post-Training: A Deep Dive into Reasoning Large Language Models | Komal Kumar et.al. | 2502.21321 | link | Kimi |
1421 | 2025-02-28 | Doping dependence of 2-spinon excitations in the doped 1D cuprate Ba $2$CuO${3+δ}$ | Jiarui Li et.al. | 2502.21316 | null | Kimi |
1422 | 2025-02-28 | Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos | Zhiyu Tan et.al. | 2502.21314 | null | Kimi |
1423 | 2025-02-28 | FANformer: Improving Large Language Models Through Effective Periodicity Modeling | Yihong Dong et.al. | 2502.21309 | null | Kimi |
1424 | 2025-02-28 | Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With Faithfulness Based on Causal Theory of Mind | Dingyi Zhang et.al. | 2502.21297 | null | Kimi |
1425 | 2025-02-28 | Bilevel Optimized Implicit Neural Representation for Scan-Specific Accelerated MRI Reconstruction | Hongze Yu et.al. | 2502.21292 | null | Kimi |
1426 | 2025-02-28 | Contextualizing biological perturbation experiments through language | Menghua Wu et.al. | 2502.21290 | link | Kimi |
1427 | 2025-02-28 | Boosting Prediction with Data Missing Not at Random | Yuan Bian et.al. | 2502.21276 | null | Kimi |
1428 | 2025-02-28 | Adaptive Keyframe Sampling for Long Video Understanding | Xi Tang et.al. | 2502.21271 | null | Kimi |
1429 | 2025-02-28 | Dynamical Decoupling of Generalization and Overfitting in Large Two-Layer Networks | Andrea Montanari et.al. | 2502.21269 | null | Kimi |
1430 | 2025-02-27 | R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts | Zhongyang Li et.al. | 2502.20395 | link | Kimi |
1431 | 2025-02-27 | LIFT-GS: Cross-Scene Render-Supervised Distillation for 3D Language Grounding | Ang Cao et.al. | 2502.20389 | null | Kimi |
1432 | 2025-02-27 | InsTaG: Learning Personalized 3D Talking Head from Few-Second Video | Jiahe Li et.al. | 2502.20387 | link | Kimi |
1433 | 2025-02-27 | ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting | Dexter Ong et.al. | 2502.20386 | null | Kimi |
1434 | 2025-02-27 | rSPDE: tools for statistical modeling using fractional SPDEs | David Bolin et.al. | 2502.20385 | null | Kimi |
1435 | 2025-02-27 | PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation | Albert Gong et.al. | 2502.20377 | link | Kimi |
1436 | 2025-02-27 | Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization | Ryan C. Barron et.al. | 2502.20364 | link | Kimi |
1437 | 2025-02-27 | Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs | Kuan Lok Zhou et.al. | 2502.20356 | null | Kimi |
1438 | 2025-02-27 | Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners | Daniele Paliotta et.al. | 2502.20339 | null | Kimi |
1439 | 2025-02-27 | KeBaB: $k$ -mer based breaking for finding super-maximal exact matches | Nathaniel K. Brown et.al. | 2502.20338 | null | Kimi |
1440 | 2025-02-26 | Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models | Lucy Xiaoyang Shi et.al. | 2502.19417 | null | Kimi |
1441 | 2025-02-26 | Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation | Shiven Sinha et.al. | 2502.19414 | link | Kimi |
1442 | 2025-02-26 | The Mighty ToRR: A Benchmark for Table Reasoning and Robustness | Shir Ashury-Tahan et.al. | 2502.19412 | link | Kimi |
1443 | 2025-02-26 | Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs | Dayu Yang et.al. | 2502.19411 | link | Kimi |
1444 | 2025-02-26 | ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models | Danae Sánchez Villegas et.al. | 2502.19409 | null | Kimi |
1445 | 2025-02-26 | Learning Code-Edit Embedding to Model Student Debugging Behavior | Hasnain Heickal et.al. | 2502.19407 | null | Kimi |
1446 | 2025-02-26 | Single-shot and two-shot decoding with generalized bicycle codes | Hsiang-Ku Lin et.al. | 2502.19406 | null | Kimi |
1447 | 2025-02-26 | General Reasoning Requires Learning to Reason from the Get-go | Seungwook Han et.al. | 2502.19402 | null | Kimi |
1448 | 2025-02-26 | TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding | Max Ku et.al. | 2502.19400 | null | Kimi |
1449 | 2025-02-26 | The End of Easy Phenomenology for CMB Experiments: A Case Study in the Dark Sector | Cynthia Trendafilova et.al. | 2502.19383 | null | Kimi |
1450 | 2025-02-25 | K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs | Ziheng Ouyang et.al. | 2502.18461 | null | Kimi |
1451 | 2025-02-25 | DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers | Xueguang Ma et.al. | 2502.18460 | link | Kimi |
1452 | 2025-02-25 | GHOST 2.0: generative high-fidelity one shot transfer of heads | Alexander Groshev et.al. | 2502.18417 | null | Kimi |
1453 | 2025-02-25 | Comparative Analysis of MDL-VAE vs. Standard VAE on 202 Years of Gynecological Data | Paula Santos et.al. | 2502.18412 | null | Kimi |
1454 | 2025-02-25 | The FFT Strikes Back: An Efficient Alternative to Self-Attention | Jacob Fein-Ashley et.al. | 2502.18394 | link | Kimi |
1455 | 2025-02-25 | ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation | Yifan Pu et.al. | 2502.18364 | null | Kimi |
1456 | 2025-02-25 | Graph Inference with Effective Resistance Queries | Huck Bennett et.al. | 2502.18350 | null | Kimi |
1457 | 2025-02-25 | Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology | Romy Beauté et.al. | 2502.18318 | null | Kimi |
1458 | 2025-02-25 | RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction | Jianhao Yan et.al. | 2502.18308 | null | Kimi |
1459 | 2025-02-25 | DeepCircuitX: A Comprehensive Repository-Level Dataset for RTL Code Understanding, Generation, and PPA Analysis | Zeju Li et.al. | 2502.18297 | null | Kimi |
1460 | 2025-02-24 | S4S: Solving for a Diffusion Model Solver | Eric Frankel et.al. | 2502.17423 | null | Kimi |
1461 | 2025-02-24 | MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs | Jiarui Zhang et.al. | 2502.17422 | link | Kimi |
1462 | 2025-02-24 | LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification | Penghui Yang et.al. | 2502.17421 | link | Kimi |
1463 | 2025-02-24 | Reasoning with Latent Thoughts: On the Power of Looped Transformers | Nikunj Saunshi et.al. | 2502.17416 | null | Kimi |
1464 | 2025-02-24 | X-Dancer: Expressive Music to Human Dance Video Generation | Zeyuan Chen et.al. | 2502.17414 | null | Kimi |
1465 | 2025-02-24 | Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning | Guijin Son et.al. | 2502.17407 | link | Kimi |
1466 | 2025-02-24 | Advances in multiparameter quantum sensing and metrology | Luca Pezzè et.al. | 2502.17396 | null | Kimi |
1467 | 2025-02-24 | The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE | Andrei Chernov et.al. | 2502.17391 | null | Kimi |
1468 | 2025-02-24 | A Concise Lyapunov Analysis of Nesterov’s Accelerated Gradient Method | Jun Liu et.al. | 2502.17373 | null | Kimi |
1469 | 2025-02-24 | KV-Edit: Training-Free Image Editing for Precise Background Preservation | Tianrui Zhu et.al. | 2502.17363 | link | Kimi |
1470 | 2025-02-21 | Sparks of cognitive flexibility: self-guided context inference for flexible stimulus-response mapping by attentional routing | Rowan Sommers et.al. | 2502.15634 | null | Kimi |
1471 | 2025-02-21 | LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models | Hugo Pitorro et.al. | 2502.15612 | null | Kimi |
1472 | 2025-02-21 | Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning | Wenhao Zhu et.al. | 2502.15592 | link | Kimi |
1473 | 2025-02-21 | LightThinker: Thinking Step-by-Step Compression | Jintian Zhang et.al. | 2502.15589 | null | Kimi |
1474 | 2025-02-21 | Adaptive Expansion for Hypergraph Learning | Tianyi Ma et.al. | 2502.15564 | null | Kimi |
1475 | 2025-02-21 | Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach | Sai Krishna Reddy Mareddy et.al. | 2502.15545 | null | Kimi |
1476 | 2025-02-21 | Generalization Guarantees for Representation Learning via Data-Dependent Gaussian Mixture Priors | Milad Sefidgaran et.al. | 2502.15540 | link | Kimi |
1477 | 2025-02-21 | Towards Swift Serverless LLM Cold Starts with ParaServe | Chiheng Lou et.al. | 2502.15524 | null | Kimi |
1478 | 2025-02-21 | Solving Inverse Problems with Deep Linear Neural Networks: Global Convergence Guarantees for Gradient Descent with Weight Decay | Hannah Laus et.al. | 2502.15522 | null | Kimi |
1479 | 2025-02-21 | Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection | Yue Sun et.al. | 2502.15516 | null | Kimi |
1480 | 2025-02-20 | LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention | Shang Yang et.al. | 2502.14866 | link | Kimi |
1481 | 2025-02-20 | CLIPPER: Compression enables long-context synthetic data generation | Chau Minh Pham et.al. | 2502.14854 | link | Kimi |
1482 | 2025-02-20 | Revealing and Mitigating Over-Attention in Knowledge Editing | Pinzheng Wang et.al. | 2502.14838 | link | Kimi |
1483 | 2025-02-20 | Towards Economical Inference: Enabling DeepSeek’s Multi-Head Latent Attention in Any Transformer-based LLMs | Tao Ji et.al. | 2502.14837 | link | Kimi |
1484 | 2025-02-20 | Improving the Diffusability of Autoencoders | Ivan Skorokhodov et.al. | 2502.14831 | null | Kimi |
1485 | 2025-02-20 | Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps | Martin Tutek et.al. | 2502.14829 | link | Kimi |
1486 | 2025-02-20 | Turning on the Light: Polymorphism-Induced Photoluminescence in Cysteine Crystals | Debarshi Banerjee et.al. | 2502.14826 | null | Kimi |
1487 | 2025-02-20 | Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models | Vlad Sobal et.al. | 2502.14819 | null | Kimi |
1488 | 2025-02-20 | RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation | Henrique Piñeiro Monteagudo et.al. | 2502.14792 | null | Kimi |
1489 | 2025-02-20 | Ray-Tracing for Conditionally Activated Neural Networks | Claudio Gallicchio et.al. | 2502.14788 | null | Kimi |
1490 | 2025-02-20 | LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning | Yansheng Mao et.al. | 2502.14644 | null | Kimi |
1491 | 2025-02-20 | PEARL: Towards Permutation-Resilient LLMs | Liang Chen et.al. | 2502.14628 | link | Kimi |
1492 | 2025-02-20 | PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models | Yu Meng et.al. | 2502.14504 | null | Kimi |
1493 | 2025-02-20 | Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression | Haoyu Wang et.al. | 2502.14477 | null | Kimi |
1494 | 2025-02-20 | Early-Exit and Instant Confidence Translation Quality Estimation | Vilém Zouhar et.al. | 2502.14429 | link | Kimi |
1495 | 2025-02-19 | MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads | Weihao Liu et.al. | 2502.13963 | link | Kimi |
1496 | 2025-02-19 | A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models | Hao Huang et.al. | 2502.13942 | null | Kimi |
1497 | 2025-02-19 | Qwen2.5-VL Technical Report | Shuai Bai et.al. | 2502.13923 | null | Kimi |
1498 | 2025-02-19 | LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization | Guanzheng Chen et.al. | 2502.13922 | link | Kimi |
1499 | 2025-02-19 | A measurement-based approach to analyze the power consumption of the softwarized 5G core | Arturo Bellin et.al. | 2502.13879 | null | Kimi |
1500 | 2025-02-19 | SPEX: Scaling Feature Interaction Explanations for LLMs | Justin Singh Kang et.al. | 2502.13870 | link | Kimi |
1501 | 2025-02-19 | Enhancing LLM-Based Recommendations Through Personalized Reasoning | Jiahao Liu et.al. | 2502.13845 | link | Kimi |
1502 | 2025-02-19 | SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning | Renxi Wang et.al. | 2502.13753 | link | Kimi |
1503 | 2025-02-19 | MoM: Linear Sequence Modeling with Mixture-of-Memories | Jusen Du et.al. | 2502.13685 | link | Kimi |
1504 | 2025-02-19 | PeerQA: A Scientific Question Answering Dataset from Peer Reviews | Tim Baumgärtner et.al. | 2502.13668 | link | Kimi |
1505 | 2025-02-18 | Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning | Jingyang Lin et.al. | 2502.13127 | null | Kimi |
1506 | 2025-02-18 | Eager Updates For Overlapped Communication and Computation in DiLoCo | Satyen Kale et.al. | 2502.12996 | null | Kimi |
1507 | 2025-02-18 | Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing | Xiaoju Ye et.al. | 2502.12962 | null | Kimi |
1508 | 2025-02-18 | Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models | Gyeongman Kim et.al. | 2502.12947 | null | Kimi |
1509 | 2025-02-18 | S $^2$ R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning | Ruotian Ma et.al. | 2502.12853 | link | Kimi |
1510 | 2025-02-18 | A $^2$ ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization | Junhui He et.al. | 2502.12665 | null | Kimi |
1511 | 2025-02-18 | MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation | Sihyun Yu et.al. | 2502.12632 | null | Kimi |
1512 | 2025-02-18 | Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions | Leonardo Ranaldi et.al. | 2502.12616 | null | Kimi |
1513 | 2025-02-18 | LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data | Cehao Yang et.al. | 2502.12583 | link | Kimi |
1514 | 2025-02-18 | HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading | Cheng Luo et.al. | 2502.12574 | link | Kimi |
1515 | 2025-02-17 | Small Models Struggle to Learn from Strong Reasoners | Yuetai Li et.al. | 2502.12143 | null | Kimi |
1516 | 2025-02-17 | SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs | Yige Xu et.al. | 2502.12134 | null | Kimi |
1517 | 2025-02-17 | APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs | Yuxiang Huang et.al. | 2502.12085 | link | Kimi |
1518 | 2025-02-17 | AdaSplash: Adaptive Sparse Flash Attention | Nuno Gonçalves et.al. | 2502.12082 | link | Kimi |
1519 | 2025-02-17 | TokenSkip: Controllable Chain-of-Thought Compression in LLMs | Heming Xia et.al. | 2502.12067 | link | Kimi |
1520 | 2025-02-17 | SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities | Fengqing Jiang et.al. | 2502.12025 | null | Kimi |
1521 | 2025-02-17 | Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving | Xin Xu et.al. | 2502.12022 | null | Kimi |
1522 | 2025-02-17 | Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL | Hanbing Liu et.al. | 2502.11656 | link | Kimi |
1523 | 2025-02-17 | SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking | Zijian Wu et.al. | 2502.11534 | null | Kimi |
1524 | 2025-02-17 | AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification | Xiaoyu Tan et.al. | 2502.11520 | null | Kimi |
1525 | 2025-02-14 | Are Large Language Models the future crowd workers of Linguistics? | Iris Ferrazzo et.al. | 2502.10266 | null | Kimi |
1526 | 2025-02-14 | LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing | Kuan Li et.al. | 2502.09977 | null | Kimi |
1527 | 2025-02-14 | MIR-Bench: Benchmarking LLM’s Long-Context Intelligence via Many-Shot In-Context Inductive Reasoning | Kai Yan et.al. | 2502.09933 | null | Kimi |
1528 | 2025-02-14 | INF^2: High-Throughput Generative Inference of Large Language Models using Near-Storage Processing | Hongsun Jang et.al. | 2502.09921 | null | Kimi |
1529 | 2025-02-13 | ATM-Net: Adaptive Termination and Multi-Precision Neural Networks for Energy-Harvested Edge Intelligence | Neeraj Solanki et.al. | 2502.09822 | null | Kimi |
1530 | 2025-02-13 | NestQuant: Nested Lattice Quantization for Matrix Products and LLMs | Semyon Savkin et.al. | 2502.09720 | null | Kimi |
1531 | 2025-02-13 | MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency | Dongzhi Jiang et.al. | 2502.09621 | null | Kimi |
1532 | 2025-02-13 | CoT-Valve: Length-Compressible Chain-of-Thought Tuning | Xinyin Ma et.al. | 2502.09601 | link | Kimi |
1533 | 2025-02-13 | Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs | Siyan Zhao et.al. | 2502.09597 | link | Kimi |
1534 | 2025-02-13 | SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models | Daniel Fleischer et.al. | 2502.09390 | link | Kimi |
1535 | 2025-02-13 | Generalizability through Explainability: Countering Overfitting with Counterfactual Examples | Flavio Giorgi et.al. | 2502.09193 | null | Kimi |
1536 | 2025-02-13 | Bridging the Gap Between LLMs and Human Intentions: Progresses and Challenges in Instruction Understanding, Intention Reasoning, and Reliable Generation | Zongyu Chang et.al. | 2502.09101 | null | Kimi |
1537 | 2025-02-13 | Unleashing the Power of Large Language Model for Denoising Recommendation | Shuyao Wang et.al. | 2502.09058 | null | Kimi |
1538 | 2025-02-13 | Diversity Enhances an LLM’s Performance in RAG and Long-context Task | Zhchao Wang et.al. | 2502.09017 | null | Kimi |
1539 | 2025-02-13 | RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models | Quan Wei et.al. | 2502.09003 | null | Kimi |
1540 | 2025-02-13 | Task Generalization With AutoRegressive Compositional Structure: Can Learning From $\d$ Tasks Generalize to $\d^{T}$ Tasks? | Amirhesam Abedsoltan et.al. | 2502.08991 | null | Kimi |
1541 | 2025-02-12 | Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning | Qifan Yu et.al. | 2502.08482 | null | Kimi |
1542 | 2025-02-12 | The MoE-Empowered Edge LLMs Deployment: Architecture, Challenges, and Opportunities | Ning Li et.al. | 2502.08381 | null | Kimi |
1543 | 2025-02-12 | Inference-time sparse attention with asymmetric indexing | Pierre-Emmanuel Mazaré et.al. | 2502.08246 | null | Kimi |
1544 | 2025-02-12 | Learning Human Skill Generators at Key-Step Levels | Yilu Wu et.al. | 2502.08234 | null | Kimi |
1545 | 2025-02-12 | Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance | Lingfei Qian et.al. | 2502.08127 | link | Kimi |
1546 | 2025-02-12 | GCoT: Chain-of-Thought Prompt Learning for Graphs | Xingtong Yu et.al. | 2502.08092 | null | Kimi |
1547 | 2025-02-12 | Mixture of Decoupled Message Passing Experts with Entropy Constraint for General Node Classification | Xuanze Chen et.al. | 2502.08083 | null | Kimi |
1548 | 2025-02-11 | Training Sparse Mixture Of Experts Text Embedding Models | Zach Nussbaum et.al. | 2502.07972 | link | Kimi |
1549 | 2025-02-11 | HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment | Youhe Jiang et.al. | 2502.07903 | null | Kimi |
1550 | 2025-02-11 | TransMLA: Multi-head Latent Attention Is All You Need | Fanxu Meng et.al. | 2502.07864 | link | Kimi |
1551 | 2025-02-11 | Magic 1-For-1: Generating One Minute Video Clips within One Minute | Hongwei Yi et.al. | 2502.07701 | link | Kimi |
1552 | 2025-02-11 | LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid | Weigao Sun et.al. | 2502.07563 | link | Kimi |
1553 | 2025-02-11 | Early Stopping Against Label Noise Without Validation Data | Suqin Yuan et.al. | 2502.07551 | link | Kimi |
1554 | 2025-02-11 | Instance-dependent Early Stopping | Suqin Yuan et.al. | 2502.07547 | link | Kimi |
1555 | 2025-02-11 | Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Xialie Zhuang et.al. | 2502.07490 | link | Kimi |
1556 | 2025-02-11 | LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! | Dacheng Li et.al. | 2502.07374 | link | Kimi |
1557 | 2025-02-11 | LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation | Zican Dong et.al. | 2502.07365 | null | Kimi |
1558 | 2025-02-11 | BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models | Xu Huang et.al. | 2502.07346 | link | Kimi |
1559 | 2025-02-11 | CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction | Junlong Li et.al. | 2502.07316 | link | Kimi |
1560 | 2025-02-11 | OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms | Lumen AI et.al. | 2502.07312 | link | Kimi |
1561 | 2025-02-10 | On the Emergence of Thinking in LLMs I: Searching for the Right Intuition | Guanghao Ye et.al. | 2502.06773 | link | Kimi |
1562 | 2025-02-10 | ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates | Ling Yang et.al. | 2502.06772 | link | Kimi |
1563 | 2025-02-10 | Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs | Ryan Synk et.al. | 2502.06766 | link | Kimi |
1564 | 2025-02-10 | History-Guided Video Diffusion | Kiwhan Song et.al. | 2502.06764 | null | Kimi |
1565 | 2025-02-10 | Rationalization Models for Text-to-SQL | Gaetano Rossiello et.al. | 2502.06759 | null | Kimi |
1566 | 2025-02-10 | MoETuner: Optimized Mixture of Expert Serving with Balanced Expert Placement and Token Routing | Seokjin Go et.al. | 2502.06643 | null | Kimi |
1567 | 2025-02-10 | Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text Approaches | Adithya Pratapa et.al. | 2502.06617 | link | Kimi |
1568 | 2025-02-10 | Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation | Chengwen Qi et.al. | 2502.06563 | link | Kimi |
1569 | 2025-02-10 | CoS: Chain-of-Shot Prompting for Long Video Understanding | Jian Hu et.al. | 2502.06428 | null | Kimi |
1570 | 2025-02-10 | Expect the Unexpected: FailSafe Long Context QA for Finance | Kiran Kamble et.al. | 2502.06329 | null | Kimi |
1571 | 2025-02-07 | Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray | Yunhang Shen et.al. | 2502.05177 | link | Kimi |
1572 | 2025-02-07 | VideoRoPE: What Makes for Good Video Rotary Position Embedding? | Xilin Wei et.al. | 2502.05173 | link | Kimi |
1573 | 2025-02-07 | Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient | Jan Ludziejewski et.al. | 2502.05172 | null | Kimi |
1574 | 2025-02-07 | NoLiMa: Long-Context Evaluation Beyond Literal Matching | Ali Modarressi et.al. | 2502.05167 | link | Kimi |
1575 | 2025-02-07 | Data-Parallel Neural Network Training via Nonlinearly Preconditioned Trust-Region Method | Samuel A. Cruz Alegría et.al. | 2502.05133 | null | Kimi |
1576 | 2025-02-07 | Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures | Tushar Pandey et.al. | 2502.05078 | link | Kimi |
1577 | 2025-02-07 | S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency | Yuting Zeng et.al. | 2502.04790 | null | Kimi |
1578 | 2025-02-07 | Early Stopping for Regression Trees | Ratmir Miftachov et.al. | 2502.04709 | null | Kimi |
1579 | 2025-02-07 | ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning | Yuwei Yin et.al. | 2502.04689 | link | Kimi |
1580 | 2025-02-07 | Unveiling the Mechanisms of Explicit CoT Training: How Chain-of-Thought Enhances Reasoning Generalization | Xinhao Yao et.al. | 2502.04667 | link | Kimi |
1581 | 2025-02-06 | Exploring operation parallelism vs. ion movement in ion-trapped QCCD architectures | Anabel Ovide et.al. | 2502.04181 | null | Kimi |
1582 | 2025-02-06 | HD-EPIC: A Highly-Detailed Egocentric Video Dataset | Toby Perrett et.al. | 2502.04144 | null | Kimi |
1583 | 2025-02-06 | AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference | Qingyue Yang et.al. | 2502.04077 | link | Kimi |
1584 | 2025-02-06 | RWKV-UI: UI Understanding with Enhanced Perception and Reasoning | Jiaxi Yang et.al. | 2502.03971 | null | Kimi |
1585 | 2025-02-06 | InfinitePOD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers | Chenchen Shou et.al. | 2502.03885 | null | Kimi |
1586 | 2025-02-06 | Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning | Peizhuang Cong et.al. | 2502.03884 | null | Kimi |
1587 | 2025-02-06 | Identify Critical KV Cache in LLM Inference from an Output Perturbation Perspective | Yuan Feng et.al. | 2502.03805 | link | Kimi |
1588 | 2025-02-05 | (GG) MoE vs. MLP on Tabular Data | Andrei Chernov et.al. | 2502.03608 | null | Kimi |
1589 | 2025-02-05 | HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference | Zeyu Zhang et.al. | 2502.03589 | null | Kimi |
1590 | 2025-02-05 | Demystifying Long Chain-of-Thought Reasoning in LLMs | Edward Yeo et.al. | 2502.03373 | link | Kimi |
1591 | 2025-02-05 | ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model | Qiguang Chen et.al. | 2502.03325 | null | Kimi |
1592 | 2025-02-05 | Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning | DiJia Su et.al. | 2502.03275 | null | Kimi |
1593 | 2025-02-05 | MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding | Pengyi Li et.al. | 2502.03183 | null | Kimi |
1594 | 2025-02-05 | Structured Token Retention and Computational Memory Paths in Large Language Models | Jonathan Delena et.al. | 2502.03102 | null | Kimi |
1595 | 2025-02-05 | IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates | Aissatou Diallo et.al. | 2502.03080 | null | Kimi |
1596 | 2025-02-05 | Scaling Laws for Upcycling Mixture-of-Experts Language Models | Seng Pei Liew et.al. | 2502.03009 | null | Kimi |
1597 | 2025-02-05 | LLM-KT: Aligning Large Language Models with Knowledge Tracing using a Plug-and-Play Instruction | Ziwei Wang et.al. | 2502.02945 | null | Kimi |
1598 | 2025-02-05 | Early Stopping in Contextual Bandits and Inferences | Zihan Cui et.al. | 2502.02793 | null | Kimi |
1599 | 2025-02-04 | Twilight: Adaptive Attention Sparsity with Hierarchical Top- $p$ Pruning | Chaofan Lin et.al. | 2502.02770 | null | Kimi |
1600 | 2025-02-04 | Hecate: Unlocking Efficient Sparse Model Training via Fully Sharded Sparse Data Parallelism | Yuhao Qing et.al. | 2502.02581 | null | Kimi |
1601 | 2025-02-04 | Brief analysis of DeepSeek R1 and it’s implications for Generative AI | Sarah Mercer et.al. | 2502.02523 | null | Kimi |
1602 | 2025-02-04 | EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization | Yize Wu et.al. | 2502.02493 | null | Kimi |
1603 | 2025-02-04 | Lower Bounds for Chain-of-Thought Reasoning in Hard-Attention Transformers | Alireza Amiri et.al. | 2502.02393 | null | Kimi |
1604 | 2025-02-04 | STAIR: Improving Safety Alignment with Introspective Reasoning | Yichi Zhang et.al. | 2502.02384 | link | Kimi |
1605 | 2025-02-04 | Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs | Sagnik Mukherjee et.al. | 2502.02362 | null | Kimi |
1606 | 2025-02-04 | VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation | Siyu Xu et.al. | 2502.02175 | null | Kimi |
1607 | 2025-02-04 | M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference | Nikhil Bhendawade et.al. | 2502.02040 | null | Kimi |
1608 | 2025-02-04 | Wavelet-based Positional Representation for Long Context | Yui Oka et.al. | 2502.02004 | null | Kimi |
1609 | 2025-02-04 | MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving | Shiju Zhao et.al. | 2502.01960 | null | Kimi |
1610 | 2025-01-31 | Scalable-Softmax Is Superior for Attention | Ken M. Nakanishi et.al. | 2501.19399 | null | Kimi |
1611 | 2025-01-31 | Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models | Alina Shutova et.al. | 2501.19392 | link | Kimi |
1612 | 2025-01-31 | Efficient Reasoning with Hidden Thinking | Xuan Shen et.al. | 2501.19201 | link | Kimi |
1613 | 2025-01-31 | Rethinking Early Stopping: Refine, Then Calibrate | Eugène Berta et.al. | 2501.19195 | link | Kimi |
1614 | 2025-01-31 | A theoretical framework for overfitting in energy-based modeling | Giovanni Catania et.al. | 2501.19158 | null | Kimi |
1615 | 2025-01-31 | $\infty$ -Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation | Saul Santos et.al. | 2501.19098 | link | Kimi |
1616 | 2025-01-30 | Rope to Nope and Back Again: A New Hybrid Attention Strategy | Bowen Yang et.al. | 2501.18795 | null | Kimi |
1617 | 2025-01-30 | Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning | Maya Kruse et.al. | 2501.18724 | null | Kimi |
1618 | 2025-01-30 | Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models | Yi Ding et.al. | 2501.18533 | null | Kimi |
1619 | 2025-01-30 | State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence | Thea Aviss et.al. | 2501.18356 | null | Kimi |
1620 | 2025-01-30 | Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge | Swarnadeep Saha et.al. | 2501.18099 | null | Kimi |
1621 | 2025-01-29 | Physics-Grounded Differentiable Simulation for Soft Growing Robots | Lucas Chen et.al. | 2501.17963 | link | Kimi |
1622 | 2025-01-29 | Free Agent in Agent-Based Mixture-of-Experts Generative AI Framework | Jung-Hua Liu et.al. | 2501.17903 | null | Kimi |
1623 | 2025-01-29 | Formally Verified Binary-level Pointer Analysis | Freek Verbeek et.al. | 2501.17766 | null | Kimi |
1624 | 2025-01-29 | CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs | Amey Hengle et.al. | 2501.17581 | null | Kimi |
1625 | 2025-01-29 | Heuristic-Informed Mixture of Experts for Link Prediction in Multilayer Networks | Lucio La Cava et.al. | 2501.17557 | null | Kimi |
1626 | 2025-01-29 | DINT Transformer | Yueyang Cang et.al. | 2501.17486 | null | Kimi |
1627 | 2025-01-28 | TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network | Yumingzhi Pan et.al. | 2501.16784 | null | Kimi |
1628 | 2025-01-28 | 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow | Yueen Ma et.al. | 2501.16698 | null | Kimi |
1629 | 2025-01-28 | MCTS-SQL: An Effective Framework for Text-to-SQL with Monte Carlo Tree Search | Shuozhi Yuan et.al. | 2501.16607 | null | Kimi |
1630 | 2025-01-27 | Searching for GEMS: Discovery and Characterization of Two Brown Dwarfs Around M Dwarfs | Alexander Larsen et.al. | 2501.16554 | null | Kimi |
1631 | 2025-01-27 | MoEVD: Enhancing Vulnerability Detection by Mixture-of-Experts (MoE) | Xu Yang et.al. | 2501.16454 | null | Kimi |
1632 | 2025-01-27 | The Effect of Optimal Self-Distillation in Noisy Gaussian Mixture Model | Kaito Takanami et.al. | 2501.16226 | null | Kimi |
1633 | 2025-01-27 | Provence: efficient and robust context pruning for retrieval-augmented generation | Nadezhda Chirkova et.al. | 2501.16214 | null | Kimi |
1634 | 2025-01-27 | Options-Aware Dense Retrieval for Multiple-Choice query Answering | Manish Singh et.al. | 2501.16111 | null | Kimi |
1635 | 2025-01-27 | Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference | Yinghan Li et.al. | 2501.16103 | null | Kimi |
1636 | 2025-01-27 | Understanding Long Videos via LLM-Powered Entity Relation Graphs | Meng Chu et.al. | 2501.15953 | null | Kimi |
1637 | 2025-01-27 | Memorization and Regularization in Generative Diffusion Models | Ricardo Baptista et.al. | 2501.15785 | link | Kimi |
1638 | 2025-01-27 | Renewable Energy Prediction: A Comparative Study of Deep Learning Models for Complex Dataset Analysis | Haibo Wang et.al. | 2501.15731 | null | Kimi |
1639 | 2025-01-26 | A Benchmarking Platform for DDR4 Memory Performance in Data-Center-Class FPGAs | Andrea Galimberti et.al. | 2501.15582 | null | Kimi |
1640 | 2025-01-26 | Qwen2.5-1M Technical Report | An Yang et.al. | 2501.15383 | null | Kimi |
1641 | 2025-01-25 | ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning | Shangqian Gao et.al. | 2501.15316 | null | Kimi |
1642 | 2025-01-24 | Mean-field limit from general mixtures of experts to quantum neural networks | Anderson Melchor Hernandez et.al. | 2501.14660 | null | Kimi |
1643 | 2025-01-24 | Experimentally Evaluating the Resource Efficiency of Big Data Autoscaling | Jonathan Will et.al. | 2501.14456 | link | Kimi |
1644 | 2025-01-24 | Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains | Xu Chu et.al. | 2501.14431 | null | Kimi |
1645 | 2025-01-24 | GraphBC: Improving LLMs for Better Graph Data Processing | Xu Chu et.al. | 2501.14427 | null | Kimi |
1646 | 2025-01-24 | Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation | Shengzhe Zhang et.al. | 2501.14269 | link | Kimi |
1647 | 2025-01-24 | Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading | Minrui Xu et.al. | 2501.14205 | null | Kimi |
1648 | 2025-01-23 | Can We Generate Images with CoT? Let’s Verify and Reinforce Image Generation Step by Step | Ziyu Guo et.al. | 2501.13926 | link | Kimi |
1649 | 2025-01-23 | The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities | Chan-Jan Hsu et.al. | 2501.13921 | link | Kimi |
1650 | 2025-01-23 | PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection | Peiyuan Zhang et.al. | 2501.13898 | link | Kimi |
1651 | 2025-01-23 | Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models | Zhenghao Lin et.al. | 2501.13629 | null | Kimi |
1652 | 2025-01-23 | Coarse-to-Fine Process Reward Modeling for Enhanced Mathematical Reasoning | Yulan Hu et.al. | 2501.13622 | null | Kimi |
1653 | 2025-01-23 | Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge | Haomiao Xiong et.al. | 2501.13468 | link | Kimi |
1654 | 2025-01-23 | Contrast: A Hybrid Architecture of Transformers and State Space Models for Low-Level Vision | Aman Urumbekov et.al. | 2501.13353 | null | Kimi |
1655 | 2025-01-23 | Qrazor: Reliable and effortless 4-bit llm quantization by significant data razoring | Dongyoung Lee et.al. | 2501.13331 | null | Kimi |
1656 | 2025-01-22 | Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment | Melissa Kazemi Rad et.al. | 2501.13080 | null | Kimi |
1657 | 2025-01-22 | Autonomy-of-Experts Models | Ang Lv et.al. | 2501.13074 | null | Kimi |
1658 | 2025-01-22 | Ehrenfeucht-Haussler Rank and Chain of Thought | Pablo Barceló et.al. | 2501.12997 | null | Kimi |
1659 | 2025-01-22 | LLM4WM: Adapting LLM for Wireless Multi-Tasking | Xuanyu Liu et.al. | 2501.12983 | null | Kimi |
1660 | 2025-01-22 | Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference | Weizhi Fei et.al. | 2501.12959 | null | Kimi |
1661 | 2025-01-22 | Late Breaking Result: FPGA-Based Emulation and Fault Injection for CNN Inference Accelerators | Filip Masar et.al. | 2501.12818 | link | Kimi |
1662 | 2025-01-22 | NExtLong: Toward Effective Long-Context Training without Long Documents | Chaochen Gao et.al. | 2501.12766 | link | Kimi |
1663 | 2025-01-22 | BLR-MoE: Boosted Language-Routing Mixture of Experts for Domain-Robust Multilingual E2E ASR | Guodong Ma et.al. | 2501.12602 | null | Kimi |
1664 | 2025-01-22 | Kimi k1.5: Scaling Reinforcement Learning with LLMs | Kimi Team et.al. | 2501.12599 | null | Kimi |
1665 | 2025-01-21 | Slot-BERT: Self-supervised Object Discovery in Surgical Video | Guiqiu Liao et.al. | 2501.12477 | null | Kimi |
1666 | 2025-01-21 | Video Depth Anything: Consistent Depth Estimation for Super-Long Videos | Sili Chen et.al. | 2501.12375 | null | Kimi |
1667 | 2025-01-21 | Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL | Yeounoh Chung et.al. | 2501.12372 | link | Kimi |
1668 | 2025-01-21 | Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models | Samira Abnar et.al. | 2501.12370 | null | Kimi |
1669 | 2025-01-21 | CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning | Yuanheng Fang et.al. | 2501.12226 | null | Kimi |
1670 | 2025-01-21 | Muon-specific two-Higgs-doublet model for $(g-2)_μ$ anomaly, $W$ -boson mass-shift, and Zee model | I. A. Yafi et.al. | 2501.12181 | null | Kimi |
1671 | 2025-01-21 | Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models | Zihan Qiu et.al. | 2501.11873 | null | Kimi |
1672 | 2025-01-20 | Characterization of GPU TEE Overheads in Distributed Data Parallel ML Training | Jonghytun Lee et.al. | 2501.11771 | null | Kimi |
1673 | 2025-01-20 | Early Stopping Bayesian Optimization for Controller Tuning | David Stenger et.al. | 2501.11532 | link | Kimi |
1674 | 2025-01-20 | CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation | Zheng Chong et.al. | 2501.11325 | link | Kimi |
1675 | 2025-01-20 | RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems? | Haotian Xu et.al. | 2501.11284 | null | Kimi |
1676 | 2025-01-17 | AraXL: A Physically Scalable, Ultra-Wide RISC-V Vector Processor Design for Fast and Efficient Computation on Long Vectors | Navaneeth Kunhi Purayil et.al. | 2501.10301 | null | Kimi |
1677 | 2025-01-17 | ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario | Lucen Zhong et.al. | 2501.10132 | link | Kimi |
1678 | 2025-01-17 | Multi-Dimensional Vector ISA Extension for Mobile In-Cache Computing | Alireza Khadem et.al. | 2501.09902 | link | Kimi |
1679 | 2025-01-16 | Coded Deep Learning: Framework and Algorithm | En-hui Yang et.al. | 2501.09849 | null | Kimi |
1680 | 2025-01-15 | LeMo: Enabling LEss Token Involvement for MOre Context Fine-tuning | Tuowei Wang et.al. | 2501.09767 | null | Kimi |
1681 | 2025-01-16 | AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation | Junjie He et.al. | 2501.09503 | link | Kimi |
1682 | 2025-01-16 | PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks | Huiyou Zhan et.al. | 2501.09367 | null | Kimi |
1683 | 2025-01-15 | Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation | Jiaxin Guo et.al. | 2501.08523 | null | Kimi |
1684 | 2025-01-14 | Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models | Yifu Qiu et.al. | 2501.08248 | null | Kimi |
1685 | 2025-01-14 | PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving | Ahmet Caner Yüzügüler et.al. | 2501.08192 | null | Kimi |
1686 | 2025-01-13 | A Survey of Early Exit Deep Neural Networks in NLP | Divya Jyoti Bajpai et.al. | 2501.07670 | null | Kimi |
1687 | 2025-01-14 | Monotone Curve Estimation via Convex Duality | Tongseok Lim et.al. | 2501.06975 | null | Kimi |
1688 | 2025-01-12 | MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference | Wenxuan Zeng et.al. | 2501.06807 | null | Kimi |
1689 | 2025-01-12 | Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management | Liu Qianli et.al. | 2501.06709 | null | Kimi |
1690 | 2025-01-11 | SafeSplit: A Novel Defense Against Client-Side Backdoor Attacks in Split Learning | Phillip Rieger et.al. | 2501.06650 | null | Kimi |
1691 | 2025-01-11 | Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks | Amr Almorsi et.al. | 2501.06625 | null | Kimi |
1692 | 2025-01-11 | Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping | Muru Zhang et.al. | 2501.06589 | link | Kimi |
1693 | 2025-01-11 | Tensor Product Attention Is All You Need | Yifan Zhang et.al. | 2501.06425 | link | Kimi |
1694 | 2025-01-10 | Scale-up Unlearnable Examples Learning with High-Performance Computing | Yanfan Zhu et.al. | 2501.06080 | link | Kimi |
1695 | 2025-01-09 | Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters | Ziyue Luo et.al. | 2501.05563 | null | Kimi |
1696 | 2025-01-09 | LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation | Xi Ye et.al. | 2501.05414 | null | Kimi |
1697 | 2025-01-09 | Euclid: Detecting Solar System objects in Euclid images and classifying them using Kohonen self-organising maps | A. A. Nucita et.al. | 2501.05023 | null | Kimi |
1698 | 2025-01-09 | SyNPar: Synthetic Null Data Parallelism for High-Power False Discovery Rate Control in High-Dimensional Variable Selection | Changhu Wang et.al. | 2501.05012 | null | Kimi |
1699 | 2025-01-09 | TreeKV: Smooth Key-Value Cache Compression with Tree Structures | Ziwei He et.al. | 2501.04987 | null | Kimi |
1700 | 2025-01-08 | Collaborative Inference Acceleration with Non-Penetrative Tensor Partitioning | Zhibang Liu et.al. | 2501.04489 | null | Kimi |
1701 | 2025-01-06 | The Power of Negative Zero: Datatype Customization for Quantized Large Language Models | Yuzong Chen et.al. | 2501.04052 | link | Kimi |
1702 | 2025-01-07 | CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering | Jialiang Chen et.al. | 2501.03447 | null | Kimi |
1703 | 2025-01-05 | PTEENet: Post-Trained Early-Exit Neural Networks Augmentation for Inference Cost Optimization | Assaf Lahiany et.al. | 2501.02508 | null | Kimi |
1704 | 2025-01-07 | ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling | Chaojie Mao et.al. | 2501.02487 | null | Kimi |
1705 | 2025-01-04 | AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference | Zhuomin He et.al. | 2501.02336 | link | Kimi |
1706 | 2025-01-04 | The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit | Huixue Zhou et.al. | 2501.02173 | null | Kimi |
1707 | 2025-01-03 | Efficient LLM Inference with Activation Checkpointing and Hybrid Caching | Sanghyeon Lee et.al. | 2501.01792 | null | Kimi |
1708 | 2025-01-03 | Data Parallel Visualization and Rendering on the RAMSES Supercomputer with ANARI | Stefan Zellmann et.al. | 2501.01628 | null | Kimi |
1709 | 2025-01-02 | TreeLUT: An Efficient Alternative to Deep Neural Networks for Inference Acceleration Using Gradient Boosted Decision Trees | Alireza Khataei et.al. | 2501.01511 | link | Kimi |
1710 | 2025-01-02 | FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving | Zihao Ye et.al. | 2501.01005 | link | Kimi |
1711 | 2025-01-01 | Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding | Jiajun Zhu et.al. | 2501.00712 | link | Kimi |
1712 | 2025-01-01 | Adjoint sharding for very long context training of state space models | Xingzi Xu et.al. | 2501.00692 | null | Kimi |
1713 | 2024-12-31 | Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing | Peihao Wang et.al. | 2501.00658 | link | Kimi |
1714 | 2024-12-31 | A Study on Context Length and Efficient Transformers for Biomedical Image Analysis | Sarah M. Hooper et.al. | 2501.00619 | null | Kimi |
1715 | 2024-12-31 | VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling | Xinhao Li et.al. | 2501.00574 | link | Kimi |
1716 | 2024-12-30 | CaseSumm: A Large-Scale Dataset for Long-Context Summarization from U.S. Supreme Court Opinions | Mourad Heddaya et.al. | 2501.00097 | null | Kimi |
1717 | 2024-12-30 | Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism | Tim Tsz-Kit Lau et.al. | 2412.21124 | null | Kimi |
1718 | 2024-12-30 | Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA | Qingyun Jin et.al. | 2412.20677 | null | Kimi |
1719 | 2024-12-29 | ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding | Xiao Wang et.al. | 2412.20504 | link | Kimi |
1720 | 2024-12-29 | TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication | Zongwu Wang et.al. | 2412.20501 | link | Kimi |
1721 | 2024-12-29 | NeutronTP: Load-Balanced Distributed Full-Graph GNN Training with Tensor Parallelism | Xin Ai et.al. | 2412.20379 | null | Kimi |
1722 | 2024-12-28 | LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System | Hyucksung Kwon et.al. | 2412.20166 | null | Kimi |
1723 | 2024-12-28 | ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming | Jiedong Zhuang et.al. | 2412.20105 | null | Kimi |
1724 | 2024-12-27 | Goal-oriented Communications based on Recursive Early Exit Neural Networks | Jary Pomponi et.al. | 2412.19587 | null | Kimi |
1725 | 2024-12-27 | StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture | Miaomiao Dai et.al. | 2412.19535 | null | Kimi |
1726 | 2025-01-02 | A Survey on Large Language Model Acceleration based on KV Cache Management | Haoyang Li et.al. | 2412.19442 | link | Kimi |
1727 | 2024-12-26 | Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones | Mehrnaz Mofakhami et.al. | 2412.19325 | null | Kimi |
1728 | 2024-12-26 | Multi-matrix Factorization Attention | Jingcheng Hu et.al. | 2412.19255 | null | Kimi |
1729 | 2024-12-26 | Repository Structure-Aware Training Makes SLMs Better Issue Resolver | Zexiong Ma et.al. | 2412.19031 | null | Kimi |
1730 | 2024-12-25 | Long-Range Tasks Using Short-Context LLMs: Incremental Reasoning With Structured Memories | Dulhan Jayalath et.al. | 2412.18914 | null | Kimi |
1731 | 2024-12-25 | Bootstrap Your Own Context Length | Liang Wang et.al. | 2412.18860 | null | Kimi |
1732 | 2024-12-25 | DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search | Lei Yang et.al. | 2412.18811 | link | Kimi |
1733 | 2024-12-24 | Efficient Long Context Language Model Retrieval with Compression | Minju Seo et.al. | 2412.18232 | null | Kimi |
1734 | 2024-12-24 | Adapter Merging with Centroid Prototype Mapping for Scalable Class-Incremental Learning | Takuma Fukuda et.al. | 2412.18219 | link | Kimi |
1735 | 2024-12-24 | KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management | Rongxin Cheng et.al. | 2412.18169 | null | Kimi |
1736 | 2024-12-24 | Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering | Francois Chaubard et.al. | 2412.18052 | link | Kimi |
1737 | 2024-12-23 | Theoretical Constraints on the Expressive Power of $\mathsf{RoPE}$ -based Tensor Attention Transformers | Xiaoyu Li et.al. | 2412.18040 | null | Kimi |
1738 | 2024-12-23 | Deliberation in Latent Space via Differentiable Cache Augmentation | Luyang Liu et.al. | 2412.17747 | null | Kimi |
1739 | 2024-12-24 | YuLan-Mini: An Open Data-efficient Language Model | Yiwen Hu et.al. | 2412.17743 | link | Kimi |
1740 | 2024-12-23 | Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework | Aswini Kumar Patra et.al. | 2412.17587 | null | Kimi |
1741 | 2024-12-23 | Optimal Convergence Rates for Neural Operators | Mike Nguyen et.al. | 2412.17518 | null | Kimi |
1742 | 2024-12-23 | A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression | Chenlong Deng et.al. | 2412.17483 | null | Kimi |
1743 | 2024-12-23 | MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models | Beibei Yu et.al. | 2412.17339 | null | Kimi |
1744 | 2024-12-22 | Revisiting In-Context Learning with Long Context Language Models | Jinheon Baek et.al. | 2412.16926 | null | Kimi |
1745 | 2024-12-20 | A survey on FPGA-based accelerator for ML models | Feng Yan et.al. | 2412.15666 | null | Kimi |
1746 | 2024-12-20 | Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks | Brian J Chan et.al. | 2412.15605 | link | Kimi |
1747 | 2024-12-19 | Systematic Evaluation of Long-Context LLMs on Financial Concepts | Lavanya Gupta et.al. | 2412.15386 | null | Kimi |
1748 | 2024-12-19 | LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks | Yushi Bai et.al. | 2412.15204 | link | Kimi |
1749 | 2024-12-19 | Minimizing speculation overhead in a parallel recognizer for regular texts | Angelo Borsotti et.al. | 2412.14975 | null | Kimi |
1750 | 2024-12-19 | DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs | Xiabin Zhou et.al. | 2412.14838 | null | Kimi |
1751 | 2024-12-19 | Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models | Wenhan Liu et.al. | 2412.14574 | link | Kimi |
1752 | 2024-12-19 | HashAttention: Semantic Sparsity for Faster Inference | Aditya Desai et.al. | 2412.14468 | null | Kimi |
1753 | 2024-12-18 | Scaling Deep Learning Training with MPMD Pipeline Parallelism | Anxhelo Xhebraj et.al. | 2412.14374 | null | Kimi |
1754 | 2024-12-18 | ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals | Utkarsh Saxena et.al. | 2412.14363 | link | Kimi |
1755 | 2024-12-18 | State Space Models are Strong Text Rerankers | Zhichao Xu et.al. | 2412.14354 | null | Kimi |
1756 | 2024-12-19 | Online MDP with Transition Prototypes: A Robust Adaptive Approach | Shuo Sun et.al. | 2412.14075 | null | Kimi |
1757 | 2024-12-19 | Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference | Benjamin Warner et.al. | 2412.13663 | link | Kimi |
1758 | 2024-12-18 | SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation | Jialong Wu et.al. | 2412.13649 | link | Kimi |
1759 | 2024-12-18 | LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning | Yansheng Mao et.al. | 2412.13626 | null | Kimi |
1760 | 2024-12-18 | Attention-aware convolutional neural networks for identification of magnetic islands in the tearing mode on EAST tokamak | Feifei Long et.al. | 2412.13498 | null | Kimi |
1761 | 2024-12-18 | Deploying Foundation Model Powered Agent Services: A Survey | Wenchao Xu et.al. | 2412.13437 | null | Kimi |
1762 | 2024-12-17 | COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism | Jianing He et.al. | 2412.13236 | link | Kimi |
1763 | 2024-12-17 | GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models | Mukai Li et.al. | 2412.12735 | link | Kimi |
1764 | 2024-12-17 | More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression | Jiebin Zhang et.al. | 2412.12706 | null | Kimi |
1765 | 2024-12-17 | LLMs are Also Effective Embedding Models: An In-depth Overview | Chongyang Tao et.al. | 2412.12591 | null | Kimi |
1766 | 2024-12-17 | PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization | Yun Luo et.al. | 2412.12588 | link | Kimi |
1767 | 2024-12-17 | ITP: Instance-Aware Test Pruning for Out-of-Distribution Detection | Haonan Xu et.al. | 2412.12566 | link | Kimi |
1768 | 2024-12-17 | A System for Microserving of LLMs | Hongyi Jin et.al. | 2412.12488 | null | Kimi |
1769 | 2024-12-17 | Boosting Long-Context Information Seeking via Query-Guided Activation Refilling | Hongjin Qian et.al. | 2412.12486 | link | Kimi |
1770 | 2024-12-17 | Core Context Aware Attention for Long Context Language Modeling | Yaofo Chen et.al. | 2412.12465 | null | Kimi |
1771 | 2024-12-17 | SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator | Guoxuan Chen et.al. | 2412.12094 | link | Kimi |
1772 | 2024-12-16 | SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval | Yueqian Lin et.al. | 2412.12009 | link | Kimi |
1773 | 2024-12-16 | EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents | Mengna Zhu et.al. | 2412.11814 | null | Kimi |
1774 | 2024-12-16 | CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation | Hongxuan Zhang et.al. | 2412.11741 | null | Kimi |
1775 | 2024-12-16 | Ultra-High-Definition Dynamic Multi-Exposure Image Fusion via Infinite Pixel Learning | Xingchi Chen et.al. | 2412.11685 | null | Kimi |
1776 | 2024-12-16 | On the SDP Relaxation of Direct Torque Finite Control Set Model Predictive Control | Luca M. Hartmann et.al. | 2412.11666 | null | Kimi |
1777 | 2024-12-16 | FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation | Dannong Wang et.al. | 2412.11378 | link | Kimi |
1778 | 2024-12-15 | Timing of Seven Isolated Pulsars in the Globular Cluster Terzan 1 | Justine Singleton et.al. | 2412.11271 | null | Kimi |
1779 | 2024-12-15 | Wasserstein Bounds for generative diffusion models with Gaussian tail targets | Xixian Wang et.al. | 2412.11251 | null | Kimi |
1780 | 2024-12-15 | ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction | Yi Feng et.al. | 2412.11210 | link | Kimi |
1781 | 2024-12-13 | SCBench: A KV Cache-Centric Analysis of Long-Context Methods | Yucheng Li et.al. | 2412.10319 | null | Kimi |
1782 | 2024-12-13 | Lost in the Middle, and In-Between: Enhancing Language Models’ Ability to Reason Over Long Contexts in Multi-Hop QA | George Arthur Baker et.al. | 2412.10079 | link | Kimi |
1783 | 2024-12-13 | Benchmarking Table Comprehension In The Wild | Yikang Pan et.al. | 2412.09884 | null | Kimi |
1784 | 2024-12-13 | V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding | Junqi Ge et.al. | 2412.09616 | link | Kimi |
1785 | 2024-12-12 | InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions | Pan Zhang et.al. | 2412.09596 | link | Kimi |
1786 | 2024-12-12 | InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption | Tiehan Fan et.al. | 2412.09283 | null | Kimi |
1787 | 2024-12-12 | ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty | Meizhi Zhong et.al. | 2412.09036 | null | Kimi |
1788 | 2024-12-12 | RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios | Ruiwen Zhou et.al. | 2412.08972 | link | Kimi |
1789 | 2024-12-12 | Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries | Junhyuck Kim et.al. | 2412.08890 | link | Kimi |
1790 | 2024-12-11 | TURBOATTENTION: Efficient Attention Approximation For High Throughputs LLMs | Hao Kang et.al. | 2412.08585 | null | Kimi |
1791 | 2024-12-11 | EMS: Adaptive Evict-then-Merge Strategy for Head-wise KV Cache Compression Based on Global-Local Importance | Yingxin Li et.al. | 2412.08521 | null | Kimi |
1792 | 2024-12-10 | From Slow Bidirectional to Fast Causal Video Generators | Tianwei Yin et.al. | 2412.07772 | null | Kimi |
1793 | 2024-12-10 | ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer | Jinyi Hu et.al. | 2412.07720 | link | Kimi |
1794 | 2024-12-09 | FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization | Boyang Zhang et.al. | 2412.06865 | null | Kimi |
1795 | 2024-12-09 | Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models | Wei Suo et.al. | 2412.06458 | null | Kimi |
1796 | 2024-12-08 | BiDM: Pushing the Limit of Quantization for Diffusion Models | Xingyu Zheng et.al. | 2412.05926 | link | Kimi |
1797 | 2024-12-08 | XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference | Weizhuo Li et.al. | 2412.05896 | null | Kimi |
1798 | 2024-12-07 | Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression | Michael R. Metel et.al. | 2412.05693 | null | Kimi |
1799 | 2024-12-11 | Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference | Qingyuan Li et.al. | 2412.04964 | null | Kimi |
1800 | 2024-12-06 | GUIDE: A Global Unified Inference Engine for Deploying Large Language Models in Heterogeneous Environments | Yanyu Chen et.al. | 2412.04788 | null | Kimi |
1801 | 2024-12-05 | Cross-Self KV Cache Pruning for Efficient Vision-Language Inference | Xiaohuan Pei et.al. | 2412.04652 | link | Kimi |
1802 | 2024-12-05 | votess: A multi-target, GPU-capable, parallel Voronoi tessellator | C. Byrohl et.al. | 2412.04514 | link | Kimi |
1803 | 2024-12-05 | p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay | Jun Zhang et.al. | 2412.04449 | link | Kimi |
1804 | 2024-12-07 | PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation | Ao Wang et.al. | 2412.03409 | link | Kimi |
1805 | 2024-12-04 | ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression | Guangda Liu et.al. | 2412.03213 | null | Kimi |
1806 | 2024-12-04 | Unifying KV Cache Compression for Large Language Models with LeanKV | Yanqi Zhang et.al. | 2412.03131 | null | Kimi |
1807 | 2024-12-04 | Lightweight Multiplane Images Network for Real-Time Stereoscopic Conversion from Planar Video | Shanding Diao et.al. | 2412.03102 | null | Kimi |
1808 | 2024-12-03 | Resource-Adaptive Successive Doubling for Hyperparameter Optimization with Large Datasets on High-Performance Computing Systems | Marcel Aach et.al. | 2412.02729 | link | Kimi |
1809 | 2024-12-03 | Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity | Da Ma et.al. | 2412.02252 | null | Kimi |
1810 | 2024-12-02 | RandAR: Decoder-only Autoregressive Visual Generation in Random Orders | Ziqi Pang et.al. | 2412.01827 | null | Kimi |
1811 | 2024-12-05 | Yi-Lightning Technical Report | 01. AI et.al. | 2412.01253 | null | Kimi |
1812 | 2024-12-02 | INTELLECT-1 Technical Report | Sami Jaghouar et.al. | 2412.01152 | link | Kimi |
1813 | 2024-12-03 | Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification | Wenxuan Huang et.al. | 2412.00876 | link | Kimi |
1814 | 2024-12-01 | MERLIN: Multi-stagE query performance prediction for dynamic paRallel oLap pIpeliNe | Kaixin Zhang et.al. | 2412.00749 | null | Kimi |
1815 | 2024-11-29 | DeMo: Decoupled Momentum Optimization | Bowen Peng et.al. | 2411.19870 | link | Kimi |
1816 | 2024-11-27 | FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving | Ao Shen et.al. | 2411.18424 | null | Kimi |
1817 | 2024-11-28 | MiniKV: Pushing the Limits of LLM Inference via 2-Bit Layer-Discriminative KV Cache | Akshat Sharma et.al. | 2411.18077 | null | Kimi |
1818 | 2024-11-27 | Addressing Architectural Obstacles for Overlay with Stream Network Abstraction | Chengyue Wang et.al. | 2411.17966 | null | Kimi |
1819 | 2024-11-26 | Attamba: Attending To Multi-Token States | Yash Akhauri et.al. | 2411.17685 | link | Kimi |
1820 | 2024-11-26 | Toward High-Performance LLM Serving: A Simulation-Based Approach for Identifying Optimal Parallelism | Yi-Chien Lin et.al. | 2411.17651 | link | Kimi |
1821 | 2024-11-26 | Efficient LLM Inference with I/O-Aware Partial KV Cache Recomputation | Chaoyi Jiang et.al. | 2411.17089 | null | Kimi |
1822 | 2024-11-25 | Lion Cub: Minimizing Communication Overhead in Distributed Lion | Satoki Ishikawa et.al. | 2411.16462 | null | Kimi |
1823 | 2024-11-24 | Hiding Communication Cost in Distributed LLM Training via Micro-batch Co-execution | Haiquan Wang et.al. | 2411.15871 | null | Kimi |
1824 | 2024-11-27 | A Method for Building Large Language Models with Predefined KV Cache Capacity | Zhonghua Yi et.al. | 2411.15785 | null | Kimi |
1825 | 2024-11-22 | DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models | Keda Tao et.al. | 2411.15024 | link | Kimi |
1826 | 2024-11-21 | Functional Array Programming in an Extended Pi-Calculus | Hans Hüttel et.al. | 2411.14579 | null | Kimi |
1827 | 2024-11-22 | Quantization without Tears | Minghao Fu et.al. | 2411.13918 | null | Kimi |
1828 | 2024-11-19 | Faster Multi-GPU Training with PPLL: A Pipeline Parallelism Framework Leveraging Local Learning | Xiuyuan Guo et.al. | 2411.12780 | null | Kimi |
1829 | 2024-11-18 | Parsing Millions of DNS Records per Second | Jeroen Koekkoek et.al. | 2411.12035 | link | Kimi |
1830 | 2024-11-17 | SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration | Jintao Zhang et.al. | 2411.10958 | link | Kimi |
1831 | 2024-11-16 | Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model | Ting Liu et.al. | 2411.10803 | link | Kimi |
1832 | 2024-11-15 | SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers | Joseph Liu et.al. | 2411.10510 | link | Kimi |
1833 | 2024-11-14 | Squeezed Attention: Accelerating Long Context Length LLM Inference | Coleman Hooper et.al. | 2411.09688 | link | Kimi |
1834 | 2024-11-15 | Communication Compression for Tensor Parallel LLM Inference | Jan Hansen-Palmus et.al. | 2411.09510 | null | Kimi |
1835 | 2024-11-12 | Towards Low-bit Communication for Tensor Parallel LLM Inference | Harry Dong et.al. | 2411.07942 | null | Kimi |
1836 | 2024-11-11 | Anchor Attention, Small Cache: Code Generation with Large Language Models | Xiangyu Zhang et.al. | 2411.06680 | link | Kimi |
1837 | 2024-11-10 | Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator | Kazuki Fujii et.al. | 2411.06465 | null | Kimi |
1838 | 2024-11-08 | Balancing Pipeline Parallelism with Vocabulary Parallelism | Man Tsung Yeung et.al. | 2411.05288 | link | Kimi |
1839 | 2024-11-07 | BitNet a4.8: 4-bit Activations for 1-bit LLMs | Hongyu Wang et.al. | 2411.04965 | null | Kimi |
1840 | 2024-11-06 | Stepping Forward on the Last Mile | Chen Feng et.al. | 2411.04036 | null | Kimi |
1841 | 2024-11-05 | TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection | Wei Wu et.al. | 2411.02886 | null | Kimi |
1842 | 2024-11-05 | DroidSpeak: Enhancing Cross-LLM Communication | Yuhan Liu et.al. | 2411.02820 | null | Kimi |
1843 | 2024-11-04 | “Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization | Eldar Kurtic et.al. | 2411.02355 | null | Kimi |
1844 | 2024-11-04 | Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism | Fan Wu et.al. | 2411.02086 | null | Kimi |
1845 | 2024-11-04 | xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism | Jiarui Fang et.al. | 2411.01738 | link | Kimi |
1846 | 2024-11-02 | NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference | Xuanlin Jiang et.al. | 2411.01142 | null | Kimi |
1847 | 2024-11-01 | MoNTA: Accelerating Mixture-of-Experts Training with Network-Traffc-Aware Parallel Optimization | Jingming Guo et.al. | 2411.00662 | link | Kimi |
1848 | 2024-11-01 | Constrained Diffusion Implicit Models | Vivek Jayaram et.al. | 2411.00359 | null | Kimi |
1849 | 2024-11-05 | SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile | Ruisi Zhang et.al. | 2411.00284 | null | Kimi |
1850 | 2024-10-31 | Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2 | Weijie Ke et.al. | 2410.23776 | null | Kimi |
1851 | 2024-10-31 | ALISE: Accelerating Large Language Model Serving with Speculative Scheduling | Youpeng Zhao et.al. | 2410.23537 | null | Kimi |
1852 | 2024-10-29 | VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration | Dezhan Tu et.al. | 2410.23317 | null | Kimi |
1853 | 2024-10-30 | BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference | Junqi Zhao et.al. | 2410.23079 | link | Kimi |
1854 | 2024-10-29 | The Impact of Inference Acceleration Strategies on Bias of LLMs | Elisabeth Kirsten et.al. | 2410.22118 | link | Kimi |
1855 | 2024-10-29 | How Does Critical Batch Size Scale in Pre-training? | Hanlin Zhang et.al. | 2410.21676 | link | Kimi |
1856 | 2024-10-28 | ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference | Hanshi Sun et.al. | 2410.21465 | link | Kimi |
1857 | 2024-10-28 | Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments | Yuzhe Yang et.al. | 2410.21340 | null | Kimi |
1858 | 2024-10-28 | Beyond Autoregression: Fast LLMs via Self-Distillation Through Time | Justin Deschenaux et.al. | 2410.21035 | link | Kimi |
1859 | 2024-10-26 | DQRM: Deep Quantized Recommendation Models | Yang Zhou et.al. | 2410.20046 | link | Kimi |
1860 | 2024-10-25 | RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction | Tanqiu Jiang et.al. | 2410.19937 | null | Kimi |
1861 | 2024-10-25 | BitPipe: Bidirectional Interleaved Pipeline Parallelism for Accelerating Large Models Training | Houming Wu et.al. | 2410.19367 | link | Kimi |
1862 | 2024-10-28 | Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning | Yu Fu et.al. | 2410.19258 | link | Kimi |
1863 | 2024-10-24 | KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing | Yifei Yang et.al. | 2410.18517 | link | Kimi |
1864 | 2024-10-24 | The Nature of Mathematical Modeling and Probabilistic Optimization Engineering in Generative AI | Fulu Li et.al. | 2410.18441 | null | Kimi |
1865 | 2024-10-25 | Fast Inference for Augmented Large Language Models | Rana Shahout et.al. | 2410.18248 | null | Kimi |
1866 | 2024-10-23 | Value Residual Learning For Alleviating Attention Concentration In Transformers | Zhanchao Zhou et.al. | 2410.17897 | link | Kimi |
1867 | 2024-10-23 | Markov Chain of Thought for Efficient Mathematical Reasoning | Wen Yang et.al. | 2410.17635 | null | Kimi |
1868 | 2024-10-22 | PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction | Long Xing et.al. | 2410.17247 | link | Kimi |
1869 | 2024-10-21 | MagicPIG: LSH Sampling for Efficient LLM Generation | Zhuoming Chen et.al. | 2410.16179 | link | Kimi |
1870 | 2024-10-21 | Residual vector quantization for KV cache compression in large language model | Ankur Kumar et.al. | 2410.15704 | link | Kimi |
1871 | 2024-10-20 | SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training | Jinda Jia et.al. | 2410.15526 | link | Kimi |
1872 | 2024-10-20 | EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models | Junhao Hu et.al. | 2410.15332 | null | Kimi |
1873 | 2024-10-20 | Lossless KV Cache Compression to 2% | Zhen Yang et.al. | 2410.15252 | null | Kimi |
1874 | 2024-10-19 | Pipeline Gradient-based Model Training on Analog In-memory Accelerators | Zhaoxian Wu et.al. | 2410.15155 | link | Kimi |
1875 | 2024-10-18 | A Systematic Study of Cross-Layer KV Sharing for Efficient LLM Inference | You Wu et.al. | 2410.14442 | link | Kimi |
1876 | 2024-10-23 | TiMePReSt: Time and Memory Efficient Pipeline Parallel DNN Training with Removed Staleness | Ankita Dutta et.al. | 2410.14312 | null | Kimi |
1877 | 2024-10-17 | SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction | Xuan Zhang et.al. | 2410.13846 | link | Kimi |
1878 | 2024-10-17 | AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations | Qian Tao et.al. | 2410.13212 | null | Kimi |
1879 | 2024-10-19 | In-context KV-Cache Eviction for LLMs via Attention-Gate | Zihao Zeng et.al. | 2410.12876 | null | Kimi |
1880 | 2024-10-16 | FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction | Akriti Jain et.al. | 2410.12513 | null | Kimi |
1881 | 2024-10-16 | COMET: Towards Partical W4A4KV4 LLMs Serving | Lian Liu et.al. | 2410.12168 | null | Kimi |
1882 | 2024-10-15 | From promise to practice: realizing high-performance decentralized training | Zesen Wang et.al. | 2410.11998 | null | Kimi |
1883 | 2024-10-15 | QSpec: Speculative Decoding with Complementary Quantization Schemes | Juntao Zhao et.al. | 2410.11305 | null | Kimi |
1884 | 2024-10-14 | DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads | Guangxuan Xiao et.al. | 2410.10819 | link | Kimi |
1885 | 2024-10-14 | When Attention Sink Emerges in Language Models: An Empirical View | Xiangming Gu et.al. | 2410.10781 | link | Kimi |
1886 | 2024-10-14 | Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling | Wenze Liu et.al. | 2410.10511 | link | Kimi |
1887 | 2024-10-15 | EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations | Zhangchi Feng et.al. | 2410.10315 | link | Kimi |
1888 | 2024-10-11 | ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression | Yefei He et.al. | 2410.08584 | null | Kimi |
1889 | 2024-10-10 | KV Prediction for Improved Time to First Token | Maxwell Horton et.al. | 2410.08391 | link | Kimi |
1890 | 2024-10-10 | TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text | Songshuo Lu et.al. | 2410.07590 | link | Kimi |
1891 | 2024-10-09 | SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration | Heming Xia et.al. | 2410.06916 | link | Kimi |
1892 | 2024-10-07 | PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs | Mengzhao Chen et.al. | 2410.05265 | link | Kimi |
1893 | 2024-10-07 | Presto! Distilling Steps and Layers for Accelerating Music Generation | Zachary Novack et.al. | 2410.05167 | null | Kimi |
1894 | 2024-10-07 | TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention | Lijie Yang et.al. | 2410.05076 | link | Kimi |
1895 | 2024-10-07 | Fast State Restoration in LLM Serving with HCache | Shiwei Gao et.al. | 2410.05004 | null | Kimi |
1896 | 2024-10-06 | Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective | Jinhao Li et.al. | 2410.04466 | null | Kimi |
1897 | 2024-10-04 | SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation | Aurick Qiao et.al. | 2410.03960 | null | Kimi |
1898 | 2024-10-04 | LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy | Rongzhi Zhang et.al. | 2410.03111 | null | Kimi |
1899 | 2024-10-04 | UNComp: Uncertainty-Aware Long-Context Compressor for Efficient Large Language Model Inference | Jing Xiong et.al. | 2410.03090 | null | Kimi |
1900 | 2024-10-09 | LEGO: QEC Decoding System Architecture for Dynamic Circuits | Yue Wu et.al. | 2410.03073 | null | Kimi |
1901 | 2024-10-04 | Compute Or Load KV Cache? Why Not Both? | Shuowei Jin et.al. | 2410.03065 | null | Kimi |
1902 | 2024-10-03 | EinDecomp: Decomposition of Declaratively-Specified Machine Learning and Numerical Computations for Parallel Execution | Daniel Bourgeois et.al. | 2410.02682 | null | Kimi |
1903 | 2024-10-03 | SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration | Jintao Zhang et.al. | 2410.02367 | link | Kimi |
1904 | 2024-10-02 | Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads | Yuxiang Huang et.al. | 2410.01805 | link | Kimi |
1905 | 2024-10-02 | InfiniPot: Infinite Context Processing on Memory-Constrained LLMs | Minsoo Kim et.al. | 2410.01518 | null | Kimi |
1906 | 2024-10-02 | A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts | Suyu Ge et.al. | 2410.01485 | null | Kimi |
1907 | 2024-10-01 | Developing a BLAS library for the AMD AI Engine | Tristan Laan et.al. | 2410.00825 | null | Kimi |
1908 | 2024-10-01 | TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices | Zonghang Li et.al. | 2410.00531 | link | Kimi |
1909 | 2024-10-01 | LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management | Yi Xiong et.al. | 2410.00428 | null | Kimi |
1910 | 2024-09-30 | KV-Compress: Paged KV-Cache Compression with Variable Compression Rates per Attention Head | Isaac Rehg et.al. | 2410.00161 | link | Kimi |
1911 | 2024-09-30 | The Early Bird Catches the Leak: Unveiling Timing Side Channels in LLM Serving Systems | Linke Song et.al. | 2409.20002 | null | Kimi |
1912 | 2024-09-27 | Toward Greener Matrix Operations by Lossless Compressed Formats | Francesco Tosoni et.al. | 2409.18620 | link | Kimi |
1913 | 2024-09-26 | Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores | Shaobo Ma et.al. | 2409.17870 | null | Kimi |
1914 | 2024-09-25 | Search for Efficient Large Language Models | Xuan Shen et.al. | 2409.17372 | link | Kimi |
1915 | 2024-09-25 | Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations | Amey Agrawal et.al. | 2409.17264 | null | Kimi |
1916 | 2024-09-25 | AlignedKV: Reducing Memory Access of KV-Cache with Precision-Aligned Quantization | Yifan Tan et.al. | 2409.16546 | link | Kimi |
1917 | 2024-09-25 | A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence | Xin Yuan et.al. | 2409.16537 | null | Kimi |
1918 | 2024-09-23 | CSPS: A Communication-Efficient Sequence-Parallelism based Serving System for Transformer based Models with Long Prompts | Zeyu Zhang et.al. | 2409.15104 | null | Kimi |
1919 | 2024-09-23 | Inference-Friendly Models With MixAttention | Shashank Rajput et.al. | 2409.15012 | null | Kimi |
1920 | 2024-09-23 | Mutation-Based Deep Learning Framework Testing Method in JavaScript Environment | Yinglong Zou et.al. | 2409.14968 | null | Kimi |
1921 | 2024-09-16 | Do Large Language Models Need a Content Delivery Network? | Yihua Cheng et.al. | 2409.13761 | link | Kimi |
1922 | 2024-09-20 | Time Distributed Deep Learning models for Purely Exogenous Forecasting. Application to Water Table Depth Prediction using Weather Image Time Series | Matteo Salis et.al. | 2409.13284 | null | Kimi |
1923 | 2024-09-23 | CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs | Junlin Lv et.al. | 2409.12490 | link | Kimi |
1924 | 2024-09-04 | ISO: Overlap of Computation and Communication within Seqenence For LLM Inference | Bin Xiao et.al. | 2409.11155 | null | Kimi |
1925 | 2024-09-17 | KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models | Bo Lv et.al. | 2409.11057 | null | Kimi |
1926 | 2024-09-21 | CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios | Luning Wang et.al. | 2409.10593 | link | Kimi |
1927 | 2024-09-14 | A Dynamic Weighting Strategy to Mitigate Worker Node Failure in Distributed Deep Learning | Yuesheng Xu et.al. | 2409.09242 | null | Kimi |
1928 | 2024-09-11 | Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU | Zhenyu Ning et.al. | 2409.09086 | null | Kimi |
1929 | 2024-09-13 | SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity | Qitian Wu et.al. | 2409.09007 | link | Kimi |
1930 | 2024-09-11 | Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering | Weixi Weng et.al. | 2409.07331 | null | Kimi |
1931 | 2024-09-11 | FreeRide: Harvesting Bubbles in Pipeline Parallelism | Jiashu Zhang et.al. | 2409.06941 | null | Kimi |
1932 | 2024-09-09 | DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects | Xu Zhang et.al. | 2409.05404 | null | Kimi |
1933 | 2024-09-08 | InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference | Xiurui Pan et.al. | 2409.04992 | null | Kimi |
1934 | 2024-09-04 | Accelerating Large Language Model Training with Hybrid GPU-based Compression | Lang Xu et.al. | 2409.02423 | null | Kimi |
1935 | 2024-09-03 | Contemporary Model Compression on Large Language Models Inference | Dong Liu et.al. | 2409.01990 | link | Kimi |
1936 | 2024-09-03 | On-chain Validation of Tracking Data Messages (TDM) Using Distributed Deep Learning on a Proof of Stake (PoS) Blockchain | Yasir Latif et.al. | 2409.01614 | null | Kimi |
1937 | 2024-09-02 | LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUs | Mo Sun et.al. | 2409.00918 | null | Kimi |
1938 | 2024-08-26 | Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition | Axel Klawonn et.al. | 2408.14442 | null | Kimi |
1939 | 2024-08-23 | Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI | Mikhail Khalilov et.al. | 2408.13356 | null | Kimi |
1940 | 2024-08-22 | LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation | Shihao Chen et.al. | 2408.12354 | null | Kimi |
1941 | 2024-08-23 | MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding | Jian Chen et.al. | 2408.11049 | link | Kimi |
1942 | 2024-08-20 | Security Assessment of Hierarchical Federated Deep Learning | D Alqattan et.al. | 2408.10752 | link | Kimi |
1943 | 2024-08-20 | Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning | Bei Ouyang et.al. | 2408.10746 | null | Kimi |
1944 | 2024-08-21 | LongVILA: Scaling Long-Context Visual Language Models for Long Videos | Fuzhao Xue et.al. | 2408.10188 | link | Kimi |
1945 | 2024-08-17 | RepControlNet: ControlNet Reparameterization | Zhaoli Deng et.al. | 2408.09240 | null | Kimi |
1946 | 2024-08-17 | Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version) | Mingkuan Xu et.al. | 2408.09055 | null | Kimi |
1947 | 2024-08-23 | ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models | Chao Zeng et.al. | 2408.08554 | link | Kimi |
1948 | 2024-08-16 | Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models | Jerry Huang et.al. | 2408.08470 | null | Kimi |
1949 | 2024-08-15 | Asteroid: Resource-Efficient Hybrid Pipeline Parallelism for Collaborative DNN Training on Heterogeneous Edge Devices | Shengyuan Ye et.al. | 2408.08015 | null | Kimi |
1950 | 2024-08-17 | Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference | Rohan Baskar Prabhakar et.al. | 2408.07802 | null | Kimi |
1951 | 2024-08-18 | Post-Training Sparse Attention with Double Sparsity | Shuo Yang et.al. | 2408.07092 | link | Kimi |
1952 | 2024-08-12 | LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration | Zhiwen Mo et.al. | 2408.06003 | null | Kimi |
1953 | 2024-08-10 | Eigen Attention: Attention in Low-Rank Space for KV Cache Compression | Utkarsh Saxena et.al. | 2408.05646 | link | Kimi |
1954 | 2024-08-05 | SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving | Andreas Kosmas Kakolyris et.al. | 2408.05235 | null | Kimi |
1955 | 2024-08-08 | Partial Experts Checkpoint: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training | Weilin Cai et.al. | 2408.04307 | null | Kimi |
1956 | 2024-08-07 | Zero-Delay QKV Compression for Mitigating KV Cache and Network Bottlenecks in LLM Inference | Zeyu Zhang et.al. | 2408.04107 | null | Kimi |
1957 | 2024-08-08 | NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time | Yilong Chen et.al. | 2408.03675 | link | Kimi |
1958 | 2024-08-04 | Cross-layer Attention Sharing for Large Language Models | Yongyu Mu et.al. | 2408.01890 | null | Kimi |
1959 | 2024-08-01 | Intermittent Semi-working Mask: A New Masking Paradigm for LLMs | Mingcong Lu et.al. | 2408.00539 | null | Kimi |
1960 | 2024-08-13 | Finch: Prompt-guided Key-Value Cache Compression | Giulio Corallo et.al. | 2408.00167 | null | Kimi |
1961 | 2024-07-31 | EdgeLLM: A Highly Efficient CPU-FPGA Heterogeneous Edge Accelerator for Large Language Models | Mingqiang Huang et.al. | 2407.21325 | null | Kimi |
1962 | 2024-07-30 | Palu: Compressing KV-Cache with Low-Rank Projection | Chi-Chih Chang et.al. | 2407.21118 | link | Kimi |
1963 | 2024-07-30 | ThinK: Thinner Key Cache by Query-Driven Pruning | Yuhui Xu et.al. | 2407.21018 | null | Kimi |
1964 | 2024-07-31 | A2SF: Accumulative Attention Scoring with Forgetting Factor for Token Pruning in Transformer Decoder | Hyun-rae Jo et.al. | 2407.20485 | null | Kimi |
1965 | 2024-07-25 | An Efficient Inference Framework for Early-exit Large Language Models | Ruijie Miao et.al. | 2407.20272 | null | Kimi |
1966 | 2024-07-29 | When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention | Lianghong Guo et.al. | 2407.20042 | link | Kimi |
1967 | 2024-07-29 | Inference acceleration for large language models using “stairs” assisted greedy generation | Domas Grigaliūnas et.al. | 2407.19947 | null | Kimi |
1968 | 2024-07-29 | Rina: Enhancing Ring-AllReduce with In-network Aggregation in Distributed Model Training | Zixuan Chen et.al. | 2407.19721 | null | Kimi |
1969 | 2024-07-25 | Efficient Inference of Vision Instruction-Following Models with Elastic Cache | Zuyan Liu et.al. | 2407.18121 | link | Kimi |
1970 | 2024-07-28 | Keep the Cost Down: A Review on Methods to Optimize LLM’ s KV-Cache Consumption | Luohe Shi et.al. | 2407.18003 | null | Kimi |
1971 | 2024-07-25 | Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads | Xihui Lin et.al. | 2407.17678 | null | Kimi |
1972 | 2024-07-23 | A deeper look at depth pruning of LLMs | Shoaib Ahmed Siddiqui et.al. | 2407.16286 | link | Kimi |
1973 | 2024-07-22 | RazorAttention: Efficient KV Cache Compression Through Retrieval Heads | Hanlin Tang et.al. | 2407.15891 | null | Kimi |
1974 | 2024-07-22 | AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description | Junyu Xie et.al. | 2407.15850 | link | Kimi |
1975 | 2024-07-22 | LLMmap: Fingerprinting For Large Language Models | Dario Pasquini et.al. | 2407.15847 | link | Kimi |
1976 | 2024-07-22 | CarFormer: Self-Driving with Learned Object-Centric Representations | Shadi Hamdan et.al. | 2407.15843 | null | Kimi |
1977 | 2024-07-22 | SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models | Mingze Xu et.al. | 2407.15841 | link | Kimi |
1978 | 2024-07-22 | MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity | Yangzhou Liu et.al. | 2407.15838 | link | Kimi |
1979 | 2024-07-22 | dMel: Speech Tokenization made Simple | He Bai et.al. | 2407.15835 | null | Kimi |
1980 | 2024-07-22 | Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight | Ziyuan Huang et.al. | 2407.15819 | null | Kimi |
1981 | 2024-07-23 | A simple and fast C++ thread pool implementation capable of running task graphs | Dmytro Puyda et.al. | 2407.15805 | link | Kimi |
1982 | 2024-07-22 | Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation | Guanyu Hu et.al. | 2407.15798 | null | Kimi |
1983 | 2024-07-22 | Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach | Rian Dolphin et.al. | 2407.15788 | null | Kimi |
1984 | 2024-07-22 | Parallel Split Learning with Global Sampling | Mohammad Kohankhaki et.al. | 2407.15738 | link | Kimi |
1985 | 2024-07-22 | vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving | Jiale Xu et.al. | 2407.15309 | link | Kimi |
1986 | 2024-07-19 | Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference | Joyjit Kundu et.al. | 2407.14645 | null | Kimi |
1987 | 2024-07-19 | Internal Consistency and Self-Feedback in Large Language Models: A Survey | Xun Liang et.al. | 2407.14507 | link | Kimi |
1988 | 2024-07-19 | On Pre-training of Multimodal Language Models Customized for Chart Understanding | Wan-Cyuan Fan et.al. | 2407.14506 | null | Kimi |
1989 | 2024-07-19 | PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding | Chenshu Hou et.al. | 2407.14491 | null | Kimi |
1990 | 2024-07-19 | Evaluating the Reliability of Self-Explanations in Large Language Models | Korbinian Randl et.al. | 2407.14487 | link | Kimi |
1991 | 2024-07-19 | Contrastive Learning with Counterfactual Explanations for Radiology Report Generation | Mingjie Li et.al. | 2407.14474 | null | Kimi |
1992 | 2024-07-19 | Check-Eval: A Checklist-based Approach for Evaluating Text Quality | Jayr Pereira et.al. | 2407.14467 | null | Kimi |
1993 | 2024-07-19 | AttentNet: Fully Convolutional 3D Attention for Lung Nodule Detection | Majedaldein Almahasneh et.al. | 2407.14464 | null | Kimi |
1994 | 2024-07-19 | PolyFormer: Scalable Node-wise Filters via Polynomial Graph Transformer | Jiahong Ma et.al. | 2407.14459 | link | Kimi |
1995 | 2024-07-19 | Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier | Zachary Wojtowicz et.al. | 2407.14452 | null | Kimi |
1996 | 2024-07-19 | From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards | Nicole Sultanum et.al. | 2407.14451 | null | Kimi |
1997 | 2024-07-19 | LoAS: Fully Temporal-Parallel Datatflow for Dual-Sparse Spiking Neural Networks | Ruokai Yin et.al. | 2407.14073 | link | Kimi |
1998 | 2024-07-19 | LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference | Qichen Fu et.al. | 2407.14057 | null | Kimi |
1999 | 2024-07-18 | SegPoint: Segment Any Point Cloud via Large Language Model | Shuting He et.al. | 2407.13761 | null | Kimi |
2000 | 2024-07-18 | Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Zhuo Chen et.al. | 2407.13757 | null | Kimi |
2001 | 2024-07-18 | CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications | Mirza Masfiqur Rahman et.al. | 2407.13742 | null | Kimi |
2002 | 2024-07-18 | Baba Is AI: Break the Rules to Beat the Benchmark | Nathan Cloos et.al. | 2407.13729 | null | Kimi |
2003 | 2024-07-18 | Compressing Structured Tensor Algebra | Mahdi Ghorbani et.al. | 2407.13726 | null | Kimi |
2004 | 2024-07-18 | CoDefeater: Using LLMs To Find Defeaters in Assurance Cases | Usman Gohar et.al. | 2407.13717 | link | Kimi |
2005 | 2024-07-18 | Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning | Ans Munir et.al. | 2407.13715 | link | Kimi |
2006 | 2024-07-18 | Understanding Reference Policies in Direct Preference Optimization | Yixin Liu et.al. | 2407.13709 | link | Kimi |
2007 | 2024-07-18 | ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection | Janek Herrlein et.al. | 2407.13702 | link | Kimi |
2008 | 2024-07-18 | Cross-Task Attack: A Self-Supervision Generative Framework Based on Attention Shift | Qingyuan Zeng et.al. | 2407.13700 | null | Kimi |
2009 | 2024-07-17 | Analysis of Crab X-ray Polarization using Deeper IXPE Observations | Josephine Wong et.al. | 2407.12779 | null | Kimi |
2010 | 2024-07-17 | The BRST quantisation of chiral BMS-like field theories | José Figueroa-O’Farrill et.al. | 2407.12778 | null | Kimi |
2011 | 2024-07-17 | Jigsaw Game: Federated Clustering | Jinxuan Xu et.al. | 2407.12764 | null | Kimi |
2012 | 2024-07-17 | LookupViT: Compressing visual information to a limited number of tokens | Rajat Koner et.al. | 2407.12753 | null | Kimi |
2013 | 2024-07-17 | CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference | Mohammad Erfan Sadeghi et.al. | 2407.12736 | null | Kimi |
2014 | 2024-07-17 | EchoSight: Advancing Visual-Language Models with Wiki Knowledge | Yibin Yan et.al. | 2407.12735 | null | Kimi |
2015 | 2024-07-17 | FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible Pruning in Uncertain Scenarios | Zekai Chen et.al. | 2407.12729 | null | Kimi |
2016 | 2024-07-17 | Exploring the interplay of individual traits and interaction dynamics in preschool social networks | Gülşah Akçakır et.al. | 2407.12728 | null | Kimi |
2017 | 2024-07-17 | NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Zhongqun Zhang et.al. | 2407.12727 | null | Kimi |
2018 | 2024-07-17 | Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models? | Ben Yao et.al. | 2407.12725 | null | Kimi |
2019 | 2024-07-16 | GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression | Daniel Goldstein et.al. | 2407.12077 | link | Kimi |
2020 | 2024-07-16 | Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale | Aymen Alsaadi et.al. | 2407.11967 | null | Kimi |
2021 | 2024-07-16 | UrbanWorld: An Urban World Model for 3D City Generation | Yu Shang et.al. | 2407.11965 | link | Kimi |
2022 | 2024-07-16 | NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? | Mo Li et.al. | 2407.11963 | link | Kimi |
2023 | 2024-07-17 | Hierarchical Separable Video Transformer for Snapshot Compressive Imaging | Ping Wang et.al. | 2407.11946 | link | Kimi |
2024 | 2024-07-16 | Min-max theory and existence of H-spheres with arbitrary codimensions | Rui Gao et.al. | 2407.11945 | null | Kimi |
2025 | 2024-07-16 | Beyond Spatial Explanations: Explainable Face Recognition in the Frequency Domain | Marco Huber et.al. | 2407.11941 | null | Kimi |
2026 | 2024-07-16 | Generalized Difference-in-Differences | Yiqing Xu et.al. | 2407.11937 | null | Kimi |
2027 | 2024-07-16 | Learning Multi-view Anomaly Detection | Haoyang He et.al. | 2407.11935 | null | Kimi |
2028 | 2024-07-16 | Code Documentation and Analysis to Secure Software Development | Paul Attie et.al. | 2407.11934 | null | Kimi |
2029 | 2024-07-16 | What’s Wrong? Refining Meeting Summaries with LLM Feedback | Frederic Kirstein et.al. | 2407.11919 | null | Kimi |
2030 | 2024-07-16 | PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation | Branden Butler et.al. | 2407.11798 | null | Kimi |
2031 | 2024-07-21 | Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference | Yuan Feng et.al. | 2407.11550 | link | Kimi |
2032 | 2024-07-15 | VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation | Bocheng Zou et.al. | 2407.10972 | link | Kimi |
2033 | 2024-07-15 | Q-Sparse: All Large Language Models can be Fully Sparsely-Activated | Hongyu Wang et.al. | 2407.10969 | null | Kimi |
2034 | 2024-07-15 | Induction of non-Fermi liquids by critical cavity photons at the onset of superradiance | Ipsita Mandal et.al. | 2407.10963 | null | Kimi |
2035 | 2024-07-15 | Fast Matrix Multiplications for Lookup Table-Quantized LLMs | Han Guo et.al. | 2407.10960 | link | Kimi |
2036 | 2024-07-15 | InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models | Nirat Saini et.al. | 2407.10958 | null | Kimi |
2037 | 2024-07-15 | MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models | Chengguang Gan et.al. | 2407.10953 | null | Kimi |
2038 | 2024-07-15 | The infamous 95 GeV $\rm b \bar b$ excess at LEP: Two b or not two b? | Patrick Janot et.al. | 2407.10948 | null | Kimi |
2039 | 2024-07-15 | Can Textual Semantics Mitigate Sounding Object Segmentation Preference? | Yaoting Wang et.al. | 2407.10947 | link | Kimi |
2040 | 2024-07-15 | GRUtopia: Dream General Robots in a City at Scale | Hanqing Wang et.al. | 2407.10943 | link | Kimi |
2041 | 2024-07-15 | IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation | Yuanhao Zhai et.al. | 2407.10937 | link | Kimi |
2042 | 2024-07-12 | FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 | Georgios Makridis et.al. | 2407.09467 | null | Kimi |
2043 | 2024-07-12 | Human-like Episodic Memory for Infinite Context LLMs | Zafeirios Fountas et.al. | 2407.09450 | link | Kimi |
2044 | 2024-07-12 | ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts | Amelia F. Hardy et.al. | 2407.09447 | link | Kimi |
2045 | 2024-07-12 | MUSCLE: A Model Update Strategy for Compatible LLM Evolution | Jessica Echterhoff et.al. | 2407.09435 | null | Kimi |
2046 | 2024-07-12 | Open (Clinical) LLMs are Sensitive to Instruction Phrasings | Alberto Mario Ceballos Arroyo et.al. | 2407.09429 | link | Kimi |
2047 | 2024-07-12 | TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models | Hang Zou et.al. | 2407.09424 | null | Kimi |
2048 | 2024-07-12 | Mitigating Entity-Level Hallucination in Large Language Models | Weihang Su et.al. | 2407.09417 | link | Kimi |
2049 | 2024-07-12 | SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers | Shraman Pramanick et.al. | 2407.09413 | link | Kimi |
2050 | 2024-07-12 | Thunderbolt: Causal Concurrent Consensus and Execution | Junchao Chen et.al. | 2407.09409 | null | Kimi |
2051 | 2024-07-12 | PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents | Saber Zerhoudi et.al. | 2407.09394 | link | Kimi |
2052 | 2024-07-11 | MAVIS: Mathematical Visual Instruction Tuning | Renrui Zhang et.al. | 2407.08739 | link | Kimi |
2053 | 2024-07-11 | Real-Time Anomaly Detection and Reactive Planning with Large Language Models | Rohan Sinha et.al. | 2407.08735 | null | Kimi |
2054 | 2024-07-11 | Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist | Zihao Zhou et.al. | 2407.08733 | null | Kimi |
2055 | 2024-07-11 | Planar decomposition of the HOMFLY polynomial for bipartite knots and links | A. Anokhina et.al. | 2407.08724 | null | Kimi |
2056 | 2024-07-11 | A Taxonomy for Data Contamination in Large Language Models | Medha Palavalli et.al. | 2407.08716 | null | Kimi |
2057 | 2024-07-11 | GTA: A Benchmark for General Tool Agents | Jize Wang et.al. | 2407.08713 | link | Kimi |
2058 | 2024-07-11 | Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models | Zhening Xing et.al. | 2407.08701 | null | Kimi |
2059 | 2024-07-11 | Flex-TPU: A Flexible TPU with Runtime Reconfigurable Dataflow Architecture | Mohammed Elbtity et.al. | 2407.08700 | null | Kimi |
2060 | 2024-07-11 | Mitigating Catastrophic Forgetting in Language Transfer via Model Merging | Anton Alexandrov et.al. | 2407.08699 | null | Kimi |
2061 | 2024-07-11 | Patterns of link reciprocity in directed, signed networks | Anna Gallo et.al. | 2407.08697 | null | Kimi |
2062 | 2024-07-10 | Training on the Test Task Confounds Evaluation and Emergence | Ricardo Dominguez-Olmedo et.al. | 2407.07890 | link | Kimi |
2063 | 2024-07-10 | Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization | Junkang Wu et.al. | 2407.07880 | link | Kimi |
2064 | 2024-07-10 | Bound States in Continuum via Singular Transfer Matrices | Ovidiu-Zeno Lipan et.al. | 2407.07879 | null | Kimi |
2065 | 2024-07-10 | FACTS About Building Retrieval Augmented Generation-based Chatbots | Rama Akkiraju et.al. | 2407.07858 | null | Kimi |
2066 | 2024-07-10 | OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training | Sami Jaghouar et.al. | 2407.07852 | link | Kimi |
2067 | 2024-07-10 | Harnessing Integrated CPU-GPU System Memory for HPC: a first look into Grace Hopper | Gabin Schieffer et.al. | 2407.07850 | null | Kimi |
2068 | 2024-07-10 | Natural Language Mechanisms via Self-Resolution with Foundation Models | Nicolas Della Penna et.al. | 2407.07845 | null | Kimi |
2069 | 2024-07-10 | Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification | Mei Qiu et.al. | 2407.07842 | null | Kimi |
2070 | 2024-07-10 | Transformer Alignment in Large Language Models | Murdock Aubry et.al. | 2407.07810 | null | Kimi |
2071 | 2024-07-10 | Attribute or Abstain: Large Language Models as Long Document Assistants | Jan Buchmann et.al. | 2407.07799 | link | Kimi |
2072 | 2024-07-09 | AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning | Jiaxi Cui et.al. | 2407.07094 | link | Kimi |
2073 | 2024-07-09 | FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation | Liqun Ma et.al. | 2407.07093 | link | Kimi |
2074 | 2024-07-09 | Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic | Ruochen Jin et.al. | 2407.07089 | link | Kimi |
2075 | 2024-07-09 | Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models | Logan Cross et.al. | 2407.07086 | link | Kimi |
2076 | 2024-07-09 | Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities | Shaltiel Shmidman et.al. | 2407.07080 | null | Kimi |
2077 | 2024-07-09 | ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction | Shaozhe Hao et.al. | 2407.07077 | link | Kimi |
2078 | 2024-07-09 | Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Yung-Sung Chuang et.al. | 2407.07071 | link | Kimi |
2079 | 2024-07-09 | Prompting Techniques for Secure Code Generation: A Systematic Investigation | Catherine Tony et.al. | 2407.07064 | null | Kimi |
2080 | 2024-07-09 | Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence | Weize Chen et.al. | 2407.07061 | link | Kimi |
2081 | 2024-07-09 | CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement | Wang Wei et.al. | 2407.07056 | null | Kimi |
2082 | 2024-07-08 | Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision | Orr Zohar et.al. | 2407.06189 | link | Kimi |
2083 | 2024-07-08 | CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation | Xinying Guo et.al. | 2407.06188 | null | Kimi |
2084 | 2024-07-08 | Left-Linear Rewriting in Adhesive Categories | Paolo Baldan et.al. | 2407.06181 | null | Kimi |
2085 | 2024-07-08 | The Tug-of-War Between Deepfake Generation and Detection | Hannah Lee et.al. | 2407.06174 | null | Kimi |
2086 | 2024-07-08 | On Speeding Up Language Model Evaluation | Jin Peng Zhou et.al. | 2407.06172 | null | Kimi |
2087 | 2024-07-08 | Inevitable Endgame of Comet Tsuchinshan-ATLAS (C/2023 A3) | Zdenek Sekanina et.al. | 2407.06166 | null | Kimi |
2088 | 2024-07-08 | What’s Wrong with Your Code Generated by Large Language Models? An Extensive Study | Shihan Dou et.al. | 2407.06153 | null | Kimi |
2089 | 2024-07-08 | WIBACong: An Argument-centric Framework for Understanding US Congressional Hearings | Arman Irani et.al. | 2407.06149 | null | Kimi |
2090 | 2024-07-08 | Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks | Lukas Netz et.al. | 2407.06146 | null | Kimi |
2091 | 2024-07-08 | ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation | Ethan Chern et.al. | 2407.06135 | link | Kimi |
2092 | 2024-07-05 | LaRa: Efficient Large-Baseline Radiance Fields | Anpei Chen et.al. | 2407.04699 | null | Kimi |
2093 | 2024-07-05 | Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs | Rudolf Laine et.al. | 2407.04694 | link | Kimi |
2094 | 2024-07-05 | ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models | Yuzhe Gu et.al. | 2407.04693 | link | Kimi |
2095 | 2024-07-05 | Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge | Yuanze Lin et.al. | 2407.04681 | null | Kimi |
2096 | 2024-07-05 | Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition | Ye Bai et.al. | 2407.04675 | null | Kimi |
2097 | 2024-07-05 | Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement | Yongji Wu et.al. | 2407.04656 | null | Kimi |
2098 | 2024-07-05 | Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework | Reza Averly et.al. | 2407.04629 | null | Kimi |
2099 | 2024-07-05 | On scalable oversight with weak LLMs judging strong LLMs | Zachary Kenton et.al. | 2407.04622 | null | Kimi |
2100 | 2024-07-08 | OneRestore: A Universal Restoration Framework for Composite Degradation | Yu Guo et.al. | 2407.04621 | link | Kimi |
2101 | 2024-07-05 | Learning to (Learn at Test Time): RNNs with Expressive Hidden States | Yu Sun et.al. | 2407.04620 | link | Kimi |
2102 | 2024-07-03 | Universal Length Generalization with Turing Programs | Kaiying Hou et.al. | 2407.03310 | null | Kimi |
2103 | 2024-07-03 | Eyes on the Game: Deciphering Implicit Human Signals to Infer Human Proficiency, Trust, and Intent | Nikhil Hulle et.al. | 2407.03298 | null | Kimi |
2104 | 2024-07-03 | Large Language Models for JSON Schema Discovery | Michael J. Mior et.al. | 2407.03286 | null | Kimi |
2105 | 2024-07-03 | LLM Internal States Reveal Hallucination Risk Faced With a Query | Ziwei Ji et.al. | 2407.03282 | link | Kimi |
2106 | 2024-07-03 | Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks | Mintae Kim et.al. | 2407.03280 | null | Kimi |
2107 | 2024-07-03 | Nesterov’s Accelerated Jacobi-Type Methods for Large-scale Symmetric Positive Semidefinite Linear Systems | Ling Liang et.al. | 2407.03272 | null | Kimi |
2108 | 2024-07-03 | STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data | Kheir Eddine Daouadi et.al. | 2407.03253 | null | Kimi |
2109 | 2024-07-03 | ACTRESS: Active Retraining for Semi-supervised Visual Grounding | Weitai Kang et.al. | 2407.03251 | null | Kimi |
2110 | 2024-07-04 | When big data actually are low-rank, or entrywise approximation of certain function-generated matrices | Stanislav Budzinskiy et.al. | 2407.03250 | link | Kimi |
2111 | 2024-07-03 | Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning | Jiaqi Wang et.al. | 2407.03247 | link | Kimi |
2112 | 2024-07-02 | MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention | Huiqiang Jiang et.al. | 2407.02490 | link | Kimi |
2113 | 2024-07-02 | Neurocache: Efficient Vector Retrieval for Long-range Language Modeling | Ali Safaya et.al. | 2407.02486 | link | Kimi |
2114 | 2024-07-02 | RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs | Yue Yu et.al. | 2407.02485 | null | Kimi |
2115 | 2024-07-02 | Characterizing the Interpretability of Attention Maps in Digital Pathology | Tomé Albuquerque et.al. | 2407.02484 | null | Kimi |
2116 | 2024-07-02 | MMedAgent: Learning to Use Medical Tools with Multi-modal Agent | Binxu Li et.al. | 2407.02483 | link | Kimi |
2117 | 2024-07-02 | Understanding Alignment in Multimodal LLMs: A Comprehensive Study | Elmira Amirloo et.al. | 2407.02477 | null | Kimi |
2118 | 2024-07-02 | Open Scene Graphs for Open World Object-Goal Navigation | Joel Loo et.al. | 2407.02473 | null | Kimi |
2119 | 2024-07-02 | Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I | Harrie Oosterhuis et.al. | 2407.02464 | null | Kimi |
2120 | 2024-07-02 | Decentralized Intelligence Network (DIN) | Abraham Nash et.al. | 2407.02461 | null | Kimi |
2121 | 2024-07-02 | Revisión de Métodos de Planificación de Camino de Cobertura para Entornos Agrícolas | Ismael Ait et.al. | 2407.02449 | null | Kimi |
ID | Publish Date | Title | Authors | Code | Kimi | |
---|---|---|---|---|---|---|
1 | 2024-12-12 | InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption | Tiehan Fan et.al. | 2412.09283 | null | Kimi |
2 | 2024-12-11 | GradStop: Exploring Training Dynamics in Unsupervised Outlier Detection through Gradient Cohesion | Yuang Zhang et.al. | 2412.08501 | link | Kimi |
3 | 2024-12-11 | Collaborative Inference for Large Models with Task Offloading and Early Exiting | Zuan Xie et.al. | 2412.08284 | null | Kimi |
4 | 2024-12-11 | Diff-GO $^\text{n}$ : Enhancing Diffusion Models for Goal-Oriented Communications | Suchinthaka Wanninayaka et.al. | 2412.06980 | null | Kimi |
5 | 2024-12-06 | Sparse autoencoders reveal selective remapping of visual concepts during adaptation | Hyesu Lim et.al. | 2412.05276 | link | Kimi |
6 | 2024-12-06 | BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits | Wazib Ansar et.al. | 2412.05225 | null | Kimi |
7 | 2024-12-05 | A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs | Wangbo Zhao et.al. | 2412.03324 | link | Kimi |
8 | 2024-12-03 | Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control | Sebastian Hirt et.al. | 2412.02423 | null | Kimi |
9 | 2024-12-02 | Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization | Weiqiao Shan et.al. | 2412.01455 | null | Kimi |
10 | 2024-12-02 | EdgeOAR: Real-time Online Action Recognition On Edge Devices | Wei Luo et.al. | 2412.01267 | null | Kimi |
11 | 2024-12-02 | Reliable and scalable variable importance estimation via warm-start and early stopping | Zexuan Sun et.al. | 2412.01120 | link | Kimi |
12 | 2024-11-28 | Deep Neural Network-Based Prediction of B-Cell Epitopes for SARS-CoV and SARS-CoV-2: Enhancing Vaccine Design through Machine Learning | Xinyu Shi et.al. | 2412.00109 | null | Kimi |
13 | 2024-11-26 | Selfish Evolution: Making Discoveries in Extreme Label Noise with the Help of Overfitting Dynamics | Nima Sedaghat et.al. | 2412.00077 | null | Kimi |
14 | 2024-11-28 | DIESEL – Dynamic Inference-Guidance via Evasion of Semantic Embeddings in LLMs | Ben Ganon et.al. | 2411.19038 | null | Kimi |
15 | 2024-11-27 | One-Step Early Stopping Strategy using Neural Tangent Kernel Theory and Rademacher Complexity | Daniel Martin Xavier et.al. | 2411.18806 | null | Kimi |
16 | 2024-11-27 | HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression | Lei Liu et.al. | 2411.18473 | null | Kimi |
17 | 2024-11-26 | Instance-Aware Graph Prompt Learning | Jiazheng Li et.al. | 2411.17676 | null | Kimi |
18 | 2024-11-22 | Instance-Aware Generalized Referring Expression Segmentation | E-Ro Nguyen et.al. | 2411.15087 | null | Kimi |
19 | 2024-11-19 | Deep Learning-Driven Heat Map Analysis for Evaluating thickness of Wounded Skin Layers | Devakumar GR et.al. | 2411.12678 | null | Kimi |
20 | 2024-11-15 | Exploiting Negative Curvature in Conjunction with Adaptive Sampling: Theoretical Results and a Practical Algorithm | Albert S. Berahas et.al. | 2411.10378 | null | Kimi |
21 | 2024-11-13 | Voxeland: Probabilistic Instance-Aware Semantic Mapping with Evidence-based Uncertainty Quantification | Jose-Luis Matez-Bandera et.al. | 2411.08727 | link | Kimi |
22 | 2024-11-11 | The Unreasonable Effectiveness of Monte Carlo Simulations in A/B Testing | Márton Trencséni et.al. | 2411.06701 | link | Kimi |
23 | 2024-11-07 | Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Flavio Di Palo et.al. | 2411.05045 | null | Kimi |
24 | 2024-11-07 | LoFi: Scalable Local Image Reconstruction with Implicit Neural Representation | AmirEhsan Khorashadizadeh et.al. | 2411.04995 | link | Kimi |
25 | 2024-11-05 | SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents | Dawei Li et.al. | 2411.03284 | link | Kimi |
26 | 2024-11-06 | Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression: A Distribution-Free Analysis | Yingzhen Yang et.al. | 2411.02904 | null | Kimi |
27 | 2024-11-05 | Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery | Bowei Du et.al. | 2411.02861 | null | Kimi |
28 | 2024-11-05 | CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration | Hongpeng Jin et.al. | 2411.02829 | null | Kimi |
29 | 2024-11-06 | Energy-Aware Dynamic Neural Inference | Marcello Bullo et.al. | 2411.02471 | null | Kimi |
30 | 2024-11-04 | DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution | Yang Yue et.al. | 2411.02359 | link | Kimi |
31 | 2024-11-02 | Bi-Level Graph Structure Learning for Next POI Recommendation | Liang Wang et.al. | 2411.01169 | null | Kimi |
32 | 2024-10-30 | Accelerated AI Inference via Dynamic Execution Methods | Haim Barad et.al. | 2411.00853 | null | Kimi |
33 | 2024-11-01 | Preventing Model Collapse in Deep Canonical Correlation Analysis by Noise Regularization | Junlin He et.al. | 2411.00383 | null | Kimi |
34 | 2024-10-29 | Power side-channel leakage localization through adversarial training of deep neural networks | Jimmy Gammell et.al. | 2410.22425 | link | Kimi |
35 | 2024-10-27 | Branch-and-bound algorithm for efficient reliability analysis of general coherent systems | Ji-Eun Byun et.al. | 2410.22363 | null | Kimi |
36 | 2024-10-28 | Agreement Tasks in Fault-Prone Synchronous Networks of Arbitrary Structure | Pierre Fraigniaud et.al. | 2410.21538 | null | Kimi |
37 | 2024-10-28 | Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA | Sangmin Bae et.al. | 2410.20672 | null | Kimi |
38 | 2024-10-27 | Sequential Large Language Model-Based Hyper-Parameter Optimization | Kanan Mahammadli et.al. | 2410.20302 | link | Kimi |
39 | 2024-10-26 | Looking Beyond The Top-1: Transformers Determine Top Tokens In Order | Daria Lioubashevski et.al. | 2410.20210 | link | Kimi |
40 | 2024-10-26 | Dynamic layer selection in decoder-only transformers | Theodore Glavas et.al. | 2410.20022 | link | Kimi |
41 | 2024-10-25 | COMSPLIT: A Communication-Aware Split Learning Design for Heterogeneous IoT Platforms | Vukan Ninkovic et.al. | 2410.19375 | null | Kimi |
42 | 2024-10-30 | Dynamic Vocabulary Pruning in Early-Exit LLMs | Jort Vincenti et.al. | 2410.18952 | link | Kimi |
43 | 2024-10-24 | AdaEDL: Early Draft Stopping for Speculative Decoding of Large Language Models via an Entropy-based Lower Bound on Token Acceptance Probability | Sudhanshu Agrawal et.al. | 2410.18351 | null | Kimi |
44 | 2024-10-23 | Inferring stability properties of chaotic systems on autoencoders’ latent spaces | Elise Özalp et.al. | 2410.18003 | link | Kimi |
45 | 2024-10-23 | Diffusion Priors for Variational Likelihood Estimation and Image Denoising | Jun Cheng et.al. | 2410.17521 | link | Kimi |
46 | 2024-10-21 | Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification | Nishant S. Gaikwad et.al. | 2410.15681 | null | Kimi |
47 | 2024-10-24 | BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping | Taolin Zhang et.al. | 2410.15430 | link | Kimi |
48 | 2024-10-16 | FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction | Akriti Jain et.al. | 2410.12513 | null | Kimi |
49 | 2024-10-15 | Juggernaut: Efficient Crypto-Agnostic Byzantine Agreement | Daniel Collins et.al. | 2410.12121 | null | Kimi |
50 | 2024-10-14 | Focused ReAct: Improving ReAct through Reiterate and Early Stop | Shuoqiu Li et.al. | 2410.10779 | null | Kimi |
51 | 2024-10-14 | big.LITTLE Vision Transformer for Efficient Visual Recognition | He Guo et.al. | 2410.10267 | null | Kimi |
52 | 2024-10-12 | DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach | Daniel Gallo Fernández et.al. | 2410.09633 | link | Kimi |
53 | 2024-10-11 | Scaling Gaussian Processes for Learning Curve Prediction via Latent Kronecker Structure | Jihao Andreas Lin et.al. | 2410.09239 | null | Kimi |
54 | 2024-10-08 | Benchmarking of a new data splitting method on volcanic eruption data | Simona Reale et.al. | 2410.06306 | null | Kimi |
55 | 2024-10-08 | MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More | Wei Huang et.al. | 2410.06270 | link | Kimi |
56 | 2024-10-08 | Mini-Batch Kernel $k$ -means | Ben Jourdan et.al. | 2410.05902 | null | Kimi |
57 | 2024-10-06 | Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach | Divya Jyoti Bajpai et.al. | 2410.05338 | null | Kimi |
58 | 2024-10-07 | L-C4: Language-Based Video Colorization for Creative and Consistent Color | Zheng Chang et.al. | 2410.04972 | null | Kimi |
59 | 2024-10-06 | CAPEEN: Image Captioning with Early Exits and Knowledge Distillation | Divya Jyoti Bajpai et.al. | 2410.04433 | link | Kimi |
60 | 2024-10-06 | DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs | Divya Jyoti Bajpai et.al. | 2410.04424 | link | Kimi |
61 | 2024-10-03 | Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis | Zikun Zhang et.al. | 2410.02321 | null | Kimi |
62 | 2024-10-03 | Global dynamical structures from infinitesimal data | Benjamin McInroe et.al. | 2410.02111 | null | Kimi |
63 | 2024-10-02 | CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL | Mohammadreza Pourreza et.al. | 2410.01943 | null | Kimi |
64 | 2024-10-02 | Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension | Zaiquan Yang et.al. | 2410.01544 | null | Kimi |
65 | 2024-10-01 | Timber! Poisoning Decision Trees | Stefano Calzavara et.al. | 2410.00862 | null | Kimi |
66 | 2024-09-30 | Inference of water waves surface elevation from horizontal velocity components using physics informed neural networks (PINN) | Omar Sallam et.al. | 2409.19851 | null | Kimi |
67 | 2024-09-27 | Improving Visual Object Tracking through Visual Prompting | Shih-Fang Chen et.al. | 2409.18901 | link | Kimi |
68 | 2024-09-24 | Reinforcement Leaning for Infinite-Dimensional Systems | Wei Zhang et.al. | 2409.15737 | null | Kimi |
69 | 2024-10-03 | Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer Prediction | Amrit Diggavi Seshadri et.al. | 2409.14091 | null | Kimi |
70 | 2024-09-21 | Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer | Zheng Liu et.al. | 2409.13999 | null | Kimi |
71 | 2024-09-18 | Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments | Gang Chen et.al. | 2409.11975 | link | Kimi |
72 | 2024-09-17 | UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning | Kathakoli Sengupta et.al. | 2409.11403 | null | Kimi |
73 | 2024-09-16 | Improving Multi-candidate Speculative Decoding | Xiaofan Lu et.al. | 2409.10644 | link | Kimi |
74 | 2024-09-14 | Group Sequential Testing of a Treatment Effect Using a Surrogate Marker | Layla Parast et.al. | 2409.09440 | link | Kimi |
75 | 2024-09-13 | Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection | Dixi Yao et.al. | 2409.08858 | null | Kimi |
76 | 2024-09-11 | AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge | Han Wang et.al. | 2409.07394 | link | Kimi |
77 | 2024-09-11 | From optimal score matching to optimal sampling | Zehao Dou et.al. | 2409.07032 | null | Kimi |
78 | 2024-09-10 | Noisy Early Stopping for Noisy Labels | William Toner et.al. | 2409.06830 | null | Kimi |
79 | 2024-09-10 | Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds | Mu Cai et.al. | 2409.06827 | link | Kimi |
80 | 2024-08-26 | Optimizing STAR Aligner for High Throughput Computing in the Cloud | Piotr Kica et.al. | 2409.05886 | null | Kimi |
81 | 2024-09-09 | Early-exit Convolutional Neural Networks | Edanur Demir et.al. | 2409.05336 | link | Kimi |
82 | 2024-09-08 | Attention-Based Efficient Breath Sound Removal in Studio Audio Recordings | Nidula Elgiriyewithana et.al. | 2409.04949 | null | Kimi |
83 | 2024-09-16 | RTop-K: Ultra-Fast Row-Wise Top-K Algorithm and GPU Implementation for Neural Networks | Xi Xie et.al. | 2409.00822 | null | Kimi |
84 | 2024-08-30 | Dynamic Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling | Guangya Wan et.al. | 2408.17017 | null | Kimi |
85 | 2024-08-24 | Inferring the shape of a solid inside a draining tank from its liquid level dynamics | Gbenga Fabusola et.al. | 2408.14503 | null | Kimi |
86 | 2024-08-26 | Re-Mix: Optimizing Data Mixtures for Large Scale Imitation Learning | Joey Hejna et.al. | 2408.14037 | link | Kimi |
87 | 2024-08-24 | Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning | Xinglin Wang et.al. | 2408.13457 | null | Kimi |
88 | 2024-08-24 | Face Clustering via Early Stopping and Edge Recall | Junjie Liu et.al. | 2408.13431 | link | Kimi |
89 | 2024-08-21 | Critique-out-Loud Reward Models | Zachary Ankner et.al. | 2408.11791 | link | Kimi |
90 | 2024-08-21 | EEG-Defender: Defending against Jailbreak through Early Exit Generation of Large Language Models | Chongwen Zhao et.al. | 2408.11308 | null | Kimi |
91 | 2024-08-20 | Inferring Underwater Topography with FINN | Coşku Can Horuz et.al. | 2408.10649 | null | Kimi |
92 | 2024-08-15 | An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation | Jun Wang et.al. | 2408.08047 | null | Kimi |
93 | 2024-08-14 | Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks | Liting Jiang et.al. | 2408.07613 | null | Kimi |
94 | 2024-08-12 | HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors | Hyungtae Lim et.al. | 2408.06328 | null | Kimi |
95 | 2024-08-12 | Transfer learning of state-based potential games for process optimization in decentralized manufacturing systems | Steve Yuwono et.al. | 2408.05992 | null | Kimi |
96 | 2024-08-12 | A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models | Taehong Moon et.al. | 2408.05927 | link | Kimi |
97 | 2024-08-08 | Early-Exit meets Model-Distributed Inference at Edge Networks | Marco Colocrese et.al. | 2408.05247 | null | Kimi |
98 | 2024-08-09 | PriPHiT: Privacy-Preserving Hierarchical Training of Deep Neural Networks | Yamin Sepehri et.al. | 2408.05092 | null | Kimi |
99 | 2024-08-09 | Early Exit Strategies for Approximate k-NN Search in Dense Retrieval | Francesco Busolin et.al. | 2408.04981 | null | Kimi |
100 | 2024-08-07 | Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling | Zilyu Ye et.al. | 2408.03695 | link | Kimi |
101 | 2024-08-03 | Advancing Green AI: Efficient and Accurate Lightweight CNNs for Rice Leaf Disease Identification | Khairun Saddami et.al. | 2408.01752 | null | Kimi |
102 | 2024-08-01 | Early Stopping Based on Repeated Significance | Eric Bax et.al. | 2408.00908 | null | Kimi |
103 | 2024-07-31 | Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation | Wenyuan Chen et.al. | 2408.00112 | null | Kimi |
104 | 2024-07-30 | Accelerating Large Language Model Inference with Self-Supervised Early Exits | Florian Valade et.al. | 2407.21082 | null | Kimi |
105 | 2024-07-25 | An Efficient Inference Framework for Early-exit Large Language Models | Ruijie Miao et.al. | 2407.20272 | null | Kimi |
106 | 2024-07-26 | Topology Optimization of Random Memristors for Input-Aware Dynamic SNN | Bo Wang et.al. | 2407.18625 | link | Kimi |
107 | 2024-07-25 | Superior Scoring Rules for Probabilistic Evaluation of Single-Label Multi-Class Classification Tasks | Rouhollah Ahmadian et.al. | 2407.17697 | null | Kimi |
108 | 2024-07-23 | Can Large Language Models Automatically Jailbreak GPT-4V? | Yuanwei Wu et.al. | 2407.16686 | null | Kimi |
109 | 2024-07-22 | WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding | Quan Kong et.al. | 2407.15350 | null | Kimi |
110 | 2024-07-19 | Joint or Disjoint: Mixing Training Regimes for Early-Exit Models | Bartłomiej Krzepkowski et.al. | 2407.14320 | link | Kimi |
111 | 2024-07-19 | BERTer: The Efficient One | Pradyumna Saligram et.al. | 2407.14039 | null | Kimi |
112 | 2024-07-18 | On the consistency of rotation curves and spatially integrated HI flux profiles | Tariq Yasin et.al. | 2407.13754 | null | Kimi |
113 | 2024-07-19 | Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View | Jianan Fan et.al. | 2407.12870 | link | Kimi |
114 | 2024-07-17 | Hallucination Index: An Image Quality Metric for Generative Reconstruction Models | Matthew Tivnan et.al. | 2407.12780 | null | Kimi |
115 | 2024-07-16 | Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning | Yanting Miao et.al. | 2407.12164 | link | Kimi |
116 | 2024-07-16 | Enhancing Split Computing and Early Exit Applications through Predefined Sparsity | Luigi Capogrosso et.al. | 2407.11763 | link | Kimi |
117 | 2024-07-16 | Preconditioned Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression | Yingzhen Yang et.al. | 2407.11353 | null | Kimi |
118 | 2024-07-10 | Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical | Adarsh Prasad Behera et.al. | 2407.11061 | null | Kimi |
119 | 2024-07-15 | Multilingual Contrastive Decoding via Language-Agnostic Layers Skipping | Wenhao Zhu et.al. | 2407.10795 | link | Kimi |
120 | 2024-07-13 | Towards understanding epoch-wise double descent in two-layer linear neural networks | Amanda Olmin et.al. | 2407.09845 | null | Kimi |
121 | 2024-07-11 | Sensor-Aware Classifiers for Energy-Efficient Time Series Applications on IoT Devices | Dina Hussein et.al. | 2407.08715 | null | Kimi |
122 | 2024-07-07 | Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking | You Wu et.al. | 2407.05383 | null | Kimi |
123 | 2024-07-04 | Unsupervised speech enhancement with spectral kurtosis and double deep priors | Hien Ohnaka et.al. | 2407.03887 | null | Kimi |
124 | 2024-07-02 | Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation | Efstathia Soufleri et.al. | 2407.02713 | link | Kimi |
125 | 2024-07-02 | Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model | Cong Cao et.al. | 2407.01960 | null | Kimi |
126 | 2024-07-01 | Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach | Stef Baas et.al. | 2407.01055 | null | Kimi |
127 | 2024-07-01 | SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection | Dingkang Liang et.al. | 2407.01016 | null | Kimi |
128 | 2024-06-27 | Adaptive Stochastic Weight Averaging | Caglar Demir et.al. | 2406.19092 | link | Kimi |
129 | 2024-06-26 | An Order Theory Framework of Recurrence Equations for Static Cost Analysis $-$ Dynamic Inference of Non-Linear Inequality Invariants | Louis Rustenholz et.al. | 2406.18260 | null | Kimi |
130 | 2024-06-24 | SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments | Neng Wang et.al. | 2406.16279 | link | Kimi |
131 | 2024-06-21 | Micro-power spoken keyword spotting on Xylo Audio 2 | Hannah Bos et.al. | 2406.15112 | null | Kimi |
132 | 2024-06-21 | Early stopping for conjugate gradients in statistical inverse problems | Laura Hucker et.al. | 2406.15001 | null | Kimi |
133 | 2024-06-21 | Cost-Effective RF Fingerprinting Based on Hybrid CVNN-RF Classifier with Automated Multi-Dimensional Early-Exit Strategy | Jiayan Gan et.al. | 2406.14869 | null | Kimi |
134 | 2024-06-20 | On Layer-wise Representation Similarity: Application for Multi-Exit Models with a Single Classifier | Jiachen Jiang et.al. | 2406.14479 | null | Kimi |