Usage instructions: here
Other links:
ID | Publish Date | Title | Authors | Code | Kimi | |
---|---|---|---|---|---|---|
1 | 2025-05-22 | CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms | Shilin Yan et.al. | 2505.17020 | null | Kimi |
2 | 2025-05-22 | Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO | Chengzhuo Tong et.al. | 2505.17017 | null | Kimi |
3 | 2025-05-22 | Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding | Runpeng Yu et.al. | 2505.16990 | null | Kimi |
4 | 2025-05-22 | Bottlenecked Transformers: Periodic KV Cache Abstraction for Generalised Reasoning | Adnan Oomerjee et.al. | 2505.16950 | null | Kimi |
5 | 2025-05-22 | CASTILLO: Characterizing Response Length Distributions of Large Language Models | Daniel F. Perez-Ramirez et.al. | 2505.16881 | null | Kimi |
6 | 2025-05-22 | R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search | Yibo Wang et.al. | 2505.16838 | null | Kimi |
7 | 2025-05-22 | KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning | Wei Sun et.al. | 2505.16826 | null | Kimi |
8 | 2025-05-22 | Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning | Xinghao Chen et.al. | 2505.16782 | null | Kimi |
9 | 2025-05-22 | R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO | Huanjin Yao et.al. | 2505.16673 | null | Kimi |
10 | 2025-05-22 | Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding | Feilong Tang et.al. | 2505.16652 | null | Kimi |
11 | 2025-05-22 | Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains | Wenhui Tan et.al. | 2505.16552 | null | Kimi |
12 | 2025-05-22 | LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing | Dario Di Palma et.al. | 2505.16491 | null | Kimi |
13 | 2025-05-22 | WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning | Zhepei Wei et.al. | 2505.16421 | null | Kimi |
14 | 2025-05-22 | DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving | Zhenjie Yang et.al. | 2505.16278 | null | Kimi |
15 | 2025-05-22 | LIFEBench: Evaluating Length Instruction Following in Large Language Models | Wei Zhang et.al. | 2505.16234 | null | Kimi |
16 | 2025-05-22 | NQKV: A KV Cache Quantization Scheme Based on Normal Distribution Characteristics | Zhihang Cai et.al. | 2505.16210 | null | Kimi |
17 | 2025-05-22 | QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design | Benjamin Schneider et.al. | 2505.16175 | null | Kimi |
18 | 2025-05-22 | KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization | Mingbo Song et.al. | 2505.16162 | null | Kimi |
19 | 2025-05-22 | Training-Free Reasoning and Reflection in MLLMs | Hongchen Wei et.al. | 2505.16151 | null | Kimi |
20 | 2025-05-22 | Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation | Zhenglin Hua et.al. | 2505.16146 | null | Kimi |
21 | 2025-05-22 | Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning | Gagan Bhatia et.al. | 2505.16088 | null | Kimi |
22 | 2025-05-22 | Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development | Ming Shen et.al. | 2505.16086 | null | Kimi |
23 | 2025-05-21 | Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models | Jingcong Liang et.al. | 2505.16056 | null | Kimi |
24 | 2025-05-21 | Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning | Alex Su et.al. | 2505.15966 | null | Kimi |
25 | 2025-05-21 | Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization | Aliakbar Nafar et.al. | 2505.15918 | null | Kimi |
26 | 2025-05-21 | dKV-Cache: The Cache for Diffusion Language Models | Xinyin Ma et.al. | 2505.15781 | link | Kimi |
27 | 2025-05-21 | Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space | Zhen Zhang et.al. | 2505.15778 | link | Kimi |
28 | 2025-05-21 | Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention | Huanxuan Liao et.al. | 2505.15774 | null | Kimi |
29 | 2025-05-21 | ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy | Gengyang Li et.al. | 2505.15684 | null | Kimi |
30 | 2025-05-21 | A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability | Zishuai Zhang et.al. | 2505.15683 | link | Kimi |
31 | 2025-05-21 | Feature Extraction and Steering for Enhanced Chain-of-Thought Reasoning in Language Models | Zihao Li et.al. | 2505.15634 | null | Kimi |
32 | 2025-05-21 | Learn to Reason Efficiently with Adaptive Length-based Reward Shaping | Wei Liu et.al. | 2505.15612 | link | Kimi |
33 | 2025-05-21 | Multilingual Test-Time Scaling via Initial Thought Transfer | Prasoon Bajpai et.al. | 2505.15508 | null | Kimi |
34 | 2025-05-21 | Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought | Ao Liu et.al. | 2505.15431 | null | Kimi |
35 | 2025-05-21 | FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management | Xiang Liu et.al. | 2505.15347 | null | Kimi |
36 | 2025-05-21 | Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack | Silvia Cappelletti et.al. | 2505.15323 | null | Kimi |
37 | 2025-05-21 | Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization | Joonho Yang et.al. | 2505.15291 | null | Kimi |
38 | 2025-05-21 | LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval | Zhenyu Ning et.al. | 2505.15269 | null | Kimi |
39 | 2025-05-21 | Towards Explainable Temporal Reasoning in Large Language Models: A Structure-Aware Generative Framework | Zihao Jiang et.al. | 2505.15245 | link | Kimi |
40 | 2025-05-21 | Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning | Jinghui Lu et.al. | 2505.15154 | null | Kimi |
41 | 2025-05-21 | BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms | Yunlong Hou et.al. | 2505.15141 | null | Kimi |
42 | 2025-05-21 | The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning | Shivam Agarwal et.al. | 2505.15134 | null | Kimi |
43 | 2025-05-21 | An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents | Bowen Jin et.al. | 2505.15117 | link | Kimi |
44 | 2025-05-21 | RoT: Enhancing Table Reasoning with Iterative Row-Wise Traversals | Xuanliang Zhang et.al. | 2505.15110 | null | Kimi |
45 | 2025-05-21 | Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs | Hao Wang et.al. | 2505.15075 | link | Kimi |
46 | 2025-05-21 | Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision | Eric Hanchen Jiang et.al. | 2505.14999 | null | Kimi |
47 | 2025-05-20 | STree: Speculative Tree Decoding for Hybrid State-Space Models | Yangchao Wu et.al. | 2505.14969 | null | Kimi |
48 | 2025-05-20 | Too Long, Didn’t Model: Decomposing LLM Long-Context Understanding With Novels | Sil Hamilton et.al. | 2505.14925 | null | Kimi |
49 | 2025-05-20 | Scaling Laws for State Dynamics in Large Language Models | Jacob X Li et.al. | 2505.14892 | null | Kimi |
50 | 2025-05-20 | Balanced and Elastic End-to-end Training of Dynamic LLMs | Mohamed Wahib et.al. | 2505.14864 | null | Kimi |
51 | 2025-05-20 | Text Generation Beyond Discrete Token Sampling | Yufan Zhuang et.al. | 2505.14827 | null | Kimi |
52 | 2025-05-21 | Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning | Haolei Xu et.al. | 2505.14684 | null | Kimi |
53 | 2025-05-20 | Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training | Mengru Wang et.al. | 2505.14681 | null | Kimi |
54 | 2025-05-20 | Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning | Jiaer Xia et.al. | 2505.14677 | null | Kimi |
55 | 2025-05-20 | SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment | Wonje Jeung et.al. | 2505.14667 | null | Kimi |
56 | 2025-05-20 | Beyond Words: Multimodal LLM Knows When to Speak | Zikai Liao et.al. | 2505.14654 | null | Kimi |
57 | 2025-05-20 | KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models | Fnu Mohbat et.al. | 2505.14629 | link | Kimi |
58 | 2025-05-20 | Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs | Morgan Lindsay Heisler et.al. | 2505.14620 | null | Kimi |
59 | 2025-05-20 | Can Pruning Improve Reasoning? Revisiting Long-CoT Compression with Capability in Mind for Better Reasoning | Shangziqi Zhao et.al. | 2505.14582 | null | Kimi |
60 | 2025-05-20 | Reasoning Models Better Express Their Confidence | Dongkeun Yoon et.al. | 2505.14489 | link | Kimi |
61 | 2025-05-20 | Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation | Peter Baile Chen et.al. | 2505.14398 | null | Kimi |
62 | 2025-05-20 | Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach | Umberto Cappellazzo et.al. | 2505.14336 | null | Kimi |
63 | 2025-05-20 | Speculative Decoding Reimagined for Multimodal Large Language Models | Luxi Lin et.al. | 2505.14260 | null | Kimi |
64 | 2025-05-20 | FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation | Shaolin Zhu et.al. | 2505.14256 | null | Kimi |
65 | 2025-05-20 | Tokenization Constraints in LLMs: A Study of Symbolic and Arithmetic Reasoning Limits | Xiang Zhang et.al. | 2505.14178 | null | Kimi |
66 | 2025-05-20 | RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning | Qianyue Hao et.al. | 2505.14140 | null | Kimi |
67 | 2025-05-20 | DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models | Yakun Zhu et.al. | 2505.14107 | link | Kimi |
68 | 2025-05-20 | Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models | Wenhui Zhu et.al. | 2505.13973 | null | Kimi |
69 | 2025-05-20 | FlashThink: An Early Exit Method For Efficient Reasoning | Guochao Jiang et.al. | 2505.13949 | null | Kimi |
70 | 2025-05-20 | EEG-to-Text Translation: A Model for Deciphering Human Brain Activity | Saydul Akbar Murad et.al. | 2505.13936 | link | Kimi |
71 | 2025-05-20 | Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning | Jiwon Song et.al. | 2505.13866 | null | Kimi |
72 | 2025-05-20 | EfficientLLM: Efficiency in Large Language Models | Zhengqing Yuan et.al. | 2505.13840 | null | Kimi |
73 | 2025-05-20 | Structured Agent Distillation for Large Language Model | Jun Liu et.al. | 2505.13820 | null | Kimi |
74 | 2025-05-19 | Ice Cream Doesn’t Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference | Jin Du et.al. | 2505.13770 | null | Kimi |
75 | 2025-05-19 | Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers | Andrew Nam et.al. | 2505.13737 | null | Kimi |
76 | 2025-05-19 | RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs | Soumya Rani Samineni et.al. | 2505.13697 | null | Kimi |
77 | 2025-05-19 | Optimizing Anytime Reasoning via Budget Relative Policy Optimization | Penghui Qi et.al. | 2505.13438 | link | Kimi |
78 | 2025-05-19 | CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process | Jinhe Bi et.al. | 2505.13408 | null | Kimi |
79 | 2025-05-19 | Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference | Shuqing Luo et.al. | 2505.13345 | link | Kimi |
80 | 2025-05-19 | Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space | Hengli Li et.al. | 2505.13308 | null | Kimi |
81 | 2025-05-19 | RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning | Qiguang Chen et.al. | 2505.13307 | link | Kimi |
82 | 2025-05-19 | Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability | Jingyi Ren et.al. | 2505.13258 | null | Kimi |
83 | 2025-05-19 | HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding | Siran Liu et.al. | 2505.13254 | null | Kimi |
84 | 2025-05-19 | Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification | Jikai Wang et.al. | 2505.13204 | null | Kimi |
85 | 2025-05-19 | Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities | Lili Zhang et.al. | 2505.13195 | null | Kimi |
86 | 2025-05-19 | ModernGBERT: German-only 1B Encoder Model Trained from Scratch | Anton Ehrmanntraut et.al. | 2505.13136 | null | Kimi |
87 | 2025-05-19 | Benchmarking and Confidence Evaluation of LALMs For Temporal Reasoning | Debarpan Bhattacharya et.al. | 2505.13115 | null | Kimi |
88 | 2025-05-19 | FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference | Guangda Liu et.al. | 2505.13109 | null | Kimi |
89 | 2025-05-19 | Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning | Xiaoyu Yang et.al. | 2505.13081 | null | Kimi |
90 | 2025-05-19 | MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO | Yicheng Xiao et.al. | 2505.13031 | link | Kimi |
91 | 2025-05-19 | Fractured Chain-of-Thought Reasoning | Baohao Liao et.al. | 2505.12992 | null | Kimi |
92 | 2025-05-19 | A3 : an Analytical Low-Rank Approximation Framework for Attention | Jeffrey T. H. Wong et.al. | 2505.12942 | null | Kimi |
93 | 2025-05-19 | Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs | Zhihe Yang et.al. | 2505.12929 | link | Kimi |
94 | 2025-05-19 | The Traitors: Deception and Trust in Multi-Agent Language Model Simulations | Pedro M. P. Curvo et.al. | 2505.12923 | null | Kimi |
95 | 2025-05-19 | LEXam: Benchmarking Legal Reasoning on 340 Law Exams | Yu Fan et.al. | 2505.12864 | null | Kimi |
96 | 2025-05-19 | Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs | Zhuo Yang et.al. | 2505.12833 | null | Kimi |
97 | 2025-05-19 | SynDec: A Synthesize-then-Decode Approach for Arbitrary Textual Style Transfer via Large Language Models | Han Sun et.al. | 2505.12821 | null | Kimi |
98 | 2025-05-19 | Accelerating Adaptive Retrieval Augmented Generation via Instruction-Driven Representation Reduction of Retrieval Overlaps | Jie Ou et.al. | 2505.12731 | null | Kimi |
99 | 2025-05-19 | FLASH: Latent-Aware Semi-Autoregressive Speculative Decoding for Multimodal Tasks | Zihua Wang et.al. | 2505.12728 | null | Kimi |
100 | 2025-05-19 | ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving | Haoyuan Wu et.al. | 2505.12717 | null | Kimi |
ID | Publish Date | Title | Authors | Code | Kimi | |
---|---|---|---|---|---|---|
1 | 2024-12-12 | InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption | Tiehan Fan et.al. | 2412.09283 | null | Kimi |
2 | 2024-12-11 | GradStop: Exploring Training Dynamics in Unsupervised Outlier Detection through Gradient Cohesion | Yuang Zhang et.al. | 2412.08501 | link | Kimi |
3 | 2024-12-11 | Collaborative Inference for Large Models with Task Offloading and Early Exiting | Zuan Xie et.al. | 2412.08284 | null | Kimi |
4 | 2024-12-11 | Diff-GO $^\text{n}$ : Enhancing Diffusion Models for Goal-Oriented Communications | Suchinthaka Wanninayaka et.al. | 2412.06980 | null | Kimi |
5 | 2024-12-06 | Sparse autoencoders reveal selective remapping of visual concepts during adaptation | Hyesu Lim et.al. | 2412.05276 | link | Kimi |
6 | 2024-12-06 | BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits | Wazib Ansar et.al. | 2412.05225 | null | Kimi |
7 | 2024-12-05 | A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs | Wangbo Zhao et.al. | 2412.03324 | link | Kimi |
8 | 2024-12-03 | Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control | Sebastian Hirt et.al. | 2412.02423 | null | Kimi |
9 | 2024-12-02 | Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization | Weiqiao Shan et.al. | 2412.01455 | null | Kimi |
10 | 2024-12-02 | EdgeOAR: Real-time Online Action Recognition On Edge Devices | Wei Luo et.al. | 2412.01267 | null | Kimi |
11 | 2024-12-02 | Reliable and scalable variable importance estimation via warm-start and early stopping | Zexuan Sun et.al. | 2412.01120 | link | Kimi |
12 | 2024-11-28 | Deep Neural Network-Based Prediction of B-Cell Epitopes for SARS-CoV and SARS-CoV-2: Enhancing Vaccine Design through Machine Learning | Xinyu Shi et.al. | 2412.00109 | null | Kimi |
13 | 2024-11-26 | Selfish Evolution: Making Discoveries in Extreme Label Noise with the Help of Overfitting Dynamics | Nima Sedaghat et.al. | 2412.00077 | null | Kimi |
14 | 2024-11-28 | DIESEL – Dynamic Inference-Guidance via Evasion of Semantic Embeddings in LLMs | Ben Ganon et.al. | 2411.19038 | null | Kimi |
15 | 2024-11-27 | One-Step Early Stopping Strategy using Neural Tangent Kernel Theory and Rademacher Complexity | Daniel Martin Xavier et.al. | 2411.18806 | null | Kimi |
16 | 2024-11-27 | HEMGS: A Hybrid Entropy Model for 3D Gaussian Splatting Data Compression | Lei Liu et.al. | 2411.18473 | null | Kimi |
17 | 2024-11-26 | Instance-Aware Graph Prompt Learning | Jiazheng Li et.al. | 2411.17676 | null | Kimi |
18 | 2024-11-22 | Instance-Aware Generalized Referring Expression Segmentation | E-Ro Nguyen et.al. | 2411.15087 | null | Kimi |
19 | 2024-11-19 | Deep Learning-Driven Heat Map Analysis for Evaluating thickness of Wounded Skin Layers | Devakumar GR et.al. | 2411.12678 | null | Kimi |
20 | 2024-11-15 | Exploiting Negative Curvature in Conjunction with Adaptive Sampling: Theoretical Results and a Practical Algorithm | Albert S. Berahas et.al. | 2411.10378 | null | Kimi |
21 | 2024-11-13 | Voxeland: Probabilistic Instance-Aware Semantic Mapping with Evidence-based Uncertainty Quantification | Jose-Luis Matez-Bandera et.al. | 2411.08727 | link | Kimi |
22 | 2024-11-11 | The Unreasonable Effectiveness of Monte Carlo Simulations in A/B Testing | Márton Trencséni et.al. | 2411.06701 | link | Kimi |
23 | 2024-11-07 | Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Flavio Di Palo et.al. | 2411.05045 | null | Kimi |
24 | 2024-11-07 | LoFi: Scalable Local Image Reconstruction with Implicit Neural Representation | AmirEhsan Khorashadizadeh et.al. | 2411.04995 | link | Kimi |
25 | 2024-11-05 | SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents | Dawei Li et.al. | 2411.03284 | link | Kimi |
26 | 2024-11-06 | Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression: A Distribution-Free Analysis | Yingzhen Yang et.al. | 2411.02904 | null | Kimi |
27 | 2024-11-05 | Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery | Bowei Du et.al. | 2411.02861 | null | Kimi |
28 | 2024-11-05 | CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration | Hongpeng Jin et.al. | 2411.02829 | null | Kimi |
29 | 2024-11-06 | Energy-Aware Dynamic Neural Inference | Marcello Bullo et.al. | 2411.02471 | null | Kimi |
30 | 2024-11-04 | DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution | Yang Yue et.al. | 2411.02359 | link | Kimi |
31 | 2024-11-02 | Bi-Level Graph Structure Learning for Next POI Recommendation | Liang Wang et.al. | 2411.01169 | null | Kimi |
32 | 2024-10-30 | Accelerated AI Inference via Dynamic Execution Methods | Haim Barad et.al. | 2411.00853 | null | Kimi |
33 | 2024-11-01 | Preventing Model Collapse in Deep Canonical Correlation Analysis by Noise Regularization | Junlin He et.al. | 2411.00383 | null | Kimi |
34 | 2024-10-29 | Power side-channel leakage localization through adversarial training of deep neural networks | Jimmy Gammell et.al. | 2410.22425 | link | Kimi |
35 | 2024-10-27 | Branch-and-bound algorithm for efficient reliability analysis of general coherent systems | Ji-Eun Byun et.al. | 2410.22363 | null | Kimi |
36 | 2024-10-28 | Agreement Tasks in Fault-Prone Synchronous Networks of Arbitrary Structure | Pierre Fraigniaud et.al. | 2410.21538 | null | Kimi |
37 | 2024-10-28 | Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA | Sangmin Bae et.al. | 2410.20672 | null | Kimi |
38 | 2024-10-27 | Sequential Large Language Model-Based Hyper-Parameter Optimization | Kanan Mahammadli et.al. | 2410.20302 | link | Kimi |
39 | 2024-10-26 | Looking Beyond The Top-1: Transformers Determine Top Tokens In Order | Daria Lioubashevski et.al. | 2410.20210 | link | Kimi |
40 | 2024-10-26 | Dynamic layer selection in decoder-only transformers | Theodore Glavas et.al. | 2410.20022 | link | Kimi |
41 | 2024-10-25 | COMSPLIT: A Communication-Aware Split Learning Design for Heterogeneous IoT Platforms | Vukan Ninkovic et.al. | 2410.19375 | null | Kimi |
42 | 2024-10-30 | Dynamic Vocabulary Pruning in Early-Exit LLMs | Jort Vincenti et.al. | 2410.18952 | link | Kimi |
43 | 2024-10-24 | AdaEDL: Early Draft Stopping for Speculative Decoding of Large Language Models via an Entropy-based Lower Bound on Token Acceptance Probability | Sudhanshu Agrawal et.al. | 2410.18351 | null | Kimi |
44 | 2024-10-23 | Inferring stability properties of chaotic systems on autoencoders’ latent spaces | Elise Özalp et.al. | 2410.18003 | link | Kimi |
45 | 2024-10-23 | Diffusion Priors for Variational Likelihood Estimation and Image Denoising | Jun Cheng et.al. | 2410.17521 | link | Kimi |
46 | 2024-10-21 | Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification | Nishant S. Gaikwad et.al. | 2410.15681 | null | Kimi |
47 | 2024-10-24 | BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping | Taolin Zhang et.al. | 2410.15430 | link | Kimi |
48 | 2024-10-16 | FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction | Akriti Jain et.al. | 2410.12513 | null | Kimi |
49 | 2024-10-15 | Juggernaut: Efficient Crypto-Agnostic Byzantine Agreement | Daniel Collins et.al. | 2410.12121 | null | Kimi |
50 | 2024-10-14 | Focused ReAct: Improving ReAct through Reiterate and Early Stop | Shuoqiu Li et.al. | 2410.10779 | null | Kimi |
51 | 2024-10-14 | big.LITTLE Vision Transformer for Efficient Visual Recognition | He Guo et.al. | 2410.10267 | null | Kimi |
52 | 2024-10-12 | DuoDiff: Accelerating Diffusion Models with a Dual-Backbone Approach | Daniel Gallo Fernández et.al. | 2410.09633 | link | Kimi |
53 | 2024-10-11 | Scaling Gaussian Processes for Learning Curve Prediction via Latent Kronecker Structure | Jihao Andreas Lin et.al. | 2410.09239 | null | Kimi |
54 | 2024-10-08 | Benchmarking of a new data splitting method on volcanic eruption data | Simona Reale et.al. | 2410.06306 | null | Kimi |
55 | 2024-10-08 | MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More | Wei Huang et.al. | 2410.06270 | link | Kimi |
56 | 2024-10-08 | Mini-Batch Kernel $k$ -means | Ben Jourdan et.al. | 2410.05902 | null | Kimi |
57 | 2024-10-06 | Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach | Divya Jyoti Bajpai et.al. | 2410.05338 | null | Kimi |
58 | 2024-10-07 | L-C4: Language-Based Video Colorization for Creative and Consistent Color | Zheng Chang et.al. | 2410.04972 | null | Kimi |
59 | 2024-10-06 | CAPEEN: Image Captioning with Early Exits and Knowledge Distillation | Divya Jyoti Bajpai et.al. | 2410.04433 | link | Kimi |
60 | 2024-10-06 | DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs | Divya Jyoti Bajpai et.al. | 2410.04424 | link | Kimi |
61 | 2024-10-03 | Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis | Zikun Zhang et.al. | 2410.02321 | null | Kimi |
62 | 2024-10-03 | Global dynamical structures from infinitesimal data | Benjamin McInroe et.al. | 2410.02111 | null | Kimi |
63 | 2024-10-02 | CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL | Mohammadreza Pourreza et.al. | 2410.01943 | null | Kimi |
64 | 2024-10-02 | Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension | Zaiquan Yang et.al. | 2410.01544 | null | Kimi |
65 | 2024-10-01 | Timber! Poisoning Decision Trees | Stefano Calzavara et.al. | 2410.00862 | null | Kimi |
66 | 2024-09-30 | Inference of water waves surface elevation from horizontal velocity components using physics informed neural networks (PINN) | Omar Sallam et.al. | 2409.19851 | null | Kimi |
67 | 2024-09-27 | Improving Visual Object Tracking through Visual Prompting | Shih-Fang Chen et.al. | 2409.18901 | link | Kimi |
68 | 2024-09-24 | Reinforcement Leaning for Infinite-Dimensional Systems | Wei Zhang et.al. | 2409.15737 | null | Kimi |
69 | 2024-10-03 | Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer Prediction | Amrit Diggavi Seshadri et.al. | 2409.14091 | null | Kimi |
70 | 2024-09-21 | Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer | Zheng Liu et.al. | 2409.13999 | null | Kimi |
71 | 2024-09-18 | Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments | Gang Chen et.al. | 2409.11975 | link | Kimi |
72 | 2024-09-17 | UniLCD: Unified Local-Cloud Decision-Making via Reinforcement Learning | Kathakoli Sengupta et.al. | 2409.11403 | null | Kimi |
73 | 2024-09-16 | Improving Multi-candidate Speculative Decoding | Xiaofan Lu et.al. | 2409.10644 | link | Kimi |
74 | 2024-09-14 | Group Sequential Testing of a Treatment Effect Using a Surrogate Marker | Layla Parast et.al. | 2409.09440 | link | Kimi |
75 | 2024-09-13 | Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection | Dixi Yao et.al. | 2409.08858 | null | Kimi |
76 | 2024-09-11 | AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge | Han Wang et.al. | 2409.07394 | link | Kimi |
77 | 2024-09-11 | From optimal score matching to optimal sampling | Zehao Dou et.al. | 2409.07032 | null | Kimi |
78 | 2024-09-10 | Noisy Early Stopping for Noisy Labels | William Toner et.al. | 2409.06830 | null | Kimi |
79 | 2024-09-10 | Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds | Mu Cai et.al. | 2409.06827 | link | Kimi |
80 | 2024-08-26 | Optimizing STAR Aligner for High Throughput Computing in the Cloud | Piotr Kica et.al. | 2409.05886 | null | Kimi |
81 | 2024-09-09 | Early-exit Convolutional Neural Networks | Edanur Demir et.al. | 2409.05336 | link | Kimi |
82 | 2024-09-08 | Attention-Based Efficient Breath Sound Removal in Studio Audio Recordings | Nidula Elgiriyewithana et.al. | 2409.04949 | null | Kimi |
83 | 2024-09-16 | RTop-K: Ultra-Fast Row-Wise Top-K Algorithm and GPU Implementation for Neural Networks | Xi Xie et.al. | 2409.00822 | null | Kimi |
84 | 2024-08-30 | Dynamic Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling | Guangya Wan et.al. | 2408.17017 | null | Kimi |
85 | 2024-08-24 | Inferring the shape of a solid inside a draining tank from its liquid level dynamics | Gbenga Fabusola et.al. | 2408.14503 | null | Kimi |
86 | 2024-08-26 | Re-Mix: Optimizing Data Mixtures for Large Scale Imitation Learning | Joey Hejna et.al. | 2408.14037 | link | Kimi |
87 | 2024-08-24 | Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning | Xinglin Wang et.al. | 2408.13457 | null | Kimi |
88 | 2024-08-24 | Face Clustering via Early Stopping and Edge Recall | Junjie Liu et.al. | 2408.13431 | link | Kimi |
89 | 2024-08-21 | Critique-out-Loud Reward Models | Zachary Ankner et.al. | 2408.11791 | link | Kimi |
90 | 2024-08-21 | EEG-Defender: Defending against Jailbreak through Early Exit Generation of Large Language Models | Chongwen Zhao et.al. | 2408.11308 | null | Kimi |
91 | 2024-08-20 | Inferring Underwater Topography with FINN | Coşku Can Horuz et.al. | 2408.10649 | null | Kimi |
92 | 2024-08-15 | An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation | Jun Wang et.al. | 2408.08047 | null | Kimi |
93 | 2024-08-14 | Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks | Liting Jiang et.al. | 2408.07613 | null | Kimi |
94 | 2024-08-12 | HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors | Hyungtae Lim et.al. | 2408.06328 | null | Kimi |
95 | 2024-08-12 | Transfer learning of state-based potential games for process optimization in decentralized manufacturing systems | Steve Yuwono et.al. | 2408.05992 | null | Kimi |
96 | 2024-08-12 | A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models | Taehong Moon et.al. | 2408.05927 | link | Kimi |
97 | 2024-08-08 | Early-Exit meets Model-Distributed Inference at Edge Networks | Marco Colocrese et.al. | 2408.05247 | null | Kimi |
98 | 2024-08-09 | PriPHiT: Privacy-Preserving Hierarchical Training of Deep Neural Networks | Yamin Sepehri et.al. | 2408.05092 | null | Kimi |
99 | 2024-08-09 | Early Exit Strategies for Approximate k-NN Search in Dense Retrieval | Francesco Busolin et.al. | 2408.04981 | null | Kimi |
100 | 2024-08-07 | Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling | Zilyu Ye et.al. | 2408.03695 | link | Kimi |