Skip to content

🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)

License

Notifications You must be signed in to change notification settings

Ther-nullptr/circult-eda-mlsys-tinyml-arxiv-daily

 
 

Repository files navigation

Contributors Forks Stargazers Issues

Updated on 2025.01.03

Usage instructions: here

Table of Contents
  1. Quantization
  2. Pruning
  3. Hardware-Software Co-Design
  4. TinyML
  5. Domain Specific Accelerator
  6. Low-Rank Adaptation
  7. Model Compression

Quantization

Publish Date Title Authors PDF Code
2024-12-30 Improving Acoustic Scene Classification in Low-Resource Conditions Zhi Chen et.al. 2412.20722 null
2024-12-29 PTQ4VM: Post-Training Quantization for Visual Mamba Younghyun Cho et.al. 2412.20386 null
2024-12-28 IMSSA: Deploying modern state-space models on memristive in-memory compute hardware Sebastian Siegel et.al. 2412.20215 null
2024-12-27 Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales Shuokai Pan et.al. 2412.19867 null
2024-12-27 MBQ: Modality-Balanced Quantization for Large Vision-Language Models Shiyao Li et.al. 2412.19509 link
2024-12-24 Unified Stochastic Framework for Neural Network Quantization and Pruning Haoyu Zhang et.al. 2412.18184 null
2024-12-21 TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models Haocheng Huang et.al. 2412.16700 null
2024-12-20 Improving Quantization-aware Training of Low-Precision Network via Block Replacement on Full-Precision Counterpart Chengting Yu et.al. 2412.15846 null
2024-12-19 Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers Rui Ding et.al. 2412.14633 null
2024-12-19 Qua $^2$ SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models Keith G. Mills et.al. 2412.14628 null
2024-12-18 ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals Utkarsh Saxena et.al. 2412.14363 link
2024-12-15 Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment Haisheng Lu et.al. 2412.11186 link
2024-12-13 TTAQ: Towards Stable Post-training Quantization in Continuous Domain Adaptation Junrui Xiao et.al. 2412.09899 null
2024-12-12 CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs Yuzhuang Xu et.al. 2412.09282 null
2024-12-10 Post-Training Non-Uniform Quantization for Convolutional Neural Networks Ahmed Luqman et.al. 2412.07391 null
2024-12-09 FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization Boyang Zhang et.al. 2412.06865 null
2024-12-09 Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion Shuaiting Li et.al. 2412.06661 null
2024-12-07 GAQAT: gradient-adaptive quantization-aware training for domain generalization Jiacheng Jiang et.al. 2412.05551 null
2024-12-07 SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization Runsheng Bai et.al. 2412.04180 null
2024-12-05 Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task Alireza Maleki et.al. 2412.03915 null
2024-12-03 CPTQuant - A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models Amitash Nanda et.al. 2412.03599 null
2024-11-26 Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving Jon Gutiérrez-Zaballa et.al. 2411.17543 null
2024-12-03 PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution Libo Zhu et.al. 2411.17106 link
2024-11-23 freePruner: A Training-free Approach for Large Multimodal Model Acceleration Bingxin Xu et.al. 2411.15446 null
2024-11-22 FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Acceleration Donghyeon Yi et.al. 2411.14733 null
2024-11-17 EfQAT: An Efficient Framework for Quantization-Aware Training Saleh Ashkboos et.al. 2411.11038 null
2024-11-12 ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization Weibo Zhao et.al. 2411.07762 null
2024-11-09 Optimizing Large Language Models through Quantization: A Comparative Analysis of PTQ and QAT Techniques Jahid Hasan et.al. 2411.06084 null
2024-11-08 SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models Muyang Li et.al. 2411.05007 link
2024-11-30 Scaling Laws for Precision Tanishq Kumar et.al. 2411.04330 null
2024-11-06 Interactions Across Blocks in Post-Training Quantization of Large Language Models Khasmamad Shabanovi et.al. 2411.03934 null
2024-11-06 An Edge Computing-Based Solution for Real-Time Leaf Disease Classification using Thermal Imaging Públio Elon Correa da Silva et.al. 2411.03835 link
2024-11-06 TATAA: Programmable Mixed-Precision Transformer Acceleration with a Transformable Arithmetic Architecture Jiajun Wu et.al. 2411.03697 null
2024-10-29 Data Generation for Hardware-Friendly Post-Training Quantization Lior Dikstein et.al. 2410.22110 link
2024-10-30 IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models Hang Guo et.al. 2410.21759 link
2024-10-26 DQRM: Deep Quantized Recommendation Models Yang Zhou et.al. 2410.20046 link
2024-10-14 Real-Time Stress Detection via Photoplethysmogram Signals: Implementation of a Combined Continuous Wavelet Transform and Convolutional Neural Network on Resource-Constrained Microcontrollers Yasin Hasanpoor et.al. 2410.19776 null
2024-10-24 TesseraQ: Ultra Low-Bit LLM Post-Training Quantization with Block Reconstruction Yuhang Li et.al. 2410.19103 null
2024-10-18 Understanding the difficulty of low-precision post-training quantization of large language models Zifei Xu et.al. 2410.14570 null
2024-10-17 Quamba: A Post-Training Quantization Recipe for Selective State Space Models Hung-Yueh Chiang et.al. 2410.13229 link
2024-10-17 Scaling laws for post-training quantized large language models Zifei Xu et.al. 2410.12119 null
2024-10-15 Error Diffusion: Post Training Quantization with Block-Scaled Number Formats for Neural Networks Alireza Khodamoradi et.al. 2410.11203 link
2024-10-06 Continuous Approximations for Improving Quantization Aware Training of LLMs He Li et.al. 2410.10849 null
2024-10-12 SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs Mohammad Mozaffari et.al. 2410.09615 link
2024-10-12 FlatQuant: Flatness Matters for LLM Quantization Yuxuan Sun et.al. 2410.09426 link
2024-10-10 Q-VLM: Post-training Quantization for Large Vision-Language Models Changyuan Wang et.al. 2410.08119 link
2024-10-10 Post-Training Quantization in Brain-Computer Interfaces based on Event-Related Potential Detection Hubert Cecotti et.al. 2410.07920 null
2024-10-10 CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression Wenyuan Liu et.al. 2410.07505 null
2024-10-09 Scaling Laws for Mixed quantization in Large Language Models Zeyu Cao et.al. 2410.06722 null
2024-10-08 QERA: an Analytical Framework for Quantization Error Reconstruction Cheng Zhang et.al. 2410.06040 null
2024-10-08 QT-DoG: Quantization-aware Training for Domain Generalization Saqib Javed et.al. 2410.06020 link
2024-10-10 ARB-LLM: Alternating Refined Binarizations for Large Language Models Zhiteng Li et.al. 2410.03129 link
2024-10-03 Lightweight Diffusion Models for Resource-Constrained Semantic Communication Giovanni Pignata et.al. 2410.02491 link
2024-10-01 Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging Ismail Erbas et.al. 2410.00948 null
2024-09-30 Constraint Guided Model Quantization of Neural Networks Quinten Van Baelen et.al. 2409.20138 null
2024-09-26 P4Q: Learning to Prompt for Quantization in Visual-language Models Huixin Sun et.al. 2409.17634 null
2024-09-25 Accumulator-Aware Post-Training Quantization Ian Colbert et.al. 2409.17092 null
2024-09-25 VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models Yifei Liu et.al. 2409.17066 link
2024-09-25 PTQ4RIS: Post-Training Quantization for Referring Image Segmentation Xiaoyan Jiang et.al. 2409.17020 link
2024-09-26 INT-FlashAttention: Enabling Flash Attention for INT8 Quantization Shimao Chen et.al. 2409.16997 link
2024-09-20 PTQ4ADM: Post-Training Quantization for Efficient Text Conditional Audio Diffusion Models Jayneel Vora et.al. 2409.13894 null
2024-09-18 Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview Yanshu Wang et.al. 2409.11650 null
2024-09-12 LlamaF: An Efficient Llama2 Architecture Accelerator on Embedded FPGAs Han Xu et.al. 2409.11424 null
2024-09-12 DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing Zhenyuan Dong et.al. 2409.07756 link
2024-08-31 Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization Vage Egiazarian et.al. 2409.00492 null
2024-08-29 A machine learning approach for computing solar flare locations in X-rays on-board Solar Orbiter/STIX Paolo Massa et.al. 2408.16642 link
2024-08-29 On-device AI: Quantization-aware Training of Transformers in Time-Series Tianheng Ling et.al. 2408.16495 null
2024-08-27 The Uniqueness of LLaMA3-70B with Per-Channel Quantization: An Empirical Study Minghai Qin et.al. 2408.15301 null
2024-08-25 MobileQuant: Mobile-friendly Quantization for On-device Language Models Fuwen Tan et.al. 2408.13933 link
2024-08-25 Infrared Domain Adaptation with Zero-Shot Quantization Burak Sevsay et.al. 2408.13925 null
2024-08-23 ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models Chao Zeng et.al. 2408.08554 link
2024-08-14 Analog Spiking Neuron in CMOS 28 nm Towards Large-Scale Neuromorphic Processors Marwan Besrour et.al. 2408.07734 null
2024-08-13 Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models Cheng Chen et.al. 2408.06995 null
2024-08-11 RTF-Q: Unsupervised domain adaptation based retraining-free quantization network Nanyang Du et.al. 2408.05752 null
2024-08-16 DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers Lianwei Yang et.al. 2408.03291 null
2024-08-05 HQOD: Harmonious Quantization for Object Detection Long Huang et.al. 2408.02561 link
2024-08-01 Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization Róisín Luo et.al. 2408.00923 null
2024-08-07 Temporal Feature Matters: A Framework for Diffusion Model Quantization Yushi Huang et.al. 2407.19547 null
2024-07-25 Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models Sanae Lotfi et.al. 2407.18158 null
2024-07-27 MetaAug: Meta-Data Augmentation for Post-Training Quantization Cuong Pham et.al. 2407.14726 link
2024-07-17 AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer Zhuguanyu Wu et.al. 2407.12951 link
2024-07-17 Mamba-PTQ: Outlier Channels in Recurrent Large Language Models Alessandro Pierro et.al. 2407.12397 null
2024-07-17 StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators Ethan G Rogers et.al. 2407.12378 null
2024-07-17 Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models Ayush Kaushal et.al. 2407.12327 link
2024-07-17 QVD: Post-training Quantization for Video Diffusion Models Shilong Tian et.al. 2407.11585 null
2024-07-16 LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling Matrices Jung Hyun Lee et.al. 2407.11534 link
2024-07-11 Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients Zhenyu Zhang et.al. 2407.08296 link
2024-07-10 RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization Xijie Huang et.al. 2407.08044 link

(back to top)

Pruning

Publish Date Title Authors PDF Code
2024-12-24 SlimGPT: Layer-wise Structured Pruning for Large Language Models Gui Ling et.al. 2412.18110 null
2024-12-23 GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference Chao Zeng et.al. 2412.17560 null
2024-12-28 Lillama: Large Language Models Compression via Low-Rank Feature Distillation Yaya Sy et.al. 2412.16719 null
2024-12-21 V"Mean"ba: Visual State Space Models only need 1 hidden dimension Tien-Yu Chi et.al. 2412.16602 null
2024-12-20 Less is More: Towards Green Code Large Language Models via Unified Structural Pruning Guang Yang et.al. 2412.15921 null
2024-12-20 All-in-One Tuning and Structural Pruning for Domain-Specific LLMs Lei Lu et.al. 2412.14426 null
2024-12-17 Learning Coarse-to-Fine Pruning of Graph Convolutional Networks for Skeleton-based Recognition Hichem Sahbi et.al. 2412.12887 null
2024-12-17 A Comparative Study of Pruning Methods in Transformer-based Time Series Forecasting Nicholas Kiefer et.al. 2412.12883 null
2024-12-17 Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation Dongyue Wu et.al. 2412.12672 link
2024-12-19 RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification Guangwenjie Zou et.al. 2412.12603 link
2024-12-16 Designing Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-based Recognition Hichem Sahbi et.al. 2412.11813 null
2024-12-16 QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models Changhai Zhou et.al. 2412.11629 null
2024-12-09 LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation Haihang Wu et.al. 2412.06419 null
2024-12-03 Effortless Efficiency: Low-Cost Pruning of Diffusion Models Yang Zhang et.al. 2412.02852 null
2024-11-25 Deep Convolutional Neural Networks Structured Pruning via Gravity Regularization Abdesselam Ferdi et.al. 2411.16901 null
2024-11-21 FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers Zehua Pei et.al. 2411.14507 null
2024-11-21 Layer Pruning with Consensus: A Triple-Win Solution Leandro Giusti Mugnaini et.al. 2411.14345 link
2024-11-21 DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization Hexuan Deng et.al. 2411.14055 link
2024-11-19 FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning Qingsong Lv et.al. 2411.12781 link
2024-11-17 Electrostatic Force Regularization for Neural Structured Pruning Abdesselam Ferdi et.al. 2411.11079 null
2024-11-15 Systolic Arrays and Structured Pruning Co-design for Efficient Transformers in Edge Systems Pedro Palacios et.al. 2411.10285 null
2024-12-16 P $^2$ Law: Scaling Law for Post-Training After Model Pruning Xiaodong Chen et.al. 2411.10272 null
2024-11-10 RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration Boyao Wang et.al. 2411.06463 link
2024-11-05 Layer-Adaptive State Pruning for Deep State Space Models Minseon Gwak et.al. 2411.02824 link
2024-11-04 Automatic Structured Pruning for Efficient Architecture in Federated Learning Thai Vu Nguyen et.al. 2411.01759 link
2024-10-31 Mutual Information Preserving Neural Network Pruning Charles Westphal et.al. 2411.00147 null
2024-10-24 Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts Danyal Aftab et.al. 2410.19185 null
2024-10-18 EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search Oliver Sieberling et.al. 2410.14649 link
2024-11-04 DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models Shangqian Gao et.al. 2410.11988 null
2024-11-12 Self-Data Distillation for Recovering Quality in Pruned Large Language Models Vithursan Thangarasa et.al. 2410.09982 null
2024-10-11 Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale Models with Structured Pruning in Resource-Limited Clients Yan Li et.al. 2410.08457 null
2024-10-11 Chip-Tuning: Classify Before Language Models Say Fangwei Zhu et.al. 2410.06541 link
2024-11-04 Large Language Model Compression with Neural Architecture Search Rhea Sanjay Sukthanker et.al. 2410.06479 null
2024-09-29 Investigating the Effect of Network Pruning on Performance and Interpretability Jonathan von Rad et.al. 2409.19727 null
2024-10-30 Search for Efficient Large Language Models Xuan Shen et.al. 2409.17372 link
2024-09-22 SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms Niraj Pudasaini et.al. 2409.14515 null
2024-09-20 CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information Yuxin Wang et.al. 2409.13199 link
2024-09-17 KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models Bo Lv et.al. 2409.11057 null
2024-09-11 HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning Tianyi Chen et.al. 2409.09085 link
2024-09-12 Structured Pruning for Efficient Visual Place Recognition Oliver Grainge et.al. 2409.07834 null
2024-09-10 STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning Jaeseong Lee et.al. 2409.06211 null
2024-09-05 TropNNC: Structured Neural Network Compression Using Tropical Geometry Konstantinos Fotopoulos et.al. 2409.03945 null
2024-09-02 Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks Samer Francy et.al. 2409.02134 null
2024-08-27 PAT: Pruning-Aware Tuning for Large Language Models Yijiang Liu et.al. 2408.14721 link
2024-08-15 PQV-Mobile: A Combined Pruning and Quantization Toolkit to Optimize Vision Transformers for Mobile Applications Kshitij Bhardwaj et.al. 2408.08437 link
2024-08-13 Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models Chenqian Yan et.al. 2408.06646 null
2024-08-06 Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Jonas Schmitt et.al. 2408.03046 link
2024-08-02 Sustainable Diffusion-based Incentive Mechanism for Generative AI-driven Digital Twins in Industrial Cyber-Physical Systems Jinbo Wen et.al. 2408.01173 null
2024-08-22 Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models Jiang Hao et.al. 2407.21316 link
2024-07-26 Greedy Output Approximation: Towards Efficient Structured Pruning for LLMs Without Retraining Jianwei Li et.al. 2407.19126 null
2024-07-17 MCU-MixQ: A HW/SW Co-optimized Mixed-precision Neural Network Design Framework for MCUs Junfeng Gong et.al. 2407.18267 null
2024-07-24 (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork Tianjin Huang et.al. 2407.17412 null
2024-07-22 Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models Aayush Saxena et.al. 2407.15904 null
2024-07-19 Shapley Pruning for Neural Network Compression Kamil Adamczewski et.al. 2407.15875 null
2024-07-22 A Pairwise Comparison Relation-assisted Multi-objective Evolutionary Neural Architecture Search Method with Multi-population Mechanism Yu Xue et.al. 2407.15600 null
2024-07-19 Straightforward Layer-wise Pruning for More Efficient Visual Adaptation Ruizi Han et.al. 2407.14330 null
2024-07-18 Data-Algorithm-Architecture Co-Optimization for Fair Neural Networks on Skin Lesion Dataset Yi Sheng et.al. 2407.13896 null
2024-07-18 Reconstruct the Pruned Model without Any Retraining Pingjie Wang et.al. 2407.13331 null
2024-07-18 MO-EMT-NAS: Multi-Objective Continuous Transfer of Architectural Knowledge Between Tasks from Different Datasets Peng Liao et.al. 2407.13122 null
2024-07-16 MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models Hongrong Cheng et.al. 2407.11681 null
2024-07-15 DDFAD: Dataset Distillation Framework for Audio Data Wenbo Jiang et.al. 2407.10446 null

(back to top)

Hardware-Software Co-Design

Publish Date Title Authors PDF Code
2024-12-29 A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier Amit Sarkar et.al. 2412.20393 null
2024-12-29 Open-Source Heterogeneous SoCs for AI: The PULP Platform Experience Francesco Conti et.al. 2412.20391 null
2024-12-27 HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language Models Ze Yang et.al. 2412.19925 null
2024-12-26 Evolution, Challenges, and Optimization in Computer Architecture: The Role of Reconfigurable Systems Jefferson Ederhion et.al. 2412.19234 null
2024-12-24 GCN-ABFT: Low-Cost Online Error Checking for Graph Convolutional Networks Christodoulos Peltekis et.al. 2412.18534 null
2024-12-23 Advantages of density in tensor network geometries for gradient based training Sergi Masot-Llima et.al. 2412.17497 null
2024-12-20 Chorba: A novel CRC32 implementation Sam Russell et.al. 2412.16398 null
2024-12-20 Designing Visual Explanations and Learner Controls to Engage Adolescents in AI-Supported Exercise Selection Jeroen Ooge et.al. 2412.16034 null
2024-12-20 A survey on FPGA-based accelerator for ML models Feng Yan et.al. 2412.15666 null
2024-12-19 LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation Chenxu Zhou et.al. 2412.15199 null
2024-12-18 Pattern Matching in AI Compilers and its Formalization (Extended Version) Joseph W. Cutler et.al. 2412.13398 null
2024-12-17 if-ZKP: Intel FPGA-Based Acceleration of Zero Knowledge Proofs Shahzad Ahmad Butt et.al. 2412.12481 null
2024-12-13 Strong Structural Bounds for MaxSAT: The Fine Details of Using Neuromorphic and Quantum Hardware Accelerators Max Bannach et.al. 2412.10289 null
2024-12-16 MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization Shuaiting Li et.al. 2412.10261 null
2024-12-12 MPAX: Mathematical Programming in JAX Haihao Lu et.al. 2412.09734 link
2024-12-12 Evaluating the Potential of In-Memory Processing to Accelerate Homomorphic Encryption Mpoki Mwaisela et.al. 2412.09144 null
2024-12-12 Analyzing Practical Policies for Multiresource Job Scheduling Zhongrui Chen et.al. 2412.08915 null
2024-12-09 LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation Haihang Wu et.al. 2412.06419 null
2024-12-03 Demonstrating the Advantages of Analog Wafer-Scale Neuromorphic Hardware Hartmut Schmidt et.al. 2412.02619 null
2024-12-03 Multi-timescale synaptic plasticity on analog neuromorphic hardware Amani Atoui et.al. 2412.02515 null
2024-11-27 Deterministic and Probabilistic Rounding Error Analysis for Mixed-Precision Arithmetic on Modern Computing Units Sahil Bhola et.al. 2411.18747 null
2024-11-26 Scalable iterative pruning of large language and vision models using block coordinate descent Gili Rosenberg et.al. 2411.17796 null
2024-11-25 Limitations of tensor network approaches for optimization and sampling: A comparison against quantum and classical Ising machines Anna Maria Dziubyna et.al. 2411.16431 null
2024-11-25 MixPE: Quantization and Hardware Co-design for Efficient LLM Inference Yu Zhang et.al. 2411.16158 null
2024-11-20 Hardware Accelerators for Artificial Intelligence S M Mojahidul Ahsan et.al. 2411.13717 null
2024-11-20 Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training Jared Fernandez et.al. 2411.13055 null
2024-11-19 FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning Qingsong Lv et.al. 2411.12781 link
2024-11-19 Design of an FPGA-Based Neutral Atom Rearrangement Accelerator for Quantum Computing Xiaorang Guo et.al. 2411.12401 null
2024-11-18 SILVIA: Automated Superword-Level Parallelism Exploitation via HLS-Specific LLVM Passes for Compute-Intensive FPGA Accelerators Giovanni Brignone et.al. 2411.11384 link
2024-12-01 InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma Xiaoxuan Hou et.al. 2411.09856 link
2024-11-21 OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling Xiaoling Yi et.al. 2411.09543 null
2024-11-15 Communication Compression for Tensor Parallel LLM Inference Jan Hansen-Palmus et.al. 2411.09510 null
2024-11-18 RPCAcc: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator Jie Zhang et.al. 2411.07632 null
2024-11-11 Spiking Transformer Hardware Accelerators in 3D Integration Boxun Xu et.al. 2411.07397 null
2024-11-10 AMAZE: Accelerated MiMC Hardware Architecture for Zero-Knowledge Applications on the Edge Anees Ahmed et.al. 2411.06350 link
2024-11-03 Stochastic Communication Avoidance for Recommendation Systems Lutfi Eren Erdogan et.al. 2411.01611 null
2024-11-01 Inducing Semi-Structured Sparsity by Masking for Efficient Model Inference in Convolutional Networks David A. Danhofer et.al. 2411.00288 null
2024-10-31 LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators Krishna Teja Chitty-Venkata et.al. 2411.00136 link
2024-10-30 Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks Michael Matthews et.al. 2410.23208 link
2024-10-24 Watermarking Large Language Models and the Generated Content: Opportunities and Challenges Ruisi Zhang et.al. 2410.19096 null
2024-10-21 Hacking the Fabric: Targeting Partial Reconfiguration for Fault Injection in FPGA Fabrics Jayeeta Chaudhuri et.al. 2410.16497 null
2024-10-21 Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure Juan Cruz-Martinez et.al. 2410.16248 null
2024-10-20 A Remedy to Compute-in-Memory with Dynamic Random Access Memory: 1FeFET-1C Technology for Neuro-Symbolic AI Xunzhao Yin et.al. 2410.15296 null
2024-10-18 Self-Satisfied: An end-to-end framework for SAT generation and prediction Christopher R. Serrano et.al. 2410.14888 null
2024-10-17 Quamba: A Post-Training Quantization Recipe for Selective State Space Models Hung-Yueh Chiang et.al. 2410.13229 link
2024-10-16 Mixed-precision finite element kernels and assembly: Rounding error analysis and hardware acceleration M. Croci et.al. 2410.12614 link
2024-10-15 Fast Local Neural Regression for Low-Cost, Path Traced Lambertian Global Illumination Arturo Salmi et.al. 2410.11625 null
2024-10-15 Efficiera Residual Networks: Hardware-Friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical ImageNet Accuracy Shuntaro Takahashi et.al. 2410.11553 link
2024-10-14 Differentiable Weightless Neural Networks Alan T. L. Bacellar et.al. 2410.11112 link
2024-10-14 SLaNC: Static LayerNorm Calibration Mahsa Salmani et.al. 2410.10553 null
2024-10-11 MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices Mohamed Amine Hamdi et.al. 2410.08855 link
2024-10-09 Optimized Spatial Architecture Mapping Flow for Transformer Accelerators Haocheng Xu et.al. 2410.07407 null
2024-10-09 Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing Ismail Erbas et.al. 2410.07364 null
2024-10-03 CAX: Cellular Automata Accelerated in JAX Maxence Faldor et.al. 2410.02651 link
2024-10-03 Extracting the Potential of Emerging Hardware Accelerators for Symmetric Eigenvalue Decomposition Hansheng Wang et.al. 2410.02170 null
2024-10-01 Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging Ismail Erbas et.al. 2410.00948 null
2024-09-26 Leader Selection and Follower Association for UE-centric Distributed Learning in Future Wireless Networks Saeedeh Parsaeefard et.al. 2409.18268 null
2024-09-26 A 5T-2MTJ STT-assisted Spin Orbit Torque based Ternary Content Addressable Memory for Hardware Accelerators Siri Narla et.al. 2409.17863 null
2024-09-24 Microsecond-Latency Feedback at a Particle Accelerator by Online Reinforcement Learning on Hardware Luca Scomparin et.al. 2409.16177 null
2024-09-25 Ultra-low latency quantum-inspired machine learning predictors implemented on FPGA Lorenzo Borella et.al. 2409.16075 null
2024-09-19 Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention Rengan Xu et.al. 2409.15373 null
2024-09-23 Efficient Tabular Data Preprocessing of ML Pipelines Yu Zhu et.al. 2409.14912 null
2024-09-21 FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs Ehsan Kabir et.al. 2409.14023 null
2024-09-21 ProTEA: Programmable Transformer Encoder Acceleration on FPGA Ehsan Kabir et.al. 2409.13975 null
2024-09-23 Towards Efficient Neuro-Symbolic AI: From Workload Characterization to Hardware Architecture Zishen Wan et.al. 2409.13153 null
2024-09-20 Learning to Compare Hardware Designs for High-Level Synthesis Yunsheng Bai et.al. 2409.13138 null
2024-09-19 Performance and Power: Systematic Evaluation of AI Workloads on Accelerators with CARAML Chelsea Maria John et.al. 2409.12994 link
2024-09-19 CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications Vladimir Frolov et.al. 2409.12617 null
2024-09-15 Pack my weights and run! Minimizing overheads for in-memory computing accelerators Pouya Houshmand et.al. 2409.11437 null
2024-09-11 Next-generation Probabilistic Computing Hardware with 3D MOSAICs, Illusion Scale-up, and Co-design Tathagata Srimani et.al. 2409.11422 null
2024-09-09 Hardware Acceleration of Kolmogorov-Arnold Network (KAN) for Lightweight Edge Inference Wei-Hsing Huang et.al. 2409.11418 null
2024-09-17 Dynamic Range Reduction via Branch-and-Bound Thore Gerlach et.al. 2409.10863 null
2024-09-16 Count2Multiply: Reliable In-memory High-Radix Counting João Paulo Cardoso de Lima et.al. 2409.10136 null
2024-09-16 Hardware-Accelerated Ray Tracing for Discrete and Continuous Collision Detection on GPUs Sizhe Sui et.al. 2409.09918 null
2024-09-13 Distributed Binary Optimization with In-Memory Computing: An Application for the SAT Problem Xiangyi Zhang et.al. 2409.09152 null
2024-09-13 Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators Konstantin Lübeck et.al. 2409.08595 null
2024-09-17 Foragax: An Agent-Based Modelling Framework Based on JAX Siddharth Chaturvedi et.al. 2409.06345 link
2024-09-10 PIM-MMU: A Memory Management Unit for Accelerating Data Transfers in Commercial PIM Systems Dongjae Lee et.al. 2409.06204 null
2024-09-06 Towards Narrowing the Generalization Gap in Deep Boolean Networks Youngsung Kim et.al. 2409.05905 null
2024-09-09 Supervised Learning for Stochastic Optimal Control Vince Kurtz et.al. 2409.05792 null
2024-09-08 BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration Yuzong Chen et.al. 2409.05227 link
2024-09-05 Libra: Architectural Support For Principled, Secure And Efficient Balanced Execution On High-End Processors (Extended Version) Hans Winderix et.al. 2409.03743 null
2024-09-05 Hardware Acceleration of LLMs: A comprehensive survey and comparison Nikoletta Koilia et.al. 2409.03384 null
2024-09-05 Towards training digitally-tied analog blocks via hybrid gradient computation Timothy Nest et.al. 2409.03306 null
2024-08-30 The picasso gas model: Painting intracluster gas on gravity-only simulations F. Kéruzoré et.al. 2408.17445 link
2024-08-29 Serial and Parallel Two-Column Probing for Mixed-Integer Programming Yongzheng Dai et.al. 2408.16927 link
2024-08-29 On-device AI: Quantization-aware Training of Transformers in Time-Series Tianheng Ling et.al. 2408.16495 null
2024-08-29 Accelerating Image-based Pest Detection on a Heterogeneous Multi-core Microcontroller Luca Bompani et.al. 2408.15911 link
2024-08-28 FireFly-S: Exploiting Dual-Side Sparsity for Spiking Neural Networks Acceleration with Reconfigurable Spatial Architecture Tenglong Li et.al. 2408.15578 null
2024-08-29 CGRA4ML: A Framework to Implement Modern Neural Networks for Scientific Edge Computing G Abarajithan et.al. 2408.15561 null
2024-08-27 SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search Hung-Yueh Chiang et.al. 2408.15395 null
2024-08-27 SiHGNN: Leveraging Properties of Semantic Graphs for Efficient HGNN Acceleration Runzhen Xue et.al. 2408.15089 null
2024-08-26 On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise M. Reza Eslami et.al. 2408.14680 null
2024-08-26 HAPM -- Hardware Aware Pruning Method for CNN hardware accelerators in resource constrained devices Federico Nicolas Peccia et.al. 2408.14055 null
2024-08-22 Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments Maciej Besta et.al. 2408.12173 null
2024-08-21 Floating-Point Multiply-Add with Approximate Normalization for Low-Cost Matrix Engines Kosmas Alexandridis et.al. 2408.11997 null
2024-08-21 Cage: Hardware-Accelerated Safe WebAssembly Martin Fink et.al. 2408.11456 null
2024-08-20 Tapping in a Remote Vehicle's onboard LLM to Complement the Ego Vehicle's Field-of-View Malsha Ashani Mahawatta Dona et.al. 2408.10794 null
2024-08-16 Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers Zihang Song et.al. 2408.08794 null
2024-08-16 Cross-Chip Partial Reconfiguration for the Initialisation of Modular and Scalable Heterogeneous Systems Marvin Fuchs et.al. 2408.08626 null
2024-08-13 HLSPilot: LLM-based High-Level Synthesis Chenwei Xiong et.al. 2408.06810 link
2024-08-12 Hardware Architecture Design of Model-Based Image Reconstruction Towards Palm-size Photoacoustic Tomography Yuwei Zheng et.al. 2408.06049 null
2024-08-12 SZKP: A Scalable Accelerator Architecture for Zero-Knowledge Proofs Alhad Daftardar et.al. 2408.05890 null
2024-08-10 LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale Jaehong Cho et.al. 2408.05499 link
2024-08-08 Noise-augmented Chaotic Ising Machines for Combinatorial Optimization and Sampling Kyle Lee et.al. 2408.04744 null
2024-08-07 Hardware-Assisted Virtualization of Neural Processing Units for Cloud Platforms Yuqi Xue et.al. 2408.04104 null
2024-08-07 Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration Zhongyao Luo et.al. 2408.03647 link
2024-08-06 LLM-Aided Compilation for Tensor Accelerators Charles Hong et.al. 2408.03408 null
2024-08-06 HeTraX: Energy Efficient 3D Heterogeneous Manycore Architecture for Transformer Acceleration Pratyush Dhingra et.al. 2408.03397 null
2024-08-05 PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy Rachmad Vidya Wicaksana Putra et.al. 2408.02412 null
2024-08-02 Digitized Phase Change Material Heterostack for Diffractive Optical Neural Network Ruiyang Chen et.al. 2408.01404 null
2024-08-02 Search-in-Memory (SiM): Reliable, Versatile, and Efficient Data Matching in SSD's NAND Flash Memory Chip for Data Indexing Acceleration Yun-Chih Chen et.al. 2408.00327 null
2024-08-07 Temporal Feature Matters: A Framework for Diffusion Model Quantization Yushi Huang et.al. 2407.19547 null
2024-07-16 Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC) Seyed Nima Omidsajedi et.al. 2407.18264 null
2024-07-22 KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer Aness Al-Qawlaq et.al. 2407.16026 null
2024-07-18 Integrated Hardware Architecture and Device Placement Search Irene Wang et.al. 2407.13143 link
2024-07-17 ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks Salma Afifi et.al. 2407.12638 null
2024-07-17 StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators Ethan G Rogers et.al. 2407.12378 null
2024-07-16 Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment Yuhao Ji et.al. 2407.12070 null
2024-07-16 Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads Aritra Dhar et.al. 2407.11888 null
2024-07-15 Hierarchical search method for gravitational waves from stellar-mass binary black holes in noisy space-based detector data Yao Fu et.al. 2407.10797 null
2024-07-14 Accelerator-as-a-Service in Public Clouds: An Intra-Host Traffic Management View for Performance Isolation in the Wild Jiechen Zhao et.al. 2407.10098 null
2024-07-12 68-Channel Highly-Integrated Neural Signal Processing PSoC with On-Chip Feature Extraction, Compression, and Hardware Accelerators for Neuroprosthetics in 22nm FDSOI Liyuan Guo et.al. 2407.09166 null
2024-07-12 Hybrid Temporal Computing for Lower Power Hardware Accelerators Maliha Tasnim et.al. 2407.08975 null

(back to top)

TinyML

Publish Date Title Authors PDF Code
2024-12-25 Tempus Core: Area-Power Efficient Temporal-Unary Convolution Core for Low-Precision Edge DLAs Prabhu Vellaisamy et.al. 2412.19002 null
2024-12-23 Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings Harsh Joshi et.al. 2412.18635 null
2024-12-23 tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AI Harideep Nair et.al. 2412.17966 null
2024-12-22 Fatigue Monitoring Using Wearables and AI: Trends, Challenges, and Future Opportunities Kourosh Kakhi et.al. 2412.16847 null
2024-12-19 ElectraSight: Smart Glasses with Fully Onboard Non-Invasive Eye Tracking Using Hybrid Contact and Contactless EOG Nicolas Schärer et.al. 2412.14848 null
2024-12-17 Design of an AI-Enhanced Digital Stethoscope: Advancing Cardiovascular Diagnostics Through Smart Auscultation Abraham G. Taye et.al. 2412.14206 null
2024-12-16 Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads Mukul Lokhande et.al. 2412.11702 link
2024-12-13 Edge AI-based Radio Frequency Fingerprinting for IoT Networks Ahmed Mohamed Hussain et.al. 2412.10553 null
2024-12-13 EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models Hanchu Zhou et.al. 2412.09782 null
2024-12-12 Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices Thanaphon Suwannaphong et.al. 2412.09289 null
2024-12-10 Performance Evaluation of ROS2-DDS middleware implementations facilitating Cooperative Driving in Autonomous Vehicle Sumit Paul et.al. 2412.07485 null
2024-12-07 Innovative Sentiment Analysis and Prediction of Stock Price Using FinBERT, GPT-4 and Logistic Regression: A Data-Driven Approach Olamilekan Shobayo et.al. 2412.06837 null
2024-12-09 DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI Accelerators Taesik Gong et.al. 2412.06566 link
2024-12-09 Sequential Printed MLP Circuits for Super TinyML Multi-Sensory Applications Gurol Saglam et.al. 2412.06542 null
2024-12-02 Optimizing LoRa for Edge Computing with TinyML Pipeline for Channel Hopping Marla Grunewald et.al. 2412.01609 null
2024-12-01 Toward Real-Time Edge AI: Model-Agnostic Task-Oriented Communication with Visual Feature Alignment Songjie Xie et.al. 2412.00862 link
2024-11-28 Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras Jicheng Yuan et.al. 2411.19143 null
2024-11-28 Towards an Implementation of the Knowledge-Based Control Plane for Intelligent Swarm Networks Xuanchi Guo et.al. 2411.19068 null
2024-11-24 Space-ground Fluid AI for 6G Edge Intelligence Qian Chen et.al. 2411.15845 null
2024-11-20 Federated Continual Learning for Edge-AI: A Comprehensive Survey Zi Wang et.al. 2411.13740 null
2024-11-16 Enhanced FIWARE-Based Architecture for Cyberphysical Systems With Tiny Machine Learning and Machine Learning Operations: A Case Study on Urban Mobility Systems Javier Conde et.al. 2411.13583 null
2024-11-19 Signformer is all you need: Towards Edge AI for Sign Language Eta Yang et.al. 2411.12901 link
2024-11-16 DEBUG-HD: Debugging TinyML models on-device using Hyper-Dimensional computing Nikhil P Ghanathe et.al. 2411.10692 null
2024-11-14 ABCI 3.0: Evolution of the leading AI infrastructure in Japan Ryousei Takano et.al. 2411.09134 null
2024-11-13 A Cost-effective, Stand-alone, and Real-time TinyML-Based Gait Diagnosis Unit Aimed at Lower-limb Robotic Prostheses and Exoskeletons Zarin Anjum Madhiha et.al. 2411.08474 null
2024-11-12 Towards Vision Mixture of Experts for Wildlife Monitoring on the Edge Emmanuel Azuh Mensah et.al. 2411.07834 null
2024-11-16 Enhancing Predictive Maintenance in Mining Mobile Machinery through a TinyML-enabled Hierarchical Inference Network Raúl de la Fuente et.al. 2411.07168 null
2024-11-11 A Primer on Word Embeddings: AI Techniques for Text Analysis in Social Work Brian E. Perron et.al. 2411.07156 null
2024-11-11 TinyML Security: Exploring Vulnerabilities in Resource-Constrained Machine Learning Systems Jacob Huckelberry et.al. 2411.07114 null
2024-11-10 Activation Map Compression through Tensor Decomposition for Deep Learning Le-Trung Nguyen et.al. 2411.06346 link
2024-11-09 TinyML NLP Approach for Semantic Wireless Sentiment Classification Ahmed Y. Radwan et.al. 2411.06291 null
2024-11-03 Energy-Aware FPGA Implementation of Spiking Neural Network with LIF Neurons Asmer Hamid Ali et.al. 2411.01628 null
2024-11-01 On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance Jaskirat Singh et.al. 2411.00907 null
2024-10-30 Profiling AI Models: Towards Efficient Computation Offloading in Heterogeneous Edge AI Systems Juan Marcelo Parra-Ullauri et.al. 2411.00859 null
2024-11-01 GPT for Games: An Updated Scoping Review (2020-2024) Daijin Yang et.al. 2411.00308 null
2024-10-31 Cough-E: A multimodal, privacy-preserving cough detection algorithm for the edge Stefano Albini et.al. 2410.24066 link
2024-10-28 FusedInf: Efficient Swapping of DNN Models for On-Demand Serverless Inference Services on the Edge Sifat Ut Taki et.al. 2410.21120 link
2024-10-28 Edge Perception: Intelligent Wireless Sensing at Network Edge Yuanhao Cui et.al. 2410.21017 null
2024-10-25 Neuromorphic IoT Architecture for Efficient Water Management: A Smart Village Case Study Mugdim Bublin et.al. 2410.19562 null
2024-10-17 SouLLMate: An Application Enhancing Diverse Mental Health Support with Adaptive LLMs, Prompt Engineering, and RAG Techniques Qiming Guo et.al. 2410.16322 null
2024-10-21 P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving Mohamed R. Elshamy et.al. 2410.15602 null
2024-10-15 SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments Syed Abdul Gaffar Shakhadri et.al. 2410.11331 null
2024-10-14 ABBA-VSM: Time Series Classification using Symbolic Representation on the Edge Meerzhan Kanatbekova et.al. 2410.10285 null
2024-10-12 Token Pruning using a Lightweight Background Aware Vision Transformer Sudhakar Sah et.al. 2410.09324 null
2024-10-11 MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices Mohamed Amine Hamdi et.al. 2410.08855 link
2024-10-11 Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation Gleb Radchenko et.al. 2410.08651 null
2024-10-10 Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices Yiwei Zhao et.al. 2410.08326 null
2024-10-10 L-VITeX: Light-weight Visual Intuition for Terrain Exploration Antar Mazumder et.al. 2410.07872 null
2024-10-10 Towards Robust IoT Defense: Comparative Statistics of Attack Detection in Resource-Constrained Scenarios Zainab Alwaisi et.al. 2410.07810 null
2024-10-10 vCLIC: Towards Fast Interrupt Handling in Virtualized RISC-V Mixed-criticality Systems Enrico Zelioli et.al. 2410.07798 null
2024-10-07 SoK: Towards Security and Safety of Edge AI Tatjana Wingarz et.al. 2410.05349 null
2024-10-10 SONAR: A Synthetic AI-Audio Detection Framework and Benchmark Xiang Li et.al. 2410.04324 link
2024-09-28 MicroFlow: An Efficient Rust-Based Inference Engine for TinyML Matteo Carnelos et.al. 2409.19432 link
2024-09-27 Analog fast Fourier transforms for scalable and efficient signal processing T. Patrick Xiao et.al. 2409.19071 null
2024-09-26 Development of an Edge Resilient ML Ensemble to Tolerate ICS Adversarial Attacks Likai Yao et.al. 2409.18244 null
2024-09-25 Susceptibility Formulation of Density Matrix Perturbation Theory Anders M. N. Niklasson et.al. 2409.17033 null
2024-09-25 Ethical and Scalable Automation: A Governance and Compliance Framework for Business Applications Haocheng Lin et.al. 2409.16872 null
2024-09-25 Accelerating TinyML Inference on Microcontrollers through Approximate Kernels Giorgos Armeniakos et.al. 2409.16815 link
2024-09-23 Benchmarking Edge AI Platforms for High-Performance ML Inference Rakshith Jayanth et.al. 2409.14803 null
2024-09-24 CamelEval: Advancing Culturally Aligned Arabic Language Models and Benchmarks Zhaozhi Qian et.al. 2409.12623 null
2024-09-17 AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances Dhruv Agarwal et.al. 2409.11360 null
2024-09-17 Optimizing TinyML: The Impact of Reduced Data Acquisition Rates for Time Series Classification on Microcontrollers Riya Samanta et.al. 2409.10942 null
2024-09-13 Pushing the boundaries of event subsampling in event-based video classification using CNNs Hesam Araghi et.al. 2409.08953 link
2024-09-12 E-QUARTIC: Energy Efficient Edge Ensemble of Convolutional Neural Networks for Resource-Optimized Learning Le Zhang et.al. 2409.08369 null
2024-09-12 DiReDi: Distillation and Reverse Distillation for AIoT Applications Chen Sun et.al. 2409.08308 null
2024-09-11 A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption Marcus Rüb et.al. 2409.07114 null
2024-09-08 Transformer with Leveraged Masked Autoencoder for video-based Pain Assessment Minh-Duc Nguyen et.al. 2409.05088 null
2024-09-02 Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks Samer Francy et.al. 2409.02134 null
2024-09-01 Research on LLM Acceleration Using the High-Performance RISC-V Processor "Xiangshan" (Nanhu Version) Based on the Open-Source Matrix Instruction Set Extension (Vector Dot Product) Xu-Hao Chen et.al. 2409.00661 null
2024-08-26 Towards Sustainable Personalized On-Device Human Activity Recognition with TinyML and Cloud-Enabled Auto Deployment Bidyut Saha et.al. 2409.00093 null
2024-08-29 TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification Bidyut Saha et.al. 2408.16535 link
2024-08-08 An Edge AI System Based on FPGA Platform for Railway Fault Detection Jiale Li et.al. 2408.15245 null
2024-08-23 S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image Analysis Kamal Basha S et.al. 2408.12833 link
2024-08-20 Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning Bei Ouyang et.al. 2408.10746 null
2024-08-21 Challenges and Responses in the Practice of Large Language Models Hongyin Zhu et.al. 2408.09416 null
2024-08-15 Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices Tess Watt et.al. 2408.08215 null
2024-08-14 Efficient Edge AI: Deploying Convolutional Neural Networks on FPGA with the Gemmini Accelerator Federico Nicolas Peccia et.al. 2408.07404 null
2024-08-13 Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach Haowei Ni et.al. 2408.06634 null
2024-08-06 Training on the Fly: On-device Self-supervised Learning aboard Nano-drones within 20 mW Elia Cereda et.al. 2408.03168 null
2024-08-05 Toward Attention-based TinyML: A Heterogeneous Accelerated Architecture and Automated Deployment Flow Philip Wiese et.al. 2408.02473 null
2024-08-05 PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy Rachmad Vidya Wicaksana Putra et.al. 2408.02412 null
2024-08-02 A Tiny Supervised ODL Core with Auto Data Pruning for Human Activity Recognition Hiroki Matsutani et.al. 2408.01283 null
2024-07-29 HOAA: Hybrid Overestimating Approximate Adder for Enhanced Performance Processing Engine Omkar Kokane et.al. 2408.00806 link
2024-07-31 TinyChirp: Bird Song Recognition Using TinyML Models on Low-power Wireless Acoustic Sensors Zhaolan Huang et.al. 2407.21453 link
2024-07-31 SHA-CNN: Scalable Hierarchical Aware Convolutional Neural Network for Edge AI Narendra Singh Dhakad et.al. 2407.21370 null
2024-07-30 On-the-fly Communication-and-Computing to Enable Representation Learning for Distributed Point Clouds Xu Chen et.al. 2407.20710 null
2024-07-29 Model Agnostic Hybrid Sharding For Heterogeneous Distributed Inference Claudio Angione et.al. 2407.19775 null
2024-07-25 A Sensitivity Analysis of Cellular Automata and Heterogeneous Topology Networks: Partially-Local Cellular Automata and Homogeneous Homogeneous Random Boolean Networks Tom Eivind Glover et.al. 2407.18017 null
2024-07-22 StreamTinyNet: video streaming analysis with spatial-temporal TinyML Hazem Hesham Yousef Shalby et.al. 2407.17524 null
2024-07-22 KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer Aness Al-Qawlaq et.al. 2407.16026 null
2024-07-18 Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence Shubha R. Kharel et.al. 2407.14560 null
2024-07-18 Ultra-Low-Latency Edge Inference for Distributed Sensing Zhanwei Wang et.al. 2407.13360 null
2024-07-17 Computing: Looking Back and Moving Forward Muhammed Golec et.al. 2407.12558 null
2024-07-16 XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach Truong Thanh Hung Nguyen et.al. 2407.11771 link
2024-07-18 Enhancing TinyML Security: Study of Adversarial Attack Transferability Parin Shah et.al. 2407.11599 null
2024-07-13 Characterizing Disparity Between Edge Models and High-Accuracy Base Models for Vision Tasks Zhenyu Wang et.al. 2407.10016 null
2024-07-11 Towards Efficient Deployment of Hybrid SNNs on Neuromorphic and Edge AI Hardware James Seekings et.al. 2407.08704 null

(back to top)

Domain Specific Accelerator

Publish Date Title Authors PDF Code
2024-12-21 Leveraging Highly Approximated Multipliers in DNN Inference Georgios Zervakis et.al. 2412.16757 null
2024-12-13 Panacea: Novel DNN Accelerator using Accuracy-Preserving Asymmetric Quantization and Energy-Saving Bit-Slice Sparsity Dongyun Kam et.al. 2412.10059 null
2024-12-06 HiVeGen -- Hierarchical LLM-based Verilog Generation for Scalable Chip Design Jinwei Tang et.al. 2412.05393 null
2024-12-06 MC3: Memory Contention based Covert Channel Communication on Shared DRAM System-on-Chips Ismet Dagli et.al. 2412.05228 null
2024-11-28 PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers Gwangoo Yeo et.al. 2411.19114 null
2024-12-06 FAMES: Fast Approximate Multiplier Substitution for Mixed-Precision Quantized DNNs--Down to 2 Bits! Yi Ren et.al. 2411.18055 null
2024-11-19 Travel Time Based Task Mapping for NoC-Based DNN Accelerator Yizhi Chen et.al. 2411.12710 null
2024-10-29 Systolic Array Data Flows for Efficient Matrix Multiplication in Deep Neural Networks Tejas Raja et.al. 2410.22595 null
2024-10-21 Adventures with Grace Hopper AI Super Chip and the National Research Platform J. Alex Hurt et.al. 2410.16487 null
2024-10-17 Shavette: Low Power Neural Network Acceleration via Algorithm-level Error Detection and Undervolting Mikael Rinkinen et.al. 2410.13415 null
2024-10-11 MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices Mohamed Amine Hamdi et.al. 2410.08855 link
2024-09-23 MESC: Re-thinking Algorithmic Priority and/or Criticality Inversions for Heterogeneous MCSs Jiapeng Guan et.al. 2409.14837 null
2024-10-14 LoopTree: Exploring the Fused-layer Dataflow Accelerator Design Space Michael Gilbert et.al. 2409.13625 link
2024-09-13 Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators Konstantin Lübeck et.al. 2409.08595 null
2024-09-08 BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration Yuzong Chen et.al. 2409.05227 link
2024-09-08 HYDRA: Hybrid Data Multiplexing and Run-time Layer Configurable DNN Accelerator Sonu Kumar et.al. 2409.04976 null
2024-08-27 SiHGNN: Leveraging Properties of Semantic Graphs for Efficient HGNN Acceleration Runzhen Xue et.al. 2408.15089 null
2024-08-24 SiTe CiM: Signed Ternary Computing-in-Memory for Ultra-Low Precision Deep Neural Networks Niharika Thakuria et.al. 2408.13617 null
2024-08-13 Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture Yu Feng et.al. 2408.06608 null
2024-09-24 Scaling Deep Learning Computation over the Inter-Core Connected Intelligence Processor with T10 Yiqi Liu et.al. 2408.04808 null
2024-07-30 Optical Computing for Deep Neural Network Acceleration: Foundations, Recent Developments, and Emerging Directions Sudeep Pasricha et.al. 2407.21184 null
2024-07-29 Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices Hayun Lee et.al. 2407.19644 null
2024-07-24 The Magnificent Seven Challenges and Opportunities in Domain-Specific Accelerator Design for Autonomous Systems Sabrina M. Neuman et.al. 2407.17311 null
2024-07-17 StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators Ethan G Rogers et.al. 2407.12378 null
2024-07-11 NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2 Tengfei Xue et.al. 2407.12057 null
2024-07-22 ARCO:Adaptive Multi-Agent Reinforcement Learning-Based Hardware/Software Co-Optimization Compiler for Improved Performance in DNN Accelerator Design Arya Fayyazi et.al. 2407.08192 null
2024-06-20 SWANN: Shuffling Weights in Crossbar Arrays for Enhanced DNN Accuracy in Deeply Scaled Technologies Jeffry Victor et.al. 2406.14706 null
2024-06-14 CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories Man Shi et.al. 2406.14574 null
2024-06-15 Memory Faults in Activation-sparse Quantized Deep Neural Networks: Analysis and Mitigation using Sharpness-aware Training Akul Malhotra et.al. 2406.10528 null
2024-07-17 Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis Zongyue Qin et.al. 2406.09606 null
2024-06-05 HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator Zhewen Yu et.al. 2406.03088 link
2024-06-03 A 0.96pJ/SOP, 30.23K-neuron/mm^2 Heterogeneous Neuromorphic Chip With Fullerene-like Interconnection Topology for Edge-AI Computing P. J. Zhou et.al. 2406.01151 null

(back to top)

Low-Rank Adaptation

Publish Date Title Authors PDF Code
2024-12-30 Adversarial Attack and Defense for LoRa Device Identification and Authentication via Deep Learning Yalin E. Sagduyu et.al. 2412.21164 null
2024-12-30 Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring Ehsan Latif et.al. 2412.21065 null
2024-12-30 DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models Xiaolin Hu et.al. 2412.20891 null
2024-12-30 Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation Shubh Singhal et.al. 2412.20838 null
2024-12-30 VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control Shaojin Wu et.al. 2412.20800 link
2025-01-02 EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers Daiheng Gao et.al. 2412.20413 null
2024-12-28 Multi-Modality Driven LoRA for Adverse Condition Depth Estimation Guanglei Yang et.al. 2412.20162 null
2024-12-28 VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition Lan Chen et.al. 2412.20064 link
2024-12-28 Adaptive Parameter-Efficient Federated Fine-Tuning on Heterogeneous Devices Jun Liu et.al. 2412.20004 null
2024-12-27 Gradient Weight-normalized Low-rank Projection for Efficient LLM Training Jia-Hong Huang et.al. 2412.19616 link
2024-12-27 Performance Evaluation of IoT LoRa Networks on Mars Through ns-3 Simulations Manuele Favero et.al. 2412.19549 link
2024-12-27 KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing Shu Zhao et.al. 2412.19417 null
2024-12-25 Optimizing Large Language Models with an Enhanced LoRA Fine-Tuning Algorithm for Efficiency and Robustness in NLP Tasks Jiacheng Hu et.al. 2412.18729 null
2024-12-24 Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models Zihan Zhou et.al. 2412.18419 null
2024-12-18 Enhancing Knowledge Distillation for LLMs with Response-Priming Prompting Vijay Goyal et.al. 2412.17846 link
2024-12-25 DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder Ente Lin et.al. 2412.17644 null
2024-12-23 Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing Prakash Aryan et.al. 2412.17548 link
2024-12-21 Label Privacy in Split Learning for Large Models with Parameter-Efficient Training Philip Zmushko et.al. 2412.16669 link
2024-12-20 Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline Guancheng Zeng et.al. 2412.15660 null
2024-12-23 CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training Xiuli Bi et.al. 2412.15646 link
2024-12-20 AutoRank: MCDA Based Rank Personalization for LoRA-Enabled Distributed Learning Shuaijun Chen et.al. 2412.15553 null
2024-12-19 Knowledge Injection via Prompt Distillation Kalle Kujanpää et.al. 2412.14964 null
2024-12-20 All-in-One Tuning and Structural Pruning for Domain-Specific LLMs Lei Lu et.al. 2412.14426 null
2024-12-18 CoRa: A Collision-Resistant LoRa Symbol Detector of Low Complexity José Álamos et.al. 2412.13930 null
2024-12-18 A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Method-Level Code Smell Detection Beiqi Zhang et.al. 2412.13801 link
2024-12-18 Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration Xuhan Zuo et.al. 2412.13551 null
2024-12-18 Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models Xinxin Liu et.al. 2412.13488 null
2024-12-18 Transducer Tuning: Efficient Model Adaptation for Software Tasks Using Code Property Graphs Imam Nur Bani Yusuf et.al. 2412.13467 link
2024-12-17 Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models Elvis Nunez et.al. 2412.13328 null
2024-12-17 FineGates: LLMs Finetuning with Compression using Stochastic Gates Jonathan Svirsky et.al. 2412.12951 null
2024-12-17 Enhancing Naturalness in LLM-Generated Utterances through Disfluency Insertion Syed Zohaib Hassan et.al. 2412.12710 null
2024-12-17 Train More Parameters But Mind Their Placement: Insights into Language Adaptation with PEFT Jenny Kunz et.al. 2412.12674 link
2024-12-17 NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning Xin Yi et.al. 2412.12497 link
2024-12-16 Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering Jinhe Bi et.al. 2412.12359 link
2024-12-16 Can video generation replace cinematographers? Research on the cinematic language of generated video Xiaozhe Li et.al. 2412.12223 null
2024-12-16 A LoRA is Worth a Thousand Pictures Chenxi Liu et.al. 2412.12048 null
2024-12-16 The Open Source Advantage in Large Language Models (LLMs) Jiya Manchanda et.al. 2412.12004 null
2024-12-17 No More Adam: Learning Rate Scaling at Initialization is All You Need Minghao Xu et.al. 2412.11768 link
2024-12-16 IDEA-Bench: How Far are Generative Models from Professional Designing? Chen Liang et.al. 2412.11767 link
2024-12-16 Adapting Segment Anything Model (SAM) to Experimental Datasets via Fine-Tuning on GAN-based Simulation: A Case Study in Additive Manufacturing Anika Tabassum et.al. 2412.11381 link
2024-12-16 FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation Dannong Wang et.al. 2412.11378 null
2024-12-15 Separate the Wheat from the Chaff: A Post-Hoc Approach to Safety Re-Alignment for Fine-Tuned Language Models Di Wu et.al. 2412.11041 null
2024-12-15 SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation Hang Zhang et.al. 2412.11026 null
2024-12-14 Efficient Adaptation of Multilingual Models for Japanese ASR Mark Bajo et.al. 2412.10705 link
2024-12-13 SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation Runtao Liu et.al. 2412.10493 null
2024-12-13 OP-LoRA: The Blessing of Dimensionality Piotr Teterwak et.al. 2412.10362 null
2024-12-16 ASLoRA: Adaptive Sharing Low-Rank Adaptation Across Layers Junyan Hu et.al. 2412.10135 null
2024-12-13 CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models Dongyu Yao et.al. 2412.09936 link
2024-12-13 Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models Changqun Li et.al. 2412.09827 null
2024-12-12 LoRACLR: Contrastive Adaptation for Customization of Diffusion Models Enis Simsar et.al. 2412.09622 null
2024-12-12 EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM Zhuofan Zong et.al. 2412.09618 null
2024-12-12 Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition Zhisheng Zhong et.al. 2412.09501 link
2024-12-15 GeLoRA: Geometric Adaptive Ranks For Efficient LoRA Fine-tuning Abdessalam Ed-dib et.al. 2412.09250 null
2024-12-12 RAD: Region-Aware Diffusion Models for Image Inpainting Sora Kim et.al. 2412.09191 null
2024-12-12 DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization Geonhui Jang et.al. 2412.09169 null
2024-12-12 MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning Lulu Zhao et.al. 2412.08946 null
2024-12-11 DMin: Scalable Training Data Influence Estimation for Diffusion Models Huawei Lin et.al. 2412.08637 link
2024-12-10 Accretion onto WD 2226 $-$ 210, the central star of the Helix Nebula S. Estrada-Dorado et.al. 2412.07863 null
2024-12-10 PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition Kartik Narayan et.al. 2412.07771 null
2024-12-10 LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models Ziqi Lu et.al. 2412.07746 null
2024-12-10 ChocoLlama: Lessons Learned From Teaching Llamas Dutch Matthieu Meeus et.al. 2412.07633 null
2024-12-10 MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning Yufei Ma et.al. 2412.07405 null
2024-12-10 Attention Head Purification: A New Perspective to Harness CLIP for Domain Generalization Yingfan Wang et.al. 2412.07226 null
2024-12-09 Optimal Routing and Link Configuration for Covert Heterogeneous Wireless Networks Amna Gillani et.al. 2412.07059 null
2024-12-09 Sequential Compression Layers for Efficient Federated Learning in Foundational Models Navyansh Mahla et.al. 2412.07021 null
2024-12-09 BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation Qiushi Wang et.al. 2412.06441 null
2024-12-10 S $^{2}$ FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity Xinyu Yang et.al. 2412.06289 null
2024-12-08 Enhanced Computationally Efficient Long LoRA Inspired Perceiver Architectures for Auto-Regressive Language Modeling Kaleel Mahmood et.al. 2412.06106 null
2024-12-08 KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models Fan Wang et.al. 2412.06071 link
2024-12-07 Training-Free Bayesianization for Low-Rank Adapters of Large Language Models Haizhou Shi et.al. 2412.05723 link
2024-12-07 Plasmonic Electro-Optic Modulators based on Epsilon-Near-Zero Materials: Comparing the Classical Drift-Diffusion and Schrödinger-Poisson Coupling Models Masoud Shabaninezhad et.al. 2412.05690 null
2024-12-06 QueEn: A Large Language Model for Quechua-English Translation Junhao Chen et.al. 2412.05184 null
2024-12-06 LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation Donald Shenaj et.al. 2412.05148 null
2024-12-05 Performance Evaluation of LoRa Technology for Rural Connectivity: An Experimental Analysis in Nepal Atit Pokharel et.al. 2412.04563 null
2024-12-04 Prompting Large Language Models for Clinical Temporal Relation Extraction Jianping He et.al. 2412.04512 null
2024-12-05 UnZipLoRA: Separating Content and Style from a Single Image Chang Liu et.al. 2412.04465 null
2024-12-08 Discriminative Fine-tuning of LVLMs Yassine Ouali et.al. 2412.04378 null
2024-12-05 Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts Chenyang Zhu et.al. 2412.04220 null
2024-12-05 SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning Seokju Yun et.al. 2412.04077 link
2024-12-04 Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis Davide Bucciarelli et.al. 2412.03665 null
2024-12-04 Imagine360: Immersive 360 Video Generation from Perspective Anchor Jing Tan et.al. 2412.03552 null
2024-12-04 DIVE: Taming DINO for Subject-Driven Video Editing Yi Huang et.al. 2412.03347 null
2024-12-04 Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach Lingchen Sun et.al. 2412.03017 link
2024-12-03 EvRT-DETR: The Surprising Effectiveness of DETR-based Detection for Event Cameras Dmitrii Torbunov et.al. 2412.02890 null
2024-12-03 Explainable CTR Prediction via LLM Reasoning Xiaohan Yu et.al. 2412.02588 null
2024-12-03 LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization Ethan Smith et.al. 2412.02352 null
2024-12-03 SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models Sabina Martyniak et.al. 2412.02332 link
2024-12-03 Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs Zixuan Hu et.al. 2412.02220 null
2024-12-02 Optimizing LoRa for Edge Computing with TinyML Pipeline for Channel Hopping Marla Grunewald et.al. 2412.01609 null
2024-12-02 CellSeg1: Robust Cell Segmentation with One Training Image Peilin Zhou et.al. 2412.01410 link
2024-12-02 Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking Marco Federici et.al. 2412.01380 null
2024-12-02 MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost Sen Xing et.al. 2412.01271 null
2024-12-02 RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy Geonho Lee et.al. 2412.01129 null
2024-12-03 Adaptive Rank, Reduced Forgetting: Knowledge Retention in Continual Learning Vision-Language Models with Dynamic Rank-Selective LoRA Haodong Lu et.al. 2412.01004 null
2024-11-29 SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks Kim-Celine Kahl et.al. 2411.19688 link
2024-11-29 Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning Kaustubh Ponkshe et.al. 2411.19557 link
2024-11-28 PEFT-as-an-Attack! Jailbreaking Language Models during Federated Parameter-Efficient Fine-Tuning Shenghui Li et.al. 2411.19335 null
2024-11-28 Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation Son Thai Ly et.al. 2411.19297 link
2024-11-28 LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair Xue Song et.al. 2411.19156 null
2024-11-28 DESIRE: Dynamic Knowledge Consolidation for Rehearsal-Free Continual Learning Haiyang Guo et.al. 2411.19154 null
2024-11-28 Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures Yicheng Zhang et.al. 2411.19128 link
2024-11-27 Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning Omkar Khade et.al. 2411.18571 null
2024-11-27 Emergence of Self-Identity in AI: A Mathematical Framework and Empirical Study with Generative Large Language Models Minhyeok Lee et.al. 2411.18530 link
2024-11-27 Adaptive Blind All-in-One Image Restoration David Serrano-Lozano et.al. 2411.18412 link
2024-11-27 Thai Financial Domain Adaptation of THaLLE -- Technical Report KBTG Labs et.al. 2411.18242 null
2024-11-27 ROICtrl: Boosting Instance Control for Visual Generation Yuchao Gu et.al. 2411.17949 null
2024-11-26 Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading Suyeol Yun et.al. 2411.17900 link
2024-11-26 Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation Sudarshan Rajagopalan et.al. 2411.17814 null
2024-11-26 PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning Zhen Sun et.al. 2411.17453 null
2024-11-26 CLOVER: Constrained Learning with Orthonormal Vectors for Eliminating Redundancy Fanxu Meng et.al. 2411.17426 null
2024-11-26 Efficient Deployment of Transformer Models in Analog In-Memory Computing Hardware Chen Li et.al. 2411.17367 link
2024-11-26 ThreatModeling-LLM: Automating Threat Modeling using Large Language Models for Banking System Shuiqiao Yang et.al. 2411.17058 null
2024-11-26 PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation Hengjia Li et.al. 2411.17048 null
2024-11-25 RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks Nazia Tasnim et.al. 2411.16870 null
2024-11-25 Parameter Efficient Instruction Tuning: An Empirical Study Pengfei He et.al. 2411.16775 null
2024-11-23 LoBAM: LoRA-Based Backdoor Attack on Model Merging Ming Yin et.al. 2411.16746 null
2024-11-24 Modality Alignment Meets Federated Broadcasting Yuting Ma et.al. 2411.15837 null
2024-11-24 LoRA-Mini : Adaptation Matrices Decomposition and Selective Training Ayush Singh et.al. 2411.15804 null
2024-11-23 Reassessing Layer Pruning in LLMs: New Insights and Methods Yao Lu et.al. 2411.15558 link
2024-11-23 Gradient dynamics for low-rank fine-tuning beyond kernels Arif Kerem Dayi et.al. 2411.15385 null
2024-11-22 On the Impact of Fine-Tuning on Chain-of-Thought Reasoning Elita Lobo et.al. 2411.15382 null
2024-11-22 ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation Junzhang Liu et.al. 2411.15281 null
2024-11-21 IterIS: Iterative Inference-Solving Alignment for LoRA Merging Hongxu Chen et.al. 2411.15231 null
2024-11-22 Exploring Foundation Models Fine-Tuning for Cytology Classification Manon Dausort et.al. 2411.14975 link
2024-11-22 LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement Jieming Bian et.al. 2411.14961 null
2024-11-21 Interpreting seasonal and interannual Hadley cell descending edge migrations via the cell-mean Rossby number Spencer A Hill et.al. 2411.14544 null
2024-11-21 Multi LoRA Meets Vision: Merging multiple adapters to create a multi task model Ege Kesim et.al. 2411.14064 null
2024-11-21 Separable Mixture of Low-Rank Adaptation for Continual Visual Instruction Tuning Ziqi Wang et.al. 2411.13949 null
2024-11-21 Dressing the Imagination: A Dataset for AI-Powered Translation of Text into Fashion Outfits and A Novel KAN Adapter for Enhanced Feature Adaptation Gayatri Deshmukh et.al. 2411.13901 null
2024-11-21 AutoMixQ: Self-Adjusting Quantization for High Performance Memory-Efficient Fine-Tuning Changhai Zhou et.al. 2411.13814 null
2024-11-20 Unleashing the Power of Large Language Models for Group POI Recommendations Jing Long et.al. 2411.13415 null
2024-11-20 On the Way to LLM Personalization: Learning to Remember User Conversations Lucie Charlotte Magister et.al. 2411.13405 null
2024-11-19 Visual Cue Enhancement and Dual Low-Rank Adaptation for Efficient Visual Instruction Fine-Tuning Pengkun Jiao et.al. 2411.12787 null
2024-11-16 LoRA Unlearns More and Retains More (Student Abstract) Atharv Mittal et.al. 2411.11907 link
2024-11-18 SeqProFT: Applying LoRA Finetuning for Sequence-only Protein Property Predictions Shuo Zhang et.al. 2411.11530 null
2024-11-16 Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts Jinqiang Long et.al. 2411.10669 link
2024-11-15 AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment Yonggan Fu et.al. 2411.10606 link
2024-11-15 Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning Yushen Zuo et.al. 2411.10130 null
2024-11-15 LoRA-LiteE: A Computationally Efficient Framework for Chatbot Preference-Tuning Yahe Yang et.al. 2411.09947 null
2024-11-12 Structured Pattern Expansion with Diffusion Models Marzia Riso et.al. 2411.08930 null
2024-11-13 Dynamic Subset Tuning: Expanding the Operational Range of Parameter-Efficient Training for Large Language Models Felix Stahlberg et.al. 2411.08610 null
2024-11-13 Machine Unlearning on Pre-trained Models by Residual Feature Alignment Using LoRA Laiqiao Qin et.al. 2411.08443 null
2024-11-11 LoRA-BERT: a Natural Language Processing Model for Robust and Accurate Prediction of long non-coding RNAs Nicholas Jeon et.al. 2411.08073 null
2024-11-12 FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training Philip Zmushko et.al. 2411.07837 link
2024-11-12 Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained Devices Kilian Pfeiffer et.al. 2411.07826 null
2024-11-12 Federated Low-Rank Adaptation with Differential Privacy over Wireless Networks Tianqu Kang et.al. 2411.07806 null
2024-11-12 ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization Weibo Zhao et.al. 2411.07762 null
2024-11-11 DeepONet as a Multi-Operator Extrapolation Model: Distributed Pretraining with Physics-Informed Fine-Tuning Zecheng Zhang et.al. 2411.07239 null
2024-11-11 Invar-RAG: Invariant LLM-aligned Retrieval for Better Generation Ziwei Liu et.al. 2411.07021 null
2024-11-11 MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps Xue Xia et.al. 2411.06971 null
2024-11-11 LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models Runming Yang et.al. 2411.06839 null
2024-11-10 Federated LLMs Fine-tuned with Adaptive Importance-Aware LoRA Yang Su et.al. 2411.06581 null
2024-11-10 Prompt-Efficient Fine-Tuning for GPT-like Deep Models to Reduce Hallucination and to Improve Reproducibility in Scientific Text Generation Using Stochastic Optimisation Techniques Daniil Sulimov et.al. 2411.06445 null
2024-11-08 Energy Efficient Protein Language Models: Leveraging Small Language Models with LoRA for Controllable Protein Generation Aayush Shah et.al. 2411.05966 null
2024-11-08 Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation Xiwen Wei et.al. 2411.05663 link
2024-11-08 SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models Muyang Li et.al. 2411.05007 link
2024-11-07 DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion Wenqiang Sun et.al. 2411.04928 null
2024-11-07 StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration Panwen Hu et.al. 2411.04925 null
2024-11-07 LLM-R: A Framework for Domain-Adaptive Maintenance Scheme Generation Combining Hierarchical Agents and RAG Laifa Tao et.al. 2411.04476 null
2024-11-09 Variational Low-Rank Adaptation Using IVON Bai Cong et.al. 2411.04421 link
2024-11-08 Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation Ayan Sengupta et.al. 2411.04358 link
2024-11-06 PyroGuardian: An IoT-Enabled System for Health and Location Monitoring in High-Risk Firefighting Environments Berkay Kaplan et.al. 2411.03654 null
2024-11-05 LLM-based Framework for Bearing Fault Diagnosis Laifa Tao et.al. 2411.02718 null
2024-11-04 TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network Nouf Alabbasi et.al. 2411.02617 link
2024-11-04 Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study André Storhaug et.al. 2411.02462 null
2024-11-04 Expanding Sparse Tuning for Low Memory Usage Shufan Shen et.al. 2411.01800 link
2024-11-02 PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment Dongxu Liu et.al. 2411.01245 null
2024-11-02 One Arrow, Many Targets: Probing LLMs for Multi-Attribute Controllable Text Summarization Tathagato Roy et.al. 2411.01213 null
2024-11-02 Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models Wonguk Cho et.al. 2411.01179 null
2024-11-02 LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding Jian Chen et.al. 2411.01106 null
2024-11-01 V-LoRA: An Efficient and Flexible System Boosts Vision Applications with LoRA LMM Liang Mi et.al. 2411.00915 null
2024-11-01 Dual Low-Rank Adaptation for Continual Learning with Pre-Trained Models Huancheng Chen et.al. 2411.00623 null
2024-10-31 DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion Weicai Ye et.al. 2410.24203 link
2024-11-05 In-Context LoRA for Diffusion Transformers Lianghua Huang et.al. 2410.23775 link
2024-10-30 Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation Stefan Stojanovic et.al. 2410.23434 null
2024-10-31 SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation Yining Hong et.al. 2410.23277 null
2024-10-31 Why Gradient Subspace? Identifying and Mitigating LoRA's Bottlenecks in Federated Fine-Tuning of Large Language Models Navyansh Mahla et.al. 2410.23111 null
2024-10-30 Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation Wei Dong et.al. 2410.22952 null
2024-10-30 CopRA: A Progressive LoRA Training Strategy Zhan Zhuang et.al. 2410.22911 null
2024-10-30 Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients Jabin Koo et.al. 2410.22815 null
2024-10-30 MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning Xujia Wang et.al. 2410.22782 null
2024-10-29 Meta-Learning Adaptable Foundation Models Jacob L. Block et.al. 2410.22264 null
2024-10-30 IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models Hang Guo et.al. 2410.21759 link
2024-10-28 LoRA vs Full Fine-tuning: An Illusion of Equivalence Reece Shuttleworth et.al. 2410.21228 null
2024-10-28 Skip2-LoRA: A Lightweight On-device DNN Fine-tuning Method for Low-cost Edge Devices Hiroki Matsutani et.al. 2410.21073 null
2024-10-28 KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation Rambod Azimi et.al. 2410.20777 link
2024-10-28 Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA Sangmin Bae et.al. 2410.20672 null
2024-10-28 PepDoRA: A Unified Peptide Language Model via Weight-Decomposed Low-Rank Adaptation Leyao Wang et.al. 2410.20667 null
2024-10-28 Collaborative Knowledge Fusion: A Novel Approach for Multi-task Recommender Systems via LLMs Chuang Zhao et.al. 2410.20642 null
2024-10-27 LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization Jui-Nan Yen et.al. 2410.20625 null
2024-10-27 FoldMark: Protecting Protein Generative Models with Watermarking Zaixi Zhang et.al. 2410.20354 link
2024-10-26 An Efficient Watermarking Method for Latent Diffusion Models via Low-Rank Adaptation Dongdong Lin et.al. 2410.20202 null
2024-10-25 Model merging with SVD to tie the Knots George Stoica et.al. 2410.19735 link
2024-10-25 Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs Yifei Zhang et.al. 2410.19694 null
2024-10-25 GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing Hosam Elgendy et.al. 2410.19552 link
2024-10-24 Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts Danyal Aftab et.al. 2410.19185 null
2024-10-24 On the Crucial Role of Initialization for Matrix Factorization Bingcong Li et.al. 2410.18965 null
2024-10-24 PSY: Posterior Sampling Based Privacy Enhancer in Large Language Models Yulian Sun et.al. 2410.18824 null
2024-10-24 GeoLoRA: Geometric integration for parameter efficient fine-tuning Steffen Schotthöfer et.al. 2410.18720 null
2024-10-24 Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model Ali Hamza et.al. 2410.18678 null
2024-10-23 CLEAR: Character Unlearning in Textual and Visual Modalities Alexey Dontsov et.al. 2410.18057 null
2024-10-23 MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning Jingfan Zhang et.al. 2410.18035 null
2024-10-23 Closed-form merging of parameter-efficient modules for Federated Continual Learning Riccardo Salami et.al. 2410.17961 null
2024-10-23 AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning Yehonathan Refael et.al. 2410.17881 null
2024-10-23 Understanding Layer Significance in LLM Alignment Guangyuan Shi et.al. 2410.17875 null
2024-10-23 VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning Yifan Peng et.al. 2410.17485 null
2024-10-22 FairLoRA: Unpacking Bias Mitigation in Vision Models with Fairness-Driven Low-Rank Adaptation Rohan Sukumaran et.al. 2410.17358 null
2024-10-22 Insights on Disagreement Patterns in Multimodal Safety Perception across Diverse Rater Groups Charvi Rastogi et.al. 2410.17032 null
2024-10-23 GeoCode-GPT: A Large Language Model for Geospatial Code Generation Tasks Shuyang Hou et.al. 2410.17031 null
2024-10-22 LoRA-C: Parameter-Efficient Fine-Tuning of Robust CNN for IoT Devices Chuntao Ding et.al. 2410.16954 link
2024-10-22 Can Large Language Models Act as Ensembler for Multi-GNNs? Hanqi Duan et.al. 2410.16822 null
2024-10-22 Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models Yuheng Lu et.al. 2410.16801 null
2024-10-22 MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report Samrajya Thapa et.al. 2410.16239 link
2024-10-21 Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs Kang Zhao et.al. 2410.16135 null
2024-10-21 Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning Arijit Das et.al. 2410.16029 link
2024-10-21 How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making? Zuojin Tang et.al. 2410.15885 null
2024-10-21 The effect of fine-tuning on language model toxicity Will Hawkins et.al. 2410.15821 link
2024-10-21 Habaek: High-performance water segmentation through dataset expansion and inductive bias optimization Hanseon Joo et.al. 2410.15794 link
2024-10-21 Students Rather Than Experts: A New AI For Education Pipeline To Model More Human-Like And Personalised Early Adolescences Yiping Ma et.al. 2410.15701 null
2024-10-20 MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models Ahmed Elbakary et.al. 2410.15524 null
2024-10-20 EVA: An Embodied World Model for Future Video Anticipation Xiaowei Chi et.al. 2410.15461 null
2024-10-20 LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration Yuang Ai et.al. 2410.15385 link
2024-10-18 Fine-Tuning DeepONets to Enhance Physics-informed Neural Networks for solving Partial Differential Equations Sidi Wu et.al. 2410.14134 null
2024-10-17 FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model ZiDong Wang et.al. 2410.13925 link
2024-10-17 Improving Multi-modal Large Language Model through Boosting Vision Capabilities Yanpeng Sun et.al. 2410.13733 null
2024-10-17 LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning Yiming Shi et.al. 2410.13618 link
2024-10-18 MoR: Mixture of Ranks for Low-Rank Adaptation Tuning Chuanyu Tang et.al. 2410.13408 null
2024-10-17 FAMSeC: A Few-shot-sample-based General AI-generated Image Detection Method Juncong Xu et.al. 2410.13156 null
2024-10-16 LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks Akshara Prabhakar et.al. 2410.13025 link
2024-10-16 DEeR: Deviation Eliminating and Noise Regulating for Privacy-preserving Federated Low-rank Adaptation Meilu Zhu et.al. 2410.12926 link
2024-10-15 In-context KV-Cache Eviction for LLMs via Attention-Gate Zihao Zeng et.al. 2410.12876 null
2024-10-16 FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction Akriti Jain et.al. 2410.12513 null
2024-10-15 LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models Hossein Abdi et.al. 2410.11551 null
2024-10-15 Transfer Learning with Foundational Models for Time Series Forecasting using Low-Rank Adaptations M. Germán-Morales et.al. 2410.11539 null
2024-10-15 Energy Efficient Transmission Parameters Selection Method Using Reinforcement Learning in Distributed LoRa Networks Ryotai Airiyoshi et.al. 2410.11270 null
2024-10-14 Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning Bokai Hu et.al. 2410.11020 null
2024-10-14 LoLCATs: On Low-Rank Linearizing of Large Language Models Michael Zhang et.al. 2410.10254 link
2024-10-14 Fed-piLot: Optimizing LoRA Assignment for Efficient Federated Foundation Model Fine-Tuning Zikai Zhang et.al. 2410.10200 null
2024-10-14 Scalable Multi-Domain Adaptation of Language Models using Modular Experts Peter Schafhalter et.al. 2410.10181 null
2024-10-14 Is Parameter Collision Hindering Continual Learning in LLMs? Shuo Yang et.al. 2410.10179 link
2024-10-14 AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality Peijun Qing et.al. 2410.10054 link
2024-10-13 Retrieval Instead of Fine-tuning: A Retrieval-based Parameter Ensemble for Zero-shot Learning Pengfei Jin et.al. 2410.09908 null
2024-10-13 A Quantum Circuit-Based Compression Perspective for Parameter-Efficient Learning Chen-Yu Liu et.al. 2410.09846 null
2024-10-13 Understanding Robustness of Parameter-Efficient Tuning for Image Classification Jiacheng Ruan et.al. 2410.09845 link
2024-10-13 BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation Peijia Qin et.al. 2410.09758 null
2024-10-13 AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model Yuchen Li et.al. 2410.09714 null
2024-10-11 Parameter-Efficient Fine-Tuning of State Space Models Kevin Galim et.al. 2410.09016 link
2024-10-10 Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical Framework for Low-Rank Adaptation Grigory Malinovsky et.al. 2410.08305 null
2024-10-10 SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture Jiayi Han et.al. 2410.07739 null
2024-10-10 MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion Onkar Susladkar et.al. 2410.07659 null
2024-10-09 SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers Viktoriia Chekalina et.al. 2410.07383 link
2024-10-09 One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation Fabian Paischer et.al. 2410.07170 link
2024-10-09 Industrial complexity and the evolution of formal employment in developing cities Neave O'Clery et.al. 2410.06971 null
2024-10-11 Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization Changli Tang et.al. 2410.06682 null
2024-10-08 Systematic 2.5 D resistive MHD simulations with ambipolar diffusion and Hall effect for fast magnetic reconnection Gabriela Landinez et.al. 2410.06391 null
2024-10-08 HyperDet: Generalizable Detection of Synthesized Images by Generating and Merging A Mixture of Hyper LoRAs Huangsen Cao et.al. 2410.06044 null
2024-10-08 QERA: an Analytical Framework for Quantization Error Reconstruction Cheng Zhang et.al. 2410.06040 null
2024-10-08 Hyper Adversarial Tuning for Boosting Adversarial Robustness of Pretrained Large Vision Models Kangtao Lv et.al. 2410.05951 null
2024-10-07 GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting Yukang Cao et.al. 2410.05259 null
2024-10-08 PAMLR: A Passive-Active Multi-Armed Bandit-Based Solution for LoRa Channel Allocation Jihoon Yun et.al. 2410.05147 null
2024-10-07 HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation Xinyu Zhou et.al. 2410.05090 link
2024-10-07 Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation Vince Zhu et.al. 2410.04689 null
2024-10-06 Learning De-Biased Representations for Remote-Sensing Imagery Zichen Tian et.al. 2410.04546 link
2024-10-05 Learning on LoRAs: GL-Equivariant Processing of Low-Rank Weight Spaces for Large Finetuned Models Theo et.al. 2410.04207 null
2024-10-05 LoRTA: Low Rank Tensor Adaptation of Large Language Models Ignacio Hounie et.al. 2410.04060 null
2024-10-05 Hyperbolic Fine-tuning for Large Language Models Menglin Yang et.al. 2410.04010 link
2024-10-04 AutoLoRA: AutoGuidance Meets Low-Rank Adaptation for Diffusion Models Artur Kasymov et.al. 2410.03941 link
2024-10-04 Collaborative and Efficient Personalization with Mixtures of Adaptors Abdulla Jasem Almansoori et.al. 2410.03497 null
2024-10-03 Neutral residues: revisiting adapters for model extension Franck Signe Talla et.al. 2410.02744 null
2024-10-03 Encryption-Friendly LLM Architecture Donghwan Rho et.al. 2410.02486 null
2024-10-02 NEAT: Nonlinear Parameter-efficient Adaptation of Pre-trained Models Yibo Zhong et.al. 2410.01870 null
2024-10-02 Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? Xi Chen et.al. 2410.01623 link
2024-10-02 DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models Yuxuan Zhang et.al. 2410.01497 link
2024-10-04 Selective Aggregation for Low-Rank Adaptation in Federated Learning Pengxin Guo et.al. 2410.01463 link
2024-10-02 FlashMask: Efficient and Rich Mask Extension of FlashAttention Guoxia Wang et.al. 2410.01359 link
2024-10-01 MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards Sheng Wang et.al. 2410.00938 null
2024-10-02 Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models Saurav Jha et.al. 2410.00700 null
2024-10-01 PrivTuner with Homomorphic Encryption and LoRA: A P3EFT Scheme for Privacy-Preserving Parameter-Efficient Fine-Tuning of AI Foundation Models Yang Li et.al. 2410.00433 null
2024-09-30 Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models Ji Liu et.al. 2410.00131 null
2024-09-30 UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation Cheng Zhang et.al. 2409.20197 link
2024-09-30 BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain Kaisi Guan et.al. 2409.20075 null
2024-09-30 HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models Bingshen Mu et.al. 2409.19878 null
2024-09-29 Learning Attentional Mixture of LoRAs for Language Model Continual Learning Jialin Liu et.al. 2409.19611 null
2024-09-29 Abstractive Summarization of Low resourced Nepali language using Multilingual Transformers Prakash Dhakal et.al. 2409.19566 null
2024-09-27 HM3: Heterogeneous Multi-Class Model Merging Stefan Hackmann et.al. 2409.19173 null
2024-09-26 MARS: Multi-radio Architecture with Radio Selection using Decision Trees for emerging mesoscale CPS/IoT applications Jothi Prasanna Shanmuga Sundaram et.al. 2409.18043 null
2024-09-26 PEDRO: Parameter-Efficient Fine-tuning with Prompt DEpenDent Representation MOdification Tianfang Xie et.al. 2409.17834 null
2024-09-30 Efficient In-Domain Question Answering for Resource-Constrained Environments Isaac Chung et.al. 2409.17648 null
2024-09-26 On the Implicit Relation Between Low-Rank Adaptation and Differential Privacy Saber Malekmohammadi et.al. 2409.17538 null
2024-09-26 A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction Guangyu Wang et.al. 2409.17440 link
2024-09-25 Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation Richard D. Paul et.al. 2409.17085 null
2024-09-25 Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors Aiping Zhang et.al. 2409.17058 link
2024-09-25 PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning Qibin Wang et.al. 2409.16722 null
2024-09-25 GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning Zhe-Rui Yang et.al. 2409.16670 null
2024-09-25 Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models Deepak Sridhar et.al. 2409.16535 link
2024-09-24 Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering Ziyu Zhao et.al. 2409.16167 null
2024-09-24 Evaluation of state-of-the-art ASR Models in Child-Adult Interactions Aditya Ashvin et.al. 2409.16135 null
2024-09-24 Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs Yang Yuhang et.al. 2409.16005 null
2024-09-24 Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM Fengrun Zhang et.al. 2409.15905 null
2024-09-24 Aided design of bridge aesthetics based on Stable Diffusion fine-tuning Leye Zhang et.al. 2409.15812 link
2024-09-17 Chain-of-Thought Prompting for Speech Translation Ke Hu et.al. 2409.11538 null
2024-09-17 Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models Divij Gupta et.al. 2409.11302 null
2024-09-17 LoRa Communication for Agriculture 4.0: Opportunities, Challenges, and Future Directions Lameya Aldhaheri et.al. 2409.11200 null
2024-09-17 Few-Shot Domain Adaptation for Learned Image Compression Tianyu Zhang et.al. 2409.11111 null
2024-09-17 KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models Bo Lv et.al. 2409.11057 null
2024-09-18 Propulsion: Steering LLM with Tiny Fine-Tuning Md Kowsher et.al. 2409.10927 link
2024-09-16 A Bayesian Interpretation of Adaptive Low-Rank Adaptation Haolin Chen et.al. 2409.10673 link
2024-09-16 From Text to Emoji: How PEFT-Driven Personality Manipulation Unleashes the Emoji Potential in LLMs Navya Jain et.al. 2409.10245 null
2024-09-16 Robust Bird's Eye View Segmentation by Adapting DINOv2 Merve Rabia Barın et.al. 2409.10228 null
2024-09-19 jina-embeddings-v3: Multilingual Embeddings With Task LoRA Saba Sturua et.al. 2409.10173 null
2024-09-16 Rapid Adaptation of Earth Observation Foundation Models for Segmentation Karthick Panner Selvam et.al. 2409.09907 null
2024-09-15 AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs Madhusudan Ghosh et.al. 2409.09704 link
2024-09-14 COMFORT: A Continual Fine-Tuning Framework for Foundation Models Targeted at Consumer Healthcare Chia-Hao Li et.al. 2409.09549 null
2024-09-14 SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2 Xinrun Chen et.al. 2409.09286 link
2024-09-13 Data Efficient Child-Adult Speaker Diarization with Simulated Conversations Anfeng Xu et.al. 2409.08881 link
2024-09-13 Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions Lingwei Meng et.al. 2409.08596 null
2024-09-13 ATFLRec: A Multimodal Recommender System with Audio-Text Fusion and Low-Rank Adaptation via Instruction-Tuned Large Language Model Zezheng Qin et.al. 2409.08543 null
2024-09-13 Risks When Sharing LoRA Fine-Tuned Diffusion Model Weights Dixi Yao et.al. 2409.08482 null
2024-09-13 Toward satisfactory public accessibility: A crowdsourcing approach through online reviews to inclusive urban design Lingyao Li et.al. 2409.08459 null
2024-09-12 AudioBERT: Audio Knowledge Augmented Language Model Hyunjong Ok et.al. 2409.08199 link
2024-09-12 Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy Bojian Li et.al. 2409.07723 null
2024-09-11 Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region Muhammad Akhtar Munir et.al. 2409.07585 link
2024-09-11 Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models Xinhu Zheng et.al. 2409.07016 null
2024-09-10 SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation Teng Hu et.al. 2409.06633 null
2024-09-09 Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models Rohit Jena et.al. 2409.06493 null
2024-09-10 HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data Hossein Hajipour et.al. 2409.06446 link
2024-09-10 VE: Modeling Multivariate Time Series Correlation with Variate Embedding Shangjiong Wang et.al. 2409.06169 link
2024-09-09 FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations Ziyao Wang et.al. 2409.05976 link
2024-09-09 SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values Chengwei Sun et.al. 2409.05926 null
2024-09-09 TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency Ahmed Imteaj et.al. 2409.05347 null
2024-09-08 Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation Zhe Cao et.al. 2409.05224 link
2024-09-06 Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning Xinyue Liu et.al. 2409.04574 null
2024-09-06 Fast Forwarding Low-Rank Training Adir Rahamim et.al. 2409.04206 null
2024-09-05 Continual Skill and Task Learning via Dialogue Weiwei Gu et.al. 2409.03166 null
2024-09-04 Non-Orthogonal Multiple-Access Strategies for Direct-to-Satellite IoT Networks Felipe Augusto Tondo et.al. 2409.02748 null
2024-09-04 Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA Shuangyi Chen et.al. 2409.02346 null
2024-08-31 CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large Language Models Xiaojun Xiao et.al. 2409.02119 null
2024-09-02 LoGex: Improved tail detection of extremely rare histopathology classes via guided diffusion Maximilian Mueller et.al. 2409.01317 link
2024-09-02 Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning Chongjie Si et.al. 2409.01035 link
2024-09-02 Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language Jeong Hun Yeo et.al. 2409.00986 link
2024-08-30 Enhancing Event Reasoning in Large Language Models through Instruction Fine-Tuning with Semantic Causal Graphs Mazal Bethany et.al. 2409.00209 null
2024-08-30 DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model Mona Sheikh Zeinoddin et.al. 2408.17433 link
2024-08-30 MoRe Fine-Tuning with 10x Fewer Parameters Wenxuan Tan et.al. 2408.17383 link
2024-08-30 Wireless Integrated Authenticated Communication System (WIA-Comm) Amith N Bharadwaj et.al. 2408.17112 null
2024-09-02 Instant Adversarial Purification with Adversarial Consistency Distillation Chun Tong Lei et.al. 2408.17064 null
2024-08-30 Efficient Image Restoration through Low-Rank Adaptation and Stable Diffusion XL Haiyang Zhao et.al. 2408.17060 null
2024-08-29 LoraMap: Harnessing the Power of LoRA Connections Hyeryun Park et.al. 2408.16264 null
2024-08-28 LeMON: Learning to Learn Multi-Operator Networks Jingmin Sun et.al. 2408.16168 link
2024-08-28 Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models Yuncheng Yang et.al. 2408.15915 link
2024-08-28 StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements Jillian Fisher et.al. 2408.15666 link
2024-08-28 TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation Junbao Zhou et.al. 2408.15657 link
2024-08-28 Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models Yiyang Zhao et.al. 2408.15585 null
2024-08-28 VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech Heeseung Kim et.al. 2408.14739 null
2024-08-27 PAT: Pruning-Aware Tuning for Large Language Models Yijiang Liu et.al. 2408.14721 link
2024-08-27 StyleSpeech: Parameter-efficient Fine Tuning for Pre-trained Controllable Text-to-Speech Haowei Lou et.al. 2408.14713 link
2024-08-26 CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation Muhammad Fawi et.al. 2408.14572 link
2024-08-27 Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models Aradhye Agarwal et.al. 2408.14470 link
2024-08-26 Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning Sakhinana Sagar Srinivas et.al. 2408.14387 null
2024-08-27 SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher Trung Dao et.al. 2408.14176 link
2024-08-25 TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation Jack Saunders et.al. 2408.13714 null
2024-08-24 Can Visual Foundation Models Achieve Long-term Point Tracking? Görkay Aydemir et.al. 2408.13575 null
2024-08-23 The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities Venkatesh Balavadhani Parthasarathy et.al. 2408.13296 null
2024-08-23 CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition Yafeng Zhang et.al. 2408.12834 null
2024-08-23 Investigating LLM Applications in E-Commerce Chester Palen-Michel et.al. 2408.12779 null
2024-08-22 EvalYaks: Instruction Tuning Datasets and LoRA Fine-tuned Models for Automated Scoring of CEFR B2 Speaking Assessment Transcripts Nicy Scaria et.al. 2408.12226 link
2024-08-21 Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards Omar Erak et.al. 2408.11775 link
2024-08-21 EAGLE: Elevating Geometric Reasoning through LLM-empowered Visual Instruction Tuning Zhihao Li et.al. 2408.11397 null
2024-08-20 EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech Xin Qi et.al. 2408.10852 null
2024-08-21 Flexora: Flexible Low Rank Adaptation for Large Language Models Chenxing Wei et.al. 2408.10774 null
2024-08-20 Large Language Models for Multimodal Deformable Image Registration Mingrui Ma et.al. 2408.10703 link
2024-08-20 Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper Tianyi Xu et.al. 2408.10680 null
2024-08-20 CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation Yuting Liu et.al. 2408.10645 null
2024-08-18 NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models Cheng Lin et.al. 2408.10280 null
2024-08-19 SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models Anke Tang et.al. 2408.10174 link
2024-08-19 Customizing Language Models with Instance-wise LoRA for Sequential Recommendation Xiaoyu Kong et.al. 2408.10159 link
2024-08-19 TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition Tianwei Lin et.al. 2408.09856 link
2024-08-18 Infinite Scrolling, Finite Satisfaction: Exploring User Behavior and Satisfaction on Social Media in Bangladesh Sanzana Karim Lora et.al. 2408.09601 null
2024-08-17 ConVerSum: A Contrastive Learning based Approach for Data-Scarce Solution of Cross-Lingual Summarization Beyond Direct Equivalents Sanzana Karim Lora et.al. 2408.09273 null
2024-08-17 An Exploratory Study on Fine-Tuning Large Language Models for Secure Code Generation Junjie Li et.al. 2408.09078 link
2024-08-17 MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality Zhiyi Shi et.al. 2408.09064 null
2024-08-16 AdaRank: Disagreement Based Module Rank Prediction for Low-rank Adaptation Yihe Dong et.al. 2408.09015 link
2024-08-16 ML Study of MaliciousTransactions in Ethereum Natan Katz et.al. 2408.08749 null
2024-08-16 RBLA: Rank-Based-LoRA-Aggregation for Fine-tuning Heterogeneous Models in FLaaS Shuaijun Chen et.al. 2408.08699 null
2024-08-16 LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression Yuqi Ye et.al. 2408.08682 null
2024-08-16 Adaptive Layer Selection for Efficient Vision Transformer Fine-Tuning Alessio Devoto et.al. 2408.08670 null
2024-08-16 A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth Yujia Gu et.al. 2408.08561 null
2024-08-15 Heavy Labels Out! Dataset Distillation with Label Space Lightening Ruonan Yu et.al. 2408.08201 null
2024-08-15 When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding Pingping Zhang et.al. 2408.08093 null
2024-08-14 Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification Yongcheng Li et.al. 2408.07467 link
2024-08-13 SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis Yuchen Mao et.al. 2408.07196 null
2024-08-13 Imagen 3 Imagen-Team-Google et.al. 2408.07009 null
2024-08-13 New refinements of Narayana polynomials and Motzkin polynomials Janet J. W. Dong et.al. 2408.06912 null
2024-08-13 LoRA $^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models Jia-Chen Zhang et.al. 2408.06854 null
2024-08-13 DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion Yujia Wu et.al. 2408.06740 null
2024-08-13 Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model Yongcheng Li et.al. 2408.06716 link
2024-08-13 Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach Haowei Ni et.al. 2408.06634 null
2024-08-13 Towards Robust and Cost-Efficient Knowledge Unlearning for Large Language Models Sungmin Cha et.al. 2408.06621 null
2024-08-15 ControlNeXt: Powerful and Efficient Control for Image and Video Generation Bohao Peng et.al. 2408.06070 link
2024-08-11 Hotfixing Large Language Models for Cod Zhou Yang et.al. 2408.05727 null
2024-08-09 TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning Yujie Feng et.al. 2408.05200 link
2024-08-09 LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description Yizhang Jin et.al. 2408.04957 link
2024-08-09 Energy performance of LR-FHSS: analysis and evaluation Roger Sanchez-Vital et.al. 2408.04908 null
2024-08-08 Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models Yupeng Chang et.al. 2408.04556 link
2024-08-08 UNLEARN Efficient Removal of Knowledge in Large Language Models Tyler Lizzo et.al. 2408.04140 null
2024-08-07 Image-to-LaTeX Converter for Mathematical Formulas and Text Daniil Gurgurov et.al. 2408.04015 link
2024-08-07 Speaker Adaptation for Quantised End-to-End ASR Models Qiuming Zhao et.al. 2408.03979 null
2024-08-07 A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Case Sonia Meyer et.al. 2408.03562 null
2024-08-11 Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation Jiachen Zhu et.al. 2408.03533 null
2024-08-06 FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning Zhi Chen et.al. 2408.03355 null
2024-08-06 SARA: Singular-Value Based Adaptive Low-Rank Adaption Jihao Gu et.al. 2408.03290 null
2024-08-06 Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi Pranita Deshmukh et.al. 2408.03172 null
2024-08-06 L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization Elvys Linhares Pontes et.al. 2408.03033 null
2024-08-06 Towards Smart Microfarming in an Urban Computing Continuum Marla Grunewald et.al. 2408.02992 null
2024-08-05 StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion Zhichao Wang et.al. 2408.02178 null
2024-08-04 SR-CIS: Self-Reflective Incremental System with Decoupled Memory and Reasoning Biqing Qi et.al. 2408.01970 null
2024-08-03 Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design Joong Ho Choi et.al. 2408.01651 link
2024-08-02 MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts Lin Ning et.al. 2408.01505 null
2024-08-02 Conditional LoRA Parameter Generation Xiaolong Jin et.al. 2408.01415 null
2024-08-02 Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer Yu Yang et.al. 2408.01402 null
2024-08-02 Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration Donwon Park et.al. 2408.01099 null
2024-08-02 Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs Afia Anjum et.al. 2408.01008 null
2024-08-02 PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized Language Prompting Liam Hebert et.al. 2408.00960 null
2024-08-01 Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization Róisín Luo et.al. 2408.00923 null
2024-07-31 Ge-based Clinopyroxene series: first principles and experimental local probe study Ricardo P. Moreira et.al. 2407.21749 null
2024-07-31 A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation Mothilal Asokan et.al. 2407.21739 null
2024-07-31 Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation Xiang Luo et.al. 2407.21633 link
2024-07-30 CELLM: An Efficient Communication in Large Language Models Training for Federated Learning Raja Vavekanand et.al. 2407.20557 null
2024-07-29 Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations Fangyijie Wang et.al. 2407.20072 link
2024-07-28 Memory-efficient Training of LLMs with Larger Mini-batches Dang Nguyen et.al. 2407.19580 null
2024-07-27 Parameter-Efficient Fine-Tuning via Circular Convolution Aochuan Chen et.al. 2407.19342 null
2024-07-27 The Impact of LoRA Adapters for LLMs on Clinical NLP Classification Under Data Limitations Thanh-Dung Le et.al. 2407.19299 null
2024-07-26 VIMs: Virtual Immunohistochemistry Multiplex staining via Text-to-Stain Diffusion Trained on Uniplex Stains Shikha Dubey et.al. 2407.19113 null
2024-07-25 Stay Tuned: An Empirical Study of the Impact of Hyperparameters on LLM Tuning in Real-World Applications Alon Halfon et.al. 2407.18990 null
2024-07-25 LoRA-Pro: Are Low-Rank Adapters Properly Optimized? Zhengbo Wang et.al. 2407.18242 link
2024-07-25 DINOv2 Rocks Geological Image Analysis: Classification, Segmentation, and Interpretability Florent Brondolo et.al. 2407.18100 link
2024-07-24 Channel-Aware Low-Rank Adaptation in Time Series Forecasting Tong Nie et.al. 2407.17246 link
2024-07-24 Accurate and Efficient Fine-Tuning of Quantized Large Language Models Through Optimal Balance Ao Shen et.al. 2407.17029 link
2024-07-22 Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters Kartikeya Bhardwaj et.al. 2407.16712 null
2024-07-23 DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models Zhenyu Xie et.al. 2407.16511 null
2024-07-23 Harmonizing Visual Text Comprehension and Generation Zhen Zhao et.al. 2407.16364 link
2024-07-23 FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network Weiying Xie et.al. 2407.16129 link
2024-07-22 Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language Models Raza Imam et.al. 2407.15913 link
2024-07-22 Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders Laura Niss et.al. 2407.15731 null
2024-07-22 LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models Xi Chen et.al. 2407.15415 link
2024-07-21 Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization Jiajun Hu et.al. 2407.15085 null
2024-07-21 MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM Navyansh Mahla et.al. 2407.15042 null

(back to top)

Model Compression

Publish Date Title Authors PDF Code
2024-12-30 Improving Acoustic Scene Classification in Low-Resource Conditions Zhi Chen et.al. 2412.20722 null
2024-12-28 Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems Wen-Dong Jiang et.al. 2412.20201 null
2024-12-28 SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection Phi Vu Tran et.al. 2412.20047 null
2024-12-28 Invariant debiasing learning for recommendation via biased imputation Ting Bai et.al. 2412.20036 link
2024-12-28 Learning Adaptive and View-Invariant Vision Transformer with Multi-Teacher Knowledge Distillation for Real-Time UAV Tracking You Wu et.al. 2412.20002 link
2024-12-27 Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis Jiaqi Wang et.al. 2412.19654 link
2024-12-27 Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models Shuo Wang et.al. 2412.19449 null
2024-12-26 SpectralKD: Understanding and Optimizing Vision Transformer Distillation through Spectral Analysis Huiyuan Tian et.al. 2412.19055 null
2024-12-25 Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models Haowei Yang et.al. 2412.18715 null
2024-12-23 Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings Harsh Joshi et.al. 2412.18635 null
2024-12-24 HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation Mohammed Hamdan et.al. 2412.18524 null
2024-12-24 Understanding Artificial Neural Network's Behavior from Neuron Activation Perspective Yizhou Zhang et.al. 2412.18073 null
2024-12-23 CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction Yuanyuan Gao et.al. 2412.17612 null
2024-12-23 GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference Chao Zeng et.al. 2412.17560 null
2024-12-24 Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement Hyeonjin Kim et.al. 2412.17387 link
2024-12-23 Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction Yuying Wang et.al. 2412.17317 null
2024-12-23 LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation Riku Uemura et.al. 2412.17282 null
2024-12-22 Lightweight Design and Optimization methods for DCNNs: Progress and Futures Hanhua Long et.al. 2412.16886 null
2024-12-21 Large Language Models Compression via Low-Rank Feature Distillation Yaya Sy et.al. 2412.16719 null
2024-12-21 CyberSentinel: Efficient Anomaly Detection in Programmable Switch using Knowledge Distillation Sankalp Mittal et.al. 2412.16693 null
2024-12-21 Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers Yunshan Zhong et.al. 2412.16553 null
2024-12-21 STKDRec: Spatial-Temporal Knowledge Distillation for Takeaway Recommendation Shuyuan Zhao et.al. 2412.16502 null
2024-12-20 BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models Patrick Haller et.al. 2412.15978 null
2024-12-20 A New Method to Capturing Compositional Knowledge in Linguistic Space Jiahe Wan et.al. 2412.15632 null
2024-12-19 Uncertainty-Guided Cross Attention Ensemble Mean Teacher for Semi-supervised Medical Image Segmentation Meghana Karri et.al. 2412.15380 null
2024-12-19 Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models Reza Shirkavand et.al. 2412.15341 null
2024-12-19 Self-Evolution Knowledge Distillation for LLM-based Machine Translation Yuncheng Song et.al. 2412.15303 null
2024-12-19 Adaptive Pruning for Large Language Models with Structural Importance Awareness Haotian Zheng et.al. 2412.15127 null
2024-12-19 SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection Ruoyu Xu et.al. 2412.14571 null
2024-12-19 Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models Xiao Cui et.al. 2412.14528 null
2024-12-19 Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance Sukrit Leelaluk et.al. 2412.14526 link
2024-12-18 A Survey on Inference Optimization Techniques for Mixture of Experts Models Jiacheng Liu et.al. 2412.14219 link
2024-12-18 Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective Zhiyuan Zeng et.al. 2412.14135 null
2024-12-18 On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process Gereziher Adhane et.al. 2412.13943 null
2024-12-18 Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN Pengxiang Li et.al. 2412.13795 link
2024-12-18 Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation Kaiwen Huang et.al. 2412.13742 link
2024-12-18 On the Compression of Language Models for Code: An Empirical Study on CodeBERT Giordano d'Aloisio et.al. 2412.13737 null
2024-12-18 Hybrid Data-Free Knowledge Distillation Jialiang Tang et.al. 2412.13525 link
2024-12-18 Deploying Foundation Model Powered Agent Services: A Survey Wenchao Xu et.al. 2412.13437 null
2024-12-17 In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning Yifei Duan et.al. 2412.13243 null
2024-12-17 Modality-Inconsistent Continual Learning of Multimodal Large Language Models Weiguo Pian et.al. 2412.13050 null
2024-12-17 Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation Jiaqi Wang et.al. 2412.12858 null
2024-12-17 RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification Guanwenjie Zou et.al. 2412.12603 link
2024-12-17 PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts Kun Guo et.al. 2412.12460 link
2024-12-16 Neural Collapse Inspired Knowledge Distillation Shuoxi Zhang et.al. 2412.11788 null
2024-12-16 Relation-Guided Adversarial Learning for Data-free Knowledge Transfer Yingping Liang et.al. 2412.11380 link
2024-12-16 BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions Wonyong Seo et.al. 2412.11365 null
2024-12-15 Wearable Accelerometer Foundation Models for Health via Knowledge Distillation Salar Abbaspourazad et.al. 2412.11276 null
2024-12-15 TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs Lanxiang Hu et.al. 2412.11242 null
2024-12-15 ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes Pedro Miguel Sánchez Sánchez et.al. 2412.11207 null
2024-12-15 Leveraging Large Language Models for Active Merchant Non-player Characters Byungjun Kim et.al. 2412.11189 null
2024-12-15 Knowledge Migration Framework for Smart Contract Vulnerability Detection Luqi Wang et.al. 2412.11175 null
2024-12-15 Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty Detection Mohammadreza Salehi et.al. 2412.11148 link
2024-12-17 On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning Pengfei Fang et.al. 2412.11017 null
2024-12-13 Can Students Beyond The Teacher? Distilling Knowledge from Teacher's Bias Jianhua Zhang et.al. 2412.09874 null
2024-12-13 ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression Kai Yao et.al. 2412.09812 null
2024-12-13 LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering Patrick Sutanto et.al. 2412.09807 null
2024-12-12 SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Dongting Hu et.al. 2412.09619 null
2024-12-12 A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks Saptarshi Mandal et.al. 2412.09579 null
2024-12-12 All You Need in Knowledge Distillation Is a Tailored Coordinate System Junjie Zhou et.al. 2412.09388 null
2024-12-12 Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices Thanaphon Suwannaphong et.al. 2412.09289 null
2024-12-15 DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification Kunlun Xu et.al. 2412.09224 link
2024-12-12 Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation Xinyue Liu et.al. 2412.08949 link
2024-12-12 Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration Yunshuai Zhou et.al. 2412.08939 null
2024-12-11 Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach Xihua Zhu et.al. 2412.08672 null
2024-12-11 Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation Jiaming Lv et.al. 2412.08139 null
2024-12-11 DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation Jaeho Moon et.al. 2412.08116 null
2024-12-10 Low-Rank Correction for Quantized LLMs Meyer Scetbon et.al. 2412.07902 null
2024-12-10 Unlocking the Potential of Reverse Distillation for Anomaly Detection Xinyue Liu et.al. 2412.07579 link
2024-12-10 TT-MPD: Test Time Model Pruning and Distillation Haihang Wu et.al. 2412.07114 null
2024-12-09 FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering Amirhossein Abaskohi et.al. 2412.07030 link
2024-12-09 VQ4ALL: Efficient Neural Network Representation via a Universal Codebook Juncan Deng et.al. 2412.06875 null
2024-12-09 Compression for Better: A General and Stable Lossless Compression Framework Boyang Zhang et.al. 2412.06868 null
2024-12-09 Lossless Model Compression via Joint Low-Rank Factorization Optimization Boyang Zhang et.al. 2412.06867 null
2024-12-08 GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model Haotong Yang et.al. 2412.06849 null
2024-12-10 Federated Split Learning with Model Pruning and Gradient Quantization in Wireless Networks Junhe Zhang et.al. 2412.06414 null
2024-12-09 U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening Sungpyo Kim et.al. 2412.06243 null
2024-12-08 Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation Aymen Sekhri et.al. 2412.06003 null
2024-12-07 Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery Ye Wang et.al. 2412.05573 null
2024-12-07 Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search Boxun Xu et.al. 2412.05505 null
2024-12-06 BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits Wazib Ansar et.al. 2412.05225 null
2024-12-06 One-shot Federated Learning via Synthetic Distiller-Distillate Communication Junyuan Zhang et.al. 2412.05186 link
2024-12-06 CCS: Continuous Learning for Customized Incremental Wireless Sensing Services Qunhang Fu et.al. 2412.04821 null
2024-12-05 Diffusion-Augmented Coreset Expansion for Scalable Dataset Distillation Ali Abbasi et.al. 2412.04668 null
2024-12-05 FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning Jiayu Liu et.al. 2412.04521 link
2024-12-05 Expanding Deep Learning-based Sensing Systems with Multi-Source Knowledge Transfer Gaole Dai et.al. 2412.04060 null
2024-12-04 Designing DNNs for a trade-off between robustness and processing performance in embedded devices Jon Gutiérrez-Zaballa et.al. 2412.03682 null
2024-12-04 Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective Jon Gutiérrez-Zaballa et.al. 2412.03630 link
2024-12-03 CPTQuant -- A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models Amitash Nanda et.al. 2412.03599 null
2024-12-07 Enhancing CLIP Conceptual Embedding through Knowledge Distillation Kuei-Chun Kao et.al. 2412.03513 null
2024-12-04 Distillation of Diffusion Features for Semantic Correspondence Frank Fundel et.al. 2412.03512 null
2024-12-03 Efficient Model Compression Techniques with FishLeg Jamie McGowan et.al. 2412.02328 null
2024-12-02 Mutli-View 3D Reconstruction using Knowledge Distillation Aditya Dutt et.al. 2412.02039 link
2024-12-02 Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model Qianhan Feng et.al. 2412.01282 link
2024-12-02 Reducing Inference Energy Consumption Using Dual Complementary CNNs Michail Kinnas et.al. 2412.01039 link
2024-12-01 QABISAR: Query-Article Bipartite Interactions for Statutory Article Retrieval T. Y. S. S. Santosh et.al. 2412.00934 null
2024-12-01 Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps Girmaw Abebe Tadesse et.al. 2412.00777 null
2024-11-30 Continuous Concepts Removal in Text-to-image Diffusion Models Tingxu Han et.al. 2412.00580 null
2024-11-30 Pruned Convolutional Attention Network Based Wideband Spectrum Sensing with Sub-Nyquist Sampling Peihao Dong et.al. 2412.00562 link
2024-11-30 Toward Fair Graph Neural Networks Via Dual-Teacher Knowledge Distillation Chengyu Li et.al. 2412.00382 null
2024-11-29 Reverse Thinking Makes LLMs Stronger Reasoners Justin Chih-Yao Chen et.al. 2411.19865 null
2024-11-28 Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG Xinxu Wei et.al. 2411.19230 null
2024-12-03 Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Akhiad Bercovich et.al. 2411.19146 null
2024-11-28 Headache to Overstock? Promoting Long-tail Items through Debiased Product Bundling Shuo Xu et.al. 2411.19107 null
2024-11-28 Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems Mansi Rana et.al. 2411.18980 null
2024-11-27 Active Data Curation Effectively Distills Large-Scale Multimodal Models Vishaal Udandarao et.al. 2411.18674 null
2024-11-27 Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models Yiming Wu et.al. 2411.18375 null
2024-11-27 Vision Mamba Distillation for Low-resolution Fine-grained Image Classification Yao Chen et.al. 2411.17980 link
2024-11-27 Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery Zhenyu Yu et.al. 2411.17973 null
2024-11-26 Attamba: Attending To Multi-Token States Yash Akhauri et.al. 2411.17685 link
2024-11-26 Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation Minh-Tuan Tran et.al. 2411.17046 null
2024-11-26 Words Matter: Leveraging Individual Text Embeddings for Code Generation in CLIP Test-Time Adaptation Shambhavi Mishra et.al. 2411.17002 link
2024-11-25 Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models Yao Fu et.al. 2411.16991 null
2024-11-25 Leveraging Foundation Models To learn the shape of semi-fluid deformable objects Omar El Assal et.al. 2411.16802 null
2024-11-25 O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Zhen Huang et.al. 2411.16489 link
2024-11-25 When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets? Srikrishna Iyer et.al. 2411.16487 link
2024-11-25 Learn from Foundation Model: Fruit Detection Model without Manual Annotation Yanan Wang et.al. 2411.16196 link
2024-11-25 Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics Tian Bowen et.al. 2411.16139 null
2024-11-25 Ensemble Learning via Knowledge Transfer for CTR Prediction Honghao Li et.al. 2411.16122 link
2024-11-23 Botfip-LLM: An Enhanced Multimodal Scientific Computing Framework Leveraging Knowledge Distillation from Large Language Models Tianhao Chen et.al. 2411.15525 null
2024-11-23 Efficient Ternary Weight Embedding Model: Bridging Scalability and Performance Jiayi Chen et.al. 2411.15438 link
2024-11-23 Partial Knowledge Distillation for Alleviating the Inherent Inter-Class Discrepancy in Federated Learning Xiaoyu Gan et.al. 2411.15403 null
2024-11-22 Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion Samarth N Ramesh et.al. 2411.15113 null
2024-11-22 RankByGene: Gene-Guided Histopathology Representation Learning Through Cross-Modal Ranking Consistency Wentao Huang et.al. 2411.15076 null
2024-11-22 Adaptive Group Robust Ensemble Knowledge Distillation Patrik Kenfack et.al. 2411.14984 null
2024-11-25 Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation Aniket Bhattacharyya et.al. 2411.14957 null
2024-11-22 Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computers Hongbo Liu et.al. 2411.14789 null
2024-11-22 Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation Xunyu Zhu et.al. 2411.14698 null
2024-11-21 TaQ-DiT: Time-aware Quantization for Diffusion Transformers Xinyan Liu et.al. 2411.14172 null
2024-11-21 DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization Hexuan Deng et.al. 2411.14055 link
2024-11-21 Teaching MLPs to Master Heterogeneous Graph-Structured Knowledge for Efficient and Accurate Inference Yunhui Liu et.al. 2411.14035 link
2024-11-21 CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition Md Mahedi Hasan et.al. 2411.13886 null
2024-11-20 RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content Yuxuan Jiang et.al. 2411.13362 null
2024-11-20 FASTNav: Fine-tuned Adaptive Small-language-models Trained for Multi-point Robot Navigation Yuxuan Chen et.al. 2411.13262 null
2024-11-20 Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning Gang Zhao et.al. 2411.13045 null
2024-11-19 Puppet-CNN: Input-Adaptive Convolutional Neural Networks with Model Compression using Ordinary Differential Equation Yucheng Xing et.al. 2411.12876 null
2024-11-19 Reward Modeling with Ordinal Feedback: Wisdom of the Crowd Shang Liu et.al. 2411.12843 null
2024-11-19 What Makes a Good Dataset for Knowledge Distillation? Logan Frank et.al. 2411.12817 null
2024-11-19 FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning Qingsong Lv et.al. 2411.12781 link
2024-11-19 KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder Maheswar Bora et.al. 2411.12270 null
2024-11-19 Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes Rahul Garg et.al. 2411.12174 null
2024-11-18 Federated Incremental Named Entity Recognition Duzhen Zhang et.al. 2411.11623 null
2024-11-18 Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms Haizhou Ge et.al. 2411.11406 null
2024-11-17 Map-Free Trajectory Prediction with Map Distillation and Hierarchical Encoding Xiaodong Liu et.al. 2411.10961 null
2024-11-16 Hybrid Attention Model Using Feature Decomposition and Knowledge Distillation for Glucose Forecasting Ebrahim Farahmand et.al. 2411.10703 null
2024-11-16 Multi-perspective Contrastive Logit Distillation Qi Wang et.al. 2411.10693 null
2024-11-16 Exploring Feature-based Knowledge Distillation For Recommender System: A Frequency Perspective Zhangchi Zhu et.al. 2411.10676 null
2024-11-15 Scaling Law for Post-training after Model Pruning Xiaodong Chen et.al. 2411.10272 null
2024-11-15 Evidential Federated Learning for Skin Lesion Image Classification Rutger Hendrix et.al. 2411.10071 null
2024-11-14 VPBSD:Vessel-Pattern-Based Semi-Supervised Distillation for Efficient 3D Microscopic Cerebrovascular Segmentation Xi Lin et.al. 2411.09567 null
2024-11-14 Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition Zixing Zhang et.al. 2411.09339 null
2024-11-14 Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching Yuran Wang et.al. 2411.09151 null
2024-11-14 Toward Democratized Generative AI in Next-Generation Mobile Edge Networks Ruichen Zhang et.al. 2411.09148 null
2024-11-13 Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head Penghui Yang et.al. 2411.08937 null
2024-11-13 UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation Chengyuan Zhang et.al. 2411.08569 null
2024-11-13 Federated Graph Learning with Graphless Clients Xingbo Fu et.al. 2411.08374 null
2024-11-12 Joint Diffusion models in Continual Learning Paweł Skierś et.al. 2411.08224 null
2024-11-12 Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data Juanhui Li et.al. 2411.08028 null
2024-11-13 Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models Youan Cong et.al. 2411.07820 null
2024-11-12 ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization Weibo Zhao et.al. 2411.07762 null
2024-11-12 Optimizing Traffic Signal Control using High-Dimensional State Representation and Efficient Deep Reinforcement Learning Lawrence Francis et.al. 2411.07759 null
2024-11-12 ALANINE: A Novel Decentralized Personalized Federated Learning For Heterogeneous LEO Satellite Constellation Liang Zhao et.al. 2411.07752 null
2024-11-12 OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework Jiaxi Li et.al. 2411.07711 link
2024-11-13 Feature Interaction Fusion Self-Distillation Network For CTR Prediction Lei Sang et.al. 2411.07508 null
2024-11-12 Quantifying Knowledge Distillation Using Partial Information Decomposition Pasan Dissanayake et.al. 2411.07483 null
2024-11-11 SAMPart3D: Segment Any Part in 3D Objects Yunhan Yang et.al. 2411.07184 link
2024-11-11 LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models Runming Yang et.al. 2411.06839 null
2024-11-11 ScaleKD: Strong Vision Transformers Could Be Excellent Teachers Jiawei Fan et.al. 2411.06786 link
2024-11-11 An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning Dong Li et.al. 2411.06659 link
2024-11-10 CULL-MT: Compression Using Language and Layer pruning for Machine Translation Pedram Rostami et.al. 2411.06506 null
2024-11-10 Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation Yu-Liang Zhan et.al. 2411.06448 link
2024-11-09 Dynamic Textual Prompt For Rehearsal-free Lifelong Person Re-identification Hongyu Chen et.al. 2411.06023 null
2024-11-09 Multi-hop RIS-aided Learning Model Sharing for Urban Air Mobility Kai Xiong et.al. 2411.06015 null
2024-11-08 Mitigating Hallucination with ZeroG: An Advanced Knowledge Management Engine Anantha Sharma et.al. 2411.05936 null
2024-11-08 Asterisk: Keep it Simple* Andrew Semenov et.al. 2411.05691 null
2024-11-08 Knowledge Distillation Neural Network for Predicting Car-following Behaviour of Human-driven and Autonomous Vehicles Ayobami Adewale et.al. 2411.05618 null
2024-11-08 Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion Nan Song et.al. 2411.05544 null
2024-11-07 ZipNN: Lossless Compression for AI Models Moshik Hershcovitch et.al. 2411.05239 link
2024-11-07 Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale Flavio Di Palo et.al. 2411.05045 null
2024-11-06 From Word Vectors to Multimodal Embeddings: Techniques, Applications, and Future Directions For Large Language Models Charles Zhang et.al. 2411.05036 null
2024-11-07 Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers Zhichao Geng et.al. 2411.04403 null
2024-11-07 GazeGen: Gaze-Driven User Interaction for Visual Content Generation He-Yen Hsieh et.al. 2411.04335 null
2024-11-06 Towards Personalized Federated Learning via Comprehensive Knowledge Distillation Pengju Wang et.al. 2411.03569 null
2024-11-05 Change Is the Only Constant: Dynamic LLM Slicing based on Layer Redundancy Razvan-Gabriel Dumitru et.al. 2411.03513 link
2024-11-05 Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation Francisco Giral et.al. 2411.02975 null
2024-11-05 Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery Bowei Du et.al. 2411.02861 null
2024-11-05 Brewing Vodka: Distilling Pure Knowledge for Lightweight Threat Detection in Audit Logs Weiheng Wu et.al. 2411.02775 null
2024-11-05 Multimodal Commonsense Knowledge Distillation for Visual Question Answering Shuo Yang et.al. 2411.02722 null
2024-11-04 Information plane and compression-gnostic feedback in quantum machine learning Nathan Haboury et.al. 2411.02313 null
2024-11-04 Training on the Test Model: Contamination in Ranking Distillation Vishakha Suresh Kalal et.al. 2411.02284 link
2024-11-03 Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment Chengting Yu et.al. 2411.01547 null
2024-11-01 On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance Jaskirat Singh et.al. 2411.00907 null
2024-11-01 Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation Bohan Lyu et.al. 2411.00412 null
2024-11-01 Towards Building Secure UAV Navigation with FHE-aware Knowledge Distillation Arjun Ramesh Kaushik et.al. 2411.00403 null
2024-11-01 Efficient Model Compression for Bayesian Neural Networks Diptarka Saha et.al. 2411.00273 null
2024-10-31 Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image Classification Thanh-Dung Le et.al. 2411.00209 link
2024-10-31 Mutual Information Preserving Neural Network Pruning Charles Westphal et.al. 2411.00147 null
2024-10-30 Larger models yield better results? Streamlined severity classification of ADHD-related concerns using BERT-based knowledge distillation Ahmed Akib Jawad Karim et.al. 2411.00052 null
2024-10-30 IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking Run Luo et.al. 2410.23907 null
2024-10-29 ML Research Benchmark Matthew Kenney et.al. 2410.22553 link
2024-11-01 Leveraging Recurrent Neural Networks for Predicting Motor Movements from Primate Motor Cortex Neural Recordings Yuanxi Wang et.al. 2410.22283 null
2024-10-28 Unveiling Context-Aware Criteria in Self-Assessing LLMs Taneesh Gupta et.al. 2410.21545 null
2024-10-28 Knowledge Distillation for Real-Time Classification of Early Media in Voice Communications Kemal Altwlkany et.al. 2410.21478 null
2024-10-31 LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment Ge Yang et.al. 2410.21352 link
2024-10-28 EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation Shih-Yang Liu et.al. 2410.21271 null
2024-10-28 Deep Learning for Medical Text Processing: BERT Model Fine-Tuning and Comparative Study Jiacheng Hu et.al. 2410.20792 null
2024-10-28 KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation Rambod Azimi et.al. 2410.20777 link
2024-10-28 Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning Bing Han et.al. 2410.20775 null
2024-10-28 Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA Sangmin Bae et.al. 2410.20672 null
2024-10-27 Uncovering Capabilities of Model Pruning in Graph Contrastive Learning Wu Junran et.al. 2410.20356 null
2024-10-25 A Survey of Small Language Models Chien Van Nguyen et.al. 2410.20011 null
2024-10-25 GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing Hosam Elgendy et.al. 2410.19552 link
2024-10-25 SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models Jahyun Koo et.al. 2410.19503 null
2024-10-24 Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts Danyal Aftab et.al. 2410.19185 null
2024-10-24 AlignCap: Aligning Speech Emotion Captioning to Human Preferences Ziqi Liang et.al. 2410.19134 null
2024-10-24 High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws M. Emrullah Ildiz et.al. 2410.18837 null
2024-10-24 Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data Anup Shirgaonkar et.al. 2410.18588 null
2024-10-24 SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning Shivam Adarsh et.al. 2410.18574 link
2024-10-23 ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams Srija Anand et.al. 2410.17901 null
2024-10-23 Beware of Calibration Data for Pruning Large Language Models Yixin Ji et.al. 2410.17711 null
2024-10-23 Towards Active Participant-Centric Vertical Federated Learning: Some Representations May Be All You Need Jon Irureta et.al. 2410.17648 null
2024-10-23 Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation Muquan Li et.al. 2410.17606 link
2024-10-23 Multimodal Information Bottleneck for Deep Reinforcement Learning with Multiple Sensors Bang You et.al. 2410.17551 null
2024-10-23 Physics-driven AI for Channel Estimation in Cellular Network Xiaoqian Qi et.al. 2410.17525 null
2024-10-22 MiniPLM: Knowledge Distillation for Pre-Training Language Models Yuxian Gu et.al. 2410.17215 link
2024-10-22 Self-calibration for Language Model Quantization and Pruning Miles Williams et.al. 2410.17170 null
2024-10-22 DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization Haowei Zhu et.al. 2410.16942 null
2024-10-22 Mitigating Vanishing Activations in Deep CapsNets Using Channel Pruning Siddharth Sahu et.al. 2410.16908 link
2024-10-22 CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare Nicholas I-Hsien Kuo et.al. 2410.16872 null
2024-10-22 AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models Yongjian Wu et.al. 2410.16820 link
2024-10-22 SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation Jing-Jing Li et.al. 2410.16665 null
2024-10-21 Pre-training Distillation for Large Language Models: A Design Space Exploration Hao Peng et.al. 2410.16215 null
2024-10-18 Interpreting Microbiome Relative Abundance Data Using Symbolic Regression Swagatam Haldar et.al. 2410.16109 link
2024-10-21 Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples Kirill Lukyanov et.al. 2410.15889 null
2024-10-20 GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning Haiwen Diao et.al. 2410.15266 link
2024-10-19 LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound Xuechen Guo et.al. 2410.15074 null
2024-10-19 Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS Tuan Nam Nguyen et.al. 2410.14997 null
2024-10-18 EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search Oliver Sieberling et.al. 2410.14649 link
2024-10-18 Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation Shuai Zhao et.al. 2410.14425 link
2024-10-18 Preview-based Category Contrastive Learning for Knowledge Distillation Muhe Ding et.al. 2410.14143 null
2024-10-17 Leveraging Fine-Tuned Language Models for Efficient and Accurate Smart Contract Auditing Zhiyuan Wei et.al. 2410.13918 link
2024-10-17 An Active Learning Framework for Inclusive Generation by Large Language Models Sabit Hassan et.al. 2410.13641 null
2024-10-18 Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach Luyao Zou et.al. 2410.13602 null
2024-10-18 Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach Luyao Zou et.al. 2410.13260 null
2024-10-16 TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant Guopeng Li et.al. 2410.12342 null
2024-10-16 Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm Guanming Huang et.al. 2410.12259 null
2024-10-16 TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration Yiwei Guo et.al. 2410.12183 link
2024-10-17 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 null
2024-10-15 MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router Yanyue Xie et.al. 2410.12013 null
2024-10-15 Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation Andong Lu et.al. 2410.11586 link
2024-10-15 Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL Qihuang Zhong et.al. 2410.11371 null
2024-10-15 Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling Wenda Xu et.al. 2410.11325 null
2024-10-14 ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection Martin Aubard et.al. 2410.10554 link
2024-10-14 QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models Zhumazhan Balapanov et.al. 2410.10318 link
2024-10-14 Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation Siru Ouyang et.al. 2410.10141 null
2024-10-15 Edge Unlearning is Not "on Edge"! An Adaptive Exact Unlearning System on Resource-Constrained Devices Xiaoyu Xia et.al. 2410.10128 link
2024-10-14 REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation Zhiyun Song et.al. 2410.10097 null
2024-10-12 SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs Mohammad Mozaffari et.al. 2410.09615 link
2024-10-12 Distilling Invariant Representations with Dual Augmentation Nikolaos Giakoumoglou et.al. 2410.09474 null
2024-10-12 Declarative Knowledge Distillation from Large Language Models for Visual Question Answering Datasets Thomas Eiter et.al. 2410.09428 link
2024-10-15 Transforming In-Vehicle Network Intrusion Detection: VAE-based Knowledge Distillation Meets Explainable AI Muhammet Anil Yagiz et.al. 2410.09043 null
2024-10-11 Mentor-KD: Making Small Language Models Better Multi-step Reasoners Hojae Lee et.al. 2410.09037 link
2024-10-11 Contrastive Knowledge Distillation for Robust Multimodal Sentiment Analysis Zhongyi Sang et.al. 2410.08692 null
2024-10-11 GAI-Enabled Explainable Personalized Federated Semi-Supervised Learning Yubo Peng et.al. 2410.08634 null
2024-10-11 Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both Abhijnan Nath et.al. 2410.08458 null
2024-10-10 What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias Aida Mohammadshahi et.al. 2410.08407 null
2024-10-10 Non-transferable Pruning Ruyi Ding et.al. 2410.08015 null
2024-10-10 A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways Jing Su et.al. 2410.07915 null
2024-10-10 SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks Haiyang Wang et.al. 2410.07857 link
2024-10-12 Relational Diffusion Distillation for Efficient Image Generation Weilun Feng et.al. 2410.07679 link
2024-10-10 CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression Wenyuan Liu et.al. 2410.07505 null
2024-10-09 Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing Ismail Erbas et.al. 2410.07364 null
2024-10-09 S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning Weihao Lin et.al. 2410.07046 null
2024-10-09 Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation Runze Chen et.al. 2410.06982 null
2024-10-09 Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching Wenqi Niu et.al. 2410.06561 null
2024-10-08 SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching Tianyi Zhang et.al. 2410.06364 null
2024-10-08 QT-DoG: Quantization-aware Training for Domain Generalization Saqib Javed et.al. 2410.06020 link
2024-10-10 KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server Wenhao Wang et.al. 2410.05725 link
2024-10-07 Progressive distillation induces an implicit curriculum Abhishek Panigrahi et.al. 2410.05464 null
2024-10-07 ESPACE: Dimensionality Reduction of Activations for Model Compression Charbel Sakr et.al. 2410.05437 null
2024-10-07 ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation Yuelyu Ji et.al. 2410.05168 null
2024-10-06 CAPEEN: Image Captioning with Early Exits and Knowledge Distillation Divya Jyoti Bajpai et.al. 2410.04433 link
2024-10-06 DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs Divya Jyoti Bajpai et.al. 2410.04424 link
2024-10-05 Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution Jianze Li et.al. 2410.04224 link
2024-10-05 Accelerating Diffusion Models with One-to-Many Knowledge Distillation Linfeng Zhang et.al. 2410.04191 null
2024-10-05 DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech Dominika Woszczyk et.al. 2410.04188 null
2024-10-05 Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher Yong Guo et.al. 2410.04140 null
2024-10-04 Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models Zhuochun Li et.al. 2410.03663 null
2024-10-04 DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models Sungnyun Kim et.al. 2410.03061 null
2024-10-03 Geometry is All You Need: A Unified Taxonomy of Matrix and Tensor Factorization for Compression of Generative Language Models Mingxue Xu et.al. 2410.03040 null
2024-10-03 Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks Siddharth Joshi et.al. 2410.02116 null
2024-10-02 Review Non-convex Optimization Method for Machine Learning Greg B Fotopoulos et.al. 2410.02017 null
2024-10-02 PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation Mike Ranzinger et.al. 2410.01680 null
2024-10-04 HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models Seanie Lee et.al. 2410.01524 link
2024-10-02 Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks Edan Kinderman et.al. 2410.01483 link
2024-10-02 PairDistill: Pairwise Relevance Distillation for Dense Retrieval Chao-Wei Huang et.al. 2410.01383 link
2024-10-02 "No Matter What You Do!": Mitigating Backdoor Attacks in Graph Neural Networks Jiale Zhang et.al. 2410.01272 link
2024-10-01 Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging Ismail Erbas et.al. 2410.00948 null
2024-10-01 Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading Mostafa Hajighasemloua et.al. 2410.00779 null
2024-10-01 Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation Jiyoon Myung et.al. 2410.00683 null
2024-10-01 AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation Ziyang Luo et.al. 2410.00558 link
2024-10-01 Self-Updatable Large Language Models with Parameter Integration Yu Wang et.al. 2410.00487 null
2024-09-30 Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation Vlad-Cristian Matei et.al. 2409.20498 null
2024-10-02 Linear Projections of Teacher Embeddings for Few-Class Distillation Noel Loo et.al. 2409.20449 null
2024-09-30 Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies Shalini Sarode et.al. 2409.20237 null
2024-09-30 Aggressive Post-Training Compression on Extremely Large Language Models Zining Zhang et.al. 2409.20094 null
2024-10-01 HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning Momin Ahmad Khan et.al. 2409.19912 null
2024-09-29 Tailored Federated Learning: Leveraging Direction Regulation & Knowledge Distillation Huidong Tang et.al. 2409.19741 null
2024-09-29 InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries Mengze Hong et.al. 2409.19689 null
2024-09-28 Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training Pihe Hu et.al. 2409.19391 null
2024-09-28 Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment Tianyi Liu et.al. 2409.19366 null
2024-09-27 Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models Shihua Qin et.al. 2409.19185 null
2024-09-27 MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation Junyou Zhu et.al. 2409.18800 null
2024-09-27 Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation Chaomin Shen et.al. 2409.18785 null
2024-09-27 Harmonizing knowledge Transfer in Neural Network with Unified Distillation Yaomin Huang et.al. 2409.18565 null
2024-09-27 Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration Mahdi Morafah et.al. 2409.18461 link
2024-09-26 EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation Jiaxiang Tang et.al. 2409.18114 null
2024-09-26 Weak-To-Strong Backdoor Attacks for LLMs with Contrastive Knowledge Distillation Shuai Zhao et.al. 2409.17946 null
2024-09-26 Kendall's $τ$ Coefficient for Logits Distillation Yuchen Guan et.al. 2409.17823 null
2024-09-26 General Compression Framework for Efficient Transformer Object Tracking Lingyi Hong et.al. 2409.17564 null
2024-09-26 Shape-intensity knowledge distillation for robust medical image segmentation Wenhui Dong et.al. 2409.17503 link
2024-09-25 Search for Efficient Large Language Models Xuan Shen et.al. 2409.17372 link
2024-09-25 MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events Xiaoyu Yang et.al. 2409.17010 null
2024-09-25 Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation Hanyu Zhou et.al. 2409.17001 null
2024-09-25 SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling Laurent Dillard et.al. 2409.16581 null
2024-09-24 AIM 2024 Challenge on UHD Blind Photo Quality Assessment Vlad Hosu et.al. 2409.16271 null
2024-09-25 Privacy Evaluation Benchmarks for NLP Models Wei Huang et.al. 2409.15868 link
2024-09-24 Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization Lucas Deckers et.al. 2409.15849 null
2024-09-23 TS-TCD: Triplet-Level Cross-Modal Distillation for Time-Series Forecasting Using Large Language Models Pengfei Wang et.al. 2409.14978 null
2024-09-23 DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models Sangyeon Cho et.al. 2409.14904 link
2024-09-23 Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation Li Li et.al. 2409.14810 null
2024-09-23 An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding Wei-Bin Kou et.al. 2409.14737 null
2024-09-18 Applications of Knowledge Distillation in Remote Sensing: A Survey Yassine Himeur et.al. 2409.12111 null
2024-09-18 Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction Jin Jie Sean Yeo et.al. 2409.11964 null
2024-09-18 Distillation-free Scaling of Large SSMs for Images and Videos Hamid Suleman et.al. 2409.11867 null
2024-09-18 EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis Shaojie Li et.al. 2409.11817 null
2024-09-18 RUIE: Retrieval-based Unified Information Extraction using Large Language Model Xincheng Liao et.al. 2409.11673 null
2024-09-17 Time-Series Forecasting, Knowledge Distillation, and Refinement within a Multimodal PDE Foundation Model Derek Jollie et.al. 2409.11609 link
2024-09-17 Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation Rui Yu et.al. 2409.11018 null
2024-09-17 Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation Gerard I. Gállego et.al. 2409.11003 null
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362 null
2024-09-16 Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference Huy-Dung Nguyen et.al. 2409.10095 null
2024-09-15 ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration Ning-Chi Huang et.al. 2409.09708 null
2024-09-14 Effective Pre-Training of Audio Transformers for Sound Event Detection Florian Schmid et.al. 2409.09546 link
2024-09-14 Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification Wenhao Yang et.al. 2409.09389 null
2024-09-14 Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility Xiaoyu Liu et.al. 2409.09357 null
2024-09-13 Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection Dixi Yao et.al. 2409.08858 null
2024-09-13 An Efficient Privacy-aware Split Learning Framework for Satellite Communications Jianfei Sun et.al. 2409.08538 null
2024-09-13 AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation Zechao Sun et.al. 2409.08516 null
2024-09-12 DiReDi: Distillation and Reverse Distillation for AIoT Applications Chen Sun et.al. 2409.08308 null
2024-09-12 Ruri: Japanese General Text Embeddings Hayato Tsukagoshi et.al. 2409.07737 link
2024-09-12 Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios Xinlei Huang et.al. 2409.07694 null
2024-09-11 DS-ViT: Dual-Stream Vision Transformer for Cross-Task Distillation in Alzheimer's Early Diagnosis Ke Chen et.al. 2409.07584 null
2024-09-11 EchoDFKD: Data-Free Knowledge Distillation for Cardiac Ultrasound Segmentation using Synthetic Data Grégoire Petit et.al. 2409.07566 null
2024-09-11 NVRC: Neural Video Representation Compression Ho Man Kwan et.al. 2409.07414 null
2024-09-11 Enhancing CTC-Based Visual Speech Recognition Hendrik Laux et.al. 2409.07210 null
2024-09-11 A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption Marcus Rüb et.al. 2409.07114 null
2024-09-11 Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator Kangyang Luo et.al. 2409.06955 null
2024-09-10 Applied Federated Model Personalisation in the Industrial Domain: A Comparative Study Ilias Siniosoglou et.al. 2409.06904 null
2024-09-10 EasyST: A Simple Framework for Spatio-Temporal Prediction Jiabin Tang et.al. 2409.06748 link
2024-09-10 SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation Teng Hu et.al. 2409.06633 null
2024-09-10 Knowledge Distillation via Query Selection for Detection Transformer Yi Liu et.al. 2409.06443 null
2024-09-10 Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition Junzheng Zhang et.al. 2409.06371 null
2024-09-10 Enhancing Long Video Understanding via Hierarchical Event-Based Memory Dingxin Cheng et.al. 2409.06299 null
2024-09-09 Joint Input and Output Coordination for Class-Incremental Learning Shuai Wang et.al. 2409.05620 null
2024-09-09 LEROjD: Lidar Extended Radar-Only Object Detection Patrick Palmer et.al. 2409.05564 link
2024-09-09 Federated Transfer Learning Based Cooperative Wideband Spectrum Sensing with Model Pruning Jibin Jia et.al. 2409.05462 null
2024-09-09 Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition Shiming Ge et.al. 2409.05384 null
2024-09-09 Application Specific Compression of Deep Learning Models Rohit Raj Rai et.al. 2409.05368 link
2024-09-09 FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data Rasoul Jafari Gohari et.al. 2409.05359 link
2024-09-08 Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation Haichao Zhu et.al. 2409.05151 null
2024-09-07 LoCa: Logit Calibration for Knowledge Distillation Runming Yang et.al. 2409.04778 null
2024-09-06 SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields Yuze Wang et.al. 2409.04482 null
2024-09-05 Experimentation in Content Moderation using RWKV Umut Yildirim et.al. 2409.03939 null
2024-09-05 DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture Qianlong Xiang et.al. 2409.03550 null
2024-09-05 Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration Pei Wang et.al. 2409.03455 null
2024-09-05 Efficient Image Compression Using Advanced State Space Models Bouzid Arezki et.al. 2409.02743 null
2024-09-04 CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation Minhee Cho et.al. 2409.02699 null
2024-09-04 Low-Resolution Object Recognition with Cross-Resolution Relational Contrastive Distillation Kangkai Zhang et.al. 2409.02555 null
2024-09-04 A design of magnetic tunnel junctions for the deployment of neuromorphic hardware for edge computing Davi Rodrigues et.al. 2409.02528 null
2024-09-04 Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation Yilong Chen et.al. 2409.02438 null
2024-09-03 Low-Resolution Face Recognition via Adaptable Instance-Relation Distillation Ruixin Shi et.al. 2409.02049 null
2024-09-03 Foundations of Large Language Model Compression -- Part 1: Weight Quantization Sean I. Young et.al. 2409.02026 link
2024-09-03 Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique Qiang Zheng et.al. 2409.02020 null
2024-09-03 Contemporary Model Compression on Large Language Models Inference Dong Liu et.al. 2409.01990 null
2024-09-03 Adaptive Explicit Knowledge Transfer for Knowledge Distillation Hyungkeun Park et.al. 2409.01679 null
2024-08-30 How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recognition Pedro C. Neto et.al. 2408.17399 link
2024-08-30 HiTSR: A Hierarchical Transformer for Reference-based Super-Resolution Masoomeh Aslahishahri et.al. 2408.16959 link
2024-08-29 VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition Zaiwei Zhang et.al. 2408.16930 null
2024-08-29 Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling Hritik Bansal et.al. 2408.16737 null
2024-08-29 MST-KD: Multiple Specialized Teachers Knowledge Distillation for Fair Face Recognition Eduarda Caldeira et.al. 2408.16563 link
2024-08-29 Convolutional Neural Network Compression Based on Low-Rank Decomposition Yaping He et.al. 2408.16289 null
2024-08-28 LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation Fangxun Shu et.al. 2408.15881 link
2024-08-28 ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation Tiantian Feng et.al. 2408.15803 null
2024-08-28 Online pre-training with long-form videos Itsuki Kato et.al. 2408.15651 null
2024-08-28 Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation Lujun Gui et.al. 2408.15562 null
2024-08-27 Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification Yiqiang Cai et.al. 2408.14862 link
2024-08-27 Learning effective pruning at initialization from iterative pruning Shengkai Liu et.al. 2408.14757 link
2024-08-26 Bridging the Gap: Unpacking the Hidden Challenges in Knowledge Distillation for Online Ranking Systems Nikhil Khani et.al. 2408.14678 null
2024-08-25 Variational autoencoder-based neural network model compression Liang Cheng et.al. 2408.14513 null
2024-08-26 TSAK: Two-Stage Semantic-Aware Knowledge Distillation for Efficient Wearable Modality and Model Optimization in Manufacturing Lines Hymalai Bello et.al. 2408.14146 null
2024-08-27 GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets Sven Oehri et.al. 2408.14131 link
2024-08-26 Let Video Teaches You More: Video-to-Image Knowledge Distillation using DEtection TRansformer for Medical Video Lesion Detection Yuncheng Jiang et.al. 2408.14051 null
2024-08-25 Condensed Sample-Guided Model Inversion for Knowledge Distillation Kuluhan Binici et.al. 2408.13850 null
2024-08-25 Bring the Power of Diffusion Model to Defect Detection Xuyi Yu et.al. 2408.13845 null
2024-08-24 Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic Yifei He et.al. 2408.13656 link
2024-08-24 MPruner: Optimizing Neural Network Size with CKA-Based Mutual Information Pruning Seungbeom Hu et.al. 2408.13482 null
2024-08-23 Growing Deep Neural Network Considering with Similarity between Neurons Taigo Sakai et.al. 2408.13291 null
2024-08-23 Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption Sakhinana Sagar Srinivas et.al. 2408.13248 null
2024-08-23 A Web-Based Solution for Federated Learning with LLM-Based Automation Chamith Mawela et.al. 2408.13010 null
2024-08-23 A Survey on Drowsiness Detection -- Modern Applications and Methods Biying Fu et.al. 2408.12990 null
2024-08-22 Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers Sayed Mohammad Vakilzadeh Hatefi et.al. 2408.12568 link
2024-08-22 Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models Meiyun Wang et.al. 2408.12326 link
2024-08-22 Rebalancing Multi-Label Class-Incremental Learning Kaile Du et.al. 2408.12161 null
2024-08-22 Vision-Based Detection of Uncooperative Targets and Components on Small Satellites Hannah Grauer et.al. 2408.12084 null
2024-08-22 Aligning (Medical) LLMs for (Counterfactual) Fairness Raphael Poulain et.al. 2408.12055 link
2024-08-22 LAKD-Activation Mapping Distillation Based on Local Learning Yaoze Zhang et.al. 2408.11478 null
2024-08-21 A Practical Trigger-Free Backdoor Attack on Neural Networks Jiahao Wang et.al. 2408.11444 null
2024-08-21 Pano2Room: Novel View Synthesis from a Single Indoor Panorama Guo Pu et.al. 2408.11413 link
2024-08-21 Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection Liang Yao et.al. 2408.11407 null
2024-08-21 A Unified Framework for Continual Learning and Machine Unlearning Romit Chatterjee et.al. 2408.11374 null
2024-08-20 SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection Huafeng Chen et.al. 2408.10760 null
2024-08-20 Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation Md Fahim Sikder et.al. 2408.10755 null
2024-08-20 Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches Yanjie Dong et.al. 2408.10691 null
2024-08-20 LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models Yupeng Su et.al. 2408.10631 link
2024-08-20 Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers Thanh Thi Nguyen et.al. 2408.10503 null
2024-08-19 Transferring Backdoors between Large Language Models by Knowledge Distillation Pengzhou Cheng et.al. 2408.09878 link
2024-08-20 MoDeGPT: Modular Decomposition for Large Language Model Compression Chi-Heng Lin et.al. 2408.09632 null
2024-08-18 MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment Tianyi Liu et.al. 2408.09465 null
2024-08-18 CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination Kaicheng Yang et.al. 2408.09441 null
2024-08-18 OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras Muhammad Rameez Ur Rahman et.al. 2408.09424 link
2024-08-17 RepControlNet: ControlNet Reparameterization Zhaoli Deng et.al. 2408.09240 null
2024-08-16 Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition Muhammad Haseeb Aslam et.al. 2408.09035 link
2024-08-16 Research on Personalized Compression Algorithm for Pre-trained Models Based on Homomorphic Entropy Increase Yicong Li et.al. 2408.08684 null
2024-08-16 ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models Chao Zeng et.al. 2408.08554 link
2024-08-15 Computer Vision Model Compression Techniques for Embedded Systems: A Survey Alexandre Lopes et.al. 2408.08250 link
2024-08-15 MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU Yan Li et.al. 2408.08144 null
2024-08-19 Knowledge Distillation with Refined Logits Wujie Sun et.al. 2408.07703 link
2024-08-14 FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual Teacher Alessio Mora et.al. 2408.07587 null
2024-08-14 Towards Real-time Video Compressive Sensing on Mobile Devices Miao Cao et.al. 2408.07530 link
2024-08-14 One Step Diffusion-based Super-Resolution with Time-Aware Distillation Xiao He et.al. 2408.07476 link
2024-08-14 Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection Zhonglin Chen et.al. 2408.07455 null
2024-08-13 Using Advanced LLMs to Enhance Smaller LLMs: An Interpretable Knowledge Distillation Approach Tong Wang et.al. 2408.07238 null
2024-08-15 An Event Structure-aware Generative Model for Biomedical Event Extraction Haohan Yuan et.al. 2408.06583 null
2024-08-12 Optimizing Vision Transformers with Data-Free Knowledge Transfer Gousia Habib et.al. 2408.05952 null
2024-08-11 Low-Dimensional Federated Knowledge Graph Embedding via Knowledge Distillation Xiaoxiong Zhang et.al. 2408.05748 null
2024-08-11 Efficient Federated Learning Using Dynamic Update and Adaptive Pruning with Momentum on Shared Server Data Ji Liu et.al. 2408.05678 null
2024-08-08 LaDiMo: Layer-wise Distillation Inspired MoEfier Sungyoon Kim et.al. 2408.04278 null
2024-08-08 Distil-DCCRN: A Small-footprint DCCRN Leveraging Feature-based Knowledge Distillation in Speech Enhancement Runduo Han et.al. 2408.04267 null
2024-08-14 ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model Yifan Chen et.al. 2408.04145 null
2024-08-07 AdapMTL: Adaptive Pruning Framework for Multitask Learning Model Mingcan Xiang et.al. 2408.03913 null
2024-08-07 Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection Xinyue Liu et.al. 2408.03888 null
2024-08-07 Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields Joo Chan Lee et.al. 2408.03822 null
2024-08-07 Iterative Knowledge Distillation through Feedback-Driven Learning Cycles Yujia Chen et.al. 2408.03680 null
2024-08-07 Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration Zhongyao Luo et.al. 2408.03647 link
2024-08-07 Distillation Learning Guided by Image Reconstruction for One-Shot Medical Image Segmentation Feng Zhou et.al. 2408.03616 link
2024-08-06 EEGMobile: Enhancing Speed and Accuracy in EEG-Based Gaze Prediction with Advanced Mobile Architectures Teng Liang et.al. 2408.03449 link
2024-08-06 DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers Lianwei Yang et.al. 2408.03291 null
2024-08-06 Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments Angie Boggust et.al. 2408.03274 null
2024-08-06 Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization Yanghai Zhang et.al. 2408.03149 link
2024-08-06 Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations Leo Donisch et.al. 2408.03130 null
2024-08-06 Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Jonas Schmitt et.al. 2408.03046 link
2024-08-06 VizECGNet: Visual ECG Image Network for Cardiovascular Diseases Classification with Multi-Modal Training and Knowledge Distillation Ju-Hyeon Nam et.al. 2408.02888 null
2024-08-05 An approach to optimize inference of the DIART speaker diarization pipeline Roman Aperdannier et.al. 2408.02341 null
2024-08-05 Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution Hojung Lee et.al. 2408.02307 link
2024-08-05 Unsupervised Domain Adaption Harnessing Vision-Language Pre-training Wenlve Zhou et.al. 2408.02192 link
2024-08-03 Joint Model Pruning and Resource Allocation for Wireless Time-triggered Federated Learning Xinlu Zhang et.al. 2408.01765 null
2024-08-02 An Adaptive Tensor-Train Decomposition Approach for Efficient Deep Neural Network Compression Shiyi Luo et.al. 2408.01534 null
2024-08-02 Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning Lu Yu et.al. 2408.01076 link
2024-08-02 Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs Afia Anjum et.al. 2408.01008 null
2024-08-01 DistillGrasp: Integrating Features Correlation with Knowledge Distillation for Depth Completion of Transparent Objects Yiheng Huang et.al. 2408.00337 null
2024-08-01 Clover-2: Accurate Inference for Regressive Lightweight Speculative Decoding Bin Xiao et.al. 2408.00264 null
2024-08-01 Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation Kohei Matsuura et.al. 2408.00205 null
2024-07-31 StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization Kaiyuan Tang et.al. 2408.00150 null
2024-08-02 Gemma 2: Improving Open Language Models at a Practical Size Gemma Team et.al. 2408.00118 null
2024-07-31 Dynamic Object Queries for Transformer-based Incremental Object Detection Jichuan Zhang et.al. 2407.21687 null
2024-07-31 Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins Lukas Gienapp et.al. 2407.21515 null
2024-07-31 VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning Yuhang Ming et.al. 2407.21416 null
2024-07-31 Lifelong Person Search Jae-Won Yang et.al. 2407.21252 null
2024-07-29 SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillation Chakkrit Termritthikun et.al. 2407.20062 link
2024-07-29 ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality Guoliang Xu et.al. 2407.19820 null
2024-07-29 Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices Hayun Lee et.al. 2407.19644 null
2024-07-28 Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models Mohammed Al-Maamari et.al. 2407.19610 link
2024-07-28 Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Knowledge Distillation and Random Data Erasing Heejoon Koo et.al. 2407.19540 null
2024-07-28 LLAVADI: What Matters For Multimodal Large Language Models Distillation Shilin Xu et.al. 2407.19409 null
2024-07-28 Logic Distillation: Learning from Code Function by Function for Planning and Decision-making Dong Chen et.al. 2407.19405 null
2024-07-27 Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network Gang Pan et.al. 2407.19271 null
2024-07-26 Automatic Detection of Moral Values in Music Lyrics Vjosa Preniqi et.al. 2407.18787 link
2024-07-26 Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers Longkun Zou et.al. 2407.18534 link
2024-07-26 FedUD: Exploiting Unaligned Data for Cross-Platform Federated Click-Through Rate Prediction Wentao Ouyang et.al. 2407.18472 null
2024-07-26 Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation Jiabo Ma et.al. 2407.18449 null
2024-07-25 Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT Niels G. Faber et.al. 2407.18288 link
2024-07-25 Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning Tianduo Wang et.al. 2407.18248 link
2024-07-25 How to Train the Teacher Model for Effective Knowledge Distillation Shayan Mohajer Hamidi et.al. 2407.18041 link
2024-07-25 Peak-Controlled Logits Poisoning Attack in Federated Distillation Yuhan Tang et.al. 2407.18039 null
2024-07-25 Separating Novel Features for Logical Anomaly Detection: A Straightforward yet Effective Approach Kangil Lee et.al. 2407.17909 null
2024-07-25 NC-NCD: Novel Class Discovery for Node Classification Yue Hou et.al. 2407.17816 link
2024-07-24 CoMoTo: Unpaired Cross-Modal Lesion Distillation Improves Breast Lesion Detection in Tomosynthesis Muhammad Alberb et.al. 2407.17620 link
2024-07-24 (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork Tianjin Huang et.al. 2407.17412 null
2024-07-23 Strike a Balance in Continual Panoptic Segmentation Jinpeng Chen et.al. 2407.16354 link
2024-07-23 OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection Fan Cui et.al. 2407.16237 link
2024-07-23 DDK: Distilling Domain Knowledge for Efficient Large Language Models Jiaheng Liu et.al. 2407.16154 null

(back to top)

About

🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%