Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-30 | Improving Acoustic Scene Classification in Low-Resource Conditions | Zhi Chen et.al. | 2412.20722 | null |
2024-12-29 | PTQ4VM: Post-Training Quantization for Visual Mamba | Younghyun Cho et.al. | 2412.20386 | null |
2024-12-28 | IMSSA: Deploying modern state-space models on memristive in-memory compute hardware | Sebastian Siegel et.al. | 2412.20215 | null |
2024-12-27 | Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales | Shuokai Pan et.al. | 2412.19867 | null |
2024-12-27 | MBQ: Modality-Balanced Quantization for Large Vision-Language Models | Shiyao Li et.al. | 2412.19509 | link |
2024-12-24 | Unified Stochastic Framework for Neural Network Quantization and Pruning | Haoyu Zhang et.al. | 2412.18184 | null |
2024-12-21 | TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models | Haocheng Huang et.al. | 2412.16700 | null |
2024-12-20 | Improving Quantization-aware Training of Low-Precision Network via Block Replacement on Full-Precision Counterpart | Chengting Yu et.al. | 2412.15846 | null |
2024-12-19 | Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers | Rui Ding et.al. | 2412.14633 | null |
2024-12-19 | Qua |
Keith G. Mills et.al. | 2412.14628 | null |
2024-12-18 | ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals | Utkarsh Saxena et.al. | 2412.14363 | link |
2024-12-15 | Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment | Haisheng Lu et.al. | 2412.11186 | link |
2024-12-13 | TTAQ: Towards Stable Post-training Quantization in Continuous Domain Adaptation | Junrui Xiao et.al. | 2412.09899 | null |
2024-12-12 | CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs | Yuzhuang Xu et.al. | 2412.09282 | null |
2024-12-10 | Post-Training Non-Uniform Quantization for Convolutional Neural Networks | Ahmed Luqman et.al. | 2412.07391 | null |
2024-12-09 | FP=xINT:A Low-Bit Series Expansion Algorithm for Post-Training Quantization | Boyang Zhang et.al. | 2412.06865 | null |
2024-12-09 | Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion | Shuaiting Li et.al. | 2412.06661 | null |
2024-12-07 | GAQAT: gradient-adaptive quantization-aware training for domain generalization | Jiacheng Jiang et.al. | 2412.05551 | null |
2024-12-07 | SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization | Runsheng Bai et.al. | 2412.04180 | null |
2024-12-05 | Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task | Alireza Maleki et.al. | 2412.03915 | null |
2024-12-03 | CPTQuant - A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models | Amitash Nanda et.al. | 2412.03599 | null |
2024-11-26 | Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving | Jon Gutiérrez-Zaballa et.al. | 2411.17543 | null |
2024-12-03 | PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution | Libo Zhu et.al. | 2411.17106 | link |
2024-11-23 | freePruner: A Training-free Approach for Large Multimodal Model Acceleration | Bingxin Xu et.al. | 2411.15446 | null |
2024-11-22 | FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Acceleration | Donghyeon Yi et.al. | 2411.14733 | null |
2024-11-17 | EfQAT: An Efficient Framework for Quantization-Aware Training | Saleh Ashkboos et.al. | 2411.11038 | null |
2024-11-12 | ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization | Weibo Zhao et.al. | 2411.07762 | null |
2024-11-09 | Optimizing Large Language Models through Quantization: A Comparative Analysis of PTQ and QAT Techniques | Jahid Hasan et.al. | 2411.06084 | null |
2024-11-08 | SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-30 | Scaling Laws for Precision | Tanishq Kumar et.al. | 2411.04330 | null |
2024-11-06 | Interactions Across Blocks in Post-Training Quantization of Large Language Models | Khasmamad Shabanovi et.al. | 2411.03934 | null |
2024-11-06 | An Edge Computing-Based Solution for Real-Time Leaf Disease Classification using Thermal Imaging | Públio Elon Correa da Silva et.al. | 2411.03835 | link |
2024-11-06 | TATAA: Programmable Mixed-Precision Transformer Acceleration with a Transformable Arithmetic Architecture | Jiajun Wu et.al. | 2411.03697 | null |
2024-10-29 | Data Generation for Hardware-Friendly Post-Training Quantization | Lior Dikstein et.al. | 2410.22110 | link |
2024-10-30 | IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models | Hang Guo et.al. | 2410.21759 | link |
2024-10-26 | DQRM: Deep Quantized Recommendation Models | Yang Zhou et.al. | 2410.20046 | link |
2024-10-14 | Real-Time Stress Detection via Photoplethysmogram Signals: Implementation of a Combined Continuous Wavelet Transform and Convolutional Neural Network on Resource-Constrained Microcontrollers | Yasin Hasanpoor et.al. | 2410.19776 | null |
2024-10-24 | TesseraQ: Ultra Low-Bit LLM Post-Training Quantization with Block Reconstruction | Yuhang Li et.al. | 2410.19103 | null |
2024-10-18 | Understanding the difficulty of low-precision post-training quantization of large language models | Zifei Xu et.al. | 2410.14570 | null |
2024-10-17 | Quamba: A Post-Training Quantization Recipe for Selective State Space Models | Hung-Yueh Chiang et.al. | 2410.13229 | link |
2024-10-17 | Scaling laws for post-training quantized large language models | Zifei Xu et.al. | 2410.12119 | null |
2024-10-15 | Error Diffusion: Post Training Quantization with Block-Scaled Number Formats for Neural Networks | Alireza Khodamoradi et.al. | 2410.11203 | link |
2024-10-06 | Continuous Approximations for Improving Quantization Aware Training of LLMs | He Li et.al. | 2410.10849 | null |
2024-10-12 | SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs | Mohammad Mozaffari et.al. | 2410.09615 | link |
2024-10-12 | FlatQuant: Flatness Matters for LLM Quantization | Yuxuan Sun et.al. | 2410.09426 | link |
2024-10-10 | Q-VLM: Post-training Quantization for Large Vision-Language Models | Changyuan Wang et.al. | 2410.08119 | link |
2024-10-10 | Post-Training Quantization in Brain-Computer Interfaces based on Event-Related Potential Detection | Hubert Cecotti et.al. | 2410.07920 | null |
2024-10-10 | CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression | Wenyuan Liu et.al. | 2410.07505 | null |
2024-10-09 | Scaling Laws for Mixed quantization in Large Language Models | Zeyu Cao et.al. | 2410.06722 | null |
2024-10-08 | QERA: an Analytical Framework for Quantization Error Reconstruction | Cheng Zhang et.al. | 2410.06040 | null |
2024-10-08 | QT-DoG: Quantization-aware Training for Domain Generalization | Saqib Javed et.al. | 2410.06020 | link |
2024-10-10 | ARB-LLM: Alternating Refined Binarizations for Large Language Models | Zhiteng Li et.al. | 2410.03129 | link |
2024-10-03 | Lightweight Diffusion Models for Resource-Constrained Semantic Communication | Giovanni Pignata et.al. | 2410.02491 | link |
2024-10-01 | Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging | Ismail Erbas et.al. | 2410.00948 | null |
2024-09-30 | Constraint Guided Model Quantization of Neural Networks | Quinten Van Baelen et.al. | 2409.20138 | null |
2024-09-26 | P4Q: Learning to Prompt for Quantization in Visual-language Models | Huixin Sun et.al. | 2409.17634 | null |
2024-09-25 | Accumulator-Aware Post-Training Quantization | Ian Colbert et.al. | 2409.17092 | null |
2024-09-25 | VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models | Yifei Liu et.al. | 2409.17066 | link |
2024-09-25 | PTQ4RIS: Post-Training Quantization for Referring Image Segmentation | Xiaoyan Jiang et.al. | 2409.17020 | link |
2024-09-26 | INT-FlashAttention: Enabling Flash Attention for INT8 Quantization | Shimao Chen et.al. | 2409.16997 | link |
2024-09-20 | PTQ4ADM: Post-Training Quantization for Efficient Text Conditional Audio Diffusion Models | Jayneel Vora et.al. | 2409.13894 | null |
2024-09-18 | Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview | Yanshu Wang et.al. | 2409.11650 | null |
2024-09-12 | LlamaF: An Efficient Llama2 Architecture Accelerator on Embedded FPGAs | Han Xu et.al. | 2409.11424 | null |
2024-09-12 | DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing | Zhenyuan Dong et.al. | 2409.07756 | link |
2024-08-31 | Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization | Vage Egiazarian et.al. | 2409.00492 | null |
2024-08-29 | A machine learning approach for computing solar flare locations in X-rays on-board Solar Orbiter/STIX | Paolo Massa et.al. | 2408.16642 | link |
2024-08-29 | On-device AI: Quantization-aware Training of Transformers in Time-Series | Tianheng Ling et.al. | 2408.16495 | null |
2024-08-27 | The Uniqueness of LLaMA3-70B with Per-Channel Quantization: An Empirical Study | Minghai Qin et.al. | 2408.15301 | null |
2024-08-25 | MobileQuant: Mobile-friendly Quantization for On-device Language Models | Fuwen Tan et.al. | 2408.13933 | link |
2024-08-25 | Infrared Domain Adaptation with Zero-Shot Quantization | Burak Sevsay et.al. | 2408.13925 | null |
2024-08-23 | ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models | Chao Zeng et.al. | 2408.08554 | link |
2024-08-14 | Analog Spiking Neuron in CMOS 28 nm Towards Large-Scale Neuromorphic Processors | Marwan Besrour et.al. | 2408.07734 | null |
2024-08-13 | Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models | Cheng Chen et.al. | 2408.06995 | null |
2024-08-11 | RTF-Q: Unsupervised domain adaptation based retraining-free quantization network | Nanyang Du et.al. | 2408.05752 | null |
2024-08-16 | DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers | Lianwei Yang et.al. | 2408.03291 | null |
2024-08-05 | HQOD: Harmonious Quantization for Object Detection | Long Huang et.al. | 2408.02561 | link |
2024-08-01 | Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization | Róisín Luo et.al. | 2408.00923 | null |
2024-08-07 | Temporal Feature Matters: A Framework for Diffusion Model Quantization | Yushi Huang et.al. | 2407.19547 | null |
2024-07-25 | Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models | Sanae Lotfi et.al. | 2407.18158 | null |
2024-07-27 | MetaAug: Meta-Data Augmentation for Post-Training Quantization | Cuong Pham et.al. | 2407.14726 | link |
2024-07-17 | AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer | Zhuguanyu Wu et.al. | 2407.12951 | link |
2024-07-17 | Mamba-PTQ: Outlier Channels in Recurrent Large Language Models | Alessandro Pierro et.al. | 2407.12397 | null |
2024-07-17 | StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators | Ethan G Rogers et.al. | 2407.12378 | null |
2024-07-17 | Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models | Ayush Kaushal et.al. | 2407.12327 | link |
2024-07-17 | QVD: Post-training Quantization for Video Diffusion Models | Shilong Tian et.al. | 2407.11585 | null |
2024-07-16 | LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling Matrices | Jung Hyun Lee et.al. | 2407.11534 | link |
2024-07-11 | Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients | Zhenyu Zhang et.al. | 2407.08296 | link |
2024-07-10 | RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization | Xijie Huang et.al. | 2407.08044 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-24 | SlimGPT: Layer-wise Structured Pruning for Large Language Models | Gui Ling et.al. | 2412.18110 | null |
2024-12-23 | GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference | Chao Zeng et.al. | 2412.17560 | null |
2024-12-28 | Lillama: Large Language Models Compression via Low-Rank Feature Distillation | Yaya Sy et.al. | 2412.16719 | null |
2024-12-21 | V"Mean"ba: Visual State Space Models only need 1 hidden dimension | Tien-Yu Chi et.al. | 2412.16602 | null |
2024-12-20 | Less is More: Towards Green Code Large Language Models via Unified Structural Pruning | Guang Yang et.al. | 2412.15921 | null |
2024-12-20 | All-in-One Tuning and Structural Pruning for Domain-Specific LLMs | Lei Lu et.al. | 2412.14426 | null |
2024-12-17 | Learning Coarse-to-Fine Pruning of Graph Convolutional Networks for Skeleton-based Recognition | Hichem Sahbi et.al. | 2412.12887 | null |
2024-12-17 | A Comparative Study of Pruning Methods in Transformer-based Time Series Forecasting | Nicholas Kiefer et.al. | 2412.12883 | null |
2024-12-17 | Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation | Dongyue Wu et.al. | 2412.12672 | link |
2024-12-19 | RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification | Guangwenjie Zou et.al. | 2412.12603 | link |
2024-12-16 | Designing Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-based Recognition | Hichem Sahbi et.al. | 2412.11813 | null |
2024-12-16 | QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models | Changhai Zhou et.al. | 2412.11629 | null |
2024-12-09 | LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation | Haihang Wu et.al. | 2412.06419 | null |
2024-12-03 | Effortless Efficiency: Low-Cost Pruning of Diffusion Models | Yang Zhang et.al. | 2412.02852 | null |
2024-11-25 | Deep Convolutional Neural Networks Structured Pruning via Gravity Regularization | Abdesselam Ferdi et.al. | 2411.16901 | null |
2024-11-21 | FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers | Zehua Pei et.al. | 2411.14507 | null |
2024-11-21 | Layer Pruning with Consensus: A Triple-Win Solution | Leandro Giusti Mugnaini et.al. | 2411.14345 | link |
2024-11-21 | DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization | Hexuan Deng et.al. | 2411.14055 | link |
2024-11-19 | FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning | Qingsong Lv et.al. | 2411.12781 | link |
2024-11-17 | Electrostatic Force Regularization for Neural Structured Pruning | Abdesselam Ferdi et.al. | 2411.11079 | null |
2024-11-15 | Systolic Arrays and Structured Pruning Co-design for Efficient Transformers in Edge Systems | Pedro Palacios et.al. | 2411.10285 | null |
2024-12-16 | P |
Xiaodong Chen et.al. | 2411.10272 | null |
2024-11-10 | RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration | Boyao Wang et.al. | 2411.06463 | link |
2024-11-05 | Layer-Adaptive State Pruning for Deep State Space Models | Minseon Gwak et.al. | 2411.02824 | link |
2024-11-04 | Automatic Structured Pruning for Efficient Architecture in Federated Learning | Thai Vu Nguyen et.al. | 2411.01759 | link |
2024-10-31 | Mutual Information Preserving Neural Network Pruning | Charles Westphal et.al. | 2411.00147 | null |
2024-10-24 | Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts | Danyal Aftab et.al. | 2410.19185 | null |
2024-10-18 | EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search | Oliver Sieberling et.al. | 2410.14649 | link |
2024-11-04 | DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models | Shangqian Gao et.al. | 2410.11988 | null |
2024-11-12 | Self-Data Distillation for Recovering Quality in Pruned Large Language Models | Vithursan Thangarasa et.al. | 2410.09982 | null |
2024-10-11 | Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale Models with Structured Pruning in Resource-Limited Clients | Yan Li et.al. | 2410.08457 | null |
2024-10-11 | Chip-Tuning: Classify Before Language Models Say | Fangwei Zhu et.al. | 2410.06541 | link |
2024-11-04 | Large Language Model Compression with Neural Architecture Search | Rhea Sanjay Sukthanker et.al. | 2410.06479 | null |
2024-09-29 | Investigating the Effect of Network Pruning on Performance and Interpretability | Jonathan von Rad et.al. | 2409.19727 | null |
2024-10-30 | Search for Efficient Large Language Models | Xuan Shen et.al. | 2409.17372 | link |
2024-09-22 | SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms | Niraj Pudasaini et.al. | 2409.14515 | null |
2024-09-20 | CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information | Yuxin Wang et.al. | 2409.13199 | link |
2024-09-17 | KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models | Bo Lv et.al. | 2409.11057 | null |
2024-09-11 | HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning | Tianyi Chen et.al. | 2409.09085 | link |
2024-09-12 | Structured Pruning for Efficient Visual Place Recognition | Oliver Grainge et.al. | 2409.07834 | null |
2024-09-10 | STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning | Jaeseong Lee et.al. | 2409.06211 | null |
2024-09-05 | TropNNC: Structured Neural Network Compression Using Tropical Geometry | Konstantinos Fotopoulos et.al. | 2409.03945 | null |
2024-09-02 | Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks | Samer Francy et.al. | 2409.02134 | null |
2024-08-27 | PAT: Pruning-Aware Tuning for Large Language Models | Yijiang Liu et.al. | 2408.14721 | link |
2024-08-15 | PQV-Mobile: A Combined Pruning and Quantization Toolkit to Optimize Vision Transformers for Mobile Applications | Kshitij Bhardwaj et.al. | 2408.08437 | link |
2024-08-13 | Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models | Chenqian Yan et.al. | 2408.06646 | null |
2024-08-06 | Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression | Jonas Schmitt et.al. | 2408.03046 | link |
2024-08-02 | Sustainable Diffusion-based Incentive Mechanism for Generative AI-driven Digital Twins in Industrial Cyber-Physical Systems | Jinbo Wen et.al. | 2408.01173 | null |
2024-08-22 | Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models | Jiang Hao et.al. | 2407.21316 | link |
2024-07-26 | Greedy Output Approximation: Towards Efficient Structured Pruning for LLMs Without Retraining | Jianwei Li et.al. | 2407.19126 | null |
2024-07-17 | MCU-MixQ: A HW/SW Co-optimized Mixed-precision Neural Network Design Framework for MCUs | Junfeng Gong et.al. | 2407.18267 | null |
2024-07-24 | (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork | Tianjin Huang et.al. | 2407.17412 | null |
2024-07-22 | Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models | Aayush Saxena et.al. | 2407.15904 | null |
2024-07-19 | Shapley Pruning for Neural Network Compression | Kamil Adamczewski et.al. | 2407.15875 | null |
2024-07-22 | A Pairwise Comparison Relation-assisted Multi-objective Evolutionary Neural Architecture Search Method with Multi-population Mechanism | Yu Xue et.al. | 2407.15600 | null |
2024-07-19 | Straightforward Layer-wise Pruning for More Efficient Visual Adaptation | Ruizi Han et.al. | 2407.14330 | null |
2024-07-18 | Data-Algorithm-Architecture Co-Optimization for Fair Neural Networks on Skin Lesion Dataset | Yi Sheng et.al. | 2407.13896 | null |
2024-07-18 | Reconstruct the Pruned Model without Any Retraining | Pingjie Wang et.al. | 2407.13331 | null |
2024-07-18 | MO-EMT-NAS: Multi-Objective Continuous Transfer of Architectural Knowledge Between Tasks from Different Datasets | Peng Liao et.al. | 2407.13122 | null |
2024-07-16 | MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models | Hongrong Cheng et.al. | 2407.11681 | null |
2024-07-15 | DDFAD: Dataset Distillation Framework for Audio Data | Wenbo Jiang et.al. | 2407.10446 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-29 | A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier | Amit Sarkar et.al. | 2412.20393 | null |
2024-12-29 | Open-Source Heterogeneous SoCs for AI: The PULP Platform Experience | Francesco Conti et.al. | 2412.20391 | null |
2024-12-27 | HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language Models | Ze Yang et.al. | 2412.19925 | null |
2024-12-26 | Evolution, Challenges, and Optimization in Computer Architecture: The Role of Reconfigurable Systems | Jefferson Ederhion et.al. | 2412.19234 | null |
2024-12-24 | GCN-ABFT: Low-Cost Online Error Checking for Graph Convolutional Networks | Christodoulos Peltekis et.al. | 2412.18534 | null |
2024-12-23 | Advantages of density in tensor network geometries for gradient based training | Sergi Masot-Llima et.al. | 2412.17497 | null |
2024-12-20 | Chorba: A novel CRC32 implementation | Sam Russell et.al. | 2412.16398 | null |
2024-12-20 | Designing Visual Explanations and Learner Controls to Engage Adolescents in AI-Supported Exercise Selection | Jeroen Ooge et.al. | 2412.16034 | null |
2024-12-20 | A survey on FPGA-based accelerator for ML models | Feng Yan et.al. | 2412.15666 | null |
2024-12-19 | LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation | Chenxu Zhou et.al. | 2412.15199 | null |
2024-12-18 | Pattern Matching in AI Compilers and its Formalization (Extended Version) | Joseph W. Cutler et.al. | 2412.13398 | null |
2024-12-17 | if-ZKP: Intel FPGA-Based Acceleration of Zero Knowledge Proofs | Shahzad Ahmad Butt et.al. | 2412.12481 | null |
2024-12-13 | Strong Structural Bounds for MaxSAT: The Fine Details of Using Neuromorphic and Quantum Hardware Accelerators | Max Bannach et.al. | 2412.10289 | null |
2024-12-16 | MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization | Shuaiting Li et.al. | 2412.10261 | null |
2024-12-12 | MPAX: Mathematical Programming in JAX | Haihao Lu et.al. | 2412.09734 | link |
2024-12-12 | Evaluating the Potential of In-Memory Processing to Accelerate Homomorphic Encryption | Mpoki Mwaisela et.al. | 2412.09144 | null |
2024-12-12 | Analyzing Practical Policies for Multiresource Job Scheduling | Zhongrui Chen et.al. | 2412.08915 | null |
2024-12-09 | LLM-BIP: Structured Pruning for Large Language Models with Block-Wise Forward Importance Propagation | Haihang Wu et.al. | 2412.06419 | null |
2024-12-03 | Demonstrating the Advantages of Analog Wafer-Scale Neuromorphic Hardware | Hartmut Schmidt et.al. | 2412.02619 | null |
2024-12-03 | Multi-timescale synaptic plasticity on analog neuromorphic hardware | Amani Atoui et.al. | 2412.02515 | null |
2024-11-27 | Deterministic and Probabilistic Rounding Error Analysis for Mixed-Precision Arithmetic on Modern Computing Units | Sahil Bhola et.al. | 2411.18747 | null |
2024-11-26 | Scalable iterative pruning of large language and vision models using block coordinate descent | Gili Rosenberg et.al. | 2411.17796 | null |
2024-11-25 | Limitations of tensor network approaches for optimization and sampling: A comparison against quantum and classical Ising machines | Anna Maria Dziubyna et.al. | 2411.16431 | null |
2024-11-25 | MixPE: Quantization and Hardware Co-design for Efficient LLM Inference | Yu Zhang et.al. | 2411.16158 | null |
2024-11-20 | Hardware Accelerators for Artificial Intelligence | S M Mojahidul Ahsan et.al. | 2411.13717 | null |
2024-11-20 | Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training | Jared Fernandez et.al. | 2411.13055 | null |
2024-11-19 | FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning | Qingsong Lv et.al. | 2411.12781 | link |
2024-11-19 | Design of an FPGA-Based Neutral Atom Rearrangement Accelerator for Quantum Computing | Xiaorang Guo et.al. | 2411.12401 | null |
2024-11-18 | SILVIA: Automated Superword-Level Parallelism Exploitation via HLS-Specific LLVM Passes for Compute-Intensive FPGA Accelerators | Giovanni Brignone et.al. | 2411.11384 | link |
2024-12-01 | InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma | Xiaoxuan Hou et.al. | 2411.09856 | link |
2024-11-21 | OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling | Xiaoling Yi et.al. | 2411.09543 | null |
2024-11-15 | Communication Compression for Tensor Parallel LLM Inference | Jan Hansen-Palmus et.al. | 2411.09510 | null |
2024-11-18 | RPCAcc: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator | Jie Zhang et.al. | 2411.07632 | null |
2024-11-11 | Spiking Transformer Hardware Accelerators in 3D Integration | Boxun Xu et.al. | 2411.07397 | null |
2024-11-10 | AMAZE: Accelerated MiMC Hardware Architecture for Zero-Knowledge Applications on the Edge | Anees Ahmed et.al. | 2411.06350 | link |
2024-11-03 | Stochastic Communication Avoidance for Recommendation Systems | Lutfi Eren Erdogan et.al. | 2411.01611 | null |
2024-11-01 | Inducing Semi-Structured Sparsity by Masking for Efficient Model Inference in Convolutional Networks | David A. Danhofer et.al. | 2411.00288 | null |
2024-10-31 | LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators | Krishna Teja Chitty-Venkata et.al. | 2411.00136 | link |
2024-10-30 | Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks | Michael Matthews et.al. | 2410.23208 | link |
2024-10-24 | Watermarking Large Language Models and the Generated Content: Opportunities and Challenges | Ruisi Zhang et.al. | 2410.19096 | null |
2024-10-21 | Hacking the Fabric: Targeting Partial Reconfiguration for Fault Injection in FPGA Fabrics | Jayeeta Chaudhuri et.al. | 2410.16497 | null |
2024-10-21 | Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure | Juan Cruz-Martinez et.al. | 2410.16248 | null |
2024-10-20 | A Remedy to Compute-in-Memory with Dynamic Random Access Memory: 1FeFET-1C Technology for Neuro-Symbolic AI | Xunzhao Yin et.al. | 2410.15296 | null |
2024-10-18 | Self-Satisfied: An end-to-end framework for SAT generation and prediction | Christopher R. Serrano et.al. | 2410.14888 | null |
2024-10-17 | Quamba: A Post-Training Quantization Recipe for Selective State Space Models | Hung-Yueh Chiang et.al. | 2410.13229 | link |
2024-10-16 | Mixed-precision finite element kernels and assembly: Rounding error analysis and hardware acceleration | M. Croci et.al. | 2410.12614 | link |
2024-10-15 | Fast Local Neural Regression for Low-Cost, Path Traced Lambertian Global Illumination | Arturo Salmi et.al. | 2410.11625 | null |
2024-10-15 | Efficiera Residual Networks: Hardware-Friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical ImageNet Accuracy | Shuntaro Takahashi et.al. | 2410.11553 | link |
2024-10-14 | Differentiable Weightless Neural Networks | Alan T. L. Bacellar et.al. | 2410.11112 | link |
2024-10-14 | SLaNC: Static LayerNorm Calibration | Mahsa Salmani et.al. | 2410.10553 | null |
2024-10-11 | MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices | Mohamed Amine Hamdi et.al. | 2410.08855 | link |
2024-10-09 | Optimized Spatial Architecture Mapping Flow for Transformer Accelerators | Haocheng Xu et.al. | 2410.07407 | null |
2024-10-09 | Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing | Ismail Erbas et.al. | 2410.07364 | null |
2024-10-03 | CAX: Cellular Automata Accelerated in JAX | Maxence Faldor et.al. | 2410.02651 | link |
2024-10-03 | Extracting the Potential of Emerging Hardware Accelerators for Symmetric Eigenvalue Decomposition | Hansheng Wang et.al. | 2410.02170 | null |
2024-10-01 | Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging | Ismail Erbas et.al. | 2410.00948 | null |
2024-09-26 | Leader Selection and Follower Association for UE-centric Distributed Learning in Future Wireless Networks | Saeedeh Parsaeefard et.al. | 2409.18268 | null |
2024-09-26 | A 5T-2MTJ STT-assisted Spin Orbit Torque based Ternary Content Addressable Memory for Hardware Accelerators | Siri Narla et.al. | 2409.17863 | null |
2024-09-24 | Microsecond-Latency Feedback at a Particle Accelerator by Online Reinforcement Learning on Hardware | Luca Scomparin et.al. | 2409.16177 | null |
2024-09-25 | Ultra-low latency quantum-inspired machine learning predictors implemented on FPGA | Lorenzo Borella et.al. | 2409.16075 | null |
2024-09-19 | Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention | Rengan Xu et.al. | 2409.15373 | null |
2024-09-23 | Efficient Tabular Data Preprocessing of ML Pipelines | Yu Zhu et.al. | 2409.14912 | null |
2024-09-21 | FAMOUS: Flexible Accelerator for the Attention Mechanism of Transformer on UltraScale+ FPGAs | Ehsan Kabir et.al. | 2409.14023 | null |
2024-09-21 | ProTEA: Programmable Transformer Encoder Acceleration on FPGA | Ehsan Kabir et.al. | 2409.13975 | null |
2024-09-23 | Towards Efficient Neuro-Symbolic AI: From Workload Characterization to Hardware Architecture | Zishen Wan et.al. | 2409.13153 | null |
2024-09-20 | Learning to Compare Hardware Designs for High-Level Synthesis | Yunsheng Bai et.al. | 2409.13138 | null |
2024-09-19 | Performance and Power: Systematic Evaluation of AI Workloads on Accelerators with CARAML | Chelsea Maria John et.al. | 2409.12994 | link |
2024-09-19 | CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications | Vladimir Frolov et.al. | 2409.12617 | null |
2024-09-15 | Pack my weights and run! Minimizing overheads for in-memory computing accelerators | Pouya Houshmand et.al. | 2409.11437 | null |
2024-09-11 | Next-generation Probabilistic Computing Hardware with 3D MOSAICs, Illusion Scale-up, and Co-design | Tathagata Srimani et.al. | 2409.11422 | null |
2024-09-09 | Hardware Acceleration of Kolmogorov-Arnold Network (KAN) for Lightweight Edge Inference | Wei-Hsing Huang et.al. | 2409.11418 | null |
2024-09-17 | Dynamic Range Reduction via Branch-and-Bound | Thore Gerlach et.al. | 2409.10863 | null |
2024-09-16 | Count2Multiply: Reliable In-memory High-Radix Counting | João Paulo Cardoso de Lima et.al. | 2409.10136 | null |
2024-09-16 | Hardware-Accelerated Ray Tracing for Discrete and Continuous Collision Detection on GPUs | Sizhe Sui et.al. | 2409.09918 | null |
2024-09-13 | Distributed Binary Optimization with In-Memory Computing: An Application for the SAT Problem | Xiangyi Zhang et.al. | 2409.09152 | null |
2024-09-13 | Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators | Konstantin Lübeck et.al. | 2409.08595 | null |
2024-09-17 | Foragax: An Agent-Based Modelling Framework Based on JAX | Siddharth Chaturvedi et.al. | 2409.06345 | link |
2024-09-10 | PIM-MMU: A Memory Management Unit for Accelerating Data Transfers in Commercial PIM Systems | Dongjae Lee et.al. | 2409.06204 | null |
2024-09-06 | Towards Narrowing the Generalization Gap in Deep Boolean Networks | Youngsung Kim et.al. | 2409.05905 | null |
2024-09-09 | Supervised Learning for Stochastic Optimal Control | Vince Kurtz et.al. | 2409.05792 | null |
2024-09-08 | BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration | Yuzong Chen et.al. | 2409.05227 | link |
2024-09-05 | Libra: Architectural Support For Principled, Secure And Efficient Balanced Execution On High-End Processors (Extended Version) | Hans Winderix et.al. | 2409.03743 | null |
2024-09-05 | Hardware Acceleration of LLMs: A comprehensive survey and comparison | Nikoletta Koilia et.al. | 2409.03384 | null |
2024-09-05 | Towards training digitally-tied analog blocks via hybrid gradient computation | Timothy Nest et.al. | 2409.03306 | null |
2024-08-30 | The picasso gas model: Painting intracluster gas on gravity-only simulations | F. Kéruzoré et.al. | 2408.17445 | link |
2024-08-29 | Serial and Parallel Two-Column Probing for Mixed-Integer Programming | Yongzheng Dai et.al. | 2408.16927 | link |
2024-08-29 | On-device AI: Quantization-aware Training of Transformers in Time-Series | Tianheng Ling et.al. | 2408.16495 | null |
2024-08-29 | Accelerating Image-based Pest Detection on a Heterogeneous Multi-core Microcontroller | Luca Bompani et.al. | 2408.15911 | link |
2024-08-28 | FireFly-S: Exploiting Dual-Side Sparsity for Spiking Neural Networks Acceleration with Reconfigurable Spatial Architecture | Tenglong Li et.al. | 2408.15578 | null |
2024-08-29 | CGRA4ML: A Framework to Implement Modern Neural Networks for Scientific Edge Computing | G Abarajithan et.al. | 2408.15561 | null |
2024-08-27 | SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search | Hung-Yueh Chiang et.al. | 2408.15395 | null |
2024-08-27 | SiHGNN: Leveraging Properties of Semantic Graphs for Efficient HGNN Acceleration | Runzhen Xue et.al. | 2408.15089 | null |
2024-08-26 | On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise | M. Reza Eslami et.al. | 2408.14680 | null |
2024-08-26 | HAPM -- Hardware Aware Pruning Method for CNN hardware accelerators in resource constrained devices | Federico Nicolas Peccia et.al. | 2408.14055 | null |
2024-08-22 | Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments | Maciej Besta et.al. | 2408.12173 | null |
2024-08-21 | Floating-Point Multiply-Add with Approximate Normalization for Low-Cost Matrix Engines | Kosmas Alexandridis et.al. | 2408.11997 | null |
2024-08-21 | Cage: Hardware-Accelerated Safe WebAssembly | Martin Fink et.al. | 2408.11456 | null |
2024-08-20 | Tapping in a Remote Vehicle's onboard LLM to Complement the Ego Vehicle's Field-of-View | Malsha Ashani Mahawatta Dona et.al. | 2408.10794 | null |
2024-08-16 | Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers | Zihang Song et.al. | 2408.08794 | null |
2024-08-16 | Cross-Chip Partial Reconfiguration for the Initialisation of Modular and Scalable Heterogeneous Systems | Marvin Fuchs et.al. | 2408.08626 | null |
2024-08-13 | HLSPilot: LLM-based High-Level Synthesis | Chenwei Xiong et.al. | 2408.06810 | link |
2024-08-12 | Hardware Architecture Design of Model-Based Image Reconstruction Towards Palm-size Photoacoustic Tomography | Yuwei Zheng et.al. | 2408.06049 | null |
2024-08-12 | SZKP: A Scalable Accelerator Architecture for Zero-Knowledge Proofs | Alhad Daftardar et.al. | 2408.05890 | null |
2024-08-10 | LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale | Jaehong Cho et.al. | 2408.05499 | link |
2024-08-08 | Noise-augmented Chaotic Ising Machines for Combinatorial Optimization and Sampling | Kyle Lee et.al. | 2408.04744 | null |
2024-08-07 | Hardware-Assisted Virtualization of Neural Processing Units for Cloud Platforms | Yuqi Xue et.al. | 2408.04104 | null |
2024-08-07 | Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration | Zhongyao Luo et.al. | 2408.03647 | link |
2024-08-06 | LLM-Aided Compilation for Tensor Accelerators | Charles Hong et.al. | 2408.03408 | null |
2024-08-06 | HeTraX: Energy Efficient 3D Heterogeneous Manycore Architecture for Transformer Acceleration | Pratyush Dhingra et.al. | 2408.03397 | null |
2024-08-05 | PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy | Rachmad Vidya Wicaksana Putra et.al. | 2408.02412 | null |
2024-08-02 | Digitized Phase Change Material Heterostack for Diffractive Optical Neural Network | Ruiyang Chen et.al. | 2408.01404 | null |
2024-08-02 | Search-in-Memory (SiM): Reliable, Versatile, and Efficient Data Matching in SSD's NAND Flash Memory Chip for Data Indexing Acceleration | Yun-Chih Chen et.al. | 2408.00327 | null |
2024-08-07 | Temporal Feature Matters: A Framework for Diffusion Model Quantization | Yushi Huang et.al. | 2407.19547 | null |
2024-07-16 | Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC) | Seyed Nima Omidsajedi et.al. | 2407.18264 | null |
2024-07-22 | KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer | Aness Al-Qawlaq et.al. | 2407.16026 | null |
2024-07-18 | Integrated Hardware Architecture and Device Placement Search | Irene Wang et.al. | 2407.13143 | link |
2024-07-17 | ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks | Salma Afifi et.al. | 2407.12638 | null |
2024-07-17 | StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators | Ethan G Rogers et.al. | 2407.12378 | null |
2024-07-16 | Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment | Yuhao Ji et.al. | 2407.12070 | null |
2024-07-16 | Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads | Aritra Dhar et.al. | 2407.11888 | null |
2024-07-15 | Hierarchical search method for gravitational waves from stellar-mass binary black holes in noisy space-based detector data | Yao Fu et.al. | 2407.10797 | null |
2024-07-14 | Accelerator-as-a-Service in Public Clouds: An Intra-Host Traffic Management View for Performance Isolation in the Wild | Jiechen Zhao et.al. | 2407.10098 | null |
2024-07-12 | 68-Channel Highly-Integrated Neural Signal Processing PSoC with On-Chip Feature Extraction, Compression, and Hardware Accelerators for Neuroprosthetics in 22nm FDSOI | Liyuan Guo et.al. | 2407.09166 | null |
2024-07-12 | Hybrid Temporal Computing for Lower Power Hardware Accelerators | Maliha Tasnim et.al. | 2407.08975 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-25 | Tempus Core: Area-Power Efficient Temporal-Unary Convolution Core for Low-Precision Edge DLAs | Prabhu Vellaisamy et.al. | 2412.19002 | null |
2024-12-23 | Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings | Harsh Joshi et.al. | 2412.18635 | null |
2024-12-23 | tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AI | Harideep Nair et.al. | 2412.17966 | null |
2024-12-22 | Fatigue Monitoring Using Wearables and AI: Trends, Challenges, and Future Opportunities | Kourosh Kakhi et.al. | 2412.16847 | null |
2024-12-19 | ElectraSight: Smart Glasses with Fully Onboard Non-Invasive Eye Tracking Using Hybrid Contact and Contactless EOG | Nicolas Schärer et.al. | 2412.14848 | null |
2024-12-17 | Design of an AI-Enhanced Digital Stethoscope: Advancing Cardiovascular Diagnostics Through Smart Auscultation | Abraham G. Taye et.al. | 2412.14206 | null |
2024-12-16 | Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads | Mukul Lokhande et.al. | 2412.11702 | link |
2024-12-13 | Edge AI-based Radio Frequency Fingerprinting for IoT Networks | Ahmed Mohamed Hussain et.al. | 2412.10553 | null |
2024-12-13 | EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models | Hanchu Zhou et.al. | 2412.09782 | null |
2024-12-12 | Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices | Thanaphon Suwannaphong et.al. | 2412.09289 | null |
2024-12-10 | Performance Evaluation of ROS2-DDS middleware implementations facilitating Cooperative Driving in Autonomous Vehicle | Sumit Paul et.al. | 2412.07485 | null |
2024-12-07 | Innovative Sentiment Analysis and Prediction of Stock Price Using FinBERT, GPT-4 and Logistic Regression: A Data-Driven Approach | Olamilekan Shobayo et.al. | 2412.06837 | null |
2024-12-09 | DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI Accelerators | Taesik Gong et.al. | 2412.06566 | link |
2024-12-09 | Sequential Printed MLP Circuits for Super TinyML Multi-Sensory Applications | Gurol Saglam et.al. | 2412.06542 | null |
2024-12-02 | Optimizing LoRa for Edge Computing with TinyML Pipeline for Channel Hopping | Marla Grunewald et.al. | 2412.01609 | null |
2024-12-01 | Toward Real-Time Edge AI: Model-Agnostic Task-Oriented Communication with Visual Feature Alignment | Songjie Xie et.al. | 2412.00862 | link |
2024-11-28 | Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras | Jicheng Yuan et.al. | 2411.19143 | null |
2024-11-28 | Towards an Implementation of the Knowledge-Based Control Plane for Intelligent Swarm Networks | Xuanchi Guo et.al. | 2411.19068 | null |
2024-11-24 | Space-ground Fluid AI for 6G Edge Intelligence | Qian Chen et.al. | 2411.15845 | null |
2024-11-20 | Federated Continual Learning for Edge-AI: A Comprehensive Survey | Zi Wang et.al. | 2411.13740 | null |
2024-11-16 | Enhanced FIWARE-Based Architecture for Cyberphysical Systems With Tiny Machine Learning and Machine Learning Operations: A Case Study on Urban Mobility Systems | Javier Conde et.al. | 2411.13583 | null |
2024-11-19 | Signformer is all you need: Towards Edge AI for Sign Language | Eta Yang et.al. | 2411.12901 | link |
2024-11-16 | DEBUG-HD: Debugging TinyML models on-device using Hyper-Dimensional computing | Nikhil P Ghanathe et.al. | 2411.10692 | null |
2024-11-14 | ABCI 3.0: Evolution of the leading AI infrastructure in Japan | Ryousei Takano et.al. | 2411.09134 | null |
2024-11-13 | A Cost-effective, Stand-alone, and Real-time TinyML-Based Gait Diagnosis Unit Aimed at Lower-limb Robotic Prostheses and Exoskeletons | Zarin Anjum Madhiha et.al. | 2411.08474 | null |
2024-11-12 | Towards Vision Mixture of Experts for Wildlife Monitoring on the Edge | Emmanuel Azuh Mensah et.al. | 2411.07834 | null |
2024-11-16 | Enhancing Predictive Maintenance in Mining Mobile Machinery through a TinyML-enabled Hierarchical Inference Network | Raúl de la Fuente et.al. | 2411.07168 | null |
2024-11-11 | A Primer on Word Embeddings: AI Techniques for Text Analysis in Social Work | Brian E. Perron et.al. | 2411.07156 | null |
2024-11-11 | TinyML Security: Exploring Vulnerabilities in Resource-Constrained Machine Learning Systems | Jacob Huckelberry et.al. | 2411.07114 | null |
2024-11-10 | Activation Map Compression through Tensor Decomposition for Deep Learning | Le-Trung Nguyen et.al. | 2411.06346 | link |
2024-11-09 | TinyML NLP Approach for Semantic Wireless Sentiment Classification | Ahmed Y. Radwan et.al. | 2411.06291 | null |
2024-11-03 | Energy-Aware FPGA Implementation of Spiking Neural Network with LIF Neurons | Asmer Hamid Ali et.al. | 2411.01628 | null |
2024-11-01 | On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance | Jaskirat Singh et.al. | 2411.00907 | null |
2024-10-30 | Profiling AI Models: Towards Efficient Computation Offloading in Heterogeneous Edge AI Systems | Juan Marcelo Parra-Ullauri et.al. | 2411.00859 | null |
2024-11-01 | GPT for Games: An Updated Scoping Review (2020-2024) | Daijin Yang et.al. | 2411.00308 | null |
2024-10-31 | Cough-E: A multimodal, privacy-preserving cough detection algorithm for the edge | Stefano Albini et.al. | 2410.24066 | link |
2024-10-28 | FusedInf: Efficient Swapping of DNN Models for On-Demand Serverless Inference Services on the Edge | Sifat Ut Taki et.al. | 2410.21120 | link |
2024-10-28 | Edge Perception: Intelligent Wireless Sensing at Network Edge | Yuanhao Cui et.al. | 2410.21017 | null |
2024-10-25 | Neuromorphic IoT Architecture for Efficient Water Management: A Smart Village Case Study | Mugdim Bublin et.al. | 2410.19562 | null |
2024-10-17 | SouLLMate: An Application Enhancing Diverse Mental Health Support with Adaptive LLMs, Prompt Engineering, and RAG Techniques | Qiming Guo et.al. | 2410.16322 | null |
2024-10-21 | P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving | Mohamed R. Elshamy et.al. | 2410.15602 | null |
2024-10-15 | SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments | Syed Abdul Gaffar Shakhadri et.al. | 2410.11331 | null |
2024-10-14 | ABBA-VSM: Time Series Classification using Symbolic Representation on the Edge | Meerzhan Kanatbekova et.al. | 2410.10285 | null |
2024-10-12 | Token Pruning using a Lightweight Background Aware Vision Transformer | Sudhakar Sah et.al. | 2410.09324 | null |
2024-10-11 | MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices | Mohamed Amine Hamdi et.al. | 2410.08855 | link |
2024-10-11 | Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation | Gleb Radchenko et.al. | 2410.08651 | null |
2024-10-10 | Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR Devices | Yiwei Zhao et.al. | 2410.08326 | null |
2024-10-10 | L-VITeX: Light-weight Visual Intuition for Terrain Exploration | Antar Mazumder et.al. | 2410.07872 | null |
2024-10-10 | Towards Robust IoT Defense: Comparative Statistics of Attack Detection in Resource-Constrained Scenarios | Zainab Alwaisi et.al. | 2410.07810 | null |
2024-10-10 | vCLIC: Towards Fast Interrupt Handling in Virtualized RISC-V Mixed-criticality Systems | Enrico Zelioli et.al. | 2410.07798 | null |
2024-10-07 | SoK: Towards Security and Safety of Edge AI | Tatjana Wingarz et.al. | 2410.05349 | null |
2024-10-10 | SONAR: A Synthetic AI-Audio Detection Framework and Benchmark | Xiang Li et.al. | 2410.04324 | link |
2024-09-28 | MicroFlow: An Efficient Rust-Based Inference Engine for TinyML | Matteo Carnelos et.al. | 2409.19432 | link |
2024-09-27 | Analog fast Fourier transforms for scalable and efficient signal processing | T. Patrick Xiao et.al. | 2409.19071 | null |
2024-09-26 | Development of an Edge Resilient ML Ensemble to Tolerate ICS Adversarial Attacks | Likai Yao et.al. | 2409.18244 | null |
2024-09-25 | Susceptibility Formulation of Density Matrix Perturbation Theory | Anders M. N. Niklasson et.al. | 2409.17033 | null |
2024-09-25 | Ethical and Scalable Automation: A Governance and Compliance Framework for Business Applications | Haocheng Lin et.al. | 2409.16872 | null |
2024-09-25 | Accelerating TinyML Inference on Microcontrollers through Approximate Kernels | Giorgos Armeniakos et.al. | 2409.16815 | link |
2024-09-23 | Benchmarking Edge AI Platforms for High-Performance ML Inference | Rakshith Jayanth et.al. | 2409.14803 | null |
2024-09-24 | CamelEval: Advancing Culturally Aligned Arabic Language Models and Benchmarks | Zhaozhi Qian et.al. | 2409.12623 | null |
2024-09-17 | AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances | Dhruv Agarwal et.al. | 2409.11360 | null |
2024-09-17 | Optimizing TinyML: The Impact of Reduced Data Acquisition Rates for Time Series Classification on Microcontrollers | Riya Samanta et.al. | 2409.10942 | null |
2024-09-13 | Pushing the boundaries of event subsampling in event-based video classification using CNNs | Hesam Araghi et.al. | 2409.08953 | link |
2024-09-12 | E-QUARTIC: Energy Efficient Edge Ensemble of Convolutional Neural Networks for Resource-Optimized Learning | Le Zhang et.al. | 2409.08369 | null |
2024-09-12 | DiReDi: Distillation and Reverse Distillation for AIoT Applications | Chen Sun et.al. | 2409.08308 | null |
2024-09-11 | A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption | Marcus Rüb et.al. | 2409.07114 | null |
2024-09-08 | Transformer with Leveraged Masked Autoencoder for video-based Pain Assessment | Minh-Duc Nguyen et.al. | 2409.05088 | null |
2024-09-02 | Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks | Samer Francy et.al. | 2409.02134 | null |
2024-09-01 | Research on LLM Acceleration Using the High-Performance RISC-V Processor "Xiangshan" (Nanhu Version) Based on the Open-Source Matrix Instruction Set Extension (Vector Dot Product) | Xu-Hao Chen et.al. | 2409.00661 | null |
2024-08-26 | Towards Sustainable Personalized On-Device Human Activity Recognition with TinyML and Cloud-Enabled Auto Deployment | Bidyut Saha et.al. | 2409.00093 | null |
2024-08-29 | TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification | Bidyut Saha et.al. | 2408.16535 | link |
2024-08-08 | An Edge AI System Based on FPGA Platform for Railway Fault Detection | Jiale Li et.al. | 2408.15245 | null |
2024-08-23 | S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image Analysis | Kamal Basha S et.al. | 2408.12833 | link |
2024-08-20 | Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning | Bei Ouyang et.al. | 2408.10746 | null |
2024-08-21 | Challenges and Responses in the Practice of Large Language Models | Hongyin Zhu et.al. | 2408.09416 | null |
2024-08-15 | Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices | Tess Watt et.al. | 2408.08215 | null |
2024-08-14 | Efficient Edge AI: Deploying Convolutional Neural Networks on FPGA with the Gemmini Accelerator | Federico Nicolas Peccia et.al. | 2408.07404 | null |
2024-08-13 | Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach | Haowei Ni et.al. | 2408.06634 | null |
2024-08-06 | Training on the Fly: On-device Self-supervised Learning aboard Nano-drones within 20 mW | Elia Cereda et.al. | 2408.03168 | null |
2024-08-05 | Toward Attention-based TinyML: A Heterogeneous Accelerated Architecture and Automated Deployment Flow | Philip Wiese et.al. | 2408.02473 | null |
2024-08-05 | PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy | Rachmad Vidya Wicaksana Putra et.al. | 2408.02412 | null |
2024-08-02 | A Tiny Supervised ODL Core with Auto Data Pruning for Human Activity Recognition | Hiroki Matsutani et.al. | 2408.01283 | null |
2024-07-29 | HOAA: Hybrid Overestimating Approximate Adder for Enhanced Performance Processing Engine | Omkar Kokane et.al. | 2408.00806 | link |
2024-07-31 | TinyChirp: Bird Song Recognition Using TinyML Models on Low-power Wireless Acoustic Sensors | Zhaolan Huang et.al. | 2407.21453 | link |
2024-07-31 | SHA-CNN: Scalable Hierarchical Aware Convolutional Neural Network for Edge AI | Narendra Singh Dhakad et.al. | 2407.21370 | null |
2024-07-30 | On-the-fly Communication-and-Computing to Enable Representation Learning for Distributed Point Clouds | Xu Chen et.al. | 2407.20710 | null |
2024-07-29 | Model Agnostic Hybrid Sharding For Heterogeneous Distributed Inference | Claudio Angione et.al. | 2407.19775 | null |
2024-07-25 | A Sensitivity Analysis of Cellular Automata and Heterogeneous Topology Networks: Partially-Local Cellular Automata and Homogeneous Homogeneous Random Boolean Networks | Tom Eivind Glover et.al. | 2407.18017 | null |
2024-07-22 | StreamTinyNet: video streaming analysis with spatial-temporal TinyML | Hazem Hesham Yousef Shalby et.al. | 2407.17524 | null |
2024-07-22 | KWT-Tiny: RISC-V Accelerated, Embedded Keyword Spotting Transformer | Aness Al-Qawlaq et.al. | 2407.16026 | null |
2024-07-18 | Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence | Shubha R. Kharel et.al. | 2407.14560 | null |
2024-07-18 | Ultra-Low-Latency Edge Inference for Distributed Sensing | Zhanwei Wang et.al. | 2407.13360 | null |
2024-07-17 | Computing: Looking Back and Moving Forward | Muhammed Golec et.al. | 2407.12558 | null |
2024-07-16 | XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach | Truong Thanh Hung Nguyen et.al. | 2407.11771 | link |
2024-07-18 | Enhancing TinyML Security: Study of Adversarial Attack Transferability | Parin Shah et.al. | 2407.11599 | null |
2024-07-13 | Characterizing Disparity Between Edge Models and High-Accuracy Base Models for Vision Tasks | Zhenyu Wang et.al. | 2407.10016 | null |
2024-07-11 | Towards Efficient Deployment of Hybrid SNNs on Neuromorphic and Edge AI Hardware | James Seekings et.al. | 2407.08704 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-21 | Leveraging Highly Approximated Multipliers in DNN Inference | Georgios Zervakis et.al. | 2412.16757 | null |
2024-12-13 | Panacea: Novel DNN Accelerator using Accuracy-Preserving Asymmetric Quantization and Energy-Saving Bit-Slice Sparsity | Dongyun Kam et.al. | 2412.10059 | null |
2024-12-06 | HiVeGen -- Hierarchical LLM-based Verilog Generation for Scalable Chip Design | Jinwei Tang et.al. | 2412.05393 | null |
2024-12-06 | MC3: Memory Contention based Covert Channel Communication on Shared DRAM System-on-Chips | Ismet Dagli et.al. | 2412.05228 | null |
2024-11-28 | PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers | Gwangoo Yeo et.al. | 2411.19114 | null |
2024-12-06 | FAMES: Fast Approximate Multiplier Substitution for Mixed-Precision Quantized DNNs--Down to 2 Bits! | Yi Ren et.al. | 2411.18055 | null |
2024-11-19 | Travel Time Based Task Mapping for NoC-Based DNN Accelerator | Yizhi Chen et.al. | 2411.12710 | null |
2024-10-29 | Systolic Array Data Flows for Efficient Matrix Multiplication in Deep Neural Networks | Tejas Raja et.al. | 2410.22595 | null |
2024-10-21 | Adventures with Grace Hopper AI Super Chip and the National Research Platform | J. Alex Hurt et.al. | 2410.16487 | null |
2024-10-17 | Shavette: Low Power Neural Network Acceleration via Algorithm-level Error Detection and Undervolting | Mikael Rinkinen et.al. | 2410.13415 | null |
2024-10-11 | MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices | Mohamed Amine Hamdi et.al. | 2410.08855 | link |
2024-09-23 | MESC: Re-thinking Algorithmic Priority and/or Criticality Inversions for Heterogeneous MCSs | Jiapeng Guan et.al. | 2409.14837 | null |
2024-10-14 | LoopTree: Exploring the Fused-layer Dataflow Accelerator Design Space | Michael Gilbert et.al. | 2409.13625 | link |
2024-09-13 | Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators | Konstantin Lübeck et.al. | 2409.08595 | null |
2024-09-08 | BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration | Yuzong Chen et.al. | 2409.05227 | link |
2024-09-08 | HYDRA: Hybrid Data Multiplexing and Run-time Layer Configurable DNN Accelerator | Sonu Kumar et.al. | 2409.04976 | null |
2024-08-27 | SiHGNN: Leveraging Properties of Semantic Graphs for Efficient HGNN Acceleration | Runzhen Xue et.al. | 2408.15089 | null |
2024-08-24 | SiTe CiM: Signed Ternary Computing-in-Memory for Ultra-Low Precision Deep Neural Networks | Niharika Thakuria et.al. | 2408.13617 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-09-24 | Scaling Deep Learning Computation over the Inter-Core Connected Intelligence Processor with T10 | Yiqi Liu et.al. | 2408.04808 | null |
2024-07-30 | Optical Computing for Deep Neural Network Acceleration: Foundations, Recent Developments, and Emerging Directions | Sudeep Pasricha et.al. | 2407.21184 | null |
2024-07-29 | Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices | Hayun Lee et.al. | 2407.19644 | null |
2024-07-24 | The Magnificent Seven Challenges and Opportunities in Domain-Specific Accelerator Design for Autonomous Systems | Sabrina M. Neuman et.al. | 2407.17311 | null |
2024-07-17 | StoX-Net: Stochastic Processing of Partial Sums for Efficient In-Memory Computing DNN Accelerators | Ethan G Rogers et.al. | 2407.12378 | null |
2024-07-11 | NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2 | Tengfei Xue et.al. | 2407.12057 | null |
2024-07-22 | ARCO:Adaptive Multi-Agent Reinforcement Learning-Based Hardware/Software Co-Optimization Compiler for Improved Performance in DNN Accelerator Design | Arya Fayyazi et.al. | 2407.08192 | null |
2024-06-20 | SWANN: Shuffling Weights in Crossbar Arrays for Enhanced DNN Accuracy in Deeply Scaled Technologies | Jeffry Victor et.al. | 2406.14706 | null |
2024-06-14 | CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories | Man Shi et.al. | 2406.14574 | null |
2024-06-15 | Memory Faults in Activation-sparse Quantized Deep Neural Networks: Analysis and Mitigation using Sharpness-aware Training | Akul Malhotra et.al. | 2406.10528 | null |
2024-07-17 | Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis | Zongyue Qin et.al. | 2406.09606 | null |
2024-06-05 | HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator | Zhewen Yu et.al. | 2406.03088 | link |
2024-06-03 | A 0.96pJ/SOP, 30.23K-neuron/mm^2 Heterogeneous Neuromorphic Chip With Fullerene-like Interconnection Topology for Edge-AI Computing | P. J. Zhou et.al. | 2406.01151 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-30 | Adversarial Attack and Defense for LoRa Device Identification and Authentication via Deep Learning | Yalin E. Sagduyu et.al. | 2412.21164 | null |
2024-12-30 | Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring | Ehsan Latif et.al. | 2412.21065 | null |
2024-12-30 | DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models | Xiaolin Hu et.al. | 2412.20891 | null |
2024-12-30 | Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation | Shubh Singhal et.al. | 2412.20838 | null |
2024-12-30 | VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control | Shaojin Wu et.al. | 2412.20800 | link |
2025-01-02 | EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers | Daiheng Gao et.al. | 2412.20413 | null |
2024-12-28 | Multi-Modality Driven LoRA for Adverse Condition Depth Estimation | Guanglei Yang et.al. | 2412.20162 | null |
2024-12-28 | VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition | Lan Chen et.al. | 2412.20064 | link |
2024-12-28 | Adaptive Parameter-Efficient Federated Fine-Tuning on Heterogeneous Devices | Jun Liu et.al. | 2412.20004 | null |
2024-12-27 | Gradient Weight-normalized Low-rank Projection for Efficient LLM Training | Jia-Hong Huang et.al. | 2412.19616 | link |
2024-12-27 | Performance Evaluation of IoT LoRa Networks on Mars Through ns-3 Simulations | Manuele Favero et.al. | 2412.19549 | link |
2024-12-27 | KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing | Shu Zhao et.al. | 2412.19417 | null |
2024-12-25 | Optimizing Large Language Models with an Enhanced LoRA Fine-Tuning Algorithm for Efficiency and Robustness in NLP Tasks | Jiacheng Hu et.al. | 2412.18729 | null |
2024-12-24 | Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models | Zihan Zhou et.al. | 2412.18419 | null |
2024-12-18 | Enhancing Knowledge Distillation for LLMs with Response-Priming Prompting | Vijay Goyal et.al. | 2412.17846 | link |
2024-12-25 | DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder | Ente Lin et.al. | 2412.17644 | null |
2024-12-23 | Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing | Prakash Aryan et.al. | 2412.17548 | link |
2024-12-21 | Label Privacy in Split Learning for Large Models with Parameter-Efficient Training | Philip Zmushko et.al. | 2412.16669 | link |
2024-12-20 | Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline | Guancheng Zeng et.al. | 2412.15660 | null |
2024-12-23 | CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training | Xiuli Bi et.al. | 2412.15646 | link |
2024-12-20 | AutoRank: MCDA Based Rank Personalization for LoRA-Enabled Distributed Learning | Shuaijun Chen et.al. | 2412.15553 | null |
2024-12-19 | Knowledge Injection via Prompt Distillation | Kalle Kujanpää et.al. | 2412.14964 | null |
2024-12-20 | All-in-One Tuning and Structural Pruning for Domain-Specific LLMs | Lei Lu et.al. | 2412.14426 | null |
2024-12-18 | CoRa: A Collision-Resistant LoRa Symbol Detector of Low Complexity | José Álamos et.al. | 2412.13930 | null |
2024-12-18 | A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Method-Level Code Smell Detection | Beiqi Zhang et.al. | 2412.13801 | link |
2024-12-18 | Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration | Xuhan Zuo et.al. | 2412.13551 | null |
2024-12-18 | Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models | Xinxin Liu et.al. | 2412.13488 | null |
2024-12-18 | Transducer Tuning: Efficient Model Adaptation for Software Tasks Using Code Property Graphs | Imam Nur Bani Yusuf et.al. | 2412.13467 | link |
2024-12-17 | Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models | Elvis Nunez et.al. | 2412.13328 | null |
2024-12-17 | FineGates: LLMs Finetuning with Compression using Stochastic Gates | Jonathan Svirsky et.al. | 2412.12951 | null |
2024-12-17 | Enhancing Naturalness in LLM-Generated Utterances through Disfluency Insertion | Syed Zohaib Hassan et.al. | 2412.12710 | null |
2024-12-17 | Train More Parameters But Mind Their Placement: Insights into Language Adaptation with PEFT | Jenny Kunz et.al. | 2412.12674 | link |
2024-12-17 | NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning | Xin Yi et.al. | 2412.12497 | link |
2024-12-16 | Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering | Jinhe Bi et.al. | 2412.12359 | link |
2024-12-16 | Can video generation replace cinematographers? Research on the cinematic language of generated video | Xiaozhe Li et.al. | 2412.12223 | null |
2024-12-16 | A LoRA is Worth a Thousand Pictures | Chenxi Liu et.al. | 2412.12048 | null |
2024-12-16 | The Open Source Advantage in Large Language Models (LLMs) | Jiya Manchanda et.al. | 2412.12004 | null |
2024-12-17 | No More Adam: Learning Rate Scaling at Initialization is All You Need | Minghao Xu et.al. | 2412.11768 | link |
2024-12-16 | IDEA-Bench: How Far are Generative Models from Professional Designing? | Chen Liang et.al. | 2412.11767 | link |
2024-12-16 | Adapting Segment Anything Model (SAM) to Experimental Datasets via Fine-Tuning on GAN-based Simulation: A Case Study in Additive Manufacturing | Anika Tabassum et.al. | 2412.11381 | link |
2024-12-16 | FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation | Dannong Wang et.al. | 2412.11378 | null |
2024-12-15 | Separate the Wheat from the Chaff: A Post-Hoc Approach to Safety Re-Alignment for Fine-Tuned Language Models | Di Wu et.al. | 2412.11041 | null |
2024-12-15 | SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation | Hang Zhang et.al. | 2412.11026 | null |
2024-12-14 | Efficient Adaptation of Multilingual Models for Japanese ASR | Mark Bajo et.al. | 2412.10705 | link |
2024-12-13 | SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation | Runtao Liu et.al. | 2412.10493 | null |
2024-12-13 | OP-LoRA: The Blessing of Dimensionality | Piotr Teterwak et.al. | 2412.10362 | null |
2024-12-16 | ASLoRA: Adaptive Sharing Low-Rank Adaptation Across Layers | Junyan Hu et.al. | 2412.10135 | null |
2024-12-13 | CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models | Dongyu Yao et.al. | 2412.09936 | link |
2024-12-13 | Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models | Changqun Li et.al. | 2412.09827 | null |
2024-12-12 | LoRACLR: Contrastive Adaptation for Customization of Diffusion Models | Enis Simsar et.al. | 2412.09622 | null |
2024-12-12 | EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Zhuofan Zong et.al. | 2412.09618 | null |
2024-12-12 | Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition | Zhisheng Zhong et.al. | 2412.09501 | link |
2024-12-15 | GeLoRA: Geometric Adaptive Ranks For Efficient LoRA Fine-tuning | Abdessalam Ed-dib et.al. | 2412.09250 | null |
2024-12-12 | RAD: Region-Aware Diffusion Models for Image Inpainting | Sora Kim et.al. | 2412.09191 | null |
2024-12-12 | DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization | Geonhui Jang et.al. | 2412.09169 | null |
2024-12-12 | MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning | Lulu Zhao et.al. | 2412.08946 | null |
2024-12-11 | DMin: Scalable Training Data Influence Estimation for Diffusion Models | Huawei Lin et.al. | 2412.08637 | link |
2024-12-10 | Accretion onto WD 2226 |
S. Estrada-Dorado et.al. | 2412.07863 | null |
2024-12-10 | PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition | Kartik Narayan et.al. | 2412.07771 | null |
2024-12-10 | LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models | Ziqi Lu et.al. | 2412.07746 | null |
2024-12-10 | ChocoLlama: Lessons Learned From Teaching Llamas Dutch | Matthieu Meeus et.al. | 2412.07633 | null |
2024-12-10 | MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning | Yufei Ma et.al. | 2412.07405 | null |
2024-12-10 | Attention Head Purification: A New Perspective to Harness CLIP for Domain Generalization | Yingfan Wang et.al. | 2412.07226 | null |
2024-12-09 | Optimal Routing and Link Configuration for Covert Heterogeneous Wireless Networks | Amna Gillani et.al. | 2412.07059 | null |
2024-12-09 | Sequential Compression Layers for Efficient Federated Learning in Foundational Models | Navyansh Mahla et.al. | 2412.07021 | null |
2024-12-09 | BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation | Qiushi Wang et.al. | 2412.06441 | null |
2024-12-10 | S |
Xinyu Yang et.al. | 2412.06289 | null |
2024-12-08 | Enhanced Computationally Efficient Long LoRA Inspired Perceiver Architectures for Auto-Regressive Language Modeling | Kaleel Mahmood et.al. | 2412.06106 | null |
2024-12-08 | KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models | Fan Wang et.al. | 2412.06071 | link |
2024-12-07 | Training-Free Bayesianization for Low-Rank Adapters of Large Language Models | Haizhou Shi et.al. | 2412.05723 | link |
2024-12-07 | Plasmonic Electro-Optic Modulators based on Epsilon-Near-Zero Materials: Comparing the Classical Drift-Diffusion and Schrödinger-Poisson Coupling Models | Masoud Shabaninezhad et.al. | 2412.05690 | null |
2024-12-06 | QueEn: A Large Language Model for Quechua-English Translation | Junhao Chen et.al. | 2412.05184 | null |
2024-12-06 | LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation | Donald Shenaj et.al. | 2412.05148 | null |
2024-12-05 | Performance Evaluation of LoRa Technology for Rural Connectivity: An Experimental Analysis in Nepal | Atit Pokharel et.al. | 2412.04563 | null |
2024-12-04 | Prompting Large Language Models for Clinical Temporal Relation Extraction | Jianping He et.al. | 2412.04512 | null |
2024-12-05 | UnZipLoRA: Separating Content and Style from a Single Image | Chang Liu et.al. | 2412.04465 | null |
2024-12-08 | Discriminative Fine-tuning of LVLMs | Yassine Ouali et.al. | 2412.04378 | null |
2024-12-05 | Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts | Chenyang Zhu et.al. | 2412.04220 | null |
2024-12-05 | SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning | Seokju Yun et.al. | 2412.04077 | link |
2024-12-04 | Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis | Davide Bucciarelli et.al. | 2412.03665 | null |
2024-12-04 | Imagine360: Immersive 360 Video Generation from Perspective Anchor | Jing Tan et.al. | 2412.03552 | null |
2024-12-04 | DIVE: Taming DINO for Subject-Driven Video Editing | Yi Huang et.al. | 2412.03347 | null |
2024-12-04 | Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach | Lingchen Sun et.al. | 2412.03017 | link |
2024-12-03 | EvRT-DETR: The Surprising Effectiveness of DETR-based Detection for Event Cameras | Dmitrii Torbunov et.al. | 2412.02890 | null |
2024-12-03 | Explainable CTR Prediction via LLM Reasoning | Xiaohan Yu et.al. | 2412.02588 | null |
2024-12-03 | LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization | Ethan Smith et.al. | 2412.02352 | null |
2024-12-03 | SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models | Sabina Martyniak et.al. | 2412.02332 | link |
2024-12-03 | Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs | Zixuan Hu et.al. | 2412.02220 | null |
2024-12-02 | Optimizing LoRa for Edge Computing with TinyML Pipeline for Channel Hopping | Marla Grunewald et.al. | 2412.01609 | null |
2024-12-02 | CellSeg1: Robust Cell Segmentation with One Training Image | Peilin Zhou et.al. | 2412.01410 | link |
2024-12-02 | Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking | Marco Federici et.al. | 2412.01380 | null |
2024-12-02 | MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost | Sen Xing et.al. | 2412.01271 | null |
2024-12-02 | RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy | Geonho Lee et.al. | 2412.01129 | null |
2024-12-03 | Adaptive Rank, Reduced Forgetting: Knowledge Retention in Continual Learning Vision-Language Models with Dynamic Rank-Selective LoRA | Haodong Lu et.al. | 2412.01004 | null |
2024-11-29 | SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks | Kim-Celine Kahl et.al. | 2411.19688 | link |
2024-11-29 | Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning | Kaustubh Ponkshe et.al. | 2411.19557 | link |
2024-11-28 | PEFT-as-an-Attack! Jailbreaking Language Models during Federated Parameter-Efficient Fine-Tuning | Shenghui Li et.al. | 2411.19335 | null |
2024-11-28 | Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation | Son Thai Ly et.al. | 2411.19297 | link |
2024-11-28 | LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair | Xue Song et.al. | 2411.19156 | null |
2024-11-28 | DESIRE: Dynamic Knowledge Consolidation for Rehearsal-Free Continual Learning | Haiyang Guo et.al. | 2411.19154 | null |
2024-11-28 | Personalized Federated Fine-Tuning for LLMs via Data-Driven Heterogeneous Model Architectures | Yicheng Zhang et.al. | 2411.19128 | link |
2024-11-27 | Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning | Omkar Khade et.al. | 2411.18571 | null |
2024-11-27 | Emergence of Self-Identity in AI: A Mathematical Framework and Empirical Study with Generative Large Language Models | Minhyeok Lee et.al. | 2411.18530 | link |
2024-11-27 | Adaptive Blind All-in-One Image Restoration | David Serrano-Lozano et.al. | 2411.18412 | link |
2024-11-27 | Thai Financial Domain Adaptation of THaLLE -- Technical Report | KBTG Labs et.al. | 2411.18242 | null |
2024-11-27 | ROICtrl: Boosting Instance Control for Visual Generation | Yuchao Gu et.al. | 2411.17949 | null |
2024-11-26 | Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading | Suyeol Yun et.al. | 2411.17900 | link |
2024-11-26 | Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation | Sudarshan Rajagopalan et.al. | 2411.17814 | null |
2024-11-26 | PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning | Zhen Sun et.al. | 2411.17453 | null |
2024-11-26 | CLOVER: Constrained Learning with Orthonormal Vectors for Eliminating Redundancy | Fanxu Meng et.al. | 2411.17426 | null |
2024-11-26 | Efficient Deployment of Transformer Models in Analog In-Memory Computing Hardware | Chen Li et.al. | 2411.17367 | link |
2024-11-26 | ThreatModeling-LLM: Automating Threat Modeling using Large Language Models for Banking System | Shuiqiao Yang et.al. | 2411.17058 | null |
2024-11-26 | PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation | Hengjia Li et.al. | 2411.17048 | null |
2024-11-25 | RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks | Nazia Tasnim et.al. | 2411.16870 | null |
2024-11-25 | Parameter Efficient Instruction Tuning: An Empirical Study | Pengfei He et.al. | 2411.16775 | null |
2024-11-23 | LoBAM: LoRA-Based Backdoor Attack on Model Merging | Ming Yin et.al. | 2411.16746 | null |
2024-11-24 | Modality Alignment Meets Federated Broadcasting | Yuting Ma et.al. | 2411.15837 | null |
2024-11-24 | LoRA-Mini : Adaptation Matrices Decomposition and Selective Training | Ayush Singh et.al. | 2411.15804 | null |
2024-11-23 | Reassessing Layer Pruning in LLMs: New Insights and Methods | Yao Lu et.al. | 2411.15558 | link |
2024-11-23 | Gradient dynamics for low-rank fine-tuning beyond kernels | Arif Kerem Dayi et.al. | 2411.15385 | null |
2024-11-22 | On the Impact of Fine-Tuning on Chain-of-Thought Reasoning | Elita Lobo et.al. | 2411.15382 | null |
2024-11-22 | ElastiFormer: Learned Redundancy Reduction in Transformer via Self-Distillation | Junzhang Liu et.al. | 2411.15281 | null |
2024-11-21 | IterIS: Iterative Inference-Solving Alignment for LoRA Merging | Hongxu Chen et.al. | 2411.15231 | null |
2024-11-22 | Exploring Foundation Models Fine-Tuning for Cytology Classification | Manon Dausort et.al. | 2411.14975 | link |
2024-11-22 | LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement | Jieming Bian et.al. | 2411.14961 | null |
2024-11-21 | Interpreting seasonal and interannual Hadley cell descending edge migrations via the cell-mean Rossby number | Spencer A Hill et.al. | 2411.14544 | null |
2024-11-21 | Multi LoRA Meets Vision: Merging multiple adapters to create a multi task model | Ege Kesim et.al. | 2411.14064 | null |
2024-11-21 | Separable Mixture of Low-Rank Adaptation for Continual Visual Instruction Tuning | Ziqi Wang et.al. | 2411.13949 | null |
2024-11-21 | Dressing the Imagination: A Dataset for AI-Powered Translation of Text into Fashion Outfits and A Novel KAN Adapter for Enhanced Feature Adaptation | Gayatri Deshmukh et.al. | 2411.13901 | null |
2024-11-21 | AutoMixQ: Self-Adjusting Quantization for High Performance Memory-Efficient Fine-Tuning | Changhai Zhou et.al. | 2411.13814 | null |
2024-11-20 | Unleashing the Power of Large Language Models for Group POI Recommendations | Jing Long et.al. | 2411.13415 | null |
2024-11-20 | On the Way to LLM Personalization: Learning to Remember User Conversations | Lucie Charlotte Magister et.al. | 2411.13405 | null |
2024-11-19 | Visual Cue Enhancement and Dual Low-Rank Adaptation for Efficient Visual Instruction Fine-Tuning | Pengkun Jiao et.al. | 2411.12787 | null |
2024-11-16 | LoRA Unlearns More and Retains More (Student Abstract) | Atharv Mittal et.al. | 2411.11907 | link |
2024-11-18 | SeqProFT: Applying LoRA Finetuning for Sequence-only Protein Property Predictions | Shuo Zhang et.al. | 2411.11530 | null |
2024-11-16 | Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts | Jinqiang Long et.al. | 2411.10669 | link |
2024-11-15 | AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment | Yonggan Fu et.al. | 2411.10606 | link |
2024-11-15 | Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning | Yushen Zuo et.al. | 2411.10130 | null |
2024-11-15 | LoRA-LiteE: A Computationally Efficient Framework for Chatbot Preference-Tuning | Yahe Yang et.al. | 2411.09947 | null |
2024-11-12 | Structured Pattern Expansion with Diffusion Models | Marzia Riso et.al. | 2411.08930 | null |
2024-11-13 | Dynamic Subset Tuning: Expanding the Operational Range of Parameter-Efficient Training for Large Language Models | Felix Stahlberg et.al. | 2411.08610 | null |
2024-11-13 | Machine Unlearning on Pre-trained Models by Residual Feature Alignment Using LoRA | Laiqiao Qin et.al. | 2411.08443 | null |
2024-11-11 | LoRA-BERT: a Natural Language Processing Model for Robust and Accurate Prediction of long non-coding RNAs | Nicholas Jeon et.al. | 2411.08073 | null |
2024-11-12 | FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training | Philip Zmushko et.al. | 2411.07837 | link |
2024-11-12 | Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained Devices | Kilian Pfeiffer et.al. | 2411.07826 | null |
2024-11-12 | Federated Low-Rank Adaptation with Differential Privacy over Wireless Networks | Tianqu Kang et.al. | 2411.07806 | null |
2024-11-12 | ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization | Weibo Zhao et.al. | 2411.07762 | null |
2024-11-11 | DeepONet as a Multi-Operator Extrapolation Model: Distributed Pretraining with Physics-Informed Fine-Tuning | Zecheng Zhang et.al. | 2411.07239 | null |
2024-11-11 | Invar-RAG: Invariant LLM-aligned Retrieval for Better Generation | Ziwei Liu et.al. | 2411.07021 | null |
2024-11-11 | MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps | Xue Xia et.al. | 2411.06971 | null |
2024-11-11 | LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Runming Yang et.al. | 2411.06839 | null |
2024-11-10 | Federated LLMs Fine-tuned with Adaptive Importance-Aware LoRA | Yang Su et.al. | 2411.06581 | null |
2024-11-10 | Prompt-Efficient Fine-Tuning for GPT-like Deep Models to Reduce Hallucination and to Improve Reproducibility in Scientific Text Generation Using Stochastic Optimisation Techniques | Daniil Sulimov et.al. | 2411.06445 | null |
2024-11-08 | Energy Efficient Protein Language Models: Leveraging Small Language Models with LoRA for Controllable Protein Generation | Aayush Shah et.al. | 2411.05966 | null |
2024-11-08 | Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation | Xiwen Wei et.al. | 2411.05663 | link |
2024-11-08 | SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Muyang Li et.al. | 2411.05007 | link |
2024-11-07 | DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | Wenqiang Sun et.al. | 2411.04928 | null |
2024-11-07 | StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration | Panwen Hu et.al. | 2411.04925 | null |
2024-11-07 | LLM-R: A Framework for Domain-Adaptive Maintenance Scheme Generation Combining Hierarchical Agents and RAG | Laifa Tao et.al. | 2411.04476 | null |
2024-11-09 | Variational Low-Rank Adaptation Using IVON | Bai Cong et.al. | 2411.04421 | link |
2024-11-08 | Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation | Ayan Sengupta et.al. | 2411.04358 | link |
2024-11-06 | PyroGuardian: An IoT-Enabled System for Health and Location Monitoring in High-Risk Firefighting Environments | Berkay Kaplan et.al. | 2411.03654 | null |
2024-11-05 | LLM-based Framework for Bearing Fault Diagnosis | Laifa Tao et.al. | 2411.02718 | null |
2024-11-04 | TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network | Nouf Alabbasi et.al. | 2411.02617 | link |
2024-11-04 | Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study | André Storhaug et.al. | 2411.02462 | null |
2024-11-04 | Expanding Sparse Tuning for Low Memory Usage | Shufan Shen et.al. | 2411.01800 | link |
2024-11-02 | PMoL: Parameter Efficient MoE for Preference Mixing of LLM Alignment | Dongxu Liu et.al. | 2411.01245 | null |
2024-11-02 | One Arrow, Many Targets: Probing LLMs for Multi-Attribute Controllable Text Summarization | Tathagato Roy et.al. | 2411.01213 | null |
2024-11-02 | Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models | Wonguk Cho et.al. | 2411.01179 | null |
2024-11-02 | LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding | Jian Chen et.al. | 2411.01106 | null |
2024-11-01 | V-LoRA: An Efficient and Flexible System Boosts Vision Applications with LoRA LMM | Liang Mi et.al. | 2411.00915 | null |
2024-11-01 | Dual Low-Rank Adaptation for Continual Learning with Pre-Trained Models | Huancheng Chen et.al. | 2411.00623 | null |
2024-10-31 | DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion | Weicai Ye et.al. | 2410.24203 | link |
2024-11-05 | In-Context LoRA for Diffusion Transformers | Lianghua Huang et.al. | 2410.23775 | link |
2024-10-30 | Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation | Stefan Stojanovic et.al. | 2410.23434 | null |
2024-10-31 | SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation | Yining Hong et.al. | 2410.23277 | null |
2024-10-31 | Why Gradient Subspace? Identifying and Mitigating LoRA's Bottlenecks in Federated Fine-Tuning of Large Language Models | Navyansh Mahla et.al. | 2410.23111 | null |
2024-10-30 | Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation | Wei Dong et.al. | 2410.22952 | null |
2024-10-30 | CopRA: A Progressive LoRA Training Strategy | Zhan Zhuang et.al. | 2410.22911 | null |
2024-10-30 | Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients | Jabin Koo et.al. | 2410.22815 | null |
2024-10-30 | MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning | Xujia Wang et.al. | 2410.22782 | null |
2024-10-29 | Meta-Learning Adaptable Foundation Models | Jacob L. Block et.al. | 2410.22264 | null |
2024-10-30 | IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models | Hang Guo et.al. | 2410.21759 | link |
2024-10-28 | LoRA vs Full Fine-tuning: An Illusion of Equivalence | Reece Shuttleworth et.al. | 2410.21228 | null |
2024-10-28 | Skip2-LoRA: A Lightweight On-device DNN Fine-tuning Method for Low-cost Edge Devices | Hiroki Matsutani et.al. | 2410.21073 | null |
2024-10-28 | KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation | Rambod Azimi et.al. | 2410.20777 | link |
2024-10-28 | Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA | Sangmin Bae et.al. | 2410.20672 | null |
2024-10-28 | PepDoRA: A Unified Peptide Language Model via Weight-Decomposed Low-Rank Adaptation | Leyao Wang et.al. | 2410.20667 | null |
2024-10-28 | Collaborative Knowledge Fusion: A Novel Approach for Multi-task Recommender Systems via LLMs | Chuang Zhao et.al. | 2410.20642 | null |
2024-10-27 | LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization | Jui-Nan Yen et.al. | 2410.20625 | null |
2024-10-27 | FoldMark: Protecting Protein Generative Models with Watermarking | Zaixi Zhang et.al. | 2410.20354 | link |
2024-10-26 | An Efficient Watermarking Method for Latent Diffusion Models via Low-Rank Adaptation | Dongdong Lin et.al. | 2410.20202 | null |
2024-10-25 | Model merging with SVD to tie the Knots | George Stoica et.al. | 2410.19735 | link |
2024-10-25 | Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs | Yifei Zhang et.al. | 2410.19694 | null |
2024-10-25 | GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing | Hosam Elgendy et.al. | 2410.19552 | link |
2024-10-24 | Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts | Danyal Aftab et.al. | 2410.19185 | null |
2024-10-24 | On the Crucial Role of Initialization for Matrix Factorization | Bingcong Li et.al. | 2410.18965 | null |
2024-10-24 | PSY: Posterior Sampling Based Privacy Enhancer in Large Language Models | Yulian Sun et.al. | 2410.18824 | null |
2024-10-24 | GeoLoRA: Geometric integration for parameter efficient fine-tuning | Steffen Schotthöfer et.al. | 2410.18720 | null |
2024-10-24 | Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model | Ali Hamza et.al. | 2410.18678 | null |
2024-10-23 | CLEAR: Character Unlearning in Textual and Visual Modalities | Alexey Dontsov et.al. | 2410.18057 | null |
2024-10-23 | MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning | Jingfan Zhang et.al. | 2410.18035 | null |
2024-10-23 | Closed-form merging of parameter-efficient modules for Federated Continual Learning | Riccardo Salami et.al. | 2410.17961 | null |
2024-10-23 | AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning | Yehonathan Refael et.al. | 2410.17881 | null |
2024-10-23 | Understanding Layer Significance in LLM Alignment | Guangyuan Shi et.al. | 2410.17875 | null |
2024-10-23 | VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning | Yifan Peng et.al. | 2410.17485 | null |
2024-10-22 | FairLoRA: Unpacking Bias Mitigation in Vision Models with Fairness-Driven Low-Rank Adaptation | Rohan Sukumaran et.al. | 2410.17358 | null |
2024-10-22 | Insights on Disagreement Patterns in Multimodal Safety Perception across Diverse Rater Groups | Charvi Rastogi et.al. | 2410.17032 | null |
2024-10-23 | GeoCode-GPT: A Large Language Model for Geospatial Code Generation Tasks | Shuyang Hou et.al. | 2410.17031 | null |
2024-10-22 | LoRA-C: Parameter-Efficient Fine-Tuning of Robust CNN for IoT Devices | Chuntao Ding et.al. | 2410.16954 | link |
2024-10-22 | Can Large Language Models Act as Ensembler for Multi-GNNs? | Hanqi Duan et.al. | 2410.16822 | null |
2024-10-22 | Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models | Yuheng Lu et.al. | 2410.16801 | null |
2024-10-22 | MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report | Samrajya Thapa et.al. | 2410.16239 | link |
2024-10-21 | Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs | Kang Zhao et.al. | 2410.16135 | null |
2024-10-21 | Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning | Arijit Das et.al. | 2410.16029 | link |
2024-10-21 | How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making? | Zuojin Tang et.al. | 2410.15885 | null |
2024-10-21 | The effect of fine-tuning on language model toxicity | Will Hawkins et.al. | 2410.15821 | link |
2024-10-21 | Habaek: High-performance water segmentation through dataset expansion and inductive bias optimization | Hanseon Joo et.al. | 2410.15794 | link |
2024-10-21 | Students Rather Than Experts: A New AI For Education Pipeline To Model More Human-Like And Personalised Early Adolescences | Yiping Ma et.al. | 2410.15701 | null |
2024-10-20 | MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models | Ahmed Elbakary et.al. | 2410.15524 | null |
2024-10-20 | EVA: An Embodied World Model for Future Video Anticipation | Xiaowei Chi et.al. | 2410.15461 | null |
2024-10-20 | LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration | Yuang Ai et.al. | 2410.15385 | link |
2024-10-18 | Fine-Tuning DeepONets to Enhance Physics-informed Neural Networks for solving Partial Differential Equations | Sidi Wu et.al. | 2410.14134 | null |
2024-10-17 | FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model | ZiDong Wang et.al. | 2410.13925 | link |
2024-10-17 | Improving Multi-modal Large Language Model through Boosting Vision Capabilities | Yanpeng Sun et.al. | 2410.13733 | null |
2024-10-17 | LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning | Yiming Shi et.al. | 2410.13618 | link |
2024-10-18 | MoR: Mixture of Ranks for Low-Rank Adaptation Tuning | Chuanyu Tang et.al. | 2410.13408 | null |
2024-10-17 | FAMSeC: A Few-shot-sample-based General AI-generated Image Detection Method | Juncong Xu et.al. | 2410.13156 | null |
2024-10-16 | LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks | Akshara Prabhakar et.al. | 2410.13025 | link |
2024-10-16 | DEeR: Deviation Eliminating and Noise Regulating for Privacy-preserving Federated Low-rank Adaptation | Meilu Zhu et.al. | 2410.12926 | link |
2024-10-15 | In-context KV-Cache Eviction for LLMs via Attention-Gate | Zihao Zeng et.al. | 2410.12876 | null |
2024-10-16 | FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction | Akriti Jain et.al. | 2410.12513 | null |
2024-10-15 | LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models | Hossein Abdi et.al. | 2410.11551 | null |
2024-10-15 | Transfer Learning with Foundational Models for Time Series Forecasting using Low-Rank Adaptations | M. Germán-Morales et.al. | 2410.11539 | null |
2024-10-15 | Energy Efficient Transmission Parameters Selection Method Using Reinforcement Learning in Distributed LoRa Networks | Ryotai Airiyoshi et.al. | 2410.11270 | null |
2024-10-14 | Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning | Bokai Hu et.al. | 2410.11020 | null |
2024-10-14 | LoLCATs: On Low-Rank Linearizing of Large Language Models | Michael Zhang et.al. | 2410.10254 | link |
2024-10-14 | Fed-piLot: Optimizing LoRA Assignment for Efficient Federated Foundation Model Fine-Tuning | Zikai Zhang et.al. | 2410.10200 | null |
2024-10-14 | Scalable Multi-Domain Adaptation of Language Models using Modular Experts | Peter Schafhalter et.al. | 2410.10181 | null |
2024-10-14 | Is Parameter Collision Hindering Continual Learning in LLMs? | Shuo Yang et.al. | 2410.10179 | link |
2024-10-14 | AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality | Peijun Qing et.al. | 2410.10054 | link |
2024-10-13 | Retrieval Instead of Fine-tuning: A Retrieval-based Parameter Ensemble for Zero-shot Learning | Pengfei Jin et.al. | 2410.09908 | null |
2024-10-13 | A Quantum Circuit-Based Compression Perspective for Parameter-Efficient Learning | Chen-Yu Liu et.al. | 2410.09846 | null |
2024-10-13 | Understanding Robustness of Parameter-Efficient Tuning for Image Classification | Jiacheng Ruan et.al. | 2410.09845 | link |
2024-10-13 | BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation | Peijia Qin et.al. | 2410.09758 | null |
2024-10-13 | AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model | Yuchen Li et.al. | 2410.09714 | null |
2024-10-11 | Parameter-Efficient Fine-Tuning of State Space Models | Kevin Galim et.al. | 2410.09016 | link |
2024-10-10 | Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical Framework for Low-Rank Adaptation | Grigory Malinovsky et.al. | 2410.08305 | null |
2024-10-10 | SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture | Jiayi Han et.al. | 2410.07739 | null |
2024-10-10 | MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion | Onkar Susladkar et.al. | 2410.07659 | null |
2024-10-09 | SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers | Viktoriia Chekalina et.al. | 2410.07383 | link |
2024-10-09 | One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation | Fabian Paischer et.al. | 2410.07170 | link |
2024-10-09 | Industrial complexity and the evolution of formal employment in developing cities | Neave O'Clery et.al. | 2410.06971 | null |
2024-10-11 | Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization | Changli Tang et.al. | 2410.06682 | null |
2024-10-08 | Systematic 2.5 D resistive MHD simulations with ambipolar diffusion and Hall effect for fast magnetic reconnection | Gabriela Landinez et.al. | 2410.06391 | null |
2024-10-08 | HyperDet: Generalizable Detection of Synthesized Images by Generating and Merging A Mixture of Hyper LoRAs | Huangsen Cao et.al. | 2410.06044 | null |
2024-10-08 | QERA: an Analytical Framework for Quantization Error Reconstruction | Cheng Zhang et.al. | 2410.06040 | null |
2024-10-08 | Hyper Adversarial Tuning for Boosting Adversarial Robustness of Pretrained Large Vision Models | Kangtao Lv et.al. | 2410.05951 | null |
2024-10-07 | GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting | Yukang Cao et.al. | 2410.05259 | null |
2024-10-08 | PAMLR: A Passive-Active Multi-Armed Bandit-Based Solution for LoRa Channel Allocation | Jihoon Yun et.al. | 2410.05147 | null |
2024-10-07 | HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation | Xinyu Zhou et.al. | 2410.05090 | link |
2024-10-07 | Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation | Vince Zhu et.al. | 2410.04689 | null |
2024-10-06 | Learning De-Biased Representations for Remote-Sensing Imagery | Zichen Tian et.al. | 2410.04546 | link |
2024-10-05 | Learning on LoRAs: GL-Equivariant Processing of Low-Rank Weight Spaces for Large Finetuned Models | Theo et.al. | 2410.04207 | null |
2024-10-05 | LoRTA: Low Rank Tensor Adaptation of Large Language Models | Ignacio Hounie et.al. | 2410.04060 | null |
2024-10-05 | Hyperbolic Fine-tuning for Large Language Models | Menglin Yang et.al. | 2410.04010 | link |
2024-10-04 | AutoLoRA: AutoGuidance Meets Low-Rank Adaptation for Diffusion Models | Artur Kasymov et.al. | 2410.03941 | link |
2024-10-04 | Collaborative and Efficient Personalization with Mixtures of Adaptors | Abdulla Jasem Almansoori et.al. | 2410.03497 | null |
2024-10-03 | Neutral residues: revisiting adapters for model extension | Franck Signe Talla et.al. | 2410.02744 | null |
2024-10-03 | Encryption-Friendly LLM Architecture | Donghwan Rho et.al. | 2410.02486 | null |
2024-10-02 | NEAT: Nonlinear Parameter-efficient Adaptation of Pre-trained Models | Yibo Zhong et.al. | 2410.01870 | null |
2024-10-02 | Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? | Xi Chen et.al. | 2410.01623 | link |
2024-10-02 | DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models | Yuxuan Zhang et.al. | 2410.01497 | link |
2024-10-04 | Selective Aggregation for Low-Rank Adaptation in Federated Learning | Pengxin Guo et.al. | 2410.01463 | link |
2024-10-02 | FlashMask: Efficient and Rich Mask Extension of FlashAttention | Guoxia Wang et.al. | 2410.01359 | link |
2024-10-01 | MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards | Sheng Wang et.al. | 2410.00938 | null |
2024-10-02 | Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models | Saurav Jha et.al. | 2410.00700 | null |
2024-10-01 | PrivTuner with Homomorphic Encryption and LoRA: A P3EFT Scheme for Privacy-Preserving Parameter-Efficient Fine-Tuning of AI Foundation Models | Yang Li et.al. | 2410.00433 | null |
2024-09-30 | Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models | Ji Liu et.al. | 2410.00131 | null |
2024-09-30 | UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation | Cheng Zhang et.al. | 2409.20197 | link |
2024-09-30 | BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain | Kaisi Guan et.al. | 2409.20075 | null |
2024-09-30 | HDMoLE: Mixture of LoRA Experts with Hierarchical Routing and Dynamic Thresholds for Fine-Tuning LLM-based ASR Models | Bingshen Mu et.al. | 2409.19878 | null |
2024-09-29 | Learning Attentional Mixture of LoRAs for Language Model Continual Learning | Jialin Liu et.al. | 2409.19611 | null |
2024-09-29 | Abstractive Summarization of Low resourced Nepali language using Multilingual Transformers | Prakash Dhakal et.al. | 2409.19566 | null |
2024-09-27 | HM3: Heterogeneous Multi-Class Model Merging | Stefan Hackmann et.al. | 2409.19173 | null |
2024-09-26 | MARS: Multi-radio Architecture with Radio Selection using Decision Trees for emerging mesoscale CPS/IoT applications | Jothi Prasanna Shanmuga Sundaram et.al. | 2409.18043 | null |
2024-09-26 | PEDRO: Parameter-Efficient Fine-tuning with Prompt DEpenDent Representation MOdification | Tianfang Xie et.al. | 2409.17834 | null |
2024-09-30 | Efficient In-Domain Question Answering for Resource-Constrained Environments | Isaac Chung et.al. | 2409.17648 | null |
2024-09-26 | On the Implicit Relation Between Low-Rank Adaptation and Differential Privacy | Saber Malekmohammadi et.al. | 2409.17538 | null |
2024-09-26 | A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction | Guangyu Wang et.al. | 2409.17440 | link |
2024-09-25 | Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation | Richard D. Paul et.al. | 2409.17085 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning | Qibin Wang et.al. | 2409.16722 | null |
2024-09-25 | GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning | Zhe-Rui Yang et.al. | 2409.16670 | null |
2024-09-25 | Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models | Deepak Sridhar et.al. | 2409.16535 | link |
2024-09-24 | Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering | Ziyu Zhao et.al. | 2409.16167 | null |
2024-09-24 | Evaluation of state-of-the-art ASR Models in Child-Adult Interactions | Aditya Ashvin et.al. | 2409.16135 | null |
2024-09-24 | Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs | Yang Yuhang et.al. | 2409.16005 | null |
2024-09-24 | Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM | Fengrun Zhang et.al. | 2409.15905 | null |
2024-09-24 | Aided design of bridge aesthetics based on Stable Diffusion fine-tuning | Leye Zhang et.al. | 2409.15812 | link |
2024-09-17 | Chain-of-Thought Prompting for Speech Translation | Ke Hu et.al. | 2409.11538 | null |
2024-09-17 | Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models | Divij Gupta et.al. | 2409.11302 | null |
2024-09-17 | LoRa Communication for Agriculture 4.0: Opportunities, Challenges, and Future Directions | Lameya Aldhaheri et.al. | 2409.11200 | null |
2024-09-17 | Few-Shot Domain Adaptation for Learned Image Compression | Tianyu Zhang et.al. | 2409.11111 | null |
2024-09-17 | KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models | Bo Lv et.al. | 2409.11057 | null |
2024-09-18 | Propulsion: Steering LLM with Tiny Fine-Tuning | Md Kowsher et.al. | 2409.10927 | link |
2024-09-16 | A Bayesian Interpretation of Adaptive Low-Rank Adaptation | Haolin Chen et.al. | 2409.10673 | link |
2024-09-16 | From Text to Emoji: How PEFT-Driven Personality Manipulation Unleashes the Emoji Potential in LLMs | Navya Jain et.al. | 2409.10245 | null |
2024-09-16 | Robust Bird's Eye View Segmentation by Adapting DINOv2 | Merve Rabia Barın et.al. | 2409.10228 | null |
2024-09-19 | jina-embeddings-v3: Multilingual Embeddings With Task LoRA | Saba Sturua et.al. | 2409.10173 | null |
2024-09-16 | Rapid Adaptation of Earth Observation Foundation Models for Segmentation | Karthick Panner Selvam et.al. | 2409.09907 | null |
2024-09-15 | AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs | Madhusudan Ghosh et.al. | 2409.09704 | link |
2024-09-14 | COMFORT: A Continual Fine-Tuning Framework for Foundation Models Targeted at Consumer Healthcare | Chia-Hao Li et.al. | 2409.09549 | null |
2024-09-14 | SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2 | Xinrun Chen et.al. | 2409.09286 | link |
2024-09-13 | Data Efficient Child-Adult Speaker Diarization with Simulated Conversations | Anfeng Xu et.al. | 2409.08881 | link |
2024-09-13 | Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions | Lingwei Meng et.al. | 2409.08596 | null |
2024-09-13 | ATFLRec: A Multimodal Recommender System with Audio-Text Fusion and Low-Rank Adaptation via Instruction-Tuned Large Language Model | Zezheng Qin et.al. | 2409.08543 | null |
2024-09-13 | Risks When Sharing LoRA Fine-Tuned Diffusion Model Weights | Dixi Yao et.al. | 2409.08482 | null |
2024-09-13 | Toward satisfactory public accessibility: A crowdsourcing approach through online reviews to inclusive urban design | Lingyao Li et.al. | 2409.08459 | null |
2024-09-12 | AudioBERT: Audio Knowledge Augmented Language Model | Hyunjong Ok et.al. | 2409.08199 | link |
2024-09-12 | Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy | Bojian Li et.al. | 2409.07723 | null |
2024-09-11 | Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region | Muhammad Akhtar Munir et.al. | 2409.07585 | link |
2024-09-11 | Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models | Xinhu Zheng et.al. | 2409.07016 | null |
2024-09-10 | SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation | Teng Hu et.al. | 2409.06633 | null |
2024-09-09 | Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models | Rohit Jena et.al. | 2409.06493 | null |
2024-09-10 | HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data | Hossein Hajipour et.al. | 2409.06446 | link |
2024-09-10 | VE: Modeling Multivariate Time Series Correlation with Variate Embedding | Shangjiong Wang et.al. | 2409.06169 | link |
2024-09-09 | FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations | Ziyao Wang et.al. | 2409.05976 | link |
2024-09-09 | SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values | Chengwei Sun et.al. | 2409.05926 | null |
2024-09-09 | TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency | Ahmed Imteaj et.al. | 2409.05347 | null |
2024-09-08 | Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation | Zhe Cao et.al. | 2409.05224 | link |
2024-09-06 | Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning | Xinyue Liu et.al. | 2409.04574 | null |
2024-09-06 | Fast Forwarding Low-Rank Training | Adir Rahamim et.al. | 2409.04206 | null |
2024-09-05 | Continual Skill and Task Learning via Dialogue | Weiwei Gu et.al. | 2409.03166 | null |
2024-09-04 | Non-Orthogonal Multiple-Access Strategies for Direct-to-Satellite IoT Networks | Felipe Augusto Tondo et.al. | 2409.02748 | null |
2024-09-04 | Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA | Shuangyi Chen et.al. | 2409.02346 | null |
2024-08-31 | CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large Language Models | Xiaojun Xiao et.al. | 2409.02119 | null |
2024-09-02 | LoGex: Improved tail detection of extremely rare histopathology classes via guided diffusion | Maximilian Mueller et.al. | 2409.01317 | link |
2024-09-02 | Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning | Chongjie Si et.al. | 2409.01035 | link |
2024-09-02 | Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language | Jeong Hun Yeo et.al. | 2409.00986 | link |
2024-08-30 | Enhancing Event Reasoning in Large Language Models through Instruction Fine-Tuning with Semantic Causal Graphs | Mazal Bethany et.al. | 2409.00209 | null |
2024-08-30 | DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model | Mona Sheikh Zeinoddin et.al. | 2408.17433 | link |
2024-08-30 | MoRe Fine-Tuning with 10x Fewer Parameters | Wenxuan Tan et.al. | 2408.17383 | link |
2024-08-30 | Wireless Integrated Authenticated Communication System (WIA-Comm) | Amith N Bharadwaj et.al. | 2408.17112 | null |
2024-09-02 | Instant Adversarial Purification with Adversarial Consistency Distillation | Chun Tong Lei et.al. | 2408.17064 | null |
2024-08-30 | Efficient Image Restoration through Low-Rank Adaptation and Stable Diffusion XL | Haiyang Zhao et.al. | 2408.17060 | null |
2024-08-29 | LoraMap: Harnessing the Power of LoRA Connections | Hyeryun Park et.al. | 2408.16264 | null |
2024-08-28 | LeMON: Learning to Learn Multi-Operator Networks | Jingmin Sun et.al. | 2408.16168 | link |
2024-08-28 | Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models | Yuncheng Yang et.al. | 2408.15915 | link |
2024-08-28 | StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements | Jillian Fisher et.al. | 2408.15666 | link |
2024-08-28 | TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation | Junbao Zhou et.al. | 2408.15657 | link |
2024-08-28 | Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models | Yiyang Zhao et.al. | 2408.15585 | null |
2024-08-28 | VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech | Heeseung Kim et.al. | 2408.14739 | null |
2024-08-27 | PAT: Pruning-Aware Tuning for Large Language Models | Yijiang Liu et.al. | 2408.14721 | link |
2024-08-27 | StyleSpeech: Parameter-efficient Fine Tuning for Pre-trained Controllable Text-to-Speech | Haowei Lou et.al. | 2408.14713 | link |
2024-08-26 | CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation | Muhammad Fawi et.al. | 2408.14572 | link |
2024-08-27 | Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models | Aradhye Agarwal et.al. | 2408.14470 | link |
2024-08-26 | Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning | Sakhinana Sagar Srinivas et.al. | 2408.14387 | null |
2024-08-27 | SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Trung Dao et.al. | 2408.14176 | link |
2024-08-25 | TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation | Jack Saunders et.al. | 2408.13714 | null |
2024-08-24 | Can Visual Foundation Models Achieve Long-term Point Tracking? | Görkay Aydemir et.al. | 2408.13575 | null |
2024-08-23 | The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities | Venkatesh Balavadhani Parthasarathy et.al. | 2408.13296 | null |
2024-08-23 | CLLMFS: A Contrastive Learning enhanced Large Language Model Framework for Few-Shot Named Entity Recognition | Yafeng Zhang et.al. | 2408.12834 | null |
2024-08-23 | Investigating LLM Applications in E-Commerce | Chester Palen-Michel et.al. | 2408.12779 | null |
2024-08-22 | EvalYaks: Instruction Tuning Datasets and LoRA Fine-tuned Models for Automated Scoring of CEFR B2 Speaking Assessment Transcripts | Nicy Scaria et.al. | 2408.12226 | link |
2024-08-21 | Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards | Omar Erak et.al. | 2408.11775 | link |
2024-08-21 | EAGLE: Elevating Geometric Reasoning through LLM-empowered Visual Instruction Tuning | Zhihao Li et.al. | 2408.11397 | null |
2024-08-20 | EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech | Xin Qi et.al. | 2408.10852 | null |
2024-08-21 | Flexora: Flexible Low Rank Adaptation for Large Language Models | Chenxing Wei et.al. | 2408.10774 | null |
2024-08-20 | Large Language Models for Multimodal Deformable Image Registration | Mingrui Ma et.al. | 2408.10703 | link |
2024-08-20 | Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper | Tianyi Xu et.al. | 2408.10680 | null |
2024-08-20 | CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation | Yuting Liu et.al. | 2408.10645 | null |
2024-08-18 | NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models | Cheng Lin et.al. | 2408.10280 | null |
2024-08-19 | SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models | Anke Tang et.al. | 2408.10174 | link |
2024-08-19 | Customizing Language Models with Instance-wise LoRA for Sequential Recommendation | Xiaoyu Kong et.al. | 2408.10159 | link |
2024-08-19 | TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition | Tianwei Lin et.al. | 2408.09856 | link |
2024-08-18 | Infinite Scrolling, Finite Satisfaction: Exploring User Behavior and Satisfaction on Social Media in Bangladesh | Sanzana Karim Lora et.al. | 2408.09601 | null |
2024-08-17 | ConVerSum: A Contrastive Learning based Approach for Data-Scarce Solution of Cross-Lingual Summarization Beyond Direct Equivalents | Sanzana Karim Lora et.al. | 2408.09273 | null |
2024-08-17 | An Exploratory Study on Fine-Tuning Large Language Models for Secure Code Generation | Junjie Li et.al. | 2408.09078 | link |
2024-08-17 | MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality | Zhiyi Shi et.al. | 2408.09064 | null |
2024-08-16 | AdaRank: Disagreement Based Module Rank Prediction for Low-rank Adaptation | Yihe Dong et.al. | 2408.09015 | link |
2024-08-16 | ML Study of MaliciousTransactions in Ethereum | Natan Katz et.al. | 2408.08749 | null |
2024-08-16 | RBLA: Rank-Based-LoRA-Aggregation for Fine-tuning Heterogeneous Models in FLaaS | Shuaijun Chen et.al. | 2408.08699 | null |
2024-08-16 | LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression | Yuqi Ye et.al. | 2408.08682 | null |
2024-08-16 | Adaptive Layer Selection for Efficient Vision Transformer Fine-Tuning | Alessio Devoto et.al. | 2408.08670 | null |
2024-08-16 | A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth | Yujia Gu et.al. | 2408.08561 | null |
2024-08-15 | Heavy Labels Out! Dataset Distillation with Label Space Lightening | Ruonan Yu et.al. | 2408.08201 | null |
2024-08-15 | When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding | Pingping Zhang et.al. | 2408.08093 | null |
2024-08-14 | Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification | Yongcheng Li et.al. | 2408.07467 | link |
2024-08-13 | SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis | Yuchen Mao et.al. | 2408.07196 | null |
2024-08-13 | Imagen 3 | Imagen-Team-Google et.al. | 2408.07009 | null |
2024-08-13 | New refinements of Narayana polynomials and Motzkin polynomials | Janet J. W. Dong et.al. | 2408.06912 | null |
2024-08-13 | LoRA |
Jia-Chen Zhang et.al. | 2408.06854 | null |
2024-08-13 | DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion | Yujia Wu et.al. | 2408.06740 | null |
2024-08-13 | Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model | Yongcheng Li et.al. | 2408.06716 | link |
2024-08-13 | Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach | Haowei Ni et.al. | 2408.06634 | null |
2024-08-13 | Towards Robust and Cost-Efficient Knowledge Unlearning for Large Language Models | Sungmin Cha et.al. | 2408.06621 | null |
2024-08-15 | ControlNeXt: Powerful and Efficient Control for Image and Video Generation | Bohao Peng et.al. | 2408.06070 | link |
2024-08-11 | Hotfixing Large Language Models for Cod | Zhou Yang et.al. | 2408.05727 | null |
2024-08-09 | TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning | Yujie Feng et.al. | 2408.05200 | link |
2024-08-09 | LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description | Yizhang Jin et.al. | 2408.04957 | link |
2024-08-09 | Energy performance of LR-FHSS: analysis and evaluation | Roger Sanchez-Vital et.al. | 2408.04908 | null |
2024-08-08 | Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models | Yupeng Chang et.al. | 2408.04556 | link |
2024-08-08 | UNLEARN Efficient Removal of Knowledge in Large Language Models | Tyler Lizzo et.al. | 2408.04140 | null |
2024-08-07 | Image-to-LaTeX Converter for Mathematical Formulas and Text | Daniil Gurgurov et.al. | 2408.04015 | link |
2024-08-07 | Speaker Adaptation for Quantised End-to-End ASR Models | Qiuming Zhao et.al. | 2408.03979 | null |
2024-08-07 | A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Case | Sonia Meyer et.al. | 2408.03562 | null |
2024-08-11 | Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation | Jiachen Zhu et.al. | 2408.03533 | null |
2024-08-06 | FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning | Zhi Chen et.al. | 2408.03355 | null |
2024-08-06 | SARA: Singular-Value Based Adaptive Low-Rank Adaption | Jihao Gu et.al. | 2408.03290 | null |
2024-08-06 | Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi | Pranita Deshmukh et.al. | 2408.03172 | null |
2024-08-06 | L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization | Elvys Linhares Pontes et.al. | 2408.03033 | null |
2024-08-06 | Towards Smart Microfarming in an Urban Computing Continuum | Marla Grunewald et.al. | 2408.02992 | null |
2024-08-05 | StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion | Zhichao Wang et.al. | 2408.02178 | null |
2024-08-04 | SR-CIS: Self-Reflective Incremental System with Decoupled Memory and Reasoning | Biqing Qi et.al. | 2408.01970 | null |
2024-08-03 | Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design | Joong Ho Choi et.al. | 2408.01651 | link |
2024-08-02 | MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts | Lin Ning et.al. | 2408.01505 | null |
2024-08-02 | Conditional LoRA Parameter Generation | Xiaolong Jin et.al. | 2408.01415 | null |
2024-08-02 | Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer | Yu Yang et.al. | 2408.01402 | null |
2024-08-02 | Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration | Donwon Park et.al. | 2408.01099 | null |
2024-08-02 | Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs | Afia Anjum et.al. | 2408.01008 | null |
2024-08-02 | PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized Language Prompting | Liam Hebert et.al. | 2408.00960 | null |
2024-08-01 | Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization | Róisín Luo et.al. | 2408.00923 | null |
2024-07-31 | Ge-based Clinopyroxene series: first principles and experimental local probe study | Ricardo P. Moreira et.al. | 2407.21749 | null |
2024-07-31 | A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation | Mothilal Asokan et.al. | 2407.21739 | null |
2024-07-31 | Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation | Xiang Luo et.al. | 2407.21633 | link |
2024-07-30 | CELLM: An Efficient Communication in Large Language Models Training for Federated Learning | Raja Vavekanand et.al. | 2407.20557 | null |
2024-07-29 | Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations | Fangyijie Wang et.al. | 2407.20072 | link |
2024-07-28 | Memory-efficient Training of LLMs with Larger Mini-batches | Dang Nguyen et.al. | 2407.19580 | null |
2024-07-27 | Parameter-Efficient Fine-Tuning via Circular Convolution | Aochuan Chen et.al. | 2407.19342 | null |
2024-07-27 | The Impact of LoRA Adapters for LLMs on Clinical NLP Classification Under Data Limitations | Thanh-Dung Le et.al. | 2407.19299 | null |
2024-07-26 | VIMs: Virtual Immunohistochemistry Multiplex staining via Text-to-Stain Diffusion Trained on Uniplex Stains | Shikha Dubey et.al. | 2407.19113 | null |
2024-07-25 | Stay Tuned: An Empirical Study of the Impact of Hyperparameters on LLM Tuning in Real-World Applications | Alon Halfon et.al. | 2407.18990 | null |
2024-07-25 | LoRA-Pro: Are Low-Rank Adapters Properly Optimized? | Zhengbo Wang et.al. | 2407.18242 | link |
2024-07-25 | DINOv2 Rocks Geological Image Analysis: Classification, Segmentation, and Interpretability | Florent Brondolo et.al. | 2407.18100 | link |
2024-07-24 | Channel-Aware Low-Rank Adaptation in Time Series Forecasting | Tong Nie et.al. | 2407.17246 | link |
2024-07-24 | Accurate and Efficient Fine-Tuning of Quantized Large Language Models Through Optimal Balance | Ao Shen et.al. | 2407.17029 | link |
2024-07-22 | Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters | Kartikeya Bhardwaj et.al. | 2407.16712 | null |
2024-07-23 | DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models | Zhenyu Xie et.al. | 2407.16511 | null |
2024-07-23 | Harmonizing Visual Text Comprehension and Generation | Zhen Zhao et.al. | 2407.16364 | link |
2024-07-23 | FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network | Weiying Xie et.al. | 2407.16129 | link |
2024-07-22 | Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language Models | Raza Imam et.al. | 2407.15913 | link |
2024-07-22 | Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders | Laura Niss et.al. | 2407.15731 | null |
2024-07-22 | LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models | Xi Chen et.al. | 2407.15415 | link |
2024-07-21 | Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization | Jiajun Hu et.al. | 2407.15085 | null |
2024-07-21 | MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAM | Navyansh Mahla et.al. | 2407.15042 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-30 | Improving Acoustic Scene Classification in Low-Resource Conditions | Zhi Chen et.al. | 2412.20722 | null |
2024-12-28 | Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems | Wen-Dong Jiang et.al. | 2412.20201 | null |
2024-12-28 | SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection | Phi Vu Tran et.al. | 2412.20047 | null |
2024-12-28 | Invariant debiasing learning for recommendation via biased imputation | Ting Bai et.al. | 2412.20036 | link |
2024-12-28 | Learning Adaptive and View-Invariant Vision Transformer with Multi-Teacher Knowledge Distillation for Real-Time UAV Tracking | You Wu et.al. | 2412.20002 | link |
2024-12-27 | Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis | Jiaqi Wang et.al. | 2412.19654 | link |
2024-12-27 | Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models | Shuo Wang et.al. | 2412.19449 | null |
2024-12-26 | SpectralKD: Understanding and Optimizing Vision Transformer Distillation through Spectral Analysis | Huiyuan Tian et.al. | 2412.19055 | null |
2024-12-25 | Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models | Haowei Yang et.al. | 2412.18715 | null |
2024-12-23 | Edge-AI for Agriculture: Lightweight Vision Models for Disease Detection in Resource-Limited Settings | Harsh Joshi et.al. | 2412.18635 | null |
2024-12-24 | HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation | Mohammed Hamdan et.al. | 2412.18524 | null |
2024-12-24 | Understanding Artificial Neural Network's Behavior from Neuron Activation Perspective | Yizhou Zhang et.al. | 2412.18073 | null |
2024-12-23 | CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Yuanyuan Gao et.al. | 2412.17612 | null |
2024-12-23 | GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference | Chao Zeng et.al. | 2412.17560 | null |
2024-12-24 | Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement | Hyeonjin Kim et.al. | 2412.17387 | link |
2024-12-23 | Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction | Yuying Wang et.al. | 2412.17317 | null |
2024-12-23 | LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation | Riku Uemura et.al. | 2412.17282 | null |
2024-12-22 | Lightweight Design and Optimization methods for DCNNs: Progress and Futures | Hanhua Long et.al. | 2412.16886 | null |
2024-12-21 | Large Language Models Compression via Low-Rank Feature Distillation | Yaya Sy et.al. | 2412.16719 | null |
2024-12-21 | CyberSentinel: Efficient Anomaly Detection in Programmable Switch using Knowledge Distillation | Sankalp Mittal et.al. | 2412.16693 | null |
2024-12-21 | Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers | Yunshan Zhong et.al. | 2412.16553 | null |
2024-12-21 | STKDRec: Spatial-Temporal Knowledge Distillation for Takeaway Recommendation | Shuyuan Zhao et.al. | 2412.16502 | null |
2024-12-20 | BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models | Patrick Haller et.al. | 2412.15978 | null |
2024-12-20 | A New Method to Capturing Compositional Knowledge in Linguistic Space | Jiahe Wan et.al. | 2412.15632 | null |
2024-12-19 | Uncertainty-Guided Cross Attention Ensemble Mean Teacher for Semi-supervised Medical Image Segmentation | Meghana Karri et.al. | 2412.15380 | null |
2024-12-19 | Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models | Reza Shirkavand et.al. | 2412.15341 | null |
2024-12-19 | Self-Evolution Knowledge Distillation for LLM-based Machine Translation | Yuncheng Song et.al. | 2412.15303 | null |
2024-12-19 | Adaptive Pruning for Large Language Models with Structural Importance Awareness | Haotian Zheng et.al. | 2412.15127 | null |
2024-12-19 | SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection | Ruoyu Xu et.al. | 2412.14571 | null |
2024-12-19 | Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models | Xiao Cui et.al. | 2412.14528 | null |
2024-12-19 | Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance | Sukrit Leelaluk et.al. | 2412.14526 | link |
2024-12-18 | A Survey on Inference Optimization Techniques for Mixture of Experts Models | Jiacheng Liu et.al. | 2412.14219 | link |
2024-12-18 | Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective | Zhiyuan Zeng et.al. | 2412.14135 | null |
2024-12-18 | On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process | Gereziher Adhane et.al. | 2412.13943 | null |
2024-12-18 | Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN | Pengxiang Li et.al. | 2412.13795 | link |
2024-12-18 | Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation | Kaiwen Huang et.al. | 2412.13742 | link |
2024-12-18 | On the Compression of Language Models for Code: An Empirical Study on CodeBERT | Giordano d'Aloisio et.al. | 2412.13737 | null |
2024-12-18 | Hybrid Data-Free Knowledge Distillation | Jialiang Tang et.al. | 2412.13525 | link |
2024-12-18 | Deploying Foundation Model Powered Agent Services: A Survey | Wenchao Xu et.al. | 2412.13437 | null |
2024-12-17 | In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning | Yifei Duan et.al. | 2412.13243 | null |
2024-12-17 | Modality-Inconsistent Continual Learning of Multimodal Large Language Models | Weiguo Pian et.al. | 2412.13050 | null |
2024-12-17 | Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation | Jiaqi Wang et.al. | 2412.12858 | null |
2024-12-17 | RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification | Guanwenjie Zou et.al. | 2412.12603 | link |
2024-12-17 | PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts | Kun Guo et.al. | 2412.12460 | link |
2024-12-16 | Neural Collapse Inspired Knowledge Distillation | Shuoxi Zhang et.al. | 2412.11788 | null |
2024-12-16 | Relation-Guided Adversarial Learning for Data-free Knowledge Transfer | Yingping Liang et.al. | 2412.11380 | link |
2024-12-16 | BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions | Wonyong Seo et.al. | 2412.11365 | null |
2024-12-15 | Wearable Accelerometer Foundation Models for Health via Knowledge Distillation | Salar Abbaspourazad et.al. | 2412.11276 | null |
2024-12-15 | TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs | Lanxiang Hu et.al. | 2412.11242 | null |
2024-12-15 | ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes | Pedro Miguel Sánchez Sánchez et.al. | 2412.11207 | null |
2024-12-15 | Leveraging Large Language Models for Active Merchant Non-player Characters | Byungjun Kim et.al. | 2412.11189 | null |
2024-12-15 | Knowledge Migration Framework for Smart Contract Vulnerability Detection | Luqi Wang et.al. | 2412.11175 | null |
2024-12-15 | Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty Detection | Mohammadreza Salehi et.al. | 2412.11148 | link |
2024-12-17 | On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning | Pengfei Fang et.al. | 2412.11017 | null |
2024-12-13 | Can Students Beyond The Teacher? Distilling Knowledge from Teacher's Bias | Jianhua Zhang et.al. | 2412.09874 | null |
2024-12-13 | ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression | Kai Yao et.al. | 2412.09812 | null |
2024-12-13 | LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering | Patrick Sutanto et.al. | 2412.09807 | null |
2024-12-12 | SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training | Dongting Hu et.al. | 2412.09619 | null |
2024-12-12 | A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks | Saptarshi Mandal et.al. | 2412.09579 | null |
2024-12-12 | All You Need in Knowledge Distillation Is a Tailored Coordinate System | Junjie Zhou et.al. | 2412.09388 | null |
2024-12-12 | Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices | Thanaphon Suwannaphong et.al. | 2412.09289 | null |
2024-12-15 | DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification | Kunlun Xu et.al. | 2412.09224 | link |
2024-12-12 | Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation | Xinyue Liu et.al. | 2412.08949 | link |
2024-12-12 | Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration | Yunshuai Zhou et.al. | 2412.08939 | null |
2024-12-11 | Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach | Xihua Zhu et.al. | 2412.08672 | null |
2024-12-11 | Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation | Jiaming Lv et.al. | 2412.08139 | null |
2024-12-11 | DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation | Jaeho Moon et.al. | 2412.08116 | null |
2024-12-10 | Low-Rank Correction for Quantized LLMs | Meyer Scetbon et.al. | 2412.07902 | null |
2024-12-10 | Unlocking the Potential of Reverse Distillation for Anomaly Detection | Xinyue Liu et.al. | 2412.07579 | link |
2024-12-10 | TT-MPD: Test Time Model Pruning and Distillation | Haihang Wu et.al. | 2412.07114 | null |
2024-12-09 | FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering | Amirhossein Abaskohi et.al. | 2412.07030 | link |
2024-12-09 | VQ4ALL: Efficient Neural Network Representation via a Universal Codebook | Juncan Deng et.al. | 2412.06875 | null |
2024-12-09 | Compression for Better: A General and Stable Lossless Compression Framework | Boyang Zhang et.al. | 2412.06868 | null |
2024-12-09 | Lossless Model Compression via Joint Low-Rank Factorization Optimization | Boyang Zhang et.al. | 2412.06867 | null |
2024-12-08 | GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model | Haotong Yang et.al. | 2412.06849 | null |
2024-12-10 | Federated Split Learning with Model Pruning and Gradient Quantization in Wireless Networks | Junhe Zhang et.al. | 2412.06414 | null |
2024-12-09 | U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening | Sungpyo Kim et.al. | 2412.06243 | null |
2024-12-08 | Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation | Aymen Sekhri et.al. | 2412.06003 | null |
2024-12-07 | Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery | Ye Wang et.al. | 2412.05573 | null |
2024-12-07 | Trimming Down Large Spiking Vision Transformers via Heterogeneous Quantization Search | Boxun Xu et.al. | 2412.05505 | null |
2024-12-06 | BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits | Wazib Ansar et.al. | 2412.05225 | null |
2024-12-06 | One-shot Federated Learning via Synthetic Distiller-Distillate Communication | Junyuan Zhang et.al. | 2412.05186 | link |
2024-12-06 | CCS: Continuous Learning for Customized Incremental Wireless Sensing Services | Qunhang Fu et.al. | 2412.04821 | null |
2024-12-05 | Diffusion-Augmented Coreset Expansion for Scalable Dataset Distillation | Ali Abbasi et.al. | 2412.04668 | null |
2024-12-05 | FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning | Jiayu Liu et.al. | 2412.04521 | link |
2024-12-05 | Expanding Deep Learning-based Sensing Systems with Multi-Source Knowledge Transfer | Gaole Dai et.al. | 2412.04060 | null |
2024-12-04 | Designing DNNs for a trade-off between robustness and processing performance in embedded devices | Jon Gutiérrez-Zaballa et.al. | 2412.03682 | null |
2024-12-04 | Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective | Jon Gutiérrez-Zaballa et.al. | 2412.03630 | link |
2024-12-03 | CPTQuant -- A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models | Amitash Nanda et.al. | 2412.03599 | null |
2024-12-07 | Enhancing CLIP Conceptual Embedding through Knowledge Distillation | Kuei-Chun Kao et.al. | 2412.03513 | null |
2024-12-04 | Distillation of Diffusion Features for Semantic Correspondence | Frank Fundel et.al. | 2412.03512 | null |
2024-12-03 | Efficient Model Compression Techniques with FishLeg | Jamie McGowan et.al. | 2412.02328 | null |
2024-12-02 | Mutli-View 3D Reconstruction using Knowledge Distillation | Aditya Dutt et.al. | 2412.02039 | link |
2024-12-02 | Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Qianhan Feng et.al. | 2412.01282 | link |
2024-12-02 | Reducing Inference Energy Consumption Using Dual Complementary CNNs | Michail Kinnas et.al. | 2412.01039 | link |
2024-12-01 | QABISAR: Query-Article Bipartite Interactions for Statutory Article Retrieval | T. Y. S. S. Santosh et.al. | 2412.00934 | null |
2024-12-01 | Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps | Girmaw Abebe Tadesse et.al. | 2412.00777 | null |
2024-11-30 | Continuous Concepts Removal in Text-to-image Diffusion Models | Tingxu Han et.al. | 2412.00580 | null |
2024-11-30 | Pruned Convolutional Attention Network Based Wideband Spectrum Sensing with Sub-Nyquist Sampling | Peihao Dong et.al. | 2412.00562 | link |
2024-11-30 | Toward Fair Graph Neural Networks Via Dual-Teacher Knowledge Distillation | Chengyu Li et.al. | 2412.00382 | null |
2024-11-29 | Reverse Thinking Makes LLMs Stronger Reasoners | Justin Chih-Yao Chen et.al. | 2411.19865 | null |
2024-11-28 | Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG | Xinxu Wei et.al. | 2411.19230 | null |
2024-12-03 | Puzzle: Distillation-Based NAS for Inference-Optimized LLMs | Akhiad Bercovich et.al. | 2411.19146 | null |
2024-11-28 | Headache to Overstock? Promoting Long-tail Items through Debiased Product Bundling | Shuo Xu et.al. | 2411.19107 | null |
2024-11-28 | Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems | Mansi Rana et.al. | 2411.18980 | null |
2024-11-27 | Active Data Curation Effectively Distills Large-Scale Multimodal Models | Vishaal Udandarao et.al. | 2411.18674 | null |
2024-11-27 | Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models | Yiming Wu et.al. | 2411.18375 | null |
2024-11-27 | Vision Mamba Distillation for Low-resolution Fine-grained Image Classification | Yao Chen et.al. | 2411.17980 | link |
2024-11-27 | Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery | Zhenyu Yu et.al. | 2411.17973 | null |
2024-11-26 | Attamba: Attending To Multi-Token States | Yash Akhauri et.al. | 2411.17685 | link |
2024-11-26 | Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation | Minh-Tuan Tran et.al. | 2411.17046 | null |
2024-11-26 | Words Matter: Leveraging Individual Text Embeddings for Code Generation in CLIP Test-Time Adaptation | Shambhavi Mishra et.al. | 2411.17002 | link |
2024-11-25 | Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models | Yao Fu et.al. | 2411.16991 | null |
2024-11-25 | Leveraging Foundation Models To learn the shape of semi-fluid deformable objects | Omar El Assal et.al. | 2411.16802 | null |
2024-11-25 | O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? | Zhen Huang et.al. | 2411.16489 | link |
2024-11-25 | When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets? | Srikrishna Iyer et.al. | 2411.16487 | link |
2024-11-25 | Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Yanan Wang et.al. | 2411.16196 | link |
2024-11-25 | Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics | Tian Bowen et.al. | 2411.16139 | null |
2024-11-25 | Ensemble Learning via Knowledge Transfer for CTR Prediction | Honghao Li et.al. | 2411.16122 | link |
2024-11-23 | Botfip-LLM: An Enhanced Multimodal Scientific Computing Framework Leveraging Knowledge Distillation from Large Language Models | Tianhao Chen et.al. | 2411.15525 | null |
2024-11-23 | Efficient Ternary Weight Embedding Model: Bridging Scalability and Performance | Jiayi Chen et.al. | 2411.15438 | link |
2024-11-23 | Partial Knowledge Distillation for Alleviating the Inherent Inter-Class Discrepancy in Federated Learning | Xiaoyu Gan et.al. | 2411.15403 | null |
2024-11-22 | Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion | Samarth N Ramesh et.al. | 2411.15113 | null |
2024-11-22 | RankByGene: Gene-Guided Histopathology Representation Learning Through Cross-Modal Ranking Consistency | Wentao Huang et.al. | 2411.15076 | null |
2024-11-22 | Adaptive Group Robust Ensemble Knowledge Distillation | Patrik Kenfack et.al. | 2411.14984 | null |
2024-11-25 | Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation | Aniket Bhattacharyya et.al. | 2411.14957 | null |
2024-11-22 | Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computers | Hongbo Liu et.al. | 2411.14789 | null |
2024-11-22 | Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation | Xunyu Zhu et.al. | 2411.14698 | null |
2024-11-21 | TaQ-DiT: Time-aware Quantization for Diffusion Transformers | Xinyan Liu et.al. | 2411.14172 | null |
2024-11-21 | DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization | Hexuan Deng et.al. | 2411.14055 | link |
2024-11-21 | Teaching MLPs to Master Heterogeneous Graph-Structured Knowledge for Efficient and Accurate Inference | Yunhui Liu et.al. | 2411.14035 | link |
2024-11-21 | CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition | Md Mahedi Hasan et.al. | 2411.13886 | null |
2024-11-20 | RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content | Yuxuan Jiang et.al. | 2411.13362 | null |
2024-11-20 | FASTNav: Fine-tuned Adaptive Small-language-models Trained for Multi-point Robot Navigation | Yuxuan Chen et.al. | 2411.13262 | null |
2024-11-20 | Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning | Gang Zhao et.al. | 2411.13045 | null |
2024-11-19 | Puppet-CNN: Input-Adaptive Convolutional Neural Networks with Model Compression using Ordinary Differential Equation | Yucheng Xing et.al. | 2411.12876 | null |
2024-11-19 | Reward Modeling with Ordinal Feedback: Wisdom of the Crowd | Shang Liu et.al. | 2411.12843 | null |
2024-11-19 | What Makes a Good Dataset for Knowledge Distillation? | Logan Frank et.al. | 2411.12817 | null |
2024-11-19 | FGP: Feature-Gradient-Prune for Efficient Convolutional Layer Pruning | Qingsong Lv et.al. | 2411.12781 | link |
2024-11-19 | KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder | Maheswar Bora et.al. | 2411.12270 | null |
2024-11-19 | Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes | Rahul Garg et.al. | 2411.12174 | null |
2024-11-18 | Federated Incremental Named Entity Recognition | Duzhen Zhang et.al. | 2411.11623 | null |
2024-11-18 | Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms | Haizhou Ge et.al. | 2411.11406 | null |
2024-11-17 | Map-Free Trajectory Prediction with Map Distillation and Hierarchical Encoding | Xiaodong Liu et.al. | 2411.10961 | null |
2024-11-16 | Hybrid Attention Model Using Feature Decomposition and Knowledge Distillation for Glucose Forecasting | Ebrahim Farahmand et.al. | 2411.10703 | null |
2024-11-16 | Multi-perspective Contrastive Logit Distillation | Qi Wang et.al. | 2411.10693 | null |
2024-11-16 | Exploring Feature-based Knowledge Distillation For Recommender System: A Frequency Perspective | Zhangchi Zhu et.al. | 2411.10676 | null |
2024-11-15 | Scaling Law for Post-training after Model Pruning | Xiaodong Chen et.al. | 2411.10272 | null |
2024-11-15 | Evidential Federated Learning for Skin Lesion Image Classification | Rutger Hendrix et.al. | 2411.10071 | null |
2024-11-14 | VPBSD:Vessel-Pattern-Based Semi-Supervised Distillation for Efficient 3D Microscopic Cerebrovascular Segmentation | Xi Lin et.al. | 2411.09567 | null |
2024-11-14 | Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition | Zixing Zhang et.al. | 2411.09339 | null |
2024-11-14 | Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching | Yuran Wang et.al. | 2411.09151 | null |
2024-11-14 | Toward Democratized Generative AI in Next-Generation Mobile Edge Networks | Ruichen Zhang et.al. | 2411.09148 | null |
2024-11-13 | Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head | Penghui Yang et.al. | 2411.08937 | null |
2024-11-13 | UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation | Chengyuan Zhang et.al. | 2411.08569 | null |
2024-11-13 | Federated Graph Learning with Graphless Clients | Xingbo Fu et.al. | 2411.08374 | null |
2024-11-12 | Joint Diffusion models in Continual Learning | Paweł Skierś et.al. | 2411.08224 | null |
2024-11-12 | Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data | Juanhui Li et.al. | 2411.08028 | null |
2024-11-13 | Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models | Youan Cong et.al. | 2411.07820 | null |
2024-11-12 | ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization | Weibo Zhao et.al. | 2411.07762 | null |
2024-11-12 | Optimizing Traffic Signal Control using High-Dimensional State Representation and Efficient Deep Reinforcement Learning | Lawrence Francis et.al. | 2411.07759 | null |
2024-11-12 | ALANINE: A Novel Decentralized Personalized Federated Learning For Heterogeneous LEO Satellite Constellation | Liang Zhao et.al. | 2411.07752 | null |
2024-11-12 | OWLed: Outlier-weighed Layerwise Pruning for Efficient Autonomous Driving Framework | Jiaxi Li et.al. | 2411.07711 | link |
2024-11-13 | Feature Interaction Fusion Self-Distillation Network For CTR Prediction | Lei Sang et.al. | 2411.07508 | null |
2024-11-12 | Quantifying Knowledge Distillation Using Partial Information Decomposition | Pasan Dissanayake et.al. | 2411.07483 | null |
2024-11-11 | SAMPart3D: Segment Any Part in 3D Objects | Yunhan Yang et.al. | 2411.07184 | link |
2024-11-11 | LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Runming Yang et.al. | 2411.06839 | null |
2024-11-11 | ScaleKD: Strong Vision Transformers Could Be Excellent Teachers | Jiawei Fan et.al. | 2411.06786 | link |
2024-11-11 | An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning | Dong Li et.al. | 2411.06659 | link |
2024-11-10 | CULL-MT: Compression Using Language and Layer pruning for Machine Translation | Pedram Rostami et.al. | 2411.06506 | null |
2024-11-10 | Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation | Yu-Liang Zhan et.al. | 2411.06448 | link |
2024-11-09 | Dynamic Textual Prompt For Rehearsal-free Lifelong Person Re-identification | Hongyu Chen et.al. | 2411.06023 | null |
2024-11-09 | Multi-hop RIS-aided Learning Model Sharing for Urban Air Mobility | Kai Xiong et.al. | 2411.06015 | null |
2024-11-08 | Mitigating Hallucination with ZeroG: An Advanced Knowledge Management Engine | Anantha Sharma et.al. | 2411.05936 | null |
2024-11-08 | Asterisk: Keep it Simple* | Andrew Semenov et.al. | 2411.05691 | null |
2024-11-08 | Knowledge Distillation Neural Network for Predicting Car-following Behaviour of Human-driven and Autonomous Vehicles | Ayobami Adewale et.al. | 2411.05618 | null |
2024-11-08 | Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion | Nan Song et.al. | 2411.05544 | null |
2024-11-07 | ZipNN: Lossless Compression for AI Models | Moshik Hershcovitch et.al. | 2411.05239 | link |
2024-11-07 | Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Flavio Di Palo et.al. | 2411.05045 | null |
2024-11-06 | From Word Vectors to Multimodal Embeddings: Techniques, Applications, and Future Directions For Large Language Models | Charles Zhang et.al. | 2411.05036 | null |
2024-11-07 | Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers | Zhichao Geng et.al. | 2411.04403 | null |
2024-11-07 | GazeGen: Gaze-Driven User Interaction for Visual Content Generation | He-Yen Hsieh et.al. | 2411.04335 | null |
2024-11-06 | Towards Personalized Federated Learning via Comprehensive Knowledge Distillation | Pengju Wang et.al. | 2411.03569 | null |
2024-11-05 | Change Is the Only Constant: Dynamic LLM Slicing based on Layer Redundancy | Razvan-Gabriel Dumitru et.al. | 2411.03513 | link |
2024-11-05 | Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation | Francisco Giral et.al. | 2411.02975 | null |
2024-11-05 | Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery | Bowei Du et.al. | 2411.02861 | null |
2024-11-05 | Brewing Vodka: Distilling Pure Knowledge for Lightweight Threat Detection in Audit Logs | Weiheng Wu et.al. | 2411.02775 | null |
2024-11-05 | Multimodal Commonsense Knowledge Distillation for Visual Question Answering | Shuo Yang et.al. | 2411.02722 | null |
2024-11-04 | Information plane and compression-gnostic feedback in quantum machine learning | Nathan Haboury et.al. | 2411.02313 | null |
2024-11-04 | Training on the Test Model: Contamination in Ranking Distillation | Vishakha Suresh Kalal et.al. | 2411.02284 | link |
2024-11-03 | Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment | Chengting Yu et.al. | 2411.01547 | null |
2024-11-01 | On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance | Jaskirat Singh et.al. | 2411.00907 | null |
2024-11-01 | Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation | Bohan Lyu et.al. | 2411.00412 | null |
2024-11-01 | Towards Building Secure UAV Navigation with FHE-aware Knowledge Distillation | Arjun Ramesh Kaushik et.al. | 2411.00403 | null |
2024-11-01 | Efficient Model Compression for Bayesian Neural Networks | Diptarka Saha et.al. | 2411.00273 | null |
2024-10-31 | Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image Classification | Thanh-Dung Le et.al. | 2411.00209 | link |
2024-10-31 | Mutual Information Preserving Neural Network Pruning | Charles Westphal et.al. | 2411.00147 | null |
2024-10-30 | Larger models yield better results? Streamlined severity classification of ADHD-related concerns using BERT-based knowledge distillation | Ahmed Akib Jawad Karim et.al. | 2411.00052 | null |
2024-10-30 | IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking | Run Luo et.al. | 2410.23907 | null |
2024-10-29 | ML Research Benchmark | Matthew Kenney et.al. | 2410.22553 | link |
2024-11-01 | Leveraging Recurrent Neural Networks for Predicting Motor Movements from Primate Motor Cortex Neural Recordings | Yuanxi Wang et.al. | 2410.22283 | null |
2024-10-28 | Unveiling Context-Aware Criteria in Self-Assessing LLMs | Taneesh Gupta et.al. | 2410.21545 | null |
2024-10-28 | Knowledge Distillation for Real-Time Classification of Early Media in Voice Communications | Kemal Altwlkany et.al. | 2410.21478 | null |
2024-10-31 | LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment | Ge Yang et.al. | 2410.21352 | link |
2024-10-28 | EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation | Shih-Yang Liu et.al. | 2410.21271 | null |
2024-10-28 | Deep Learning for Medical Text Processing: BERT Model Fine-Tuning and Comparative Study | Jiacheng Hu et.al. | 2410.20792 | null |
2024-10-28 | KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation | Rambod Azimi et.al. | 2410.20777 | link |
2024-10-28 | Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning | Bing Han et.al. | 2410.20775 | null |
2024-10-28 | Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA | Sangmin Bae et.al. | 2410.20672 | null |
2024-10-27 | Uncovering Capabilities of Model Pruning in Graph Contrastive Learning | Wu Junran et.al. | 2410.20356 | null |
2024-10-25 | A Survey of Small Language Models | Chien Van Nguyen et.al. | 2410.20011 | null |
2024-10-25 | GeoLLaVA: Efficient Fine-Tuned Vision-Language Models for Temporal Change Detection in Remote Sensing | Hosam Elgendy et.al. | 2410.19552 | link |
2024-10-25 | SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models | Jahyun Koo et.al. | 2410.19503 | null |
2024-10-24 | Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts | Danyal Aftab et.al. | 2410.19185 | null |
2024-10-24 | AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Ziqi Liang et.al. | 2410.19134 | null |
2024-10-24 | High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws | M. Emrullah Ildiz et.al. | 2410.18837 | null |
2024-10-24 | Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data | Anup Shirgaonkar et.al. | 2410.18588 | null |
2024-10-24 | SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning | Shivam Adarsh et.al. | 2410.18574 | link |
2024-10-23 | ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams | Srija Anand et.al. | 2410.17901 | null |
2024-10-23 | Beware of Calibration Data for Pruning Large Language Models | Yixin Ji et.al. | 2410.17711 | null |
2024-10-23 | Towards Active Participant-Centric Vertical Federated Learning: Some Representations May Be All You Need | Jon Irureta et.al. | 2410.17648 | null |
2024-10-23 | Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation | Muquan Li et.al. | 2410.17606 | link |
2024-10-23 | Multimodal Information Bottleneck for Deep Reinforcement Learning with Multiple Sensors | Bang You et.al. | 2410.17551 | null |
2024-10-23 | Physics-driven AI for Channel Estimation in Cellular Network | Xiaoqian Qi et.al. | 2410.17525 | null |
2024-10-22 | MiniPLM: Knowledge Distillation for Pre-Training Language Models | Yuxian Gu et.al. | 2410.17215 | link |
2024-10-22 | Self-calibration for Language Model Quantization and Pruning | Miles Williams et.al. | 2410.17170 | null |
2024-10-22 | DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization | Haowei Zhu et.al. | 2410.16942 | null |
2024-10-22 | Mitigating Vanishing Activations in Deep CapsNets Using Channel Pruning | Siddharth Sahu et.al. | 2410.16908 | link |
2024-10-22 | CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare | Nicholas I-Hsien Kuo et.al. | 2410.16872 | null |
2024-10-22 | AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models | Yongjian Wu et.al. | 2410.16820 | link |
2024-10-22 | SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation | Jing-Jing Li et.al. | 2410.16665 | null |
2024-10-21 | Pre-training Distillation for Large Language Models: A Design Space Exploration | Hao Peng et.al. | 2410.16215 | null |
2024-10-18 | Interpreting Microbiome Relative Abundance Data Using Symbolic Regression | Swagatam Haldar et.al. | 2410.16109 | link |
2024-10-21 | Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples | Kirill Lukyanov et.al. | 2410.15889 | null |
2024-10-20 | GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning | Haiwen Diao et.al. | 2410.15266 | link |
2024-10-19 | LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound | Xuechen Guo et.al. | 2410.15074 | null |
2024-10-19 | Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS | Tuan Nam Nguyen et.al. | 2410.14997 | null |
2024-10-18 | EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search | Oliver Sieberling et.al. | 2410.14649 | link |
2024-10-18 | Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation | Shuai Zhao et.al. | 2410.14425 | link |
2024-10-18 | Preview-based Category Contrastive Learning for Knowledge Distillation | Muhe Ding et.al. | 2410.14143 | null |
2024-10-17 | Leveraging Fine-Tuned Language Models for Efficient and Accurate Smart Contract Auditing | Zhiyuan Wei et.al. | 2410.13918 | link |
2024-10-17 | An Active Learning Framework for Inclusive Generation by Large Language Models | Sabit Hassan et.al. | 2410.13641 | null |
2024-10-18 | Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach | Luyao Zou et.al. | 2410.13602 | null |
2024-10-18 | Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach | Luyao Zou et.al. | 2410.13260 | null |
2024-10-16 | TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant | Guopeng Li et.al. | 2410.12342 | null |
2024-10-16 | Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm | Guanming Huang et.al. | 2410.12259 | null |
2024-10-16 | TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration | Yiwei Guo et.al. | 2410.12183 | link |
2024-10-17 | SAM-Guided Masked Token Prediction for 3D Scene Understanding | Zhimin Chen et.al. | 2410.12158 | null |
2024-10-15 | MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router | Yanyue Xie et.al. | 2410.12013 | null |
2024-10-15 | Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation | Andong Lu et.al. | 2410.11586 | link |
2024-10-15 | Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL | Qihuang Zhong et.al. | 2410.11371 | null |
2024-10-15 | Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling | Wenda Xu et.al. | 2410.11325 | null |
2024-10-14 | ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection | Martin Aubard et.al. | 2410.10554 | link |
2024-10-14 | QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models | Zhumazhan Balapanov et.al. | 2410.10318 | link |
2024-10-14 | Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation | Siru Ouyang et.al. | 2410.10141 | null |
2024-10-15 | Edge Unlearning is Not "on Edge"! An Adaptive Exact Unlearning System on Resource-Constrained Devices | Xiaoyu Xia et.al. | 2410.10128 | link |
2024-10-14 | REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation | Zhiyun Song et.al. | 2410.10097 | null |
2024-10-12 | SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs | Mohammad Mozaffari et.al. | 2410.09615 | link |
2024-10-12 | Distilling Invariant Representations with Dual Augmentation | Nikolaos Giakoumoglou et.al. | 2410.09474 | null |
2024-10-12 | Declarative Knowledge Distillation from Large Language Models for Visual Question Answering Datasets | Thomas Eiter et.al. | 2410.09428 | link |
2024-10-15 | Transforming In-Vehicle Network Intrusion Detection: VAE-based Knowledge Distillation Meets Explainable AI | Muhammet Anil Yagiz et.al. | 2410.09043 | null |
2024-10-11 | Mentor-KD: Making Small Language Models Better Multi-step Reasoners | Hojae Lee et.al. | 2410.09037 | link |
2024-10-11 | Contrastive Knowledge Distillation for Robust Multimodal Sentiment Analysis | Zhongyi Sang et.al. | 2410.08692 | null |
2024-10-11 | GAI-Enabled Explainable Personalized Federated Semi-Supervised Learning | Yubo Peng et.al. | 2410.08634 | null |
2024-10-11 | Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both | Abhijnan Nath et.al. | 2410.08458 | null |
2024-10-10 | What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias | Aida Mohammadshahi et.al. | 2410.08407 | null |
2024-10-10 | Non-transferable Pruning | Ruyi Ding et.al. | 2410.08015 | null |
2024-10-10 | A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways | Jing Su et.al. | 2410.07915 | null |
2024-10-10 | SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks | Haiyang Wang et.al. | 2410.07857 | link |
2024-10-12 | Relational Diffusion Distillation for Efficient Image Generation | Weilun Feng et.al. | 2410.07679 | link |
2024-10-10 | CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression | Wenyuan Liu et.al. | 2410.07505 | null |
2024-10-09 | Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing | Ismail Erbas et.al. | 2410.07364 | null |
2024-10-09 | S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning | Weihao Lin et.al. | 2410.07046 | null |
2024-10-09 | Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation | Runze Chen et.al. | 2410.06982 | null |
2024-10-09 | Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching | Wenqi Niu et.al. | 2410.06561 | null |
2024-10-08 | SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching | Tianyi Zhang et.al. | 2410.06364 | null |
2024-10-08 | QT-DoG: Quantization-aware Training for Domain Generalization | Saqib Javed et.al. | 2410.06020 | link |
2024-10-10 | KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server | Wenhao Wang et.al. | 2410.05725 | link |
2024-10-07 | Progressive distillation induces an implicit curriculum | Abhishek Panigrahi et.al. | 2410.05464 | null |
2024-10-07 | ESPACE: Dimensionality Reduction of Activations for Model Compression | Charbel Sakr et.al. | 2410.05437 | null |
2024-10-07 | ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation | Yuelyu Ji et.al. | 2410.05168 | null |
2024-10-06 | CAPEEN: Image Captioning with Early Exits and Knowledge Distillation | Divya Jyoti Bajpai et.al. | 2410.04433 | link |
2024-10-06 | DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs | Divya Jyoti Bajpai et.al. | 2410.04424 | link |
2024-10-05 | Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution | Jianze Li et.al. | 2410.04224 | link |
2024-10-05 | Accelerating Diffusion Models with One-to-Many Knowledge Distillation | Linfeng Zhang et.al. | 2410.04191 | null |
2024-10-05 | DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech | Dominika Woszczyk et.al. | 2410.04188 | null |
2024-10-05 | Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher | Yong Guo et.al. | 2410.04140 | null |
2024-10-04 | Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models | Zhuochun Li et.al. | 2410.03663 | null |
2024-10-04 | DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models | Sungnyun Kim et.al. | 2410.03061 | null |
2024-10-03 | Geometry is All You Need: A Unified Taxonomy of Matrix and Tensor Factorization for Compression of Generative Language Models | Mingxue Xu et.al. | 2410.03040 | null |
2024-10-03 | Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks | Siddharth Joshi et.al. | 2410.02116 | null |
2024-10-02 | Review Non-convex Optimization Method for Machine Learning | Greg B Fotopoulos et.al. | 2410.02017 | null |
2024-10-02 | PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation | Mike Ranzinger et.al. | 2410.01680 | null |
2024-10-04 | HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models | Seanie Lee et.al. | 2410.01524 | link |
2024-10-02 | Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks | Edan Kinderman et.al. | 2410.01483 | link |
2024-10-02 | PairDistill: Pairwise Relevance Distillation for Dense Retrieval | Chao-Wei Huang et.al. | 2410.01383 | link |
2024-10-02 | "No Matter What You Do!": Mitigating Backdoor Attacks in Graph Neural Networks | Jiale Zhang et.al. | 2410.01272 | link |
2024-10-01 | Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging | Ismail Erbas et.al. | 2410.00948 | null |
2024-10-01 | Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading | Mostafa Hajighasemloua et.al. | 2410.00779 | null |
2024-10-01 | Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation | Jiyoon Myung et.al. | 2410.00683 | null |
2024-10-01 | AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation | Ziyang Luo et.al. | 2410.00558 | link |
2024-10-01 | Self-Updatable Large Language Models with Parameter Integration | Yu Wang et.al. | 2410.00487 | null |
2024-09-30 | Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation | Vlad-Cristian Matei et.al. | 2409.20498 | null |
2024-10-02 | Linear Projections of Teacher Embeddings for Few-Class Distillation | Noel Loo et.al. | 2409.20449 | null |
2024-09-30 | Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies | Shalini Sarode et.al. | 2409.20237 | null |
2024-09-30 | Aggressive Post-Training Compression on Extremely Large Language Models | Zining Zhang et.al. | 2409.20094 | null |
2024-10-01 | HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning | Momin Ahmad Khan et.al. | 2409.19912 | null |
2024-09-29 | Tailored Federated Learning: Leveraging Direction Regulation & Knowledge Distillation | Huidong Tang et.al. | 2409.19741 | null |
2024-09-29 | InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries | Mengze Hong et.al. | 2409.19689 | null |
2024-09-28 | Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training | Pihe Hu et.al. | 2409.19391 | null |
2024-09-28 | Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment | Tianyi Liu et.al. | 2409.19366 | null |
2024-09-27 | Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models | Shihua Qin et.al. | 2409.19185 | null |
2024-09-27 | MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation | Junyou Zhu et.al. | 2409.18800 | null |
2024-09-27 | Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation | Chaomin Shen et.al. | 2409.18785 | null |
2024-09-27 | Harmonizing knowledge Transfer in Neural Network with Unified Distillation | Yaomin Huang et.al. | 2409.18565 | null |
2024-09-27 | Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration | Mahdi Morafah et.al. | 2409.18461 | link |
2024-09-26 | EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation | Jiaxiang Tang et.al. | 2409.18114 | null |
2024-09-26 | Weak-To-Strong Backdoor Attacks for LLMs with Contrastive Knowledge Distillation | Shuai Zhao et.al. | 2409.17946 | null |
2024-09-26 | Kendall's |
Yuchen Guan et.al. | 2409.17823 | null |
2024-09-26 | General Compression Framework for Efficient Transformer Object Tracking | Lingyi Hong et.al. | 2409.17564 | null |
2024-09-26 | Shape-intensity knowledge distillation for robust medical image segmentation | Wenhui Dong et.al. | 2409.17503 | link |
2024-09-25 | Search for Efficient Large Language Models | Xuan Shen et.al. | 2409.17372 | link |
2024-09-25 | MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events | Xiaoyu Yang et.al. | 2409.17010 | null |
2024-09-25 | Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation | Hanyu Zhou et.al. | 2409.17001 | null |
2024-09-25 | SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling | Laurent Dillard et.al. | 2409.16581 | null |
2024-09-24 | AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Vlad Hosu et.al. | 2409.16271 | null |
2024-09-25 | Privacy Evaluation Benchmarks for NLP Models | Wei Huang et.al. | 2409.15868 | link |
2024-09-24 | Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization | Lucas Deckers et.al. | 2409.15849 | null |
2024-09-23 | TS-TCD: Triplet-Level Cross-Modal Distillation for Time-Series Forecasting Using Large Language Models | Pengfei Wang et.al. | 2409.14978 | null |
2024-09-23 | DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models | Sangyeon Cho et.al. | 2409.14904 | link |
2024-09-23 | Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation | Li Li et.al. | 2409.14810 | null |
2024-09-23 | An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding | Wei-Bin Kou et.al. | 2409.14737 | null |
2024-09-18 | Applications of Knowledge Distillation in Remote Sensing: A Survey | Yassine Himeur et.al. | 2409.12111 | null |
2024-09-18 | Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction | Jin Jie Sean Yeo et.al. | 2409.11964 | null |
2024-09-18 | Distillation-free Scaling of Large SSMs for Images and Videos | Hamid Suleman et.al. | 2409.11867 | null |
2024-09-18 | EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis | Shaojie Li et.al. | 2409.11817 | null |
2024-09-18 | RUIE: Retrieval-based Unified Information Extraction using Large Language Model | Xincheng Liao et.al. | 2409.11673 | null |
2024-09-17 | Time-Series Forecasting, Knowledge Distillation, and Refinement within a Multimodal PDE Foundation Model | Derek Jollie et.al. | 2409.11609 | link |
2024-09-17 | Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation | Rui Yu et.al. | 2409.11018 | null |
2024-09-17 | Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation | Gerard I. Gállego et.al. | 2409.11003 | null |
2024-09-16 | Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning | Amin Karimi Monsefi et.al. | 2409.10362 | null |
2024-09-16 | Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference | Huy-Dung Nguyen et.al. | 2409.10095 | null |
2024-09-15 | ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration | Ning-Chi Huang et.al. | 2409.09708 | null |
2024-09-14 | Effective Pre-Training of Audio Transformers for Sound Event Detection | Florian Schmid et.al. | 2409.09546 | link |
2024-09-14 | Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification | Wenhao Yang et.al. | 2409.09389 | null |
2024-09-14 | Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility | Xiaoyu Liu et.al. | 2409.09357 | null |
2024-09-13 | Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection | Dixi Yao et.al. | 2409.08858 | null |
2024-09-13 | An Efficient Privacy-aware Split Learning Framework for Satellite Communications | Jianfei Sun et.al. | 2409.08538 | null |
2024-09-13 | AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation | Zechao Sun et.al. | 2409.08516 | null |
2024-09-12 | DiReDi: Distillation and Reverse Distillation for AIoT Applications | Chen Sun et.al. | 2409.08308 | null |
2024-09-12 | Ruri: Japanese General Text Embeddings | Hayato Tsukagoshi et.al. | 2409.07737 | link |
2024-09-12 | Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios | Xinlei Huang et.al. | 2409.07694 | null |
2024-09-11 | DS-ViT: Dual-Stream Vision Transformer for Cross-Task Distillation in Alzheimer's Early Diagnosis | Ke Chen et.al. | 2409.07584 | null |
2024-09-11 | EchoDFKD: Data-Free Knowledge Distillation for Cardiac Ultrasound Segmentation using Synthetic Data | Grégoire Petit et.al. | 2409.07566 | null |
2024-09-11 | NVRC: Neural Video Representation Compression | Ho Man Kwan et.al. | 2409.07414 | null |
2024-09-11 | Enhancing CTC-Based Visual Speech Recognition | Hendrik Laux et.al. | 2409.07210 | null |
2024-09-11 | A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption | Marcus Rüb et.al. | 2409.07114 | null |
2024-09-11 | Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator | Kangyang Luo et.al. | 2409.06955 | null |
2024-09-10 | Applied Federated Model Personalisation in the Industrial Domain: A Comparative Study | Ilias Siniosoglou et.al. | 2409.06904 | null |
2024-09-10 | EasyST: A Simple Framework for Spatio-Temporal Prediction | Jiabin Tang et.al. | 2409.06748 | link |
2024-09-10 | SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation | Teng Hu et.al. | 2409.06633 | null |
2024-09-10 | Knowledge Distillation via Query Selection for Detection Transformer | Yi Liu et.al. | 2409.06443 | null |
2024-09-10 | Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition | Junzheng Zhang et.al. | 2409.06371 | null |
2024-09-10 | Enhancing Long Video Understanding via Hierarchical Event-Based Memory | Dingxin Cheng et.al. | 2409.06299 | null |
2024-09-09 | Joint Input and Output Coordination for Class-Incremental Learning | Shuai Wang et.al. | 2409.05620 | null |
2024-09-09 | LEROjD: Lidar Extended Radar-Only Object Detection | Patrick Palmer et.al. | 2409.05564 | link |
2024-09-09 | Federated Transfer Learning Based Cooperative Wideband Spectrum Sensing with Model Pruning | Jibin Jia et.al. | 2409.05462 | null |
2024-09-09 | Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition | Shiming Ge et.al. | 2409.05384 | null |
2024-09-09 | Application Specific Compression of Deep Learning Models | Rohit Raj Rai et.al. | 2409.05368 | link |
2024-09-09 | FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data | Rasoul Jafari Gohari et.al. | 2409.05359 | link |
2024-09-08 | Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation | Haichao Zhu et.al. | 2409.05151 | null |
2024-09-07 | LoCa: Logit Calibration for Knowledge Distillation | Runming Yang et.al. | 2409.04778 | null |
2024-09-06 | SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields | Yuze Wang et.al. | 2409.04482 | null |
2024-09-05 | Experimentation in Content Moderation using RWKV | Umut Yildirim et.al. | 2409.03939 | null |
2024-09-05 | DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture | Qianlong Xiang et.al. | 2409.03550 | null |
2024-09-05 | Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration | Pei Wang et.al. | 2409.03455 | null |
2024-09-05 | Efficient Image Compression Using Advanced State Space Models | Bouzid Arezki et.al. | 2409.02743 | null |
2024-09-04 | CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation | Minhee Cho et.al. | 2409.02699 | null |
2024-09-04 | Low-Resolution Object Recognition with Cross-Resolution Relational Contrastive Distillation | Kangkai Zhang et.al. | 2409.02555 | null |
2024-09-04 | A design of magnetic tunnel junctions for the deployment of neuromorphic hardware for edge computing | Davi Rodrigues et.al. | 2409.02528 | null |
2024-09-04 | Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation | Yilong Chen et.al. | 2409.02438 | null |
2024-09-03 | Low-Resolution Face Recognition via Adaptable Instance-Relation Distillation | Ruixin Shi et.al. | 2409.02049 | null |
2024-09-03 | Foundations of Large Language Model Compression -- Part 1: Weight Quantization | Sean I. Young et.al. | 2409.02026 | link |
2024-09-03 | Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique | Qiang Zheng et.al. | 2409.02020 | null |
2024-09-03 | Contemporary Model Compression on Large Language Models Inference | Dong Liu et.al. | 2409.01990 | null |
2024-09-03 | Adaptive Explicit Knowledge Transfer for Knowledge Distillation | Hyungkeun Park et.al. | 2409.01679 | null |
2024-08-30 | How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recognition | Pedro C. Neto et.al. | 2408.17399 | link |
2024-08-30 | HiTSR: A Hierarchical Transformer for Reference-based Super-Resolution | Masoomeh Aslahishahri et.al. | 2408.16959 | link |
2024-08-29 | VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition | Zaiwei Zhang et.al. | 2408.16930 | null |
2024-08-29 | Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling | Hritik Bansal et.al. | 2408.16737 | null |
2024-08-29 | MST-KD: Multiple Specialized Teachers Knowledge Distillation for Fair Face Recognition | Eduarda Caldeira et.al. | 2408.16563 | link |
2024-08-29 | Convolutional Neural Network Compression Based on Low-Rank Decomposition | Yaping He et.al. | 2408.16289 | null |
2024-08-28 | LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation | Fangxun Shu et.al. | 2408.15881 | link |
2024-08-28 | ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation | Tiantian Feng et.al. | 2408.15803 | null |
2024-08-28 | Online pre-training with long-form videos | Itsuki Kato et.al. | 2408.15651 | null |
2024-08-28 | Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation | Lujun Gui et.al. | 2408.15562 | null |
2024-08-27 | Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification | Yiqiang Cai et.al. | 2408.14862 | link |
2024-08-27 | Learning effective pruning at initialization from iterative pruning | Shengkai Liu et.al. | 2408.14757 | link |
2024-08-26 | Bridging the Gap: Unpacking the Hidden Challenges in Knowledge Distillation for Online Ranking Systems | Nikhil Khani et.al. | 2408.14678 | null |
2024-08-25 | Variational autoencoder-based neural network model compression | Liang Cheng et.al. | 2408.14513 | null |
2024-08-26 | TSAK: Two-Stage Semantic-Aware Knowledge Distillation for Efficient Wearable Modality and Model Optimization in Manufacturing Lines | Hymalai Bello et.al. | 2408.14146 | null |
2024-08-27 | GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets | Sven Oehri et.al. | 2408.14131 | link |
2024-08-26 | Let Video Teaches You More: Video-to-Image Knowledge Distillation using DEtection TRansformer for Medical Video Lesion Detection | Yuncheng Jiang et.al. | 2408.14051 | null |
2024-08-25 | Condensed Sample-Guided Model Inversion for Knowledge Distillation | Kuluhan Binici et.al. | 2408.13850 | null |
2024-08-25 | Bring the Power of Diffusion Model to Defect Detection | Xuyi Yu et.al. | 2408.13845 | null |
2024-08-24 | Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic | Yifei He et.al. | 2408.13656 | link |
2024-08-24 | MPruner: Optimizing Neural Network Size with CKA-Based Mutual Information Pruning | Seungbeom Hu et.al. | 2408.13482 | null |
2024-08-23 | Growing Deep Neural Network Considering with Similarity between Neurons | Taigo Sakai et.al. | 2408.13291 | null |
2024-08-23 | Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption | Sakhinana Sagar Srinivas et.al. | 2408.13248 | null |
2024-08-23 | A Web-Based Solution for Federated Learning with LLM-Based Automation | Chamith Mawela et.al. | 2408.13010 | null |
2024-08-23 | A Survey on Drowsiness Detection -- Modern Applications and Methods | Biying Fu et.al. | 2408.12990 | null |
2024-08-22 | Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers | Sayed Mohammad Vakilzadeh Hatefi et.al. | 2408.12568 | link |
2024-08-22 | Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models | Meiyun Wang et.al. | 2408.12326 | link |
2024-08-22 | Rebalancing Multi-Label Class-Incremental Learning | Kaile Du et.al. | 2408.12161 | null |
2024-08-22 | Vision-Based Detection of Uncooperative Targets and Components on Small Satellites | Hannah Grauer et.al. | 2408.12084 | null |
2024-08-22 | Aligning (Medical) LLMs for (Counterfactual) Fairness | Raphael Poulain et.al. | 2408.12055 | link |
2024-08-22 | LAKD-Activation Mapping Distillation Based on Local Learning | Yaoze Zhang et.al. | 2408.11478 | null |
2024-08-21 | A Practical Trigger-Free Backdoor Attack on Neural Networks | Jiahao Wang et.al. | 2408.11444 | null |
2024-08-21 | Pano2Room: Novel View Synthesis from a Single Indoor Panorama | Guo Pu et.al. | 2408.11413 | link |
2024-08-21 | Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection | Liang Yao et.al. | 2408.11407 | null |
2024-08-21 | A Unified Framework for Continual Learning and Machine Unlearning | Romit Chatterjee et.al. | 2408.11374 | null |
2024-08-20 | SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection | Huafeng Chen et.al. | 2408.10760 | null |
2024-08-20 | Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation | Md Fahim Sikder et.al. | 2408.10755 | null |
2024-08-20 | Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches | Yanjie Dong et.al. | 2408.10691 | null |
2024-08-20 | LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models | Yupeng Su et.al. | 2408.10631 | link |
2024-08-20 | Adaptive Knowledge Distillation for Classification of Hand Images using Explainable Vision Transformers | Thanh Thi Nguyen et.al. | 2408.10503 | null |
2024-08-19 | Transferring Backdoors between Large Language Models by Knowledge Distillation | Pengzhou Cheng et.al. | 2408.09878 | link |
2024-08-20 | MoDeGPT: Modular Decomposition for Large Language Model Compression | Chi-Heng Lin et.al. | 2408.09632 | null |
2024-08-18 | MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment | Tianyi Liu et.al. | 2408.09465 | null |
2024-08-18 | CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination | Kaicheng Yang et.al. | 2408.09441 | null |
2024-08-18 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras | Muhammad Rameez Ur Rahman et.al. | 2408.09424 | link |
2024-08-17 | RepControlNet: ControlNet Reparameterization | Zhaoli Deng et.al. | 2408.09240 | null |
2024-08-16 | Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition | Muhammad Haseeb Aslam et.al. | 2408.09035 | link |
2024-08-16 | Research on Personalized Compression Algorithm for Pre-trained Models Based on Homomorphic Entropy Increase | Yicong Li et.al. | 2408.08684 | null |
2024-08-16 | ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models | Chao Zeng et.al. | 2408.08554 | link |
2024-08-15 | Computer Vision Model Compression Techniques for Embedded Systems: A Survey | Alexandre Lopes et.al. | 2408.08250 | link |
2024-08-15 | MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU | Yan Li et.al. | 2408.08144 | null |
2024-08-19 | Knowledge Distillation with Refined Logits | Wujie Sun et.al. | 2408.07703 | link |
2024-08-14 | FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual Teacher | Alessio Mora et.al. | 2408.07587 | null |
2024-08-14 | Towards Real-time Video Compressive Sensing on Mobile Devices | Miao Cao et.al. | 2408.07530 | link |
2024-08-14 | One Step Diffusion-based Super-Resolution with Time-Aware Distillation | Xiao He et.al. | 2408.07476 | link |
2024-08-14 | Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection | Zhonglin Chen et.al. | 2408.07455 | null |
2024-08-13 | Using Advanced LLMs to Enhance Smaller LLMs: An Interpretable Knowledge Distillation Approach | Tong Wang et.al. | 2408.07238 | null |
2024-08-15 | An Event Structure-aware Generative Model for Biomedical Event Extraction | Haohan Yuan et.al. | 2408.06583 | null |
2024-08-12 | Optimizing Vision Transformers with Data-Free Knowledge Transfer | Gousia Habib et.al. | 2408.05952 | null |
2024-08-11 | Low-Dimensional Federated Knowledge Graph Embedding via Knowledge Distillation | Xiaoxiong Zhang et.al. | 2408.05748 | null |
2024-08-11 | Efficient Federated Learning Using Dynamic Update and Adaptive Pruning with Momentum on Shared Server Data | Ji Liu et.al. | 2408.05678 | null |
2024-08-08 | LaDiMo: Layer-wise Distillation Inspired MoEfier | Sungyoon Kim et.al. | 2408.04278 | null |
2024-08-08 | Distil-DCCRN: A Small-footprint DCCRN Leveraging Feature-based Knowledge Distillation in Speech Enhancement | Runduo Han et.al. | 2408.04267 | null |
2024-08-14 | ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model | Yifan Chen et.al. | 2408.04145 | null |
2024-08-07 | AdapMTL: Adaptive Pruning Framework for Multitask Learning Model | Mingcan Xiang et.al. | 2408.03913 | null |
2024-08-07 | Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection | Xinyue Liu et.al. | 2408.03888 | null |
2024-08-07 | Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Joo Chan Lee et.al. | 2408.03822 | null |
2024-08-07 | Iterative Knowledge Distillation through Feedback-Driven Learning Cycles | Yujia Chen et.al. | 2408.03680 | null |
2024-08-07 | Real-time Event Recognition of Long-distance Distributed Vibration Sensing with Knowledge Distillation and Hardware Acceleration | Zhongyao Luo et.al. | 2408.03647 | link |
2024-08-07 | Distillation Learning Guided by Image Reconstruction for One-Shot Medical Image Segmentation | Feng Zhou et.al. | 2408.03616 | link |
2024-08-06 | EEGMobile: Enhancing Speed and Accuracy in EEG-Based Gaze Prediction with Advanced Mobile Architectures | Teng Liang et.al. | 2408.03449 | link |
2024-08-06 | DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers | Lianwei Yang et.al. | 2408.03291 | null |
2024-08-06 | Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments | Angie Boggust et.al. | 2408.03274 | null |
2024-08-06 | Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization | Yanghai Zhang et.al. | 2408.03149 | link |
2024-08-06 | Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations | Leo Donisch et.al. | 2408.03130 | null |
2024-08-06 | Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression | Jonas Schmitt et.al. | 2408.03046 | link |
2024-08-06 | VizECGNet: Visual ECG Image Network for Cardiovascular Diseases Classification with Multi-Modal Training and Knowledge Distillation | Ju-Hyeon Nam et.al. | 2408.02888 | null |
2024-08-05 | An approach to optimize inference of the DIART speaker diarization pipeline | Roman Aperdannier et.al. | 2408.02341 | null |
2024-08-05 | Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution | Hojung Lee et.al. | 2408.02307 | link |
2024-08-05 | Unsupervised Domain Adaption Harnessing Vision-Language Pre-training | Wenlve Zhou et.al. | 2408.02192 | link |
2024-08-03 | Joint Model Pruning and Resource Allocation for Wireless Time-triggered Federated Learning | Xinlu Zhang et.al. | 2408.01765 | null |
2024-08-02 | An Adaptive Tensor-Train Decomposition Approach for Efficient Deep Neural Network Compression | Shiyi Luo et.al. | 2408.01534 | null |
2024-08-02 | Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning | Lu Yu et.al. | 2408.01076 | link |
2024-08-02 | Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs | Afia Anjum et.al. | 2408.01008 | null |
2024-08-01 | DistillGrasp: Integrating Features Correlation with Knowledge Distillation for Depth Completion of Transparent Objects | Yiheng Huang et.al. | 2408.00337 | null |
2024-08-01 | Clover-2: Accurate Inference for Regressive Lightweight Speculative Decoding | Bin Xiao et.al. | 2408.00264 | null |
2024-08-01 | Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation | Kohei Matsuura et.al. | 2408.00205 | null |
2024-07-31 | StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization | Kaiyuan Tang et.al. | 2408.00150 | null |
2024-08-02 | Gemma 2: Improving Open Language Models at a Practical Size | Gemma Team et.al. | 2408.00118 | null |
2024-07-31 | Dynamic Object Queries for Transformer-based Incremental Object Detection | Jichuan Zhang et.al. | 2407.21687 | null |
2024-07-31 | Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins | Lukas Gienapp et.al. | 2407.21515 | null |
2024-07-31 | VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning | Yuhang Ming et.al. | 2407.21416 | null |
2024-07-31 | Lifelong Person Search | Jae-Won Yang et.al. | 2407.21252 | null |
2024-07-29 | SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillation | Chakkrit Termritthikun et.al. | 2407.20062 | link |
2024-07-29 | ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality | Guoliang Xu et.al. | 2407.19820 | null |
2024-07-29 | Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices | Hayun Lee et.al. | 2407.19644 | null |
2024-07-28 | Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models | Mohammed Al-Maamari et.al. | 2407.19610 | link |
2024-07-28 | Overcoming Uncertain Incompleteness for Robust Multimodal Sequential Diagnosis Prediction via Knowledge Distillation and Random Data Erasing | Heejoon Koo et.al. | 2407.19540 | null |
2024-07-28 | LLAVADI: What Matters For Multimodal Large Language Models Distillation | Shilin Xu et.al. | 2407.19409 | null |
2024-07-28 | Logic Distillation: Learning from Code Function by Function for Planning and Decision-making | Dong Chen et.al. | 2407.19405 | null |
2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
2024-07-26 | Automatic Detection of Moral Values in Music Lyrics | Vjosa Preniqi et.al. | 2407.18787 | link |
2024-07-26 | Boosting Cross-Domain Point Classification via Distilling Relational Priors from 2D Transformers | Longkun Zou et.al. | 2407.18534 | link |
2024-07-26 | FedUD: Exploiting Unaligned Data for Cross-Platform Federated Click-Through Rate Prediction | Wentao Ouyang et.al. | 2407.18472 | null |
2024-07-26 | Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation | Jiabo Ma et.al. | 2407.18449 | null |
2024-07-25 | Leveraging Foundation Models via Knowledge Distillation in Multi-Object Tracking: Distilling DINOv2 Features to FairMOT | Niels G. Faber et.al. | 2407.18288 | link |
2024-07-25 | Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning | Tianduo Wang et.al. | 2407.18248 | link |
2024-07-25 | How to Train the Teacher Model for Effective Knowledge Distillation | Shayan Mohajer Hamidi et.al. | 2407.18041 | link |
2024-07-25 | Peak-Controlled Logits Poisoning Attack in Federated Distillation | Yuhan Tang et.al. | 2407.18039 | null |
2024-07-25 | Separating Novel Features for Logical Anomaly Detection: A Straightforward yet Effective Approach | Kangil Lee et.al. | 2407.17909 | null |
2024-07-25 | NC-NCD: Novel Class Discovery for Node Classification | Yue Hou et.al. | 2407.17816 | link |
2024-07-24 | CoMoTo: Unpaired Cross-Modal Lesion Distillation Improves Breast Lesion Detection in Tomosynthesis | Muhammad Alberb et.al. | 2407.17620 | link |
2024-07-24 | (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork | Tianjin Huang et.al. | 2407.17412 | null |
2024-07-23 | Strike a Balance in Continual Panoptic Segmentation | Jinpeng Chen et.al. | 2407.16354 | link |
2024-07-23 | OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection | Fan Cui et.al. | 2407.16237 | link |
2024-07-23 | DDK: Distilling Domain Knowledge for Efficient Large Language Models | Jiaheng Liu et.al. | 2407.16154 | null |