Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-22 | One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance | Minyi Zhao et.al. | 2409.14483 | null |
2024-08-04 | LEGO: Self-Supervised Representation Learning for Scene Text Images | Yujin Ren et.al. | 2408.02036 | null |
2024-07-28 | WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting | Jingjing Wu et.al. | 2407.19507 | null |
2024-07-23 | CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction | Liang Zhao et.al. | 2407.16204 | null |
2024-09-15 | Layout Agnostic Scene Text Image Synthesis with Diffusion Models | Qilong Zhangli et.al. | 2406.01062 | null |
2024-05-19 | The First Swahili Language Scene Text Detection and Recognition Dataset | Fadila Wendigoundi Douamba et.al. | 2405.11437 | link |
2024-05-07 | Choose What You Need: Disentangled Representation Learning for Scene Text Recognition, Removal and Editing | Boqiang Zhang et.al. | 2405.04377 | null |
2024-03-20 | Efficient scene text image super-resolution with semantic guidance | LeoWu TomyEnrique et.al. | 2403.13330 | link |
2024-08-01 | Text Image Inpainting via Global Structure-Guided Diffusion Models | Shipeng Zhu et.al. | 2401.14832 | link |
2023-12-25 | Word length-aware text spotting: Enhancing detection and recognition in dense text image | Hao Wang et.al. | 2312.15690 | null |
2023-12-19 | Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model | Lingjun Zhang et.al. | 2312.12232 | link |
2024-01-05 | Research on Multilingual Natural Scene Text Detection Algorithm | Tao Wang et.al. | 2312.11153 | null |
2024-07-23 | PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution | Zuoyan Zhao et.al. | 2311.17955 | link |
2023-11-22 | Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution | Yuxuan Zhou et.al. | 2311.13317 | null |
2023-12-22 | Scene Text Image Super-resolution based on Text-conditional Diffusion Models | Chihiro Noguchi et.al. | 2311.09759 | link |
2023-09-21 | SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap | Daehee Kim et.al. | 2309.12382 | null |
2023-03-26 | Learning Generative Structure Prior for Blind Text Image Super-resolution | Xiaoming Li et.al. | 2303.14726 | link |
2023-08-18 | Self-supervised Character-to-Character Distillation for Text Recognition | Tongkun Guan et.al. | 2211.00288 | link |
2022-10-13 | Scene Text Image Super-Resolution via Content Perceptual Loss and Criss-Cross Transformer Blocks | Rui Qin et.al. | 2210.06924 | null |
2020-08-02 | Scene Text Image Super-Resolution in the Wild | Wenjia Wang et.al. | 2005.03341 | link |
2019-10-20 | TextSR: Content-Aware Text Super-Resolution Guided by Recognition | Wenjia Wang et.al. | 1909.07113 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-28 | Stochastic Frequency Fluctuation Super-Resolution Imaging | Yifan Chen et.al. | 2411.19369 | null |
2024-11-27 | FaithDiff: Unleashing Diffusion Priors for Faithful Image Super-resolution | Junyang Chen et.al. | 2411.18824 | null |
2024-11-27 | HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior | Li-Yuan Tsao et.al. | 2411.18662 | link |
2024-11-27 | Uncertainty-driven Sampling for Efficient Pairwise Comparison Subjective Assessment | Shima Mohammadi et.al. | 2411.18372 | link |
2024-11-27 | TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution | Linwei Dong et.al. | 2411.18263 | null |
2024-12-01 | HAAT: Hybrid Attention Aggregation Transformer for Image Super-Resolution | Song-Jiang Lai et.al. | 2411.18003 | null |
2024-11-27 | Vision Mamba Distillation for Low-resolution Fine-grained Image Classification | Yao Chen et.al. | 2411.17980 | link |
2024-11-26 | Perceptually Optimized Super Resolution | Volodymyr Karpenko et.al. | 2411.17513 | null |
2024-11-26 | MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution | Chengxing Xie et.al. | 2411.17214 | null |
2024-11-30 | PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution | Libo Zhu et.al. | 2411.17106 | link |
2024-11-26 | ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction | Chang Li et.al. | 2411.17088 | null |
2024-11-25 | ZoomLDM: Latent Diffusion Model for multi-scale image generation | Srikar Yellapragada et.al. | 2411.16969 | null |
2024-11-25 | From Diffusion to Resolution: Leveraging 2D Diffusion Models for 3D Super-Resolution Task | Bohao Chen et.al. | 2411.16792 | null |
2024-11-25 | EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training | Yiying Wei et.al. | 2411.16312 | null |
2024-11-25 | High-Resolution Be Aware! Improving the Self-Supervised Real-World Super-Resolution | Yuehan Zhang et.al. | 2411.16175 | null |
2024-11-23 | FFT-Enhanced Low-Complexity Near-Field Super-Resolution Sensing | Yuxiao Wu et.al. | 2411.15532 | null |
2024-11-21 | UPdec-Webb: A Dataset for Coaddition of JWST NIRCam Images | Lei Wang et.al. | 2411.13891 | null |
2024-11-20 | HF-Diff: High-Frequency Perceptual Loss and Distribution Matching for One-Step Diffusion-Based Image Super-Resolution | Shoaib Meraj Sami et.al. | 2411.13548 | null |
2024-11-20 | Adversarial Diffusion Compression for Real-World Image Super-Resolution | Bin Chen et.al. | 2411.13383 | null |
2024-11-20 | RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content | Yuxuan Jiang et.al. | 2411.13362 | null |
2024-11-19 | Efficient Medicinal Image Transmission and Resolution Enhancement via GAN | Rishabh Kumar Sharma et.al. | 2411.12833 | null |
2024-11-19 | ISAC Super-Resolution Receivers: The Effect of Different Dictionary Matrices | Iman Valiulahi et.al. | 2411.12672 | null |
2024-11-19 | Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution | Yang Zou et.al. | 2411.12530 | link |
2024-11-18 | Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution | Brian B. Moser et.al. | 2411.12072 | link |
2024-11-16 | Peizhe Xia et.al. | 2411.11906 | null | |
2024-11-17 | Low-Complexity Algorithms for Multichannel Spectral Super-Resolution | Xunmeng Wu et.al. | 2411.10938 | null |
2024-11-21 | Unveiling Hidden Details: A RAW Data-Enhanced Paradigm for Real-World Super-Resolution | Long Peng et.al. | 2411.10798 | null |
2024-11-15 | Experimental demonstration of Tessellation Structured Illumination Microscopy | Doron Shterman et.al. | 2411.10405 | null |
2024-11-15 | A Low-Resolution Image is Worth 1x1 Words: Enabling Fine Image Super-Resolution with Transformers and TaylorShift | Sanath Budakegowdanadoddi Nagaraju et.al. | 2411.10231 | null |
2024-11-15 | DiffFNO: Diffusion Fourier Neural Operator | Xiaoyi Liu et.al. | 2411.09911 | null |
2024-11-15 | Enhancing Diffusion Posterior Sampling for Inverse Problems by Integrating Crafted Measurements | Shijie Zhou et.al. | 2411.09850 | null |
2024-11-14 | OneNet: A Channel-Wise 1D Convolutional U-Net | Sanghyun Byun et.al. | 2411.09838 | link |
2024-11-14 | GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising | Yunuo Wang et.al. | 2411.09512 | null |
2024-11-14 | ISAC Super-Resolution Receiver via Lifted Atomic Norm Minimization | Iman Valiulahi et.al. | 2411.09495 | null |
2024-11-14 | Evaluation of RIS-Enabled B5G/6G Indoor Positioning and Mapping using Ray Tracing Models | Dimitris Kompostiotis et.al. | 2411.09440 | null |
2024-11-14 | LLV-FSR: Exploiting Large Language-Vision Prior for Face Super-resolution | Chenyang Wang et.al. | 2411.09293 | null |
2024-11-14 | Performance Boundaries and Tradeoffs in Super-Resolution Imaging Technologies for Space Targets | XiaoLe He et.al. | 2411.09155 | null |
2024-11-12 | On Adapting Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction | Tao Hong et.al. | 2411.08178 | null |
2024-11-12 | ALANINE: A Novel Decentralized Personalized Federated Learning For Heterogeneous LEO Satellite Constellation | Liang Zhao et.al. | 2411.07752 | null |
2024-11-12 | LapGSR: Laplacian Reconstructive Network for Guided Thermal Super-Resolution | Aditya Kasliwal et.al. | 2411.07750 | null |
2024-11-12 | Numerical Homogenization by Continuous Super-Resolution | Zhi-Song Liu et.al. | 2411.07576 | null |
2024-11-11 | Evaluating Detection Thresholds: The Impact of False Positives and Negatives on Super-Resolution Ultrasound Localization Microscopy | Sepideh K. Gharamaleki et.al. | 2411.07426 | null |
2024-11-11 | Ensemble Learning for Microbubble Localization in Super-Resolution Ultrasound | Sepideh K. Gharamaleki et.al. | 2411.07376 | null |
2024-11-11 | AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models | Wallace Abreu et.al. | 2411.07364 | null |
2024-11-13 | General Geospatial Inference with a Population Dynamics Foundation Model | Mohit Agarwal et.al. | 2411.07207 | null |
2024-11-11 | 360-Degree Video Super Resolution and Quality Enhancement Challenge: Methods and Results | Ahmed Telili et.al. | 2411.06738 | null |
2024-11-11 | Expansion microscopy reveals neural circuit organization in genetic animal models | Shakila Behzadi et.al. | 2411.06676 | null |
2024-11-10 | Local Implicit Wavelet Transformer for Arbitrary-Scale Super-Resolution | Minghong Duan et.al. | 2411.06442 | link |
2024-11-10 | SuperResolution Radar Gesture Recognitio | Netanel Blumenfeld et.al. | 2411.06410 | null |
2024-11-09 | Quasi-Newton OMP Approach for Super-Resolution Channel Estimation and Extrapolation | Yi Zeng et.al. | 2411.06082 | null |
2024-11-09 | Predicting band structures for 2D Photonic Crystals via Deep Learning | Yueqi Wang et.al. | 2411.06063 | null |
2024-11-08 | A Modular Conditional Diffusion Framework for Image Reconstruction | Magauiya Zhussip et.al. | 2411.05993 | null |
2024-11-08 | WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning | Xiangyu Zhao et.al. | 2411.05420 | null |
2024-11-08 | Electro-diffusive modeling and the role of spine geometry on action potential propagation in neurons | Rahul Gulati et.al. | 2411.05329 | null |
2024-11-07 | Reducing data resolution for better super-resolution: Reconstructing turbulent flows from noisy observation | Kyongmin Yeo et.al. | 2411.05240 | null |
2024-11-07 | ESC-MISR: Enhancing Spatial Correlations for Multi-Image Super-Resolution in Remote Sensing | Zhihui Zhang et.al. | 2411.04706 | null |
2024-11-06 | "Super-resolution" holographic optical tweezers array | Keisuke Nishimura et.al. | 2411.03564 | null |
2024-11-05 | SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture | Andrew Heschl et.al. | 2411.03505 | link |
2024-11-05 | Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution | Huan Zheng et.al. | 2411.03239 | null |
2024-11-05 | Applications of Automatic Differentiation in Image Registration | Warin Watson et.al. | 2411.02806 | link |
2024-11-05 | Super-resolution generalized eigenvalue method with truly sub-Nyquist sampling | Baoguo Liu et.al. | 2411.02700 | null |
2024-11-01 | Strongly Topology-preserving GNNs for Brain Graph Super-resolution | Pragya Singh et.al. | 2411.02525 | null |
2024-11-03 | Super-Resolution without High-Resolution Labels for Black Hole Simulations | Thomas Helfer et.al. | 2411.02453 | link |
2024-11-04 | MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D | Wei Cheng et.al. | 2411.02336 | null |
2024-11-01 | A Robust Super-Resolution Classifier by Nonlinear Optics | Ishan Darji et.al. | 2411.00953 | link |
2024-10-31 | Blind Time-of-Flight Imaging: Sparse Deconvolution on the Continuum with Unknown Kernels | Ruiming Guo et.al. | 2411.00893 | null |
2024-11-01 | Constrained Diffusion Implicit Models | Vivek Jayaram et.al. | 2411.00359 | null |
2024-10-31 | DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination | Jia Fu et.al. | 2410.24006 | link |
2024-10-29 | Temporal and Spatial Super Resolution with Latent Diffusion Model in Medical MRI images | Vishal Dubey et.al. | 2410.23898 | null |
2024-10-30 | Enhancing Image Resolution: A Simulation Study and Sensitivity Analysis of System Parameters for Resourcesat-3S/3SA | Ankur Garg et.al. | 2410.23319 | null |
2024-10-30 | EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models | Shangquan Sun et.al. | 2410.22959 | link |
2024-10-30 | Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing Images | Hanlin Wu et.al. | 2410.22830 | null |
2024-10-29 | Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images | Suhyun Ahn et.al. | 2410.21826 | link |
2024-10-29 | Fingerprints of Super Resolution Networks | Jeremy Vonderfecht et.al. | 2410.21653 | null |
2024-10-30 | Super-resolution in disordered media using neural networks | Alexander Christie et.al. | 2410.21556 | null |
2024-10-28 | Super-resolution with dynamics in the loss | Jacob Page et.al. | 2410.20884 | null |
2024-10-27 | Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network | Chongxiao Liu et.al. | 2410.20546 | link |
2024-10-27 | Guidance Disentanglement Network for Optics-Guided Thermal UAV Image Super-Resolution | Zhicheng Zhao et.al. | 2410.20466 | link |
2024-10-26 | Super-resolved virtual staining of label-free tissue using diffusion models | Yijie Zhang et.al. | 2410.20073 | null |
2024-10-25 | A Flow-based Truncated Denoising Diffusion Model for Super-resolution Magnetic Resonance Spectroscopic Imaging | Siyuan Dong et.al. | 2410.19288 | null |
2024-10-24 | A Spectral-based Physics-informed Finite Operator Learning for Prediction of Mechanical Behavior of Microstructures | Ali Harandi et.al. | 2410.19027 | null |
2024-10-25 | Transferring Knowledge from High-Quality to Low-Quality MRI for Adult Glioma Diagnosis | Yanguang Zhao et.al. | 2410.18698 | null |
2024-10-24 | Hyperspectral Spatial Super-Resolution using Keystone Error | Ankur Garg et.al. | 2410.18691 | null |
2024-10-24 | Advancements in Image Resolution: Super-Resolution Algorithm for Enhanced EOS-06 OCM-3 Data | Ankur Garg et.al. | 2410.18690 | null |
2024-10-22 | Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies | Shrey Vishen et.al. | 2410.18137 | link |
2024-10-23 | FIPER: Generalizable Factorized Fields for Joint Image Compression and Super-Resolution | Yang-Che Sun et.al. | 2410.18083 | null |
2024-10-23 | A Wavelet Diffusion GAN for Image Super-Resolution | Lorenzo Aloisi et.al. | 2410.17966 | null |
2024-10-23 | Truly Sub-Nyquist Method Based Matrix Pencil and CRT with Super Resolution | Huiguang Zhang et.al. | 2410.17841 | null |
2024-10-23 | AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution | Yuanting Fan et.al. | 2410.17752 | null |
2024-10-23 | Generalizable Motion Planning via Operator Learning | Sharath Matada et.al. | 2410.17547 | null |
2024-10-22 | Multi Kernel Estimation based Object Segmentation | Haim Goldfisher et.al. | 2410.17064 | link |
2024-10-22 | Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models | Giannis Daras et.al. | 2410.16152 | null |
2024-10-21 | MINFLUX -- molecular resolution with minimal photons | Lukas Scheiderer et.al. | 2410.15902 | null |
2024-10-18 | Ultrasound matrix imaging for transcranial in-vivo localization microscopy | Flavien Bureau et.al. | 2410.14499 | null |
2024-10-18 | Advanced Underwater Image Quality Enhancement via Hybrid Super-Resolution Convolutional Neural Networks and Multi-Scale Retinex-Based Defogging Techniques | Yugandhar Reddy Gogireddy et.al. | 2410.14285 | null |
2024-10-18 | ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer | Yuhao Wan et.al. | 2410.14279 | null |
2024-10-17 | MMAD-Purify: A Precision-Optimized Framework for Efficient and Scalable Multi-Modal Attacks | Xinxin Liu et.al. | 2410.14089 | null |
2024-10-17 | ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution | Junhao Gu et.al. | 2410.13807 | null |
2024-10-17 | Unsupervised Skull Segmentation via Contrastive MR-to-CT Modality Translation | Kamil Kwarciak et.al. | 2410.13427 | null |
2024-10-16 | Super-resolving Real-world Image Illumination Enhancement: A New Dataset and A Conditional Diffusion Model | Yang Liu et.al. | 2410.12961 | null |
2024-10-16 | Transformer based super-resolution downscaling for regional reanalysis: Full domain vs tiling approaches | Antonio Pérez et.al. | 2410.12728 | null |
2024-10-16 | Approximations of MINFLUX Localization Precision with Background | Zach Marin et.al. | 2410.12427 | null |
2024-10-16 | Superoscillation focusing of high-order cylindrical-vector beams | Zhongwei Jin et.al. | 2410.12335 | null |
2024-10-15 | Temporal resolution enhancement in Structured Illumination Microscopy using cascaded reconstruction | Doron Shterman et.al. | 2410.11770 | null |
2024-10-15 | Degradation Oriented and Regularized Network for Real-World Depth Super-Resolution | Zhengxue Wang et.al. | 2410.11666 | link |
2024-10-15 | Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution | Hongyu An et.al. | 2410.11506 | link |
2024-10-14 | Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution | Junbo Qiao et.al. | 2410.10140 | null |
2024-10-14 | Optimizing Fingerprint-Spectrum-Based Synchronization in Integrated Sensing and Communications | Xiao-Yang Wang et.al. | 2410.10134 | null |
2024-10-14 | REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation | Zhiyun Song et.al. | 2410.10097 | null |
2024-10-13 | Conditioning 3D Diffusion Models with 2D Images: Towards Standardized OCT Volumes through En Face-Informed Super-Resolution | Coen de Vente et.al. | 2410.09862 | null |
2024-10-13 | HASN: Hybrid Attention Separable Network for Efficient Image Super-resolution | Weifeng Cao et.al. | 2410.09844 | link |
2024-10-11 | Riemannian Gradient Descent Method to Joint Blind Super-Resolution and Demixing in ISAC | Zeyu Xiang et.al. | 2410.08607 | null |
2024-10-11 | Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities | Abhijay Ghildyal et.al. | 2410.08534 | null |
2024-10-10 | TDDSR: Single-Step Diffusion with Two Discriminators for Super Resolution | Sohwi Kim et.al. | 2410.07663 | null |
2024-10-09 | HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution | Hua Li et.al. | 2410.06488 | link |
2024-10-09 | MaskBlur: Spatial and Angular Data Augmentation for Light Field Image Super-Resolution | Wentao Chao et.al. | 2410.06478 | link |
2024-10-12 | SeeClear: Semantic Distillation Enhances Pixel Condensation for Video Super-Resolution | Qi Tang et.al. | 2410.05799 | link |
2024-10-07 | Enhanced Super-Resolution Training via Mimicked Alignment for Real-World Scenes | Omar Elezabi et.al. | 2410.05410 | null |
2024-10-07 | Near-Field ISAC in 6G: Addressing Phase Nonlinearity via Lifted Super-Resolution | Sajad Daei et.al. | 2410.04930 | null |
2024-10-05 | AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results | Ivan Molodetskikh et.al. | 2410.04225 | null |
2024-10-10 | Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution | Jianze Li et.al. | 2410.04224 | link |
2024-10-05 | Exploring Strengths and Weaknesses of Super-Resolution Attack in Deepfake Detection | Davide Alessandro Coccomini et.al. | 2410.04205 | null |
2024-10-05 | TV-based Deep 3D Self Super-Resolution for fMRI | Fernando Pérez-Bueno et.al. | 2410.04097 | null |
2024-10-04 | Learning Truncated Causal History Model for Video Restoration | Amirhosein Ghasemabadi et.al. | 2410.03936 | link |
2024-10-04 | Point-Spread-Function Engineering in MINFLUX: Optimality of Donut and Half-Moon Excitation Patterns | Yan Liu et.al. | 2410.03349 | null |
2024-10-04 | Atom Camera: Super-resolution scanning microscope of a light pattern with a single ultracold atom | Takafumi Tomita et.al. | 2410.03241 | null |
2024-10-03 | PixelShuffler: A Simple Image Translation Through Pixel Rearrangement | Omar Zamzam et.al. | 2410.03021 | link |
2024-10-07 | SuperGS: Super-Resolution 3D Gaussian Splatting via Latent Feature Field and Gradient-guided Splitting | Shiyun Xie et.al. | 2410.02571 | null |
2024-10-03 | PnP-Flow: Plug-and-Play Image Restoration with Flow Matching | Ségolène Martin et.al. | 2410.02423 | link |
2024-10-03 | Ultrathin BIC metasurfaces based on ultralow-loss Sb2Se3 phase-change material | Zhaoyang Xie et.al. | 2410.02413 | null |
2024-10-02 | Stochastic Deep Restoration Priors for Imaging Inverse Problems | Yuyang Hu et.al. | 2410.02057 | null |
2024-10-01 | Optimizing Drug Delivery in Smart Pharmacies: A Novel Framework of Multi-Stage Grasping Network Combined with Adaptive Robotics Mechanism | Rui Tang et.al. | 2410.00753 | null |
2024-10-01 | Enhancing Sentinel-2 Image Resolution: Evaluating Advanced Techniques based on Convolutional and Generative Neural Networks | Patrick Kramer et.al. | 2410.00516 | null |
2024-09-29 | Effective Diffusion Transformer Architecture for Image Super-Resolution | Kun Cheng et.al. | 2409.19589 | link |
2024-09-27 | A Generalized Tensor Formulation for Hyperspectral Image Super-Resolution Under General Spatial Blurring | Yinjian Wang et.al. | 2409.18731 | null |
2024-09-27 | Simpler Gradient Methods for Blind Super-Resolution with Lower Iteration Complexity | Jinsheng Li et.al. | 2409.18387 | link |
2024-09-26 | Toward Efficient Deep Blind RAW Image Restoration | Marcos V. Conde et.al. | 2409.18204 | link |
2024-09-30 | DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models | Helin Cao et.al. | 2409.18092 | null |
2024-09-26 | Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs | Qinpeng Cui et.al. | 2409.17778 | link |
2024-09-26 | LGFN: Lightweight Light Field Image Super-Resolution using Local Convolution Modulation and Global Attention Feature Extraction | Zhongxin Yu et.al. | 2409.17759 | null |
2024-09-26 | Unifying Dimensions: A Linear Adaptive Approach to Lightweight Image Super-Resolution | Zhenyu Hu et.al. | 2409.17597 | link |
2024-09-26 | Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset | Yongrok Kim et.al. | 2409.17451 | null |
2024-09-25 | PSWF-Radon approach to reconstruction from band-limited Hankel transform | Fedor Goncharov et.al. | 2409.17409 | link |
2024-09-25 | Implicit Neural Representations for Simultaneous Reduction and Continuous Reconstruction of Multi-Altitude Climate Data | Alif Bin Abdul Qayyum et.al. | 2409.17367 | link |
2024-09-25 | AIM 2024 Challenge on Efficient Video Super-Resolution for AV1 Compressed Content | Marcos V Conde et.al. | 2409.17256 | null |
2024-09-25 | Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Aiping Zhang et.al. | 2409.17058 | link |
2024-09-25 | NTIRE 2024 Challenge on Stereo Image Super-Resolution: Methods and Results | Longguang Wang et.al. | 2409.16947 | null |
2024-09-24 | Diffusion Models to Enhance the Resolution of Microscopy Images: A Tutorial | Harshith Bachimanchi et.al. | 2409.16488 | null |
2024-09-24 | Compressed Depth Map Super-Resolution and Restoration: AIM 2024 Challenge Results | Marcos V. Conde et.al. | 2409.16277 | null |
2024-09-24 | Super-resolution positron emission tomography by intensity modulation: Proof of concept | Youdong Lang et.al. | 2409.16085 | null |
2024-09-24 | Denoising Graph Super-Resolution towards Improved Collider Event Reconstruction | Nilotpal Kakati et.al. | 2409.16052 | null |
2024-09-24 | Stochastically Structured Illumination Microscopy scan less super resolution imaging | Denzel Fusco et.al. | 2409.16006 | null |
2024-09-24 | Dual-Comb Photothermal Microscopy | Peter Chang et.al. | 2409.15685 | null |
2024-09-21 | BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow | EungGu Kang et.al. | 2409.15384 | link |
2024-09-22 | One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance | Minyi Zhao et.al. | 2409.14483 | null |
2024-09-22 | Prior Knowledge Distillation Network for Face Super-Resolution | Qiu Yang et.al. | 2409.14385 | null |
2024-09-22 | Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues | Mingshen Wang et.al. | 2409.14330 | null |
2024-09-21 | A Sinkhorn Regularized Adversarial Network for Image Guided DEM Super-resolution using Frequency Selective Hybrid Graph Transformer | Subhajit Paul et.al. | 2409.14198 | null |
2024-09-17 | NSSR-DIL: Null-Shot Image Super-Resolution Using Deep Identity Learning | Sree Rama Vamsidhar S et.al. | 2409.12165 | null |
2024-09-18 | Quantum-like nonlinear interferometry with frequency-engineered classical light | Romain Dalidet et.al. | 2409.12049 | null |
2024-09-19 | Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing | Seongmin Hong et.al. | 2409.11738 | null |
2024-09-17 | Enhancing the Reliability of LiDAR Point Cloud Sampling: A Colorization and Super-Resolution Approach Based on LiDAR-Generated Images | Sier Ha et.al. | 2409.11532 | null |
2024-09-19 | Super Resolution On Global Weather Forecasts | Lawrence Zhang et.al. | 2409.11502 | null |
2024-09-17 | Online 4D Ultrasound-Guided Robotic Tracking Enables 3D Ultrasound Localisation Microscopy with Large Tissue Displacements | Jipeng Yan et.al. | 2409.11391 | null |
2024-09-18 | Single-Layer Learnable Activation for Implicit Neural Representation (SL |
Moein Heidari et.al. | 2409.10836 | null |
2024-09-16 | WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency | Pranav Jeevan et.al. | 2409.10582 | link |
2024-09-16 | Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression | Yi-Hsin Li et.al. | 2409.10101 | null |
2024-09-15 | Learning Two-factor Representation for Magnetic Resonance Image Super-resolution | Weifeng Wei et.al. | 2409.09731 | null |
2024-09-14 | Adversarial Deep-Unfolding Network for MA-XRF Super-Resolution on Old Master Paintings Using Minimal Training Data | Herman Verinaz-Jadan et.al. | 2409.09483 | null |
2024-09-17 | Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution | Yongjoon Lee et.al. | 2409.09337 | null |
2024-09-13 | FB-HyDON: Parameter-Efficient Physics-Informed Operator Learning of Complex PDEs via Hypernetwork and Finite Basis Domain Decomposition | Milad Ramezankhani et.al. | 2409.09207 | null |
2024-09-13 | Optically-Validated Microvascular Phantom for Super-Resolution Ultrasound Imaging | Jaime Parra Raad et.al. | 2409.09031 | null |
2024-09-13 | Test-time Training for Hyperspectral Image Super-resolution | Ke Li et.al. | 2409.08667 | null |
2024-09-13 | Low Complexity DoA-ToA Signature Estimation for Multi-Antenna Multi-Carrier Systems | Chandrashekhar Rai et.al. | 2409.08650 | null |
2024-09-13 | Think Twice Before You Act: Improving Inverse Problem Solving With MCMC | Yaxuan Zhu et.al. | 2409.08551 | null |
2024-09-12 | Learned Compression for Images and Point Clouds | Mateen Ulhaq et.al. | 2409.08376 | link |
2024-09-12 | Mapping the nanoscale optical topological textures with a fiber-integrated plasmonic probe | Yunkun Wu et.al. | 2409.07894 | null |
2024-09-12 | Mesh-based Super-Resolution of Fluid Flows with Multiscale Graph Neural Networks | Shivam Barwey et.al. | 2409.07769 | null |
2024-09-11 | Dual scale Residual-Network for turbulent flow sub grid scale resolving: A prior analysis | Omar Sallam et.al. | 2409.07605 | null |
2024-09-11 | Three-Dimensional, Multimodal Synchrotron Data for Machine Learning Applications | Calum Green et.al. | 2409.07322 | link |
2024-09-11 | CWT-Net: Super-resolution of Histopathology Images Using a Cross-scale Wavelet-based Transformer | Feiyang Jia et.al. | 2409.07092 | null |
2024-09-10 | Lightweight Multiscale Feature Fusion Super-Resolution Network Based on Two-branch Convolution and Transformer | Li Ke et.al. | 2409.06590 | null |
2024-09-10 | Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition | Junzheng Zhang et.al. | 2409.06371 | null |
2024-09-10 | EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation | Nischal Khanal et.al. | 2409.06183 | link |
2024-09-07 | Single-snapshot machine learning for turbulence super resolution | Kai Fukami et.al. | 2409.04923 | null |
2024-09-06 | Empirical Bayesian image restoration by Langevin sampling with a denoising diffusion implicit prior | Charlesquin Kemajou Mbakam et.al. | 2409.04384 | null |
2024-09-06 | Adaptive Super-Resolution Imaging Without Prior Knowledge Using a Programmable Spatial-Mode Sorter | Itay Ozer et.al. | 2409.04323 | null |
2024-09-06 | EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-Resolution | Xi Su et.al. | 2409.04050 | null |
2024-09-05 | Use of triplet loss for facial restoration in low-resolution images | Sebastian Pulgar et.al. | 2409.03530 | null |
2024-09-05 | LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolution | Jeongsoo Kim et.al. | 2409.03516 | link |
2024-09-07 | Real-time Speech Enhancement on Raw Signals with Deep State-space Modeling | Yan Ru Pei et.al. | 2409.03377 | link |
2024-09-05 | Enhancing digital core image resolution using optimal upscaling algorithm: with application to paired SEM images | Shaohua You et.al. | 2409.03265 | null |
2024-09-05 | Perceptual-Distortion Balanced Image Super-Resolution is a Multi-Objective Optimization Problem | Qiwen Zhu et.al. | 2409.03179 | link |
2024-09-04 | Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Zhibin Liu et.al. | 2409.02851 | link |
2024-09-04 | Solving Video Inverse Problems Using Image Diffusion Models | Taesung Kwon et.al. | 2409.02574 | null |
2024-09-02 | EarthGen: Generating the World from Top-Down Views | Ansh Sharma et.al. | 2409.01491 | link |
2024-09-02 | DiffEyeSyn: Diffusion-based User-specific Eye Movement Synthesis | Chuhan Jiao et.al. | 2409.01240 | null |
2024-09-02 | Single-photon super-resolved spectroscopy from spatial-mode demultiplexing | Luigi Santamaria Amato et.al. | 2409.01190 | null |
2024-09-02 | SeCo-INR: Semantically Conditioned Implicit Neural Representations for Improved Medical Image Super-Resolution | Mevan Ekanayake et.al. | 2409.01013 | null |
2024-09-01 | DMRA: An Adaptive Line Spectrum Estimation Method through Dynamical Multi-Resolution of Atoms | Mingguang Han et.al. | 2409.00799 | null |
2024-09-01 | Rethinking Image Super-Resolution from Training Data Perspectives | Go Ohtani et.al. | 2409.00768 | link |
2024-09-01 | Attention-Guided Multi-scale Interaction Network for Face Super-Resolution | Xujie Wan et.al. | 2409.00591 | null |
2024-08-30 | HiTSR: A Hierarchical Transformer for Reference-based Super-Resolution | Masoomeh Aslahishahri et.al. | 2408.16959 | link |
2024-08-29 | GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content | Lebin Zhou et.al. | 2408.16866 | null |
2024-08-30 | Beyond MR Image Harmonization: Resolution Matters Too | Savannah P. Hays et.al. | 2408.16562 | null |
2024-08-29 | Super-Resolution works for coastal simulations | Zhi-Song Liu et.al. | 2408.16553 | null |
2024-08-29 | Enhanced Control for Diffusion Bridge in Image Restoration | Conghan Yue et.al. | 2408.16303 | link |
2024-08-28 | ChartEye: A Deep Learning Framework for Chart Information Extraction | Osama Mustafa et.al. | 2408.16123 | null |
2024-08-27 | Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution | Marcelo dos Santos et.al. | 2408.15386 | link |
2024-08-27 | Histo-Diffusion: A Diffusion Super-Resolution Method for Digital Pathology with Comprehensive Quality Assessment | Xuan Xu et.al. | 2408.15218 | null |
2024-08-27 | A Preliminary Exploration Towards General Image Restoration | Xiangtao Kong et.al. | 2408.15143 | null |
2024-08-27 | Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven Approach | Valfride Nascimento et.al. | 2408.15103 | link |
2024-08-26 | Cascaded Temporal Updating Network for Efficient Video Super-Resolution | Hao Li et.al. | 2408.14244 | null |
2024-08-26 | Efficient Active Flow Control Strategy for Confined Square Cylinder Wake Using Deep Learning-Based Surrogate Model and Reinforcement Learning | Meng Zhang et.al. | 2408.14232 | null |
2024-08-25 | Particle-Filtering-based Latent Diffusion for Inverse Problems | Amir Nazemi et.al. | 2408.13868 | null |
2024-08-25 | FreqINR: Frequency Consistency for Implicit Neural Representation with Adaptive DCT Frequency Loss | Meiyi Wei et.al. | 2408.13716 | null |
2024-08-23 | ResSR: A Residual Approach to Super-Resolving Multispectral Images | Haley Duba-Sullivan et.al. | 2408.13225 | null |
2024-08-23 | SIMPLE: Simultaneous Multi-Plane Self-Supervised Learning for Isotropic MRI Restoration from Anisotropic Data | Rotem Benisty et.al. | 2408.13065 | null |
2024-08-22 | A Unified Plug-and-Play Algorithm with Projected Landweber Operator for Split Convex Feasibility Problems | Shuchang Zhang et.al. | 2408.12100 | null |
2024-08-21 | MambaCSR: Dual-Interleaved Scanning for Compressed Image Super-Resolution With SSMs | Yulin Ren et.al. | 2408.11758 | link |
2024-08-21 | Quantum super-resolution microscopy by photon statistics and structured light | Fabio Picariello et.al. | 2408.11654 | null |
2024-08-20 | MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling | Zili Liu et.al. | 2408.10854 | null |
2024-08-19 | Webcam-based Pupil Diameter Prediction Benefits from Upscaling | Vijul Shah et.al. | 2408.10397 | null |
2024-08-19 | ML-CrAIST: Multi-scale Low-high Frequency Information-based Cross black Attention with Image Super-resolving Transformer | Alik Pramanick et.al. | 2408.09940 | link |
2024-08-19 | Harnessing Multi-resolution and Multi-scale Attention for Underwater Image Restoration | Alik Pramanick et.al. | 2408.09912 | link |
2024-08-19 | Predicting Long-term Dynamics of Complex Networks via Identifying Skeleton in Hyperbolic Space | Ruikun Li et.al. | 2408.09845 | link |
2024-08-19 | Implicit Grid Convolution for Multi-Scale Image Super-Resolution | Dongheon Lee et.al. | 2408.09674 | link |
2024-08-18 | Angle of Arrival Estimation with Transformer: A Sparse and Gridless Method with Zero-Shot Capability | Zhaoxuan Zhu et.al. | 2408.09362 | null |
2024-08-17 | Discovery of Limb-Brightening in the Parsec-Scale Jet of NGC 315 through Global VLBI Observations and Its Implications for Jet Models | Jongho Park et.al. | 2408.09069 | null |
2024-08-16 | AI-assisted super-resolution cosmological simulations IV: An emulator for deterministic realizations | Xiaowen Zhang et.al. | 2408.09051 | link |
2024-08-16 | Task-Aware Dynamic Transformer for Efficient Arbitrary-Scale Image Super-Resolution | Tianyi Xu et.al. | 2408.08736 | link |
2024-08-16 | QMambaBSR: Burst Image Super-Resolution with Query State Space Model | Xin Di et.al. | 2408.08665 | null |
2024-08-16 | Reference-free Axial Super-resolution of 3D Microscopy Images using Implicit Neural Representation with a 2D Diffusion Prior | Kyungryun Lee et.al. | 2408.08616 | link |
2024-08-16 | Enhancing Events in Neutrino Telescopes through Deep Learning-Driven Super-Resolution | Felix J. Yu et.al. | 2408.08474 | null |
2024-08-15 | SuperNANO: Enabling Nano-Scale Laser an-ti-counterfeiting Marking and Precision Cutting with Super-Resolution Imaging | Yiduo Chen et.al. | 2408.08455 | null |
2024-08-14 | Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Yuqing Wen et.al. | 2408.07605 | null |
2024-08-15 | DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution | Yuanbo Zhou et.al. | 2408.07516 | null |
2024-08-14 | GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution | Yuzhen Li et.al. | 2408.07484 | link |
2024-08-14 | One Step Diffusion-based Super-Resolution with Time-Aware Distillation | Xiao He et.al. | 2408.07476 | link |
2024-08-17 | Deep-sub-cycle attosecond optical pulses | Hongliang Dang et.al. | 2408.07306 | null |
2024-08-13 | Event-Stream Super Resolution using Sigma-Delta Neural Network | Waseem Shariff et.al. | 2408.06968 | null |
2024-08-12 | Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming | Xinqi Jin et.al. | 2408.06152 | link |
2024-08-12 | Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models | Ioannis Romanelis et.al. | 2408.06145 | link |
2024-08-11 | SSL: A Self-similarity Loss for Improving Generative Image Super-resolution | Du Chen et.al. | 2408.05713 | link |
2024-08-10 | Content-decoupled Contrastive Learning-based Implicit Degradation Modeling for Blind Image Super-Resolution | Jiang Yuan et.al. | 2408.05440 | null |
2024-08-09 | Kalman-Inspired Feature Propagation for Video Face Super-Resolution | Ruicheng Feng et.al. | 2408.05205 | null |
2024-08-08 | Efficient Single Image Super-Resolution with Entropy Attention and Receptive Field Augmentation | Xiaole Zhao et.al. | 2408.04158 | null |
2024-08-07 | Underwater litter monitoring using consumer-grade aerial-aquatic speedy scanner (AASS) and deep learning based super-resolution reconstruction and detection network | Fan Zhao et.al. | 2408.03564 | null |
2024-08-07 | Monitoring of Hermit Crabs Using drone-captured imagery and Deep Learning based Super-Resolution Reconstruction and Improved YOLOv8 | Fan Zhao et.al. | 2408.03559 | null |
2024-08-06 | SGSR: Structure-Guided Multi-Contrast MRI Super-Resolution via Spatio-Frequency Co-Query Attention | Shaoming Zheng et.al. | 2408.03194 | null |
2024-08-03 | Supervised Image Translation from Visible to Infrared Domain for Object Detection | Prahlad Anand et.al. | 2408.01843 | null |
2024-08-03 | Transformer for seismic image super-resolution | Shiqi Dong et.al. | 2408.01695 | null |
2024-08-03 | Flow Reconstruction Using Spatially Restricted Domains Based on Enhanced Super-Resolution Generative Adversarial Networks | Mustafa Z. Yousif et.al. | 2408.01658 | null |
2024-08-02 | PINNs for Medical Image Analysis: A Survey | Chayan Banerjee et.al. | 2408.01026 | null |
2024-08-01 | Stop-and-go waves reconstruction via iterative refinement | Junyi Ji et.al. | 2408.00941 | null |
2024-08-01 | Exceptional points in SSH-like models with hopping amplitude gradient | David S. Simon et.al. | 2408.00879 | null |
2024-08-01 | Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception | Jiancong Feng et.al. | 2408.00470 | null |
2024-07-31 | Accelerating Image Super-Resolution Networks with Pixel-Level Classification | Jinho Jeong et.al. | 2407.21448 | null |
2024-07-27 | Inverse Problems with Diffusion Models: A MAP Estimation Perspective | Sai bharath chandra Gutha et.al. | 2407.20784 | null |
2024-08-01 | What makes for good morphology representations for spatial omics? | Eduard Chelebian et.al. | 2407.20660 | null |
2024-07-30 | Efficient Channel Estimation for Millimeter Wave and Terahertz Systems Enabled by Integrated Super-resolution Sensing and Communication | Jingran Xu et.al. | 2407.20607 | null |
2024-07-29 | Spatial sub-Rayleigh imaging via structured speckle illumination | Liming Li et.al. | 2407.20460 | null |
2024-08-02 | Deep Learning for Super-resolution Ultrasound Imaging with Spatiotemporal Data | Arthur David Redfern et.al. | 2407.20407 | null |
2024-07-30 | Efficient Face Super-Resolution via Wavelet-based Feature Enhancement Network | Wenjie Li et.al. | 2407.19768 | link |
2024-07-28 | Giant Purcell broadening and Lamb shift for DNA-assembled near-infrared quantum emitters | Sachin Verlekar et.al. | 2407.19513 | null |
2024-07-28 | Perfect Hyperlens | Tao Hou et.al. | 2407.19506 | null |
2024-07-28 | Model-based Super-resolution: Towards a Unified Framework for Super-resolution | Zetao Fei et.al. | 2407.19480 | null |
2024-07-28 | Competition-based Adaptive ReLU for Deep Neural Networks | Junjia Chen et.al. | 2407.19441 | null |
2024-07-27 | Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network | Gang Pan et.al. | 2407.19271 | null |
2024-07-26 | Super Resolution for Renewable Energy Resource Data With Wind From Reanalysis Data (Sup3rWind) and Application to Ukraine | Brandon N. Benton et.al. | 2407.19086 | null |
2024-07-25 | GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution | Jintong Hu et.al. | 2407.18046 | null |
2024-07-24 | Cuboid-Net: A Multi-Branch Convolutional Neural Network for Joint Space-Time Video Super Resolution | Congrui Fu et.al. | 2407.16986 | null |
2024-07-24 | 3DAttGAN: A 3D Attention-based Generative Adversarial Network for Joint Space-Time Video Super-Resolution | Congrui Fu et.al. | 2407.16965 | link |
2024-07-23 | Channel-Partitioned Windowed Attention And Frequency Learning for Single Image Super-Resolution | Dinh Phu Tran et.al. | 2407.16232 | null |
2024-07-23 | Topological Dark Spots of Electric Near Field in Metal Structures | Tong Fu et.al. | 2407.16213 | null |
2024-07-23 | Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems | Sojin Lee et.al. | 2407.16125 | link |
2024-07-22 | High-flexibility reconstruction of small-scale motions in wall turbulence using a generalized zero-shot learning | Haokai Wu et.al. | 2407.15604 | null |
2024-07-22 | Attention Beats Linear for Fast Implicit Neural Representation Generation | Shuyi Zhang et.al. | 2407.15355 | link |
2024-07-22 | ThermalNeRF: Thermal Radiance Fields | Yvette Y. Lin et.al. | 2407.15337 | null |
2024-07-22 | Efficient Multi-disparity Transformer for Light Field Image Super-resolution | Zeke Zexi Hu et.al. | 2407.15329 | null |
2024-07-20 | A New Dataset and Framework for Real-World Blurred Images Super-Resolution | Rui Qin et.al. | 2407.14880 | link |
2024-07-19 | Large Kernel Distillation Network for Efficient Single Image Super-Resolution | Chengxing Xie et.al. | 2407.14340 | link |
2024-07-19 | RealViformer: Investigating Attention for Real-World Video Super-Resolution | Yuehan Zhang et.al. | 2407.13987 | link |
2024-07-18 | MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References | Lukas Bösiger et.al. | 2407.13745 | link |
2024-07-18 | Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network | Hao Yan et.al. | 2407.13211 | null |
2024-07-18 | UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt | Xin Li et.al. | 2407.13108 | null |
2024-07-17 | Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Dohyung Kim et.al. | 2407.12637 | null |
2024-07-16 | Speckle-based 3D sub-diffraction imaging through a multimode fiber | Zhouping Lyu et.al. | 2407.11796 | null |
2024-07-16 | Deconvolution with a Box | Pedro Felzenszwalb et.al. | 2407.11685 | null |
2024-07-16 | Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations | Yunya Gao et.al. | 2407.11381 | link |
2024-07-16 | Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems | Yaşar Utku Alçalar et.al. | 2407.11288 | null |
2024-07-14 | Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKV | Zhiwen Yang et.al. | 2407.11087 | link |
2024-07-15 | Spectral Properties of Infinitely Smooth Kernel Matrices in the Single Cluster Limit, with Applications to Multivariate Super-Resolution | Nuha Diab et.al. | 2407.10600 | null |
2024-07-15 | Backdoor Attacks against Image-to-Image Networks | Wenbo Jiang et.al. | 2407.10445 | null |
2024-07-13 | Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors | Wei Shang et.al. | 2407.09919 | link |
2024-07-13 | Fast and Provable Simultaneous Blind Super-Resolution and Demixing for Point Source Signals: Scaled Gradient Descent without Regularization | Jinchi Chen et.al. | 2407.09900 | link |
2024-07-12 | Region Attention Transformer for Medical Image Restoration | Zhiwen Yang et.al. | 2407.09268 | link |
2024-07-12 | Task-driven single-image super-resolution reconstruction of document scans | Maciej Zyrek et.al. | 2407.08993 | null |
2024-07-11 | Global Spatial-Temporal Information-based Residual ConvLSTM for Video Space-Time Super-Resolution | Congrui Fu et.al. | 2407.08466 | null |
2024-07-11 | Wind Power Assessment based on Super-Resolution and Downscaling -- A Comparison of Deep Learning Methods | Luca Schmidt et.al. | 2407.08259 | null |
2024-07-11 | Spatially-Variant Degradation Model for Dataset-free Super-resolution | Shaojie Guo et.al. | 2407.08252 | null |
2024-07-10 | VEnhancer: Generative Space-Time Enhancement for Video Generation | Jingwen He et.al. | 2407.07667 | null |
2024-07-10 | Aging-Resistant Wideband Precoding in 5G and Beyond Using 3D Convolutional Neural Networks | Alejandro Villena-Rodriguez et.al. | 2407.07434 | null |
2024-07-10 | Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution | Yuehan Zhang et.al. | 2407.07302 | link |
2024-07-09 | UnmixingSR: Material-aware Network with Unsupervised Unmixing as Auxiliary Task for Hyperspectral Image Super-resolution | Yang Yu et.al. | 2407.06525 | null |
2024-07-08 | Enhancing super-resolution ultrasound localisation through multi-frame deconvolution exploiting spatiotemporal coherence | Su Yan et.al. | 2407.06373 | null |
2024-07-08 | Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis | Emaad Khwaja et.al. | 2407.06079 | null |
2024-07-08 | Self-Prior Guided Mamba-UNet Networks for Medical Image Super-Resolution | Zexin Ji et.al. | 2407.05993 | null |
2024-07-08 | Deform-Mamba Network for MRI Super-Resolution | Zexin Ji et.al. | 2407.05969 | null |
2024-07-08 | HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution | Xiang Zhang et.al. | 2407.05878 | null |
2024-07-08 | Neuromorphic Imaging with Super-Resolution | Pei Zhang et.al. | 2407.05764 | null |
2024-07-07 | Edge-guided and Cross-scale Feature Fusion Network for Efficient Multi-contrast MRI Super-Resolution | Zhiyuan Yang et.al. | 2407.05307 | link |
2024-07-07 | A Hybrid Registration and Fusion Method for Hyperspectral Super-resolution | Kunjing Yang et.al. | 2407.05279 | null |
2024-07-07 | RIS-assisted Coverage Enhancement in mmWave Integrated Sensing and Communication Networks | Xu Gan et.al. | 2407.05249 | null |
2024-07-05 | NSD-DIL: Null-Shot Deblurring Using Deep Identity Learning | Sree Rama Vamsidhar S et.al. | 2407.04815 | null |
2024-07-08 | Super-resolution imaging of nanoscale inhomogeneities in hBN-covered and encapsulated few-layer graphene | Lina Jäckering et.al. | 2407.04565 | null |
2024-07-05 | AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource | Wengyi Zhan et.al. | 2407.04241 | link |
2024-07-04 | M^3:Manipulation Mask Manufacturer for Arbitrary-Scale Super-Resolution Mask | Xinyu Yang et.al. | 2407.03695 | null |
2024-07-04 | ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution | Yuanbo Zhou et.al. | 2407.03598 | null |
2024-07-04 | Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis | Tong Zhou et.al. | 2407.03089 | null |
2024-07-03 | Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design | Gen Li et.al. | 2407.02813 | link |
2024-07-02 | Adversarial Magnification to Deceive Deepfake Detection through Super Resolution | Davide Alessandro Coccomini et.al. | 2407.02670 | link |
2024-07-01 | Broadband planar electromagnetic hyper-lens with uniform magnification in air | Ran Sun et.al. | 2407.02532 | null |
2024-07-04 | Real HSI-MSI-PAN image dataset for the hyperspectral/multi-spectral/panchromatic image fusion and super-resolution fields | Shuangliang Li et.al. | 2407.02387 | link |
2024-07-02 | Efficient Stochastic Differential Equation for DEM Super Resolution with Void Filling | Tongtong Zhang et.al. | 2407.01908 | null |
2024-07-01 | DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models | Chang-Han Yeh et.al. | 2407.01519 | link |
2024-07-02 | Preserving Full Degradation Details for Blind Image Super-Resolution | Hongda Liu et.al. | 2407.01299 | link |
2024-07-01 | DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-Resolution | Crispian Morris et.al. | 2407.01230 | null |
2024-06-28 | ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction | Ding-Jiun Huang et.al. | 2406.20066 | null |
2024-06-28 | Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence | Xiantao Fan et.al. | 2406.20047 | null |
2024-06-28 | CSAKD: Knowledge Distillation with Cross Self-Attention for Hyperspectral and Multispectral Image Fusion | Chih-Chung Hsu et.al. | 2406.19666 | link |
2024-06-28 | Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion | Quanmin Liang et.al. | 2406.19640 | link |
2024-06-27 | Shoulder of Dust Rings Formed by Planet-disk Interactions | Jiaqing Bi et.al. | 2406.19438 | null |
2024-06-27 | Super-resolution imaging using super-oscillatory diffractive neural networks | Hang Chen et.al. | 2406.19126 | null |
2024-06-26 | Spatial-temporal Hierarchical Reinforcement Learning for Interpretable Pathology Image Super-Resolution | Wenting Chen et.al. | 2406.18310 | link |
2024-06-30 | V2X Sidelink Positioning in FR1: From Ray-Tracing and Channel Estimation to Bayesian Tracking | Yu Ge et.al. | 2406.17950 | null |
2024-06-25 | Burst Image Super-Resolution with Base Frame Selection | Sanghyun Kim et.al. | 2406.17869 | null |
2024-06-25 | A Near-Field Super-Resolution Network for Accelerating Antenna Characterization | Yuchen Gu et.al. | 2406.17244 | null |
2024-06-24 | DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution | Aiwen Jiang et.al. | 2406.16477 | link |
2024-06-24 | Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution | Junxiong Lin et.al. | 2406.16459 | null |
2024-06-24 | Improving Generative Adversarial Networks for Video Super-Resolution | Daniel Wen et.al. | 2406.16359 | null |
2024-06-23 | Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning | Ruisheng Gao et.al. | 2406.16083 | null |
2024-06-23 | Gridless Parameter Estimation in Partly Calibrated Rectangular Arrays | Tianyi Liu et.al. | 2406.16041 | null |
2024-06-23 | Learning Accurate and Enriched Features for Stereo Image Super-Resolution | Hu Gao et.al. | 2406.16001 | link |
2024-06-21 | A Generative Machine Learning Approach for Improving Precipitation from Earth System Models | Philipp Hess et.al. | 2406.15026 | null |
2024-06-20 | Zero-Shot Image Denoising for High-Resolution Electron Microscopy | Xuanyu Tian et.al. | 2406.14264 | link |
2024-06-19 | IG-CFAT: An Improved GAN-Based Framework for Effectively Exploiting Transformers in Real-World Image Super-Resolution | Alireza Aghelan et.al. | 2406.13815 | link |
2024-06-19 | Enhance the Image: Super Resolution using Artificial Intelligence in MRI | Ziyu Li et.al. | 2406.13625 | null |
2024-06-19 | EvTexture: Event-driven Texture Enhancement for Video Super-Resolution | Dachun Kai et.al. | 2406.13457 | link |
2024-06-19 | Super-resolution 3D tomography of vector near-fields in dielectric resonators | Bingbing Zhu et.al. | 2406.13171 | null |
2024-06-18 | Structured Detection for Simultaneous Super-Resolution and Optical Sectioning in Laser Scanning Microscopy | Alessandro Zunino et.al. | 2406.12542 | link |
2024-06-18 | LFMamba: Light Field Image Super-Resolution with State Space Model | Wang xia et.al. | 2406.12463 | null |
2024-06-17 | A Dictionary Based Approach for Removing Out-of-Focus Blur | Uditangshu Aurangabadkar et.al. | 2406.11330 | link |
2024-06-16 | Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution | Cuixin Yang et.al. | 2406.10869 | null |
2024-06-14 | SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-resolution with Latent Diffusion Models | Zhaoxu Luo et.al. | 2406.10225 | null |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | Exact Sparse Representation Recovery in Signal Demixing and Group BLASSO | Marcello Carioni et.al. | 2406.09922 | null |
2024-06-14 | Bayesian Conditioned Diffusion Models for Inverse Problems | Alper Güngör et.al. | 2406.09768 | null |
2024-06-13 | Near-Field Multiuser Communications based on Sparse Arrays | Kangjian Chen et.al. | 2406.09238 | null |
2024-06-13 | SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution | Soufiane Belharbi et.al. | 2406.09168 | link |
2024-06-13 | Microparticle-assisted 2D super resolution virtual image modeling | Arlen Bekirov et.al. | 2406.09060 | null |
2024-06-13 | Blind Super-Resolution via Meta-learning and Markov Chain Monte Carlo Simulation | Jingyuan Xia et.al. | 2406.08896 | link |
2024-06-12 | Pranath Reddy et.al. | 2406.08442 | null | |
2024-06-12 | DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor | Juncheng Wu et.al. | 2406.08377 | link |
2024-06-14 | One-Step Effective Diffusion Network for Real-World Image Super-Resolution | Rongyuan Wu et.al. | 2406.08177 | link |
2024-06-11 | Image Neural Field Diffusion Models | Yinbo Chen et.al. | 2406.07480 | null |
2024-06-11 | Redefining Automotive Radar Imaging: A Domain-Informed 1D Deep Learning Approach for High-Resolution and Efficient Performance | Ruxin Zheng et.al. | 2406.07399 | null |
2024-06-12 | Towards Realistic Data Generation for Real-World Super-Resolution | Long Peng et.al. | 2406.07255 | null |
2024-06-10 | 2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution | Kai Liu et.al. | 2406.06649 | link |
2024-06-10 | Inter-slice Super-resolution of Magnetic Resonance Images by Pre-training and Self-supervised Fine-tuning | Xin Wang et.al. | 2406.05974 | null |
2024-06-09 | Binarized Diffusion Model for Image Super-Resolution | Zheng Chen et.al. | 2406.05723 | link |
2024-06-07 | M2NO: Multiresolution Operator Learning with Multiwavelet-based Algebraic Multigrid Method | Zhihao Li et.al. | 2406.04822 | null |
2024-06-06 | M&M VTO: Multi-Garment Virtual Try-On and Editing | Luyang Zhu et.al. | 2406.04542 | link |
2024-06-06 | Enhancing Weather Predictions: Super-Resolution via Deep Diffusion Models | Jan Martinů et.al. | 2406.04099 | null |
2024-06-06 | Vectorized Conditional Neural Fields: A Framework for Solving Time-dependent Parametric Partial Differential Equations | Jan Hagnberger et.al. | 2406.03919 | link |
2024-06-07 | Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge | Nan Zhang et.al. | 2406.03799 | link |
2024-06-05 | SuperFormer: Volumetric Transformer Architectures for MRI Super-Resolution | Cristhian Forigua et.al. | 2406.03359 | link |
2024-06-04 | ReLUs Are Sufficient for Learning Implicit Neural Representations | Joseph Shenouda et.al. | 2406.02529 | link |
2024-06-05 | Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation | Clement Chadebec et.al. | 2406.02347 | link |
2024-06-03 | L-MAGIC: Language Model Assisted Generation of Images with Coherence | Zhipeng Cai et.al. | 2406.01843 | link |
2024-06-03 | PolyCLEAN: When Högbom meets Bayes -- Fast Super-Resolution Imaging with Bayesian MAP Estimation | Adrian Jarret et.al. | 2406.01342 | link |
2024-06-03 | Arctic Sea Ice Image Super-Resolution Based on Multi-Scale Convolution and Dual-Gating Mechanism | Zhaomin Fang et.al. | 2406.01240 | null |
2024-06-02 | Stealing Image-to-Image Translation Models With a Single Query | Nurit Spingarn-Eliezer et.al. | 2406.00828 | null |
2024-06-02 | Multidimensional optical singularities and their applications | Soon Wei Daniel Lim et.al. | 2406.00784 | null |
2024-06-02 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-06-04 | SuperGaussian: Repurposing Video Models for 3D Super Resolution | Yuan Shen et.al. | 2406.00609 | null |
2024-06-01 | GLCAN: Global-Local Collaborative Auxiliary Network for Local Learning | Feiyu Zhu et.al. | 2406.00446 | null |
2024-05-31 | Climate Variable Downscaling with Conditional Normalizing Flows | Christina Winkler et.al. | 2405.20719 | null |
2024-05-30 | Can No-Reference Quality-Assessment Methods Serve as Perceptual Losses for Super-Resolution? | Egor Kashkarov et.al. | 2405.20392 | null |
2024-05-30 | All-In-One Medical Image Restoration via Task-Adaptive Routing | Zhiwen Yang et.al. | 2405.19769 | link |
2024-05-30 | MAE-GAN: A Novel Strategy for Simultaneous Super-resolution Reconstruction and Denoising of Post-stack Seismic Profile | Wenshuo Yu et.al. | 2405.19767 | null |
2024-05-29 | Reconstructing Interpretable Features in Computational Super-Resolution microscopy via Regularized Latent Search | Marzieh Gheisari et.al. | 2405.19112 | null |
2024-05-29 | Single image super-resolution based on trainable feature matching attention network | Qizhou Chen et.al. | 2405.18872 | link |
2024-05-29 | Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching | Yasi Zhang et.al. | 2405.18816 | link |
2024-05-28 | Towards a Sampling Theory for Implicit Neural Representations | Mahrokh Najaf et.al. | 2405.18410 | null |
2024-05-28 | Hyperspectral and multispectral image fusion with arbitrary resolution through self-supervised representations | Ting Wang et.al. | 2405.17818 | null |
2024-05-27 | Fast Samplers for Inverse Problems in Iterative Refinement Models | Kushagra Pandey et.al. | 2405.17673 | link |
2024-05-27 | Does Diffusion Beat GAN in Image Super Resolution? | Denis Kuznedelev et.al. | 2405.17261 | link |
2024-05-27 | PatchScaler: An Efficient Patch-independent Diffusion Model for Super-Resolution | Yong Liu et.al. | 2405.17158 | link |
2024-05-27 | Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models | Cristina N. Vasconcelos et.al. | 2405.16759 | null |
2024-05-26 | Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models | Regev Cohen et.al. | 2405.16475 | null |
2024-05-25 | BOLD: Boolean Logic Deep Learning | Van Minh Nguyen et.al. | 2405.16339 | null |
2024-05-24 | Visible-frequency hyperbolic plasmon polaritons in a natural van der Waals crystal | Giacomo Venturi et.al. | 2405.15420 | null |
2024-05-29 | Stochastic super-resolution for Gaussian microtextures | Emile Pierret et.al. | 2405.15399 | null |
2024-05-24 | Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving | Jia He et.al. | 2405.15241 | null |
2024-05-23 | Universal Robustness via Median Randomized Smoothing for Real-World Super-Resolution | Zakariya Chaouai et.al. | 2405.14934 | null |
2024-05-24 | Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation | Hongxu Jiang et.al. | 2405.14802 | link |
2024-05-23 | Stimulated Raman-induced Beam Focusing | Minhaeng Cho et.al. | 2405.14240 | null |
2024-05-22 | Perceptual Fairness in Image Restoration | Guy Ohayon et.al. | 2405.13805 | null |
2024-05-22 | HR-INR: Continuous Space-Time Video Super-Resolution via Event Camera | Yunfan Lu et.al. | 2405.13389 | null |
2024-05-20 | Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for Arbitrary-scale Super-resolution | Xihaier Luo et.al. | 2405.12202 | null |
2024-05-18 | HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos | Qifeng Chen et.al. | 2405.11270 | null |
2024-05-17 | AdaWaveNet: Adaptive Wavelet Network for Time Series Analysis | Han Yu et.al. | 2405.11124 | null |
2024-05-20 | Infrared Image Super-Resolution via Lightweight Information Split Network | Shijie Liu et.al. | 2405.10561 | null |
2024-05-16 | RGB Guided ToF Imaging System: A Survey of Deep Learning-based Methods | Xin Qiao et.al. | 2405.10357 | null |
2024-05-16 | Bilateral Event Mining and Complementary for Event Stream Super-Resolution | Zhilin Huang et.al. | 2405.10037 | link |
2024-05-16 | Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution | Xingjian Wang et.al. | 2405.10014 | null |
2024-05-16 | IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model | Yongsong Huang et.al. | 2405.09873 | link |
2024-05-15 | Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment | Xinying Lin et.al. | 2405.09472 | null |
2024-05-15 | Low-Complexity Joint Azimuth-Range-Velocity Estimation for Integrated Sensing and Communication with OFDM Waveform | Jun Zhang et.al. | 2405.09443 | null |
2024-05-15 | Large coordinate kernel attention network for lightweight image super-resolution | Fangwei Hao et.al. | 2405.09353 | null |
2024-05-14 | NAFRSSR: a Lightweight Recursive Network for Efficient Stereo Image Super-Resolution | Yihong Chen et.al. | 2405.08423 | link |
2024-05-17 | Exploring the Low-Pass Filtering Behavior in Image Super-Resolution | Haoyu Deng et.al. | 2405.07919 | link |
2024-05-13 | CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution | Qingguo Liu et.al. | 2405.07648 | link |
2024-05-11 | Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior | Ce Wang et.al. | 2405.07044 | link |
2024-05-11 | Efficient Real-world Image Super-Resolution Via Adaptive Directional Gradient Convolution | Long Peng et.al. | 2405.07023 | link |
2024-05-11 | Incorporating Degradation Estimation in Light Field Spatial Super-Resolution | Zeyu Xiao et.al. | 2405.07012 | null |
2024-05-11 | Super-Resolving Blurry Images with Events | Chi Zhang et.al. | 2405.06918 | null |
2024-05-10 | Machine learning for reconstruction of polarity inversion lines from solar filaments | V. Kisielius et.al. | 2405.06293 | link |
2024-05-07 | Single-antenna 3D localization with nonseparable toroidal pulses | Ren Wang et.al. | 2405.05979 | null |
2024-05-09 | Diag2Diag: Multi modal super resolution for physics discovery with application to fusion | Azarakhsh Jalalvand et.al. | 2405.05908 | null |
2024-05-09 | Multi-Level Feature Fusion Network for Lightweight Stereo Image Super-Resolution | Yunxiang Li et.al. | 2405.05497 | link |
2024-05-08 | HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-Resolution | Shu-Chuan Chu et.al. | 2405.05001 | link |
2024-05-08 | Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution | Yi Xiao et.al. | 2405.04964 | link |
2024-05-08 | Teacher-Student Network for Real-World Face Super-Resolution with Progressive Embedding of Edge Information | Zhilei Liu et.al. | 2405.04778 | null |
2024-05-07 | An Advanced Features Extraction Module for Remote Sensing Image Super-Resolution | Naveed Sultan et.al. | 2405.04595 | null |
2024-05-07 | CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model | Haixia Xiao et.al. | 2405.04483 | null |
2024-05-08 | Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer | Zhuoyi Yang et.al. | 2405.04312 | link |
2024-05-06 | All-in-One Deep Learning Framework for MR Image Reconstruction | Geunu Jeong et.al. | 2405.03684 | null |
2024-05-05 | DVMSR: Distillated Vision Mamba for Efficient Super-Resolution | Xiaoyan Lei et.al. | 2405.03008 | link |
2024-05-05 | I |
Haofei Song et.al. | 2405.02857 | null |
2024-05-05 | Antenna Failure Resilience: Deep Learning-Enabled Robust DOA Estimation with Single Snapshot Sparse Arrays | Ruxin Zheng et.al. | 2405.02788 | link |
2024-05-03 | Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations | Zhilu Zhang et.al. | 2405.02171 | link |
2024-05-03 | Optical skyrmions from metafibers | Tiantian He et.al. | 2405.01962 | null |
2024-05-05 | TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms | Yueyuan Sui et.al. | 2405.01242 | null |
2024-05-02 | Single Image Super-Resolution Based on Global-Local Information Synergy | Nianzu Qiao et.al. | 2405.01085 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-04-24 | Few-shot point cloud reconstruction and denoising via learned Guassian splats renderings and fine-tuned diffusion features | Pietro Bonazzi et.al. | 2404.01112 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-21 | InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation | Marziyeh Bamdad et.al. | 2411.14358 | null |
2024-11-20 | Robust Monocular Visual Odometry using Curriculum Learning | Assaf Lahiany et.al. | 2411.13438 | null |
2024-11-15 | BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation | Yufei Wei et.al. | 2411.10195 | null |
2024-11-24 | Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments | Ankit Shaw et.al. | 2411.08231 | null |
2024-11-07 | MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation | Sayan Paul et.al. | 2411.04796 | null |
2024-10-30 | LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM | Yucheng Huang et.al. | 2410.23231 | link |
2024-10-29 | LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues | Hanqing Jiang et.al. | 2410.22213 | null |
2024-10-12 | ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras | Junkai Niu et.al. | 2410.09374 | link |
2024-10-18 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
2024-09-20 | Learning Visual Information Utility with PIXER | Yash Turkar et.al. | 2409.13151 | null |
2024-10-19 | ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation | Yanlin Jin et.al. | 2409.11692 | null |
2024-09-14 | MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry | Yuheng Qiu et.al. | 2409.09479 | null |
2024-09-14 | GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians | Dasong Gao et.al. | 2409.09295 | null |
2024-09-14 | Panoramic Direct LiDAR-assisted Visual Odometry | Zikang Yuan et.al. | 2409.09287 | link |
2024-09-02 | Robust Vehicle Localization and Tracking in Rain using Street Maps | Yu Xiang Tan et.al. | 2409.01038 | link |
2024-08-30 | Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Shuyang Zhang et.al. | 2408.17005 | link |
2024-08-29 | Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry | Michael Adlerstein et.al. | 2408.16472 | null |
2024-08-28 | Single-Photon 3D Imaging with Equi-Depth Photon Histograms | Kaustubh Sadekar et.al. | 2408.16150 | null |
2024-08-28 | ES-PTAM: Event-based Stereo Parallel Tracking and Mapping | Suman Ghosh et.al. | 2408.15605 | link |
2024-08-28 | FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2408.14035 | link |
2024-08-03 | Deep Patch Visual SLAM | Lahav Lipson et.al. | 2408.01654 | link |
2024-07-25 | CodedVO: Coded Visual Odometry | Sachin Shah et.al. | 2407.18240 | null |
2024-07-22 | Reinforcement Learning Meets Visual Odometry | Nico Messikommer et.al. | 2407.15626 | link |
2024-07-21 | Semi-Supervised Pipe Video Temporal Defect Interval Localization | Zhu Huang et.al. | 2407.15170 | null |
2024-07-18 | Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain | Bach Nguyen Gia et.al. | 2407.13159 | link |
2024-07-17 | Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge | Andrea Albanese et.al. | 2407.12663 | null |
2024-07-01 | Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation | Lianjie Guo et.al. | 2407.01292 | link |
2024-08-07 | Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy | Chen Wang et.al. | 2406.16087 | null |
2024-06-16 | Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry | Boris Chidlovskii et.al. | 2406.11019 | null |
2024-06-12 | From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers | Swaminathan Gurumurthy et.al. | 2406.07785 | link |
2024-06-03 | The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry | Paolo Cudrano et.al. | 2406.01797 | null |
2024-06-03 | Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry | Takayuki Kanai et.al. | 2406.00929 | null |
2024-05-30 | TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM | Peifeng Jiang et.al. | 2405.19614 | null |
2024-06-20 | Advancements in Translation Accuracy for Stereo Visual-Inertial Initialization | Han Song et.al. | 2405.15082 | null |
2024-06-08 | EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving | Boyi Liu et.al. | 2405.12120 | null |
2024-05-10 | MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization | Pengcheng Zhu et.al. | 2405.06241 | null |
2024-05-07 | Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map | Yuxuan Xia et.al. | 2405.04290 | null |
2024-05-07 | IMU-Aided Event-based Stereo Visual Odometry | Junkai Niu et.al. | 2405.04071 | link |
2024-04-27 | An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation | Olivier Brochu Dufour et.al. | 2404.17745 | null |
2024-04-26 | Camera Motion Estimation from RGB-D-Inertial Scene Flow | Samuel Cerezo et.al. | 2404.17251 | null |
2024-04-23 | Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization | Lahav Lipson et.al. | 2404.15263 | link |
2024-04-18 | SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints | Spencer Carmichael et.al. | 2404.12339 | null |
2024-04-17 | VBR: A Vision Benchmark in Rome | Leonardo Brizi et.al. | 2404.11322 | link |
2024-04-14 | Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration | Yanhao Zhang et.al. | 2404.09169 | link |
2024-04-06 | Salient Sparse Visual Odometry With Pose-Only Supervision | Siyu Chen et.al. | 2404.04677 | null |
2024-03-25 | A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments | Gianluca D'Amico et.al. | 2403.17084 | null |
2024-03-19 | On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine | Jagatpreet Singh Nir et.al. | 2403.13170 | null |
2024-03-18 | The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions | Margaret Hansen et.al. | 2403.12194 | null |
2024-03-18 | An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation | Zewen Xu et.al. | 2403.11639 | null |
2024-03-16 | Efficient Domain Adaptation for Endoscopic Visual Odometry | Junyang Wu et.al. | 2403.10860 | null |
2024-03-14 | Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO) | Matthew Lisondra et.al. | 2403.09882 | null |
2024-03-02 | Grid-based Fast and Structural Visual Odometry | Zhang Zhihe et.al. | 2403.01110 | null |
2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | link |
2024-02-22 | Secure Navigation using Landmark-based Localization in a GPS-denied Environment | Ganesh Sapkota et.al. | 2402.14280 | null |
2024-02-19 | Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment | Ganesh Sapkota et.al. | 2402.12551 | null |
2024-02-07 | Online and Certifiably Correct Visual Odometry and Mapping | Devansh R Agrawal et.al. | 2402.05254 | null |
2024-02-06 | YOLOPoint Joint Keypoint and Object Detection | Anton Backhaus et.al. | 2402.03989 | link |
2024-01-19 | Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning | André O. Françani et.al. | 2401.10857 | null |
2024-01-17 | Event-Based Visual Odometry on Non-Holonomic Ground Vehicles | Wanting Xu et.al. | 2401.09331 | link |
2024-01-11 | On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering | Feng Zhu et.al. | 2401.05836 | null |
2023-12-19 | Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry | Olaya Álvarez-Tuñón et.al. | 2401.05396 | link |
2024-01-07 | Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people | Ali Samadzadeh et.al. | 2401.03604 | link |
2024-01-03 | LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry | Weirong Chen et.al. | 2401.01887 | null |
2023-12-28 | SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction | Zikang Yuan et.al. | 2312.16800 | link |
2023-12-20 | NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields | Jens Naumann et.al. | 2312.13471 | null |
2023-12-22 | Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM | Junru Lin et.al. | 2312.13332 | null |
2023-12-20 | Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach | Habib Boloorchi Tabrizi et.al. | 2312.13162 | link |
2023-12-20 | Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera | Abdulkadhem A. Abdulkadhem et.al. | 2312.12680 | null |
2023-12-15 | Deep Event Visual Odometry | Simon Klenk et.al. | 2312.09800 | link |
2023-12-10 | SuperPrimitive: Scene Reconstruction at a Primitive Level | Kirill Mazur et.al. | 2312.05889 | null |
2023-12-04 | iMatching: Imperative Correspondence Learning | Zitong Zhan et.al. | 2312.02141 | link |
2023-11-30 | Event-based Visual Inertial Velometer | Xiuyuan Lu et.al. | 2311.18189 | null |
2023-11-21 | CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems | Young-Hee Lee et.al. | 2311.12580 | null |
2023-11-10 | Dense Visual Odometry Using Genetic Algorithm | Slimane Djema et.al. | 2311.06149 | null |
2023-11-07 | Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM | Seongwook Yoon et.al. | 2311.03722 | null |
2023-10-23 | Converting Depth Images and Point Clouds for Feature-based Pose Estimation | Robert Lösch et.al. | 2310.14924 | link |
2023-10-17 | Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms | Yanyan Li et.al. | 2310.10931 | link |
2023-10-12 | Jointly Optimized Global-Local Visual Localization of UAVs | Haoling Li et.al. | 2310.08082 | null |
2023-10-10 | l-dyno: framework to learn consistent visual features using robot's motion | Kartikeya Singh et.al. | 2310.06249 | link |
2023-10-08 | XVO: Generalized Visual Odometry via Cross-Modal Self-Training | Lei Lai et.al. | 2309.16772 | null |
2023-10-22 | ObVi-SLAM: Long-Term Object-Visual SLAM | Amanda Adkins et.al. | 2309.15268 | link |
2023-09-23 | Tag-based Visual Odometry Estimation for Indoor UAVs Localization | Massimiliano Bertoni et.al. | 2309.13311 | null |
2023-09-22 | Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms | Olivier Gamache et.al. | 2309.13139 | link |
2023-09-20 | Conformalized Multimodal Uncertainty Regression and Reasoning | Domenico Parente et.al. | 2309.11018 | null |
2023-09-20 | OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving | Heng Li et.al. | 2309.11011 | link |
2023-09-19 | LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation | Haizhou Zhang et.al. | 2309.10436 | link |
2023-09-21 | Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration | Hongbo Zhao et.al. | 2309.10314 | null |
2023-09-18 | End-to-End Learned Event- and Image-based Visual Odometry | Roberto Pellerito et.al. | 2309.09947 | link |
2023-09-14 | An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments | Yehao Liu et.al. | 2309.07408 | null |
2023-09-11 | Evaluating Visual Odometry Methods for Autonomous Driving in Rain | Yu Xiang Tan et.al. | 2309.05249 | null |
2023-09-08 | Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry | Akankshya Kar et.al. | 2309.04147 | null |
2023-09-04 | EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity | Zijie Jiang et.al. | 2309.01296 | null |
2023-08-27 | Deep Learning for Visual Localization and Mapping: A Survey | Changhao Chen et.al. | 2308.14039 | null |
2023-08-19 | Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters | Xiao Liu et.al. | 2308.09870 | link |
2023-08-12 | 4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion | Guirong Zhuo et.al. | 2308.06573 | null |
2023-08-10 | Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU | U. V. B. L. Udugama et.al. | 2308.05515 | null |
2023-08-02 | A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry | Cora A. Dimmig et.al. | 2308.01398 | null |
2023-08-02 | Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network | Shenbagaraj Kannapiran et.al. | 2308.01125 | null |
2023-08-02 | Preliminary Design of the Dragonfly Navigation Filter | Ben Schilling et.al. | 2307.13513 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-29 | A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications | Liqiang Zhang Ye Tian Dongyan Wei et.al. | 2411.19845 | null |
2024-11-27 | Optimizing Image Retrieval with an Extended b-Metric Space | Abdelkader Belhenniche et.al. | 2411.18800 | null |
2024-11-26 | Learning Visual Hierarchies with Hyperbolic Embeddings | Ziwei Wang et.al. | 2411.17490 | null |
2024-12-02 | Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy | You Li et.al. | 2411.16752 | null |
2024-12-02 | AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks | You Li et.al. | 2411.16749 | null |
2024-11-25 | Image Generation Diversity Issues and How to Tame Them | Mischa Dombrowski et.al. | 2411.16171 | link |
2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
2024-11-22 | Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval | Zengbao Sun et.al. | 2411.14704 | null |
2024-11-20 | Globally Correlation-Aware Hard Negative Generation | Wenjie Peng et.al. | 2411.13145 | link |
2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
2024-11-13 | OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances | Youqi Liao et.al. | 2411.08665 | link |
2024-11-13 | Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval | Saul Santos et.al. | 2411.08590 | link |
2024-11-22 | Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments | Ashkan Nejad et.al. | 2411.08567 | link |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-05 | From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing | Xintian Sun et.al. | 2411.05826 | null |
2024-11-04 | TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives | Maitreya Patel et.al. | 2411.02545 | null |
2024-11-11 | INQUIRE: A Natural World Text-to-Image Retrieval Benchmark | Edward Vendrow et.al. | 2411.02537 | link |
2024-11-20 | Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models | Sharat Agarwal et.al. | 2411.01925 | null |
2024-11-04 | Semantic Masking and Visual Feature Matching for Robust Localization | Luisa Mao et.al. | 2411.01804 | null |
2024-11-03 | Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification | MD Shaikh Rahman et.al. | 2411.01473 | null |
2024-11-01 | Identifying Implicit Social Biases in Vision-Language Models | Kimia Hamidieh et.al. | 2411.00997 | null |
2024-10-31 | Nearest Neighbor Normalization Improves Multimodal Retrieval | Neil Chowdhury et.al. | 2410.24114 | link |
2024-10-31 | MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval | Haiwen Li et.al. | 2410.23736 | null |
2024-10-30 | Decoupling Semantic Similarity from Spatial Alignment for Neural Networks | Tassilo Wald et.al. | 2410.23107 | link |
2024-10-29 | Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications | Monica Riedler et.al. | 2410.21943 | link |
2024-10-28 | NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments | Taiyi Pan et.al. | 2410.21615 | link |
2024-10-25 | Context-Based Visual-Language Place Recognition | Soojin Woo et.al. | 2410.19341 | link |
2024-10-24 | ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval | Zijia Zhao et.al. | 2410.18715 | link |
2024-10-25 | On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features | Tomáš Pivoňka et.al. | 2410.18573 | null |
2024-10-22 | Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval | Yuanmin Tang et.al. | 2410.17393 | null |
2024-10-20 | GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning | Haiwen Diao et.al. | 2410.15266 | link |
2024-10-19 | Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book Collection | Marie Roald et.al. | 2410.14969 | link |
2024-10-16 | Development of Image Collection Method Using YOLO and Siamese Network | Chan Young Shin et.al. | 2410.12561 | null |
2024-10-16 | LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment | Juelin Zhu et.al. | 2410.12269 | link |
2024-10-16 | Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization | Nanda Febri Istighfarin et.al. | 2410.12240 | null |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-10-11 | Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System | Zheng Liu et.al. | 2410.08935 | link |
2024-10-16 | Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP | Eunji Kim et.al. | 2410.08469 | null |
2024-10-11 | A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification | Eugene P. W. Ang et.al. | 2410.08456 | null |
2024-10-10 | A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks | Hoin Jung et.al. | 2410.07593 | link |
2024-10-09 | Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval | Mohammad Omama et.al. | 2410.07022 | null |
2024-10-09 | Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers | Stephen Hausler et.al. | 2410.06614 | null |
2024-10-09 | MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging | Noel C. F. Codella et.al. | 2410.06542 | null |
2024-10-08 | Temporal Image Caption Retrieval Competition -- Description and Results | Jakub Pokrywka et.al. | 2410.06314 | null |
2024-10-08 | Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching | Gongxin Yao et.al. | 2410.06285 | null |
2024-10-08 | GSLoc: Visual Localization with 3D Gaussian Splatting | Kazii Botashev et.al. | 2410.06165 | null |
2024-10-08 | Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning | Ayush Singh et.al. | 2410.05928 | null |
2024-10-08 | RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps | Minsoo Kim et.al. | 2410.05621 | null |
2024-10-09 | LoTLIP: Improving Language-Image Pre-training for Long Text Understanding | Wei Wu et.al. | 2410.05249 | null |
2024-10-06 | LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation | Jianhao Jiao et.al. | 2410.04419 | null |
2024-10-02 | Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension | Zaiquan Yang et.al. | 2410.01544 | null |
2024-10-03 | EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections | Francesc Net et.al. | 2410.01536 | link |
2024-10-04 | CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment | Safouane El Ghazouali et.al. | 2410.01411 | link |
2024-09-30 | Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation | Aleyna Kütük et.al. | 2410.00266 | null |
2024-09-29 | CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation | Yifan Duan et.al. | 2409.19597 | null |
2024-09-28 | VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition | Ahmad Khaliq et.al. | 2409.19293 | link |
2024-09-27 | MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion | Bardienus Duisterhof et.al. | 2409.19152 | null |
2024-09-26 | Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval | Mankeerat Sidhu et.al. | 2409.18733 | null |
2024-09-26 | Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Kartik Garg et.al. | 2409.18049 | link |
2024-09-24 | GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Gennady Sidorov et.al. | 2409.16502 | link |
2024-09-23 | CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis | Xiang Zhang et.al. | 2409.15169 | null |
2024-09-21 | Combining Absolute and Semi-Generalized Relative Poses for Visual Localization | Vojtech Panek et.al. | 2409.14269 | null |
2024-09-21 | SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality | Hongjia Zhai et.al. | 2409.14067 | null |
2024-09-20 | Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval | Morris Florek et.al. | 2409.13513 | link |
2024-09-18 | Towards Global Localization using Multi-Modal Object-Instance Re-Identification | Aneesh Chavan et.al. | 2409.12002 | link |
2024-09-17 | Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching | Kurran Singh et.al. | 2409.11555 | null |
2024-09-17 | Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information | Kunal Chelani et.al. | 2409.11536 | null |
2024-09-17 | Improving the Efficiency of Visually Augmented Language Models | Paula Ontalvilla et.al. | 2409.11148 | null |
2024-09-21 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
2024-09-16 | SOLVR: Submap Oriented LiDAR-Visual Re-Localisation | Joshua Knights et.al. | 2409.10247 | null |
2024-09-16 | Garment Attribute Manipulation with Multi-level Attention | Vittorio Casula et.al. | 2409.10206 | null |
2024-09-14 | Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval | Amirreza Mahbod et.al. | 2409.09430 | link |
2024-09-12 | Structured Pruning for Efficient Visual Place Recognition | Oliver Grainge et.al. | 2409.07834 | null |
2024-09-10 | GeoCalib: Learning Single-image Calibration with Geometric Optimization | Alexander Veicht et.al. | 2409.06704 | link |
2024-09-10 | Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Yujiao Shi et.al. | 2409.06471 | link |
2024-09-10 | A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions | Zhicong Wu et.al. | 2409.06381 | null |
2024-09-09 | Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding | Bram Willemsen et.al. | 2409.05721 | link |
2024-09-09 | Open-World Dynamic Prompt and Continual Visual Representation Learning | Youngeun Kim et.al. | 2409.05312 | null |
2024-09-12 | Training-free ZS-CIR via Weighted Modality Fusion and Similarity | Ren-Di Wu et.al. | 2409.04918 | link |
2024-09-12 | Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models | Saghir Alfasly et.al. | 2409.04631 | null |
2024-09-06 | Reprojection Errors as Prompts for Efficient Scene Coordinate Regression | Ting-Ru Liu et.al. | 2409.04178 | null |
2024-09-06 | Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments | Therese Joseph et.al. | 2409.03998 | null |
2024-09-04 | Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications | Abby Stylianou et.al. | 2409.03012 | null |
2024-09-04 | NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval | Sepanta Zeighami et.al. | 2409.02343 | link |
2024-09-03 | Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment | Konstantin Schall et.al. | 2409.01936 | link |
2024-09-02 | A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches | Kim Jinwoo et.al. | 2409.01219 | null |
2024-09-02 | Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection | Manon Kok et.al. | 2409.01091 | null |
2024-09-02 | Evidential Transformers for Improved Image Retrieval | Danilo Dordevic et.al. | 2409.01082 | null |
2024-09-05 | EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System | Bonan Liu et.al. | 2409.00343 | null |
2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
2024-09-02 | RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance | Avideep Mukherjee et.al. | 2408.17095 | null |
2024-08-29 | A compact neuromorphic system for ultra energy-efficient, on-device robot localization | Adam D. Hines et.al. | 2408.16754 | link |
2024-08-29 | Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models | Kengo Nakata et.al. | 2408.16296 | null |
2024-08-28 | Temporal Attention for Cross-View Sequential Image Localization | Dong Yuan et.al. | 2408.15569 | link |
2024-08-27 | Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild | Tianqi Wei et.al. | 2408.14723 | null |
2024-08-25 | LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task | Ali Asgarov et.al. | 2408.13909 | link |
2024-08-15 | Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval | Lifeng Zhou et.al. | 2408.13705 | null |
2024-08-15 | Coarse-to-fine Alignment Makes Better Speech-image Retrieval | Lifeng Zhou et.al. | 2408.13119 | null |
2024-08-21 | FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization | Son Tung Nguyen et.al. | 2408.12037 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-21 | UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Xiangyu Zhao et.al. | 2408.11305 | link |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | null |
2024-08-19 | BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval | Zhenyu Lu et.al. | 2408.10383 | null |
2024-08-23 | Fashion Image-to-Image Translation for Complementary Item Retrieval | Matteo Attimonelli et.al. | 2408.09847 | link |
2024-08-20 | MambaLoc: Efficient Camera Localisation via State Space Model | Jialu Wang et.al. | 2408.09680 | null |
2024-08-15 | DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions | Ryosuke Korekata et.al. | 2408.07910 | null |
2024-08-13 | A Miniature Vision-Based Localization System for Indoor Blimps | Shicong Ma et.al. | 2408.06648 | null |
2024-08-10 | Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network | Junyan Ye et.al. | 2408.05475 | link |
2024-08-09 | Spherical World-Locking for Audio-Visual Localization in Egocentric Videos | Heeseung Yun et.al. | 2408.05364 | null |
2024-08-06 | AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval | Pavel Suma et.al. | 2408.03282 | link |
2024-08-05 | CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration | Gongxin Yao et.al. | 2408.02394 | null |
2024-08-09 | BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles | Lun Luo et.al. | 2408.01841 | link |
2024-08-02 | On Validation of Search & Retrieval of Tissue Images in Digital Pathology | H. R. Tizhoosh et.al. | 2408.01570 | null |
2024-07-31 | VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning | Yuhang Ming et.al. | 2407.21416 | null |
2024-07-31 | SuperVINS: A visual-inertial SLAM framework integrated deep learning features | Hongkun Luo et.al. | 2407.21348 | link |
2024-07-30 | Re-localization acceleration with Medoid Silhouette Clustering | Hongyi Zhang et.al. | 2407.20749 | null |
2024-07-29 | A flexible framework for accurate LiDAR odometry, map manipulation, and localization | José Luis Blanco-Claraco et.al. | 2407.20465 | link |
2024-07-26 | From 2D to 3D: AISG-SLA Visual Localization Challenge | Jialin Gao et.al. | 2407.18590 | null |
2024-07-24 | Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation | Yongqi Li et.al. | 2407.17274 | null |
2024-07-24 | Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments | Wei Gao et.al. | 2407.17078 | null |
2024-07-24 | Pose Estimation from Camera Images for Underwater Inspection | Luyuan Peng et.al. | 2407.16961 | null |
2024-07-22 | Memory Management for Real-Time Appearance-Based Loop Closure Detection | Mathieu Labbé et.al. | 2407.15890 | null |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-22 | Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM | Mathieu Labbe et.al. | 2407.15305 | null |
2024-07-22 | Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation | Mathieu Labbé et.al. | 2407.15304 | null |
2024-07-19 | Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization | Yuehua Ding et.al. | 2407.14643 | null |
2024-07-18 | Visual Haystacks: Answering Harder Questions About Sets of Images | Tsung-Han Wu et.al. | 2407.13766 | link |
2024-07-17 | Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM | Markus Weißflog et.al. | 2407.12408 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-16 | EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis | Ruijie Yang et.al. | 2407.11401 | null |
2024-07-15 | No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations | Walter Simoncini et.al. | 2407.10964 | link |
2024-07-15 | DINO Pre-training for Vision-based End-to-end Autonomous Driving | Shubham Juneja et.al. | 2407.10803 | null |
2024-07-15 | Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Youngsun Lim et.al. | 2407.10683 | null |
2024-07-15 | An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots | J. J. Cabrera et.al. | 2407.10596 | link |
2024-07-15 | An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments | J. J. Cabrera et.al. | 2407.10536 | null |
2024-07-12 | Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval | Vaibhav Balloli et.al. | 2407.08908 | link |
2024-07-11 | Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates | Owen Claxton et.al. | 2407.08162 | link |
2024-07-12 | Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal | Xinyu Zhu et.al. | 2407.08153 | link |
2024-07-11 | SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM | Neng Wang et.al. | 2407.08106 | link |
2024-07-09 | LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition | Teng Wang et.al. | 2407.06730 | null |
2024-07-09 | CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding | Wenhao Xu et.al. | 2407.06611 | null |
2024-07-08 | Pseudo-triplet Guided Few-shot Composed Image Retrieval | Bohan Hou et.al. | 2407.06001 | null |
2024-07-09 | HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels | Yingying Jiang et.al. | 2407.05795 | null |
2024-07-05 | Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning | Mainak Singha et.al. | 2407.04207 | link |
2024-07-04 | Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models | Chang-Sheng Kao et.al. | 2407.03615 | link |
2024-07-03 | Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach | Pronay Debnath et.al. | 2407.03486 | null |
2024-07-02 | Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition | Sergio Izquierdo et.al. | 2407.02422 | link |
2024-07-01 | Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval | Aneeshan Sain et.al. | 2407.01810 | null |
2024-07-01 | Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval | Hanwen Su et.al. | 2407.00979 | null |
2024-07-01 | Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios | Connor Malone et.al. | 2407.00863 | null |
2024-06-27 | PathAlign: A vision-language model for whole slide images in histopathology | Faruk Ahmed et.al. | 2406.19578 | null |
2024-07-05 | 360 in the Wild: Dataset for Depth Prediction and View Synthesis | Kibaek Park et.al. | 2406.18898 | null |
2024-06-27 | Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs | Huaying Zhang et.al. | 2406.18836 | null |
2024-06-26 | WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images | Yannik Glaser et.al. | 2406.18765 | null |
2024-06-26 | View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis | Subin Varghese et.al. | 2406.18012 | null |
2024-06-25 | Tell Me Where You Are: Multimodal LLMs Meet Place Recognition | Zonglin Lyu et.al. | 2406.17520 | null |
2024-06-25 | SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation | Xu Liu et.al. | 2406.17249 | null |
2024-06-23 | Breaking the Frame: Image Retrieval by Visual Overlap Prediction | Tong Wei et.al. | 2406.16204 | link |
2024-06-19 | Towards a multimodal framework for remote sensing image change retrieval and captioning | Roger Ferrod et.al. | 2406.13424 | link |
2024-06-19 | CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval | Christian Lülf et.al. | 2406.13322 | link |
2024-06-17 | Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization | Huaiji Zhou et.al. | 2406.11766 | null |
2024-06-22 | Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment | Jianan Jiang et.al. | 2406.11551 | link |
2024-06-17 | They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias | Salma Abdel Magid et.al. | 2406.11331 | null |
2024-06-17 | Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion | Guoyuan An et.al. | 2406.11242 | null |
2024-06-14 | Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval | Genc Hoxha et.al. | 2406.10107 | null |
2024-06-14 | BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval | Imanol Miranda et.al. | 2406.09952 | link |
2024-06-13 | Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases | Meng Wang et.al. | 2406.09317 | link |
2024-06-13 | Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval | Jaeseok Byun et.al. | 2406.09188 | null |
2024-06-13 | DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification | Zhengrui Xu et.al. | 2406.08773 | null |
2024-06-12 | Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement | Maxime Pietrantoni et.al. | 2406.08463 | null |
2024-06-12 | ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery | Kam Woh Ng et.al. | 2406.08457 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502 | link |
2024-06-11 | Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Shuvendu Roy et.al. | 2406.07450 | link |
2024-06-11 | Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval | Adrià Molina et.al. | 2406.07315 | null |
2024-06-10 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374 | link |
2024-06-09 | Unified Text-to-Image Generation and Retrieval | Leigang Qu et.al. | 2406.05814 | null |
2024-06-07 | The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better | Scott Geng et.al. | 2406.05184 | link |
2024-06-07 | PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction | Eduard Poesina et.al. | 2406.04746 | link |
2024-06-06 | GLACE: Global Local Accelerated Coordinate Encoding | Fangjinhua Wang et.al. | 2406.04340 | link |
2024-06-06 | Monocular Localization with Semantics Map for Autonomous Vehicles | Jixiang Wan et.al. | 2406.03835 | null |
2024-06-05 | Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach | Saehyung Lee et.al. | 2406.03411 | link |
2024-06-04 | MeshVPR: Citywide Visual Place Recognition Using 3D Meshes | Gabriele Berton et.al. | 2406.02776 | null |
2024-06-04 | Can CLIP help CLIP in learning 3D? | Cristian Sbrolli et.al. | 2406.02202 | null |
2024-06-03 | Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP | Sriram Balasubramanian et.al. | 2406.01583 | link |
2024-06-03 | Scale-Free Image Keypoints Using Differentiable Persistent Homology | Giovanni Barbarani et.al. | 2406.01315 | link |
2024-06-02 | Visual place recognition for aerial imagery: A survey | Ivan Moskalenko et.al. | 2406.00885 | link |
2024-06-01 | NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization | Wugang Meng et.al. | 2406.00312 | null |
2024-05-31 | DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models | Linli Yao et.al. | 2405.20985 | link |
2024-05-29 | Multi-Modal Generative Embedding Model | Feipeng Ma et.al. | 2405.19333 | null |
2024-05-29 | ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions | Honglin Lin et.al. | 2405.19226 | null |
2024-05-30 | CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval | Xintong Jiang et.al. | 2405.19149 | link |
2024-05-29 | SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation | Zhenbei Wu et.al. | 2405.18801 | null |
2024-05-29 | Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs | Jialiang Xu et.al. | 2405.18740 | link |
2024-05-28 | EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition | Issar Tzachor et.al. | 2405.18065 | null |
2024-05-28 | AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval | Sihe Zhang et.al. | 2405.17718 | null |
2024-05-26 | MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups | Yusen Xie et.al. | 2405.16599 | null |
2024-05-29 | Composed Image Retrieval for Remote Sensing | Bill Psomas et.al. | 2405.15587 | link |
2024-05-24 | Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval | Yiming Wu et.al. | 2405.15451 | null |
2024-05-20 | UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization | Wenjia Xu et.al. | 2405.11936 | link |
2024-05-19 | Register assisted aggregation for Visual Place Recognition | Xuan Yu et.al. | 2405.11526 | null |
2024-05-26 | CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion | Gang Wang et.al. | 2405.10793 | null |
2024-05-16 | FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models | Adrian Bulat et.al. | 2405.10286 | null |
2024-05-15 | Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study | Farnaz Khun Jush et.al. | 2405.09334 | null |
2024-05-14 | BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment | Lihong Jin et.al. | 2405.09001 | null |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-13 | OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | Qiuchi Xiang et.al. | 2405.07966 | link |
2024-05-14 | HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval | Chao He et.al. | 2405.07524 | link |
2024-05-13 | JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation | Xubo Luo et.al. | 2405.07429 | link |
2024-05-12 | BoQ: A Place is Worth a Bag of Learnable Queries | Amar Ali-bey et.al. | 2405.07364 | link |
2024-05-07 | Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction | Nematollah Saeidi et.al. | 2405.04211 | null |
2024-05-06 | A New Robust Partial |
Sharath Raghvendra et.al. | 2405.03664 | null |
2024-05-06 | Knowledge-aware Text-Image Retrieval for Remote Sensing Images | Li Mi et.al. | 2405.03373 | null |
2024-05-06 | Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval | Jiacheng Cheng et.al. | 2405.03190 | null |
2024-05-05 | iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval | Lorenzo Agnolucci et.al. | 2405.02951 | link |
2024-05-01 | Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval | Young Kyun Jang et.al. | 2405.00571 | null |
2024-04-30 | Large Language Model Informed Patent Image Retrieval | Hao-Cheng Lo et.al. | 2404.19360 | null |
2024-04-30 | XFeat: Accelerated Features for Lightweight Image Matching | Guilherme Potje et.al. | 2404.19174 | null |
2024-04-29 | Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models | Hongyi Zhu et.al. | 2404.18746 | null |
2024-04-29 | Dual-Modal Prompting for Sketch-Based Image Retrieval | Liying Gao et.al. | 2404.18695 | null |
2024-05-01 | Semantic Line Combination Detector | Jinwon Ko et.al. | 2404.18399 | link |
2024-04-26 | Learning text-to-video retrieval from image captioning | Lucas Ventura et.al. | 2404.17498 | null |
2024-04-25 | CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching | Samia Shafique et.al. | 2404.16972 | link |
2024-04-29 | Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval | Ryoya Nara et.al. | 2404.16398 | null |
2024-04-24 | Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval | Haokun Wen et.al. | 2404.15875 | link |
2024-04-24 | DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines | Xin Jiang et.al. | 2404.15771 | null |
2024-04-23 | Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval | Young Kyun Jang et.al. | 2404.15516 | null |
2024-04-22 | EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models | Mathias Thorsager et.al. | 2404.14236 | null |
2024-04-22 | Hierarchical localization with panoramic views and triplet loss functions | Marcos Alfaro et.al. | 2404.14117 | link |
2024-04-20 | High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces | Baoru Huang et.al. | 2404.13437 | null |
2024-04-20 | Collaborative Visual Place Recognition through Federated Learning | Mattia Dutto et.al. | 2404.13324 | null |
2024-04-18 | SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints | Spencer Carmichael et.al. | 2404.12339 | null |
2024-04-17 | Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives | Zhangchi Feng et.al. | 2404.11317 | link |
2024-04-17 | Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing | Sanggeon Yun et.al. | 2404.11025 | null |
2024-04-16 | SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments | Niklas Gard et.al. | 2404.10527 | link |
2024-04-20 | CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning | Haojian Huang et.al. | 2404.09640 | link |
2024-04-11 | PRAM: Place Recognition Anywhere Model for Efficient Visual Localization | Fei Xue et.al. | 2404.07785 | null |
2024-04-16 | 2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure | Bin Zhang et.al. | 2404.07644 | link |
2024-04-11 | Semantically-correlated memories in a dense associative model | Thomas F Burns et.al. | 2404.07123 | link |
2024-04-09 | Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation | Luca Barsellotti et.al. | 2404.06542 | null |
2024-04-09 | Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping | Anas Gouda et.al. | 2404.06277 | link |
2024-04-07 | Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval | Jinpeng Wang et.al. | 2404.04998 | link |
2024-04-06 | Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning | Juncheng Yang et.al. | 2404.04538 | link |
2024-04-05 | Towards introspective loop closure in 4D radar SLAM | Maximilian Hilger et.al. | 2404.03940 | null |
2024-04-02 | TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation | Yehui Shen et.al. | 2404.01587 | link |
2024-04-01 | On Train-Test Class Overlap and Detection for Image Retrieval | Chull Hwan Song et.al. | 2404.01524 | link |
2024-04-01 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400 | null |
2024-03-31 | On the Estimation of Image-matching Uncertainty in Visual Place Recognition | Mubariz Zaffar et.al. | 2404.00546 | null |
2024-03-31 | NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation | Diwei Sheng et.al. | 2404.00504 | null |
2024-03-30 | SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs | Yang Miao et.al. | 2404.00469 | null |
2024-03-30 | Do Vision-Language Models Understand Compound Nouns? | Sonal Kumar et.al. | 2404.00419 | link |
2024-04-05 | FairRAG: Fair Human Generation via Fair Retrieval Augmentation | Robik Shrestha et.al. | 2403.19964 | null |
2024-03-28 | JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition | Gabriele Berton et.al. | 2403.19787 | link |
2024-03-28 | MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions | Kai Zhang et.al. | 2403.19651 | link |
2024-03-27 | AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation | Changkun Liu et.al. | 2403.18281 | null |
2024-03-26 | Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge | Dongjin Kim et.al. | 2403.17420 | link |
2024-03-25 | Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras | Gokul B. Nair et.al. | 2403.16425 | link |
2024-03-24 | Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval | Yucheng Suo et.al. | 2403.16005 | link |
2024-03-24 | BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval | Yinda Chen et.al. | 2403.15992 | null |
2024-03-22 | Long-CLIP: Unlocking the Long-Text Capability of CLIP | Beichen Zhang et.al. | 2403.15378 | link |
2024-03-22 | A Multimodal Approach for Cross-Domain Image Retrieval | Lucas Iijima et.al. | 2403.15152 | null |
2024-03-22 | Piecewise-Linear Manifolds for Deep Metric Learning | Shubhang Bhatnagar et.al. | 2403.14977 | null |
2024-03-21 | Enhancing Historical Image Retrieval with Compositional Cues | Tingyu Lin et.al. | 2403.14287 | link |
2024-03-20 | Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval | Aymene Berriche et.al. | 2403.13747 | null |
2024-03-20 | Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval | Haoyu Liu et.al. | 2403.13317 | null |
2024-03-19 | Learning Neural Volumetric Pose Features for Camera Localization | Jingyu Lin et.al. | 2403.12800 | null |
2024-03-19 | Quantixar: High-performance Vector Data Management System | Gulshan Yadav et.al. | 2403.12583 | null |
2024-03-17 | 3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization | Peng Jiang et.al. | 2403.11367 | null |
2024-03-17 | MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data | Paul S. Scotti et.al. | 2403.11207 | link |
2024-03-16 | Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval | Shunsuke Tsubaki et.al. | 2403.10756 | null |
2024-03-16 | Vector search with small radiuses | Gergely Szilvasy et.al. | 2403.10746 | null |
2024-03-13 | Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer | Kenta Tsukahara et.al. | 2403.10552 | null |
2024-03-20 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297 | link |
2024-03-15 | Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline | Fangming Yuan et.al. | 2403.10283 | null |
2024-03-14 | The NeRFect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou et.al. | 2403.09577 | null |
2024-03-14 | VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition | Benjamin Ramtoula et.al. | 2403.09025 | null |
2024-03-13 | PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models | Siddharth Mishra-Sharma et.al. | 2403.08851 | link |
2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-12 | It's All About Your Sketch: Democratising Sketch Control in Diffusion Models | Subhadeep Koley et.al. | 2403.07234 | link |
2024-03-12 | You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval | Subhadeep Koley et.al. | 2403.07222 | null |
2024-03-12 | Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers | Subhadeep Koley et.al. | 2403.07214 | null |
2024-03-11 | How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? | Subhadeep Koley et.al. | 2403.07203 | null |
2024-03-11 | EarthLoc: Astronaut Photography Localization by Indexing Earth from Space | Gabriele Berton et.al. | 2403.06758 | link |
2024-03-11 | BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues | Fudong Ge et.al. | 2403.06600 | link |
2024-03-11 | Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology | Stefan Denner et.al. | 2403.06567 | link |
2024-03-10 | RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation | Mathieu Labbé et.al. | 2403.06341 | null |
2024-03-10 | Texture image retrieval using a classification and contourlet-based features | Asal Rouhafzay et.al. | 2403.06048 | null |
2024-03-11 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002 | link |
2024-03-11 | Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed | Yifan Wang et.al. | 2403.04765 | null |
2024-03-07 | mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar | Chengzhen Meng et.al. | 2403.04703 | null |
2024-03-06 | Self-supervised Photographic Image Layout Representation Learning | Zhaoran Zhao et.al. | 2403.03740 | link |
2024-03-04 | Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models | Benedikt Blumenstiel et.al. | 2403.02059 | link |
2024-03-03 | Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval | Yongchao Du et.al. | 2403.01431 | null |
2024-03-01 | Asymmetric Feature Fusion for Image Retrieval | Hui Wu et.al. | 2403.00671 | null |
2024-03-01 | Structure Similarity Preservation Learning for Asymmetric Image Retrieval | Hui Wu et.al. | 2403.00648 | link |
2024-02-29 | CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition | Feng Lu et.al. | 2402.19231 | link |
2024-02-28 | Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport | Bin Li et.al. | 2402.18411 | link |
2024-02-28 | Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning | Hanyao Wang et.al. | 2402.18400 | null |
2024-02-28 | Representing 3D sparse map points and lines for camera relocalization | Bach-Thuan Bui et.al. | 2402.18011 | link |
2024-02-27 | Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control | Thong Nguyen et.al. | 2402.17535 | link |
2024-02-29 | Active propulsion noise shaping for multi-rotor aircraft localization | Gabriele Serussi et.al. | 2402.17289 | link |
2024-02-27 | NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer | Bingxi Liu et.al. | 2402.17159 | link |
2024-02-25 | Deep Homography Estimation for Visual Place Recognition | Feng Lu et.al. | 2402.16086 | link |
2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | link |
2024-02-28 | Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries | Zijun Long et.al. | 2402.15276 | null |
2024-02-23 | Fine-tuning CLIP Text Encoders with Two-step Paraphrasing | Hyunjae Kim et.al. | 2402.15120 | null |
2024-02-22 | Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition | Feng Lu et.al. | 2402.14505 | link |
2024-02-16 | Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition | Chenming Hu et.al. | 2402.10476 | null |
2024-02-15 | Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task | Mirko Nava et.al. | 2402.09886 | link |
2024-02-14 | Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency | Yannis Kalantidis et.al. | 2402.09237 | null |
2024-02-13 | Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Xiangming Gu et.al. | 2402.08567 | link |
2024-02-13 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359 | link |
2024-02-10 | Semantic Object-level Modeling for Robust Visual Camera Relocalization | Yifan Zhu et.al. | 2402.06951 | null |
2024-02-09 | Large Language Models for Captioning and Retrieving Remote Sensing Images | João Daniel Silva et.al. | 2402.06475 | null |
2024-02-09 | PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes | Xinggang Hu et.al. | 2402.06131 | null |
2024-02-21 | MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction | Heng Zhou et.al. | 2402.03762 | null |
2024-02-04 | Region-Based Representations Revisited | Michal Shlapentokh-Rothman et.al. | 2402.02352 | link |
2024-02-03 | Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization | Bo Yang et.al. | 2402.02141 | link |
2024-02-01 | BrainSLAM: SLAM on Neural Population Activity Data | Kipp Freud et.al. | 2402.00588 | null |
2024-02-01 | Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering | Tianxiao Gao et.al. | 2402.00330 | link |
2024-01-31 | Improved Scene Landmark Detection for Camera Localization | Tien Do et.al. | 2401.18083 | link |
2024-01-31 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-29 | Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors | Shiyin Dong et.al. | 2401.16459 | null |
2024-01-29 | Cross-Modal Coordination Across a Diverse Set of Input Modalities | Jorge Sánchez et.al. | 2401.16347 | null |
2024-01-29 | Regressing Transformers for Data-efficient Visual Place Recognition | María Leyva-Vallina et.al. | 2401.16304 | null |
2024-01-27 | Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval | Ayush Dubey et.al. | 2401.15362 | null |
2024-01-24 | Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode | Naresh Kumar Lahajal et.al. | 2401.13613 | null |
2024-01-23 | PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion | Shyam Sundar Kannan et.al. | 2401.13082 | null |
2024-01-23 | SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization | Mingyang Li et.al. | 2401.13076 | link |
2024-01-25 | CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios | Xiangshuo Qiao et.al. | 2401.10475 | link |
2024-01-19 | PhotoScout: Synthesis-Powered Multi-Modal Image Search | Celeste Barnaby et.al. | 2401.10464 | null |
2024-01-19 | Cross-Modality Perturbation Synergy Attack for Person Re-identification | Yunpeng Gong et.al. | 2401.10090 | null |
2024-01-16 | Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging | Zahra Tabatabaei et.al. | 2401.08272 | null |
2024-01-16 | Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments | Bruno Arcanjo et.al. | 2401.08263 | null |
2024-01-15 | Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing | Jakob Hackstein et.al. | 2401.07782 | link |
2024-01-14 | HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval | Zexuan Qiu et.al. | 2401.07212 | link |
2024-01-11 | UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization | Rouwan Wu et.al. | 2401.05971 | link |
2024-01-10 | Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval | Eunyi Lyou et.al. | 2401.04860 | link |
2024-01-05 | Benchmarking PathCLIP for Pathology Image Analysis | Sunyi Zheng et.al. | 2401.02651 | null |
2024-01-03 | DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM with Joint Semantic Encoding | Mingrui Li et.al. | 2401.01545 | null |
2024-01-02 | BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving | Dafeng Wei et.al. | 2401.01065 | null |
2023-12-31 | Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval | Liang Wang et.al. | 2401.00371 | link |
2023-12-29 | Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering | Long-Kun Du et.al. | 2401.00032 | null |
2023-12-27 | LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization | Sai Shubodh Puligilla et.al. | 2312.16648 | null |
2023-12-26 | Recursive Distillation for Open-Set Distributed Robot Localization | Kenta Tsukahara et.al. | 2312.15897 | null |
2023-12-24 | Residual Learning for Image Point Descriptors | Rashik Shrestha et.al. | 2312.15471 | null |
2023-12-23 | CaLDiff: Camera Localization in NeRF via Pose Diffusion | Rashik Shrestha et.al. | 2312.15242 | null |
2023-12-20 | Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition | Bruno Arcanjo et.al. | 2312.12995 | null |
2023-12-19 | VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering | Chun-Mei Feng et.al. | 2312.12273 | link |
2023-12-18 | Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback | Boaz Lerner et.al. | 2312.11078 | link |
2023-12-17 | PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields | Boming Zhao et.al. | 2312.10649 | null |
2023-12-17 | DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition | Sijie Wang et.al. | 2312.10616 | link |
2023-12-16 | Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval | Decheng Liu et.al. | 2312.10320 | link |
2023-12-15 | Data-Efficient Multimodal Fusion on a Single GPU | Noël Vouitsis et.al. | 2312.10144 | link |
2023-12-13 | Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques | Hamed Qazanfari et.al. | 2312.10089 | null |
2023-12-15 | Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval | Zhe Ma et.al. | 2312.09716 | link |
2023-12-14 | Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition | Oliver Grainge et.al. | 2312.09028 | null |
2023-12-14 | Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking | Shitong Sun et.al. | 2312.08924 | null |
2023-12-13 | C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation | Florian Fervers et.al. | 2312.08060 | null |
2023-12-12 | Contextually Affinitive Neighborhood Refinery for Deep Clustering | Chunlin Yu et.al. | 2312.07806 | link |
2023-12-12 | Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval | Qiwei Tian et.al. | 2312.07364 | link |
2023-12-12 | Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection | Jonathan J. Y. Kim et.al. | 2312.06991 | null |
2023-12-11 | Dynamic Weighted Combiner for Mixed-Modal Image Retrieval | Fuxiang Huang et.al. | 2312.06179 | link |
2023-12-06 | Lite-Mind: Towards Efficient and Versatile Brain Representation Network | Zixuan Gong et.al. | 2312.03781 | link |
2023-12-08 | FreestyleRet: Retrieving Images from Style-Diversified Queries | Hao Li et.al. | 2312.02428 | link |
2023-12-04 | Implicit Learning of Scene Geometry from Poses for Global Localization | Mohammad Altillawi et.al. | 2312.02029 | null |
2023-12-04 | Language-only Efficient Training of Zero-shot Composed Image Retrieval | Geonmo Gu et.al. | 2312.01998 | link |
2023-12-03 | G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training | Che Liu et.al. | 2312.01522 | link |
2023-12-01 | Improve Supervised Representation Learning with Masked Image Modeling | Kaifeng Chen et.al. | 2312.00950 | null |
2023-12-05 | Grounding Everything: Emerging Localization Properties in Vision-Language Transformers | Walid Bousselham et.al. | 2312.00878 | link |
2023-12-01 | Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras | Mohammad Altillawi et.al. | 2312.00500 | null |
2023-11-30 | HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance | Zhuohao Yin et.al. | 2311.18273 | link |
2023-11-30 | Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models | Raviteja Vemulapalli et.al. | 2311.18237 | link |
2023-11-29 | Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce | Chang Liu et.al. | 2311.17954 | null |
2023-11-28 | Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames | Chao Chen et.al. | 2311.17940 | null |
2023-11-29 | 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries | Huajian Huang et.al. | 2311.17389 | link |
2023-11-27 | Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation | Samuele Poppi et.al. | 2311.16254 | link |
2023-11-27 | Optimal Transport Aggregation for Visual Place Recognition | Sergio Izquierdo et.al. | 2311.15937 | link |
2023-11-27 | AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval | Shicheng Xu et.al. | 2311.14084 | link |
2023-11-23 | 3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology | Asma Ben Abacha et.al. | 2311.13752 | link |
2023-11-22 | Medical Image Retrieval Using Pretrained Embeddings | Farnaz Khun Jush et.al. | 2311.13547 | null |
2023-11-22 | Applications of Spiking Neural Networks in Visual Place Recognition | Somayeh Hussaini et.al. | 2311.13186 | link |
2023-11-21 | Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval | Xiu-Shen Wei et.al. | 2311.12894 | null |
2023-11-21 | Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs | Zhentian Qian et.al. | 2311.12245 | null |
2023-11-19 | From Categories to Classifier: Name-Only Continual Learning by Exploring the Web | Ameya Prabhu et.al. | 2311.11293 | null |
2023-11-18 | Lesion Search with Self-supervised Learning | Kristin Qi et.al. | 2311.11014 | null |
2023-11-15 | Flow reconstruction and particle characterization from inertial Lagrangian tracks | Ke Zhou et.al. | 2311.09076 | null |
2023-11-15 | Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval | Junyang Chen et.al. | 2311.07622 | null |
2023-11-13 | VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search | Shuting He et.al. | 2311.07514 | null |
2023-11-10 | Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval | Xin Lu et.al. | 2311.06067 | null |
2023-11-08 | Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model | Junya Shiraishi et.al. | 2311.04788 | null |
2023-11-08 | Training CLIP models on Data from Scientific Papers | Calvin Metzger et.al. | 2311.04711 | link |
2023-11-07 | DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding | Kehinde Ajayi et.al. | 2311.04098 | link |
2023-11-06 | Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences | Zador Pataki et.al. | 2311.03345 | null |
2023-11-06 | FocusTune: Tuning Visual Localization through Focus-Guided Sampling | Son Tung Nguyen et.al. | 2311.02872 | link |
2023-11-01 | DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing | Gaoshuang Huang et.al. | 2311.00230 | link |
2023-10-29 | Identifiable Contrastive Learning with Automatic Feature Importance Discovery | Qi Zhang et.al. | 2310.18904 | link |
2023-10-27 | LipSim: A Provably Robust Perceptual Similarity Metric | Sara Ghazanfari et.al. | 2310.18274 | link |
2023-10-27 | Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation | Susu Fang et.al. | 2310.17879 | null |
2023-10-25 | FoundLoc: Vision-based Onboard Aerial Localization in the Wild | Yao He et.al. | 2310.16299 | null |
2023-10-24 | Cross-view Self-localization from Synthesized Scene-graphs | Ryogo Yamamoto et.al. | 2310.15504 | null |
2023-10-23 | Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval | Xu Yuan et.al. | 2310.14637 | link |
2023-10-21 | Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation | Anastasia Kritharoula et.al. | 2310.14025 | link |
2023-10-20 | FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer | Xinyu Zhang et.al. | 2310.13605 | null |
2023-10-20 | CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants | Shaoan Wang et.al. | 2310.13320 | link |
2023-10-27 | Representation Learning via Consistent Assignment of Views over Random Partitions | Thalles Silva et.al. | 2310.12692 | link |
2023-10-18 | Evaluating the Fairness of Discriminative Foundation Models in Computer Vision | Junaid Ali et.al. | 2310.11867 | null |
2023-10-17 | Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification | Shuanglin Yan et.al. | 2310.11210 | null |
2023-10-16 | Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People | Dharmateja Adapa et.al. | 2310.10290 | null |
2023-10-16 | EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge | Tom Bryan et.al. | 2310.10050 | null |
2023-10-15 | CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes | Yulei Qin et.al. | 2310.09761 | link |
2023-10-13 | Pairwise Similarity Learning is SimPLE | Yandong Wen et.al. | 2310.09449 | link |
2023-10-13 | Vision-by-Language for Training-Free Compositional Image Retrieval | Shyamgopal Karthik et.al. | 2310.09291 | link |
2023-10-12 | Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning | Shiyang Yan et.al. | 2310.08390 | null |
2023-10-12 | Jointly Optimized Global-Local Visual Localization of UAVs | Haoling Li et.al. | 2310.08082 | null |
2023-10-10 | Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization | Le Chen et.al. | 2310.06984 | null |
2023-10-10 | Distillation Improves Visual Place Recognition for Low-Quality Queries | Anbang Yang et.al. | 2310.06906 | link |
2023-10-10 | Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets | Jiajun Zhang et.al. | 2310.06566 | null |
2023-10-10 | Topological RANSAC for instance verification and retrieval without fine-tuning | Guoyuan An et.al. | 2310.06486 | null |
2023-10-10 | 3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments | Ghanta Sai Krishna et.al. | 2310.06385 | null |
2023-10-09 | Collaborative Visual Place Recognition | Yiming Li et.al. | 2310.05541 | null |
2023-10-09 | Sentence-level Prompts Benefit Composed Image Retrieval | Yang Bai et.al. | 2310.05473 | link |
2023-10-08 | AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition | Feng Lu et.al. | 2310.05184 | link |
2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
2023-10-12 | ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer | Yifan Xu et.al. | 2310.04099 | null |
2023-10-06 | Sub-token ViT Embedding via Stochastic Resonance Transformers | Dong Lao et.al. | 2310.03967 | link |
2023-10-04 | Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach | Matthew Hanlon et.al. | 2310.02650 | null |
2023-10-02 | NEUCORE: Neural Concept Reasoning for Composed Image Retrieval | Shu Zhao et.al. | 2310.01358 | null |
2023-10-02 | Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images | Georg Bökman et.al. | 2310.01092 | null |
2023-10-05 | PlaceNav: Topological Navigation through Place Recognition | Lauri Suomela et.al. | 2309.17260 | null |
2023-09-29 | Segment Anything Model is a Good Teacher for Local Feature Learning | Jingqian Wu et.al. | 2309.16992 | link |
2023-09-28 | Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning | Albert Mohwald et.al. | 2309.16351 | link |
2023-09-28 | FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding | Pengxiang Wu et.al. | 2309.16249 | link |
2023-09-28 | Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval | Yuanmin Tang et.al. | 2309.16137 | link |
2023-09-27 | GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization | Vicente Vivanco Cepeda et.al. | 2309.16020 | link |
2023-09-27 | Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization | Zhenbo Song et.al. | 2309.15556 | null |
2023-09-26 | Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features | Hila Levi et.al. | 2309.14999 | null |
2023-09-23 | Resolving References in Visually-Grounded Dialogue via Text Generation | Bram Willemsen et.al. | 2309.13430 | link |
2023-09-21 | Face Identity-Aware Disentanglement in StyleGAN | Adrian Suwała et.al. | 2309.12033 | null |
2023-09-21 | On-the-Fly SfM: What you capture is What you get | Zongqian Zhan et.al. | 2309.11883 | link |
2023-09-20 | 2D-3D Pose Tracking with Multi-View Constraints | Huai Yu et.al. | 2309.11335 | null |
2023-09-19 | VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition | Adam D. Hines et.al. | 2309.10225 | link |
2023-09-18 | DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach | Chenghao Xu et.al. | 2309.09879 | null |
2023-09-18 | Decompose Semantic Shifts for Composed Image Retrieval | Xingyu Yang et.al. | 2309.09531 | null |
2023-09-16 | Efficient Object Rearrangement via Multi-view Fusion | Dehao Huang et.al. | 2309.08994 | null |
2023-09-16 | DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF | Mert Asim Karaoglu et.al. | 2309.08927 | null |
2023-09-16 | Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning | Pengyu Yin et.al. | 2309.08914 | link |
2023-09-15 | Active Learning for Fine-Grained Sketch-Based Image Retrieval | Himanshu Thakur et.al. | 2309.08743 | null |
2023-09-15 | Optimization of Rank Losses for Image Retrieval | Elias Ramzi et.al. | 2309.08250 | link |
2023-09-18 | Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer | Yaoting Wang et.al. | 2309.07929 | link |
2023-09-14 | EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization | Minjung Kim et.al. | 2309.07471 | link |
2023-09-13 | RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline | Mirko Usuelli et.al. | 2309.07094 | null |
2023-09-11 | Towards Content-based Pixel Retrieval in Revisited Oxford and Paris | Guoyuan An et.al. | 2309.05438 | link |
2023-09-08 | Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning | Hiroki Nakamura et.al. | 2309.04148 | null |
2023-09-05 | Magnetic Navigation using Attitude-Invariant Magnetic Field Information for Loop Closure Detection | Natalia Pavlasek et.al. | 2309.02394 | null |
2023-09-05 | Dual Relation Alignment for Composed Image Retrieval | Xintong Jiang et.al. | 2309.02169 | null |
2023-09-04 | NLLB-CLIP -- train performant multilingual image retrieval model on a budget | Alexander Visheratin et.al. | 2309.01859 | null |
2023-09-04 | Target-Guided Composed Image Retrieval | Haokun Wen et.al. | 2309.01366 | null |
2023-09-02 | Deep supervised hashing for fast retrieval of radio image cubes | Steven Ndung'u et.al. | 2309.00932 | null |
2023-08-31 | Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval | Prateksha Udhayanan et.al. | 2308.16649 | null |
2023-08-28 | Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics | Nils Böhne et.al. | 2308.14786 | null |
2023-08-28 | CoVR: Learning Composed Video Retrieval from Web Video Captions | Lucas Ventura et.al. | 2308.14746 | link |
2023-08-27 | Deep Learning for Visual Localization and Mapping: A Survey | Changhao Chen et.al. | 2308.14039 | null |
2023-08-26 | Learning Efficient Representations for Image-Based Patent Retrieval | Hongsong Wang et.al. | 2308.13749 | null |
2023-08-25 | Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers | Mohammad Javad Rajabi et.al. | 2308.13671 | null |
2023-08-24 | Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities | Jinze Bai et.al. | 2308.12966 | link |
2023-08-23 | Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval | Huafeng Li et.al. | 2308.11994 | null |
2023-08-23 | OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes | Tao Xie et.al. | 2308.11928 | link |
2023-08-22 | Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features | Alberto Baldrati et.al. | 2308.11485 | link |
2023-08-22 | GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training | Xinchi Deng et.al. | 2308.11331 | null |
2023-08-22 | LDP-Feat: Image Features with Local Differential Privacy | Francesco Pittaluga et.al. | 2308.11223 | null |
2023-08-21 | EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition | Gabriele Berton et.al. | 2308.10832 | link |
2023-08-20 | FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory | Anwesan Pal et.al. | 2308.10170 | null |
2023-08-18 | 3D Model-free Visual localization System from Essential Matrix under Local Planar Motion | Yanmei Jiao et.al. | 2308.09566 | null |
2023-08-17 | FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings | Yulin Su et.al. | 2308.09012 | link |
2023-08-16 | Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval | Aishwarya Venkataramanan et.al. | 2308.08431 | link |
2023-08-16 | Ranking-aware Uncertainty for Text-guided Image Retrieval | Junyang Chen et.al. | 2308.08131 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-23 | OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs | Chen Xin et.al. | 2411.15653 | link |
2024-11-19 | IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose | Fei Ren et.al. | 2411.12676 | null |
2024-11-04 | Silver medal Solution for Image Matching Challenge 2024 | Yian Wang et.al. | 2411.01851 | null |
2024-11-04 | KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension | Jie Yang et.al. | 2411.01846 | null |
2024-10-31 | From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots | Vasileios Tzouras et.al. | 2410.23906 | null |
2024-10-04 | Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation | Aman Anand et.al. | 2410.14700 | null |
2024-11-27 | Sim2real Cattle Joint Estimation in 3D point clouds | Mohammad Okour et.al. | 2410.14419 | null |
2024-10-16 | PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network | Asish Bera et.al. | 2410.12742 | null |
2024-10-16 | RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition | Asish Bera et.al. | 2410.12718 | null |
2024-10-01 | A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference | Yuan Li et.al. | 2410.11848 | null |
2024-10-11 | Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image | Marta Veganzones Rodriguez et.al. | 2410.09155 | null |
2024-10-08 | Unsupervised Model Diagnosis | Yinong Oliver Wang et.al. | 2410.06243 | null |
2024-10-08 | Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration | Xueyang Kang et.al. | 2410.05729 | link |
2024-10-16 | Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features | Chengkai Hou et.al. | 2410.02237 | null |
2024-10-02 | Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection | Hongru Yan et.al. | 2410.01404 | null |
2024-09-30 | OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection | Changsheng Lu et.al. | 2409.19899 | link |
2024-10-07 | SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation | Xin Li et.al. | 2409.18082 | null |
2024-09-24 | GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Gennady Sidorov et.al. | 2409.16502 | link |
2024-09-20 | Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators | Niloufar Amiri et.al. | 2409.13668 | null |
2024-09-25 | Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding | Rania Hossam et.al. | 2409.08695 | link |
2024-09-06 | D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection | Kentaro Hirahara et.al. | 2409.04060 | null |
2024-10-01 | Towards Practical Human Motion Prediction with LiDAR Point Clouds | Xiao Han et.al. | 2408.08202 | null |
2024-07-31 | Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods | Xusheng Luo et.al. | 2408.00117 | null |
2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907 | null |
2024-07-25 | LION: Linear Group RNN for 3D Object Detection in Point Clouds | Zhe Liu et.al. | 2407.18232 | link |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-09 | LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition | Teng Wang et.al. | 2407.06730 | null |
2024-07-04 | PFGS: High Fidelity Point Cloud Rendering via Feature Splatting | Jiaxu Wang et.al. | 2407.03857 | link |
2024-07-03 | A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes | Li Fang et.al. | 2407.02830 | link |
2024-07-02 | Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning | Chengchao Shen et.al. | 2407.02014 | link |
2024-06-28 | Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics | Chengrui Gao et.al. | 2406.19672 | null |
2024-07-23 | A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking | Lorenzo Shaikewitz et.al. | 2406.16837 | link |
2024-06-03 | Scale-Free Image Keypoints Using Differentiable Persistent Homology | Giovanni Barbarani et.al. | 2406.01315 | link |
2024-06-23 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-05-25 | Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration | Junjie Gao et.al. | 2405.16085 | null |
2024-06-01 | Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection -- Towards Precise Fish Morphological Assessment in Aquaculture Breeding | Weizhen Liu et.al. | 2405.12476 | link |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-15 | Vector-Symbolic Architecture for Event-Based Optical Flow | Hongzhi You et.al. | 2405.08300 | null |
2024-05-13 | RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration | Congjia Chen et.al. | 2405.07594 | null |
2024-05-08 | Unsupervised Skin Feature Tracking with Deep Neural Networks | Jose Chang et.al. | 2405.04943 | null |
2024-05-07 | A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images | László Kopácsi et.al. | 2405.04650 | null |
2024-04-30 | A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images | Wang Zhang et.al. | 2404.19311 | null |
2024-04-25 | Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach | Tahmim Hossain et.al. | 2404.14560 | null |
2024-04-19 | SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers | Vandad Davoodnia et.al. | 2404.12625 | null |
2024-04-17 | Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images | Junbiao Pang et.al. | 2404.10985 | null |
2024-03-28 | Towards Long Term SLAM on Thermal Imagery | Colin Keil et.al. | 2403.19885 | link |
2024-03-28 | Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation | Xiao Lin et.al. | 2403.19527 | link |
2024-03-27 | RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation | Yang Tian et.al. | 2403.18259 | null |
2024-03-18 | FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events | Xiangyuan Wang et.al. | 2403.11662 | link |
2024-03-05 | Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion | Meng Zheng et.al. | 2403.03217 | null |
2024-02-22 | A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets | Chengzhang Yu et.al. | 2402.14241 | null |
2024-02-25 | A Feature Matching Method Based on Multi-Level Refinement Strategy | Shaojie Zhang et.al. | 2402.13488 | null |
2024-03-05 | 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data | Zhi-Yi Lin et.al. | 2402.13172 | null |
2024-02-25 | Region Feature Descriptor Adapted to High Affine Transformations | Shaojie Zhang et.al. | 2402.09724 | null |
2024-01-29 | Reconstructing Close Human Interactions from Multiple Views | Qing Shuai et.al. | 2401.16173 | link |
2024-01-17 | To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection | Luyi Han et.al. | 2401.09336 | link |
2024-01-08 | Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach | Huanyu Liu et.al. | 2401.03742 | link |
2024-03-22 | 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation | Li Xu et.al. | 2401.00029 | null |
2023-12-27 | Bezier-based Regression Feature Descriptor for Deformable Linear Objects | Fangqing Chen et.al. | 2312.16502 | null |
2023-12-24 | Residual Learning for Image Point Descriptors | Rashik Shrestha et.al. | 2312.15471 | null |
2023-12-22 | BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions | Elias Marks et.al. | 2312.14706 | null |
2023-12-19 | Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation | Jiaming Liu et.al. | 2312.12480 | null |
2023-12-19 | An effective image copy-move forgery detection using entropy image | Zhaowei Lu et.al. | 2312.11793 | link |
2023-12-11 | VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data | Jian Shi et.al. | 2312.08871 | link |
2023-12-11 | Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach | Travis Driver et.al. | 2312.06865 | link |
2023-12-01 | Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) | Emma Cramer et.al. | 2312.00592 | link |
2023-11-30 | Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications | Sahar Almahfouz Nasser et.al. | 2311.18281 | null |
2023-11-29 | Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features | Thomas Wimmer et.al. | 2311.18113 | link |
2023-11-28 | Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features | Niladri Shekhar Dutt et.al. | 2311.17024 | link |
2023-11-28 | Riemannian Self-Attention Mechanism for SPD Networks | Rui Wang et.al. | 2311.16738 | null |
2023-11-27 | A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor | Jialin Liu et.al. | 2311.15609 | null |
2023-11-21 | Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers | Bo Sun et.al. | 2311.12291 | null |
2023-11-20 | CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement | Boni Hu et.al. | 2311.11604 | link |
2023-11-17 | Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration | Paul J. Claasen et.al. | 2311.10361 | link |
2023-11-13 | Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning | Tomáš Kunzo et.al. | 2311.07398 | null |
2023-11-11 | CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer | Haoyu Ma et.al. | 2311.06443 | link |
2023-11-08 | 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud | Jianchao Ci et.al. | 2311.04699 | null |
2023-11-06 | TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains | Alexander Naumann et.al. | 2311.03124 | link |
2023-11-06 | An invariant feature extraction for multi-modal images matching | Chenzhong Gao et.al. | 2311.02842 | null |
2023-10-20 | Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification | Mateus Roder et.al. | 2310.13490 | null |
2023-10-12 | UniPose: Detecting Any Keypoints | Jie Yang et.al. | 2310.08530 | link |
2023-10-10 | l-dyno: framework to learn consistent visual features using robot's motion | Kartikeya Singh et.al. | 2310.06249 | link |
2023-10-10 | Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face | Hao Zhang et.al. | 2310.05056 | link |
2023-10-13 | H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation | Yanjie Ze et.al. | 2310.01404 | link |
2023-10-04 | Self-supervised Learning of Contextualized Local Visual Embeddings | Thalles Santos Silva et.al. | 2310.00527 | link |
2023-10-22 | ObVi-SLAM: Long-Term Object-Visual SLAM | Amanda Adkins et.al. | 2309.15268 | link |
2023-09-19 | LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation | Haizhou Zhang et.al. | 2309.10436 | link |
2023-09-18 | RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy | Mert Asim Karaoglu et.al. | 2309.09563 | null |
2023-09-17 | CryoAlign: feature-based method for global and local 3D alignment of EM density maps | Bintao He et.al. | 2309.09217 | null |
2023-09-14 | EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization | Minjung Kim et.al. | 2309.07471 | link |
2023-09-09 | Mirror-Aware Neural Humans | Daniel Ajisafe et.al. | 2309.04750 | link |
2023-09-07 | InstructDiffusion: A Generalist Modeling Interface for Vision Tasks | Zigang Geng et.al. | 2309.03895 | null |
2023-09-04 | SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras | Himanshu Pahadia et.al. | 2309.01324 | null |
2023-09-12 | Improving the matching of deformable objects by learning to detect keypoints | Felipe Cadar et.al. | 2309.00434 | link |
2023-08-31 | SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation | Jiaben Chen et.al. | 2308.16876 | null |
2023-08-30 | Learning Structure-from-Motion with Graph Attention Networks | Lucas Brynte et.al. | 2308.15984 | link |
2023-08-29 | A lightweight 3D dense facial landmark estimation model from position map data | Shubhajit Basak et.al. | 2308.15170 | link |
2023-08-27 | Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors | Francesco Pirotti et.al. | 2308.14047 | null |
2023-08-24 | VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition | Gengxuan Tian et.al. | 2308.12870 | null |
2023-08-22 | LDP-Feat: Image Features with Local Differential Privacy | Francesco Pittaluga et.al. | 2308.11223 | null |
2023-08-20 | Neural Interactive Keypoint Detection | Jie Yang et.al. | 2308.10174 | link |
2023-08-19 | ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment | Bingyang Zhou et.al. | 2308.09987 | null |
2023-09-03 | DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching | Johan Edstedt et.al. | 2308.08479 | link |
2023-08-15 | CoDeF: Content Deformation Fields for Temporally Consistent Video Processing | Hao Ouyang et.al. | 2308.07926 | link |
2023-08-15 | ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition | Wenyuan Xue et.al. | 2308.07743 | null |
2023-08-14 | DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport | Sk Aziz Ali et.al. | 2308.07153 | null |
2023-08-14 | 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds | Minhao Li et.al. | 2308.05667 | link |
2023-08-02 | Automated Hit-frame Detection for Badminton Match Analysis | Yu-Hang Chien et.al. | 2307.16000 | link |
2023-07-25 | Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception | Chuanyu Luo et.al. | 2307.13300 | null |
2023-07-21 | Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data | Sahar Almahfouz Nasser et.al. | 2307.10698 | link |
2023-07-19 | SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid | Zi Li et.al. | 2307.09727 | link |
2023-07-01 | SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation | Fabian Duffhauss et.al. | 2307.00306 | link |
2023-06-27 | Detector-Free Structure from Motion | Xingyi He et.al. | 2306.15669 | link |
2023-06-26 | CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild | Li Ding et.al. | 2306.15073 | null |
2023-06-28 | Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset | Ziqiao Weng et.al. | 2306.07089 | link |
2023-06-07 | Learning Probabilistic Coordinate Fields for Robust Correspondences | Weiyue Zhao et.al. | 2306.04231 | null |
2023-06-03 | LDEB -- Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues | Amitabha Dey et.al. | 2306.02193 | null |
2023-06-02 | Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images | Marcela Mera-Trujillo et.al. | 2306.01938 | null |
2023-06-01 | A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm | Onur Beker et.al. | 2306.00892 | null |
2023-05-30 | Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection | Supeng Wang et.al. | 2305.18714 | link |
2023-05-23 | Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence | Grace Luo et.al. | 2305.14334 | null |
2023-05-15 | Non-Separable Multi-Dimensional Network Flows for Visual Computing | Viktoria Ehm et.al. | 2305.08628 | null |
2023-05-13 | Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance | Xinyu Lin et.al. | 2305.07943 | link |
2023-05-05 | HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration | Canhui Tang et.al. | 2305.03487 | link |
2023-04-17 | Human Pose Estimation in Monocular Omnidirectional Top-View Images | Jingrui Yu et.al. | 2304.08186 | null |
2023-04-14 | CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression | Mubariz Zaffar et.al. | 2304.07426 | null |
2023-04-12 | SiLK -- Simple Learned Keypoints | Pierre Gleize et.al. | 2304.06194 | link |
2023-04-06 | From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection | Changsheng Lu et.al. | 2304.03140 | null |
2023-03-29 | NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud | Xiangyu Zhu et.al. | 2303.16465 | null |
2023-03-24 | PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View | Ze Shi et.al. | 2303.14095 | link |
2023-03-23 | Semantic Image Attack for Visual Model Diagnosis | Jinqi Luo et.al. | 2303.13010 | null |
2023-03-22 | Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation | Heng Yang et.al. | 2303.12246 | link |
2023-03-21 | RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network | Sangmin Yoo et.al. | 2303.10770 | null |
2023-03-17 | ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty | Vanessa Wirth et.al. | 2303.10042 | null |
2023-03-15 | Descriptor Distillation for Efficient Multi-Robot SLAM | Xiyue Guo et.al. | 2303.08420 | null |
2023-03-15 | From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning | Zhuo Su et.al. | 2303.08414 | null |
2023-03-16 | KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input | Yiye Chen et.al. | 2303.05617 | link |
2023-03-07 | External Camera-based Mobile Robot Pose Estimation for Collaborative Perception with Smart Edge Sensors | Simon Bultmann et.al. | 2303.03797 | null |
2023-02-26 | PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection | Shenwei Xie et.al. | 2302.13263 | null |
2023-02-24 | Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks | Julian Lißner et.al. | 2302.12545 | null |
2023-02-21 | Deep Reinforcement Learning Based on Local GNN for Goal-conditioned Deformable Object Rearranging | Yuhong Deng et.al. | 2302.10446 | null |
2023-02-12 | A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training | Jingnan Shi et.al. | 2302.06019 | null |
2023-02-11 | Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing | Zitong Yu et.al. | 2302.05744 | null |
2023-02-09 | MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection | Yuhe Ding et.al. | 2302.04589 | link |
2023-02-03 | Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation | Jie Yang et.al. | 2302.01593 | link |
2023-02-03 | Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization | Yingying Zhu et.al. | 2302.01572 | link |
2023-01-21 | Vision Aided Environment Semantics Extraction and Its Application in mmWave Beam Selection | Feiyang Wen et.al. | 2301.08973 | null |
2023-01-18 | OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models | Xingyi He et.al. | 2301.07673 | null |
2023-01-12 | Towards High Performance One-Stage Human Pose Estimation | Ling Li et.al. | 2301.04842 | null |
2022-12-31 | Rethinking Rotation Invariance with Point Cloud Registration | Jianhui Yu et.al. | 2301.00149 | null |
2023-02-06 | Fruit Ripeness Classification: a Survey | Matteo Rizzo et.al. | 2212.14441 | null |
2022-12-28 | NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action | Kuan-Chieh Wang et.al. | 2212.13660 | link |
2022-12-24 | HandsOff: Labeled Dataset Generation With No Additional Human Annotations | Austin Xu et.al. | 2212.12645 | null |
2022-12-13 | Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images | Welerson Melo et.al. | 2212.09589 | link |
2022-12-15 | Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation | Bugra C. Sefercik et.al. | 2212.07567 | null |
2023-02-01 | DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization | Xiangyu Xu et.al. | 2212.04575 | null |
2022-12-07 | ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation | Yufei Xu et.al. | 2212.04246 | link |
2022-12-15 | Designing Feature Vector Representations: A case study from Chemistry | Signe Sidwall Thygesen et.al. | 2212.03731 | null |
2022-12-09 | DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model | Jeongjun Choi et.al. | 2212.02796 | link |
2022-12-05 | Images Speak in Images: A Generalist Painter for In-Context Visual Learning | Xinlong Wang et.al. | 2212.02499 | link |
2022-12-06 | R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor | Bai Zhu et.al. | 2212.02277 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
2022-11-29 | BALF: Simple and Efficient Blur Aware Local Feature Detector | Zhenjun Zhao et.al. | 2211.14731 | null |
2022-11-21 | Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching | Paul Roetzer et.al. | 2211.11589 | link |
2022-11-07 | Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration | Zixin Yang et.al. | 2211.03688 | null |
2022-10-31 | Tree Detection and Diameter Estimation Based on Deep Learning | Vincent Grondin et.al. | 2210.17424 | link |
2022-10-26 | Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds | Zhiyuan Zhang et.al. | 2210.14899 | null |
2022-10-23 | Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders | Ömer Sümer et.al. | 2210.12705 | null |
2022-10-21 | Real-time Detection of 2D Tool Landmarks with Synthetic Training Data | Bram Vanherle et.al. | 2210.11991 | null |
2022-10-09 | Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning | Ali Safa et.al. | 2210.04236 | null |
2022-10-04 | Centroid Distance Keypoint Detector for Colored Point Clouds | Hanzhe Teng et.al. | 2210.01298 | link |
2022-09-28 | Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences | Jun-Jee Chao et.al. | 2209.14419 | null |
2022-09-28 | USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation | Zhengrong Xue et.al. | 2209.13864 | null |
2022-10-16 | Suture Thread Spline Reconstruction from Endoscopic Images for Robotic Surgery with Reliability-driven Keypoint Detection | Neelay Joglekar et.al. | 2209.13657 | link |
2022-09-27 | Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors | Hao Dong et.al. | 2209.13586 | link |
2022-09-26 | Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments | Kyungmin Jung et.al. | 2209.12881 | null |
2022-10-07 | Long-Lived Accurate Keypoints in Event Streams | Philippe Chiberre et.al. | 2209.10385 | null |
2022-09-20 | Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence | Sunghwan Hong et.al. | 2209.08742 | null |
2022-09-15 | Online Marker-free Extrinsic Camera Calibration using Person Keypoint Detections | Bastian Pätzold et.al. | 2209.07393 | link |
2022-09-07 | Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip | Yang Li et.al. | 2209.03440 | null |
2022-08-27 | Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes | Ali Safa et.al. | 2208.12997 | null |
2022-08-24 | Self-Supervised Endoscopic Image Key-Points Matching | Manel Farhat et.al. | 2208.11424 | link |
2022-08-19 | Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture | Muhammad Muzammel et.al. | 2208.08224 | null |
2022-08-08 | MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis | Maximilian Gilles et.al. | 2208.03963 | null |
2022-08-07 | CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization | Yujiao Shi et.al. | 2208.03660 | null |
2022-07-29 | Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation | Qihao Liu et.al. | 2208.00090 | null |
2022-07-25 | Translating a Visual LEGO Manual to a Machine-Executable Plan | Ruocheng Wang et.al. | 2207.12572 | null |
2022-07-21 | Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network | Aline Sindel et.al. | 2207.10506 | null |
2022-07-15 | Human keypoint detection for close proximity human-robot interaction | Jan Docekal et.al. | 2207.07742 | null |
2022-07-15 | Adversarial Focal Loss: Asking Your Discriminator for Hard Examples | Chen Liu et.al. | 2207.07739 | null |
2022-07-13 | Rapid Person Re-Identification via Sub-space Consistency Regularization | Qingze Yin et.al. | 2207.05933 | null |
2022-07-07 | RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments | Qihao Peng et.al. | 2207.03539 | null |
2022-08-15 | Semi-supervised Human Pose Estimation in Art-historical Images | Matthias Springstein et.al. | 2207.02976 | link |
2022-07-01 | Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling | Jiamin Liang et.al. | 2207.00474 | null |
2022-06-24 | Motion Estimation for Large Displacements and Deformations | Qiao Chen et.al. | 2206.12464 | null |
2022-06-24 | Deep embedded clustering algorithm for clustering PACS repositories | Teo Manojlović et.al. | 2206.12417 | null |
2022-06-21 | KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences | Xuanhan Wang et.al. | 2206.10090 | link |
2022-06-20 | Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval | Guile Wu et.al. | 2206.09806 | null |
2022-06-15 | A Unified Sequence Interface for Vision Tasks | Ting Chen et.al. | 2206.07669 | link |
2022-06-09 | Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields | Mingtong Zhang et.al. | 2206.04669 | null |
2022-06-03 | SNAKE: Shape-aware Neural 3D Keypoint Field | Chengliang Zhong et.al. | 2206.01724 | link |
2022-05-17 | MulT: An End-to-End Multitask Learning Transformer | Deblina Bhattacharjee et.al. | 2205.08303 | null |
2022-05-10 | ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild | Chirag Raman et.al. | 2205.05177 | link |
2022-04-28 | Polarimetric imaging for the detection of synthetic models of SARS-CoV-2: a proof of concept | Emilio Gomez-Gonzalez et.al. | 2204.14050 | null |
2022-05-02 | GRIT: General Robust Image Task Benchmark | Tanmay Gupta et.al. | 2204.13653 | link |
2022-05-24 | ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation | Yufei Xu et.al. | 2204.12484 | link |
2022-04-26 | Unified GCNs: Towards Connecting GCNs with CNNs | Ziyan Zhang et.al. | 2204.12300 | null |
2022-04-19 | Self-Supervised Equivariant Learning for Oriented Keypoint Detection | Jongmin Lee et.al. | 2204.08613 | link |
2022-04-17 | The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation | Bao Zhao et.al. | 2204.08024 | null |
2022-04-15 | 2D Human Pose Estimation: A Survey | Haoming Chen et.al. | 2204.07370 | null |
2022-04-11 | Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification | Haojie Liu et.al. | 2204.04842 | null |
2022-04-07 | Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification | Yanan Wang et.al. | 2204.02611 | link |
2022-04-02 | SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning | Nilaksh Das et.al. | 2204.00734 | link |
2022-04-01 | MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration | Chenzhong Gao et.al. | 2204.00260 | null |
2022-03-29 | Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning | David Howard et.al. | 2203.15172 | null |
2022-03-28 | REGTR: End-to-end Point Cloud Correspondences with Transformers | Zi Jian Yew et.al. | 2203.14517 | link |
2022-03-27 | UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection | Ye Liu et.al. | 2203.12745 | link |
2022-03-21 | MatchFormer: Interleaving Attention in Transformers for Feature Matching | Qing Wang et.al. | 2203.09645 | link |
2022-03-16 | PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research | R. James Cotton et.al. | 2203.08792 | link |
2022-03-11 | DRTAM: Dual Rank-1 Tensor Attention Module | Hanxing Chi et.al. | 2203.05893 | null |
2022-03-07 | Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation | Meng Tian et.al. | 2203.03498 | null |
2022-02-10 | Motion-Aware Transformer For Occluded Person Re-identification | Mi Zhou et.al. | 2202.04243 | null |
2022-02-03 | Sim2Real Object-Centric Keypoint Detection and Description | Chengliang Zhong et.al. | 2202.00448 | null |
2022-01-16 | Cross-Centroid Ripple Pattern for Facial Expression Recognition | Monu Verma et.al. | 2201.05958 | null |
2022-01-14 | Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words | Harry Nguyen et.al. | 2201.03556 | link |
2022-01-10 | TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials | Jinnavat Sanalohit et.al. | 2201.03170 | null |
2022-01-06 | A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration | Aline Sindel et.al. | 2201.02242 | null |
2021-12-28 | Skin feature point tracking using deep feature encodings | Jose Ramon Chang et.al. | 2112.14159 | null |
2021-12-23 | Data-efficient learning for 3D mirror symmetry detection | Yancong Lin et.al. | 2112.12579 | null |
2021-12-22 | Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations -- combining input rotations and a kinematic model | Michael Zwölfer et.al. | 2112.12193 | null |
2021-12-22 | Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction | Henrique Siqueira et.al. | 2112.12002 | link |
2021-12-19 | Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection | Renjie Li et.al. | 2112.10275 | null |
2021-12-19 | GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor | Jean-Baptiste Carluer et.al. | 2112.10258 | link |
2021-12-16 | Masked Feature Prediction for Self-Supervised Visual Pre-Training | Chen Wei et.al. | 2112.09133 | link |
2021-12-13 | DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points | Zhengfei Kuang et.al. | 2112.06910 | null |
2021-12-12 | Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species | Changsheng Lu et.al. | 2112.06183 | link |
2021-12-13 | Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings | Mel Vecerik et.al. | 2112.04910 | null |
2021-12-06 | ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction | Xiaoming Zhao et.al. | 2112.02906 | link |
2021-11-25 | Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association | Sen Yang et.al. | 2111.12892 | link |
2021-11-08 | Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images | Jianfei Guo et.al. | 2111.04237 | null |
2021-11-04 | Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image | Feng Liu et.al. | 2111.03098 | null |
2021-11-01 | Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-realistic Perspective of Computer Vision | Ali Safa et.al. | 2111.00791 | null |
2021-10-30 | Geometry-Aware Hierarchical Bayesian Learning on Manifolds | Yonghui Fan et.al. | 2111.00184 | null |
2021-10-26 | CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration | Hao Yu et.al. | 2110.14076 | link |
2021-10-23 | HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware | James Hegarty et.al. | 2110.12106 | null |
2021-10-18 | Keypoint-Based Bimanual Shaping of Deformable Linear Objects under Environmental Constraints using Hierarchical Action Planning | Shengzeng Huo et.al. | 2110.08962 | null |
2021-10-11 | High-order Tensor Pooling with Attention for Action Recognition | Piotr Koniusz et.al. | 2110.05216 | null |
2021-10-10 | Digging Into Self-Supervised Learning of Feature Descriptors | Iaroslav Melekhov et.al. | 2110.04773 | null |
2021-10-04 | BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion | Zhaoqun Li et.al. | 2110.01179 | link |
2021-10-01 | Machine learning aided noise filtration and signal classification for CREDO experiment | Łukasz Bibrzycki et.al. | 2110.00297 | null |
2021-09-28 | PDC-Net+: Enhanced Probabilistic Dense Correspondence Network | Prune Truong et.al. | 2109.13912 | link |
2021-09-27 | HarrisZ |
Fabio Bellavia et.al. | 2109.12925 | null |
2021-09-24 | Catadioptric Stereo on a Smartphone | Kristijan Bartol et.al. | 2109.11872 | null |
2021-09-20 | Semi-supervised Dense Keypointsusing Unlabeled Multiview Images | Zhixuan Yu et.al. | 2109.09299 | null |
2021-08-31 | A Novel Dataset for Keypoint Detection of quadruped Animals from Images | Prianka Banik et.al. | 2108.13958 | link |
2021-08-27 | A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images | Xiaoteng Zhou et.al. | 2108.12151 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-20 | DT-LSD: Deformable Transformer-based Line Segment Detection | Sebastian Janampa et.al. | 2411.13005 | link |
2024-11-15 | Image Matching Filtering and Refinement by Planes and Beyond | Fabio Bellavia et.al. | 2411.09484 | link |
2024-11-11 | XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration | Ismail Can Yagmur et.al. | 2411.07430 | link |
2024-11-07 | The Impact of Semi-Supervised Learning on Line Segment Detection | Johanna Engman et.al. | 2411.04596 | link |
2024-11-04 | Silver medal Solution for Image Matching Challenge 2024 | Yian Wang et.al. | 2411.01851 | null |
2024-10-30 | Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants | Azadeh Sharafi et.al. | 2410.23329 | null |
2024-11-05 | RelationBooth: Towards Relation-Aware Customized Object Generation | Qingyu Shi et.al. | 2410.23280 | null |
2024-10-31 | ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses | Junjie Ni et.al. | 2410.22733 | null |
2024-10-30 | LoFLAT: Local Feature Matching using Focused Linear Attention Transformer | Naijian Cao et.al. | 2410.22710 | null |
2024-10-26 | Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification | Yue Su et.al. | 2410.20097 | null |
2024-10-01 | A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference | Yuan Li et.al. | 2410.11848 | null |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-09-27 | Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras | Yipeng Lu et.al. | 2409.18673 | null |
2024-09-25 | Game4Loc: A UAV Geo-Localization Benchmark from Game Data | Yuxiang Ji et.al. | 2409.16925 | link |
2024-09-24 | Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge | Marek Wodzinski et.al. | 2409.15931 | null |
2024-09-10 | Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Yujiao Shi et.al. | 2409.06471 | link |
2024-09-05 | Enabling Practical and Privacy-Preserving Image Processing | Chao Wang et.al. | 2409.03568 | null |
2024-09-20 | A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering | Shuang Song et.al. | 2409.03032 | link |
2024-08-29 | Super-Resolution works for coastal simulations | Zhi-Song Liu et.al. | 2408.16553 | null |
2024-09-15 | Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Sierra Bonilla et.al. | 2408.16445 | link |
2024-08-26 | Affine steerers for structured keypoint description | Georg Bökman et.al. | 2408.14186 | link |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-09-11 | Coarse-to-fine Alignment Makes Better Speech-image Retrieval | Lifeng Zhou et.al. | 2408.13119 | null |
2024-08-19 | BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval | Zhenyu Lu et.al. | 2408.10383 | null |
2024-08-14 | RSD-DOG : A New Image Descriptor based on Second Order Derivatives | Darshan Venkatrayappa et.al. | 2408.07687 | null |
2024-08-09 | One Shot is Enough for Sequential Infrared Small Target Segmentation | Bingbing Dan et.al. | 2408.04823 | link |
2024-08-07 | PRISM: PRogressive dependency maxImization for Scale-invariant image Matching | Xudong Cai et.al. | 2408.03598 | null |
2024-08-05 | ConDL: Detector-Free Dense Image Matching | Monika Kwiatkowski et.al. | 2408.02766 | null |
2024-08-04 | Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image | Xinlin Ren et.al. | 2408.02079 | link |
2024-07-29 | Image-text matching for large-scale book collections | Artemis Llabrés et.al. | 2407.19812 | link |
2024-07-26 | PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis | Sohyeong Kim et.al. | 2407.18695 | null |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-16 | REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching | Han Nie et.al. | 2407.11637 | link |
2024-07-16 | A Self-Correcting Strategy of the Digital Volume Correlation Displacement Field Based on Image Matching: Application to Poor Speckles Quality and Complex-Large Deformation | Chengsheng Li et.al. | 2407.11287 | null |
2024-07-14 | Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching | Xiaoyong Lu et.al. | 2407.07789 | null |
2024-07-10 | Mutual Information calculation on different appearances | Jiecheng Liao et.al. | 2407.07410 | null |
2024-07-15 | SfM on-the-fly: Get better 3D from What You Capture | Zongqian Zhan et.al. | 2407.03939 | null |
2024-07-03 | IMC 2024 Methods & Solutions Review | Shyam Gupta et.al. | 2407.03172 | null |
2024-06-21 | High Resolution Surface Reconstruction of Cultural Heritage Objects Using Shape from Polarization Method | F. S. Mortazavi et.al. | 2406.15121 | null |
2024-06-16 | Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models | Yikai Zhang et.al. | 2406.10902 | link |
2024-06-14 | Grounding Image Matching in 3D with MASt3R | Vincent Leroy et.al. | 2406.09756 | link |
2024-06-05 | A Self-Supervised Denoising Strategy for Underwater Acoustic Camera Imageries | Xiaoteng Zhou et.al. | 2406.02914 | null |
2024-05-22 | Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching | Hongkai Chen et.al. | 2405.13874 | null |
2024-05-21 | OmniGlue: Generalizable Feature Matching with Foundation Model Guidance | Hanwen Jiang et.al. | 2405.12979 | link |
2024-05-14 | Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation | Rezkellah Noureddine Khiati et.al. | 2405.08556 | link |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-13 | Authentic Hand Avatar from a Phone Scan via Universal Hand Model | Gyeongsik Moon et.al. | 2405.07933 | null |
2024-04-30 | A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images | Wang Zhang et.al. | 2404.19311 | null |
2024-04-30 | XFeat: Accelerated Features for Lightweight Image Matching | Guilherme Potje et.al. | 2404.19174 | null |
2024-06-10 | MinBackProp -- Backpropagating through Minimal Solvers | Diana Sungatullina et.al. | 2404.17993 | link |
2024-04-25 | Transformer-Based Local Feature Matching for Multimodal Image Registration | Remi Delaunay et.al. | 2404.16802 | null |
2024-04-23 | FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction | Hang Hua et.al. | 2404.14715 | null |
2024-04-22 | Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Eric Brachmann et.al. | 2404.14351 | null |
2024-04-17 | A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching | Francesco Pro et.al. | 2404.11302 | link |
2024-04-16 | Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction | John Francis et.al. | 2404.10626 | null |
2024-04-15 | XoFTR: Cross-modal Feature Matching Transformer | Önder Tuzcuoğlu et.al. | 2404.09692 | link |
2024-04-13 | DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector | Johan Edstedt et.al. | 2404.08928 | link |
2024-04-09 | Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences | Axel Barroso-Laguna et.al. | 2404.06337 | link |
2024-04-01 | Marrying NeRF with Feature Matching for One-step Pose Estimation | Ronghan Chen et.al. | 2404.00891 | null |
2024-04-01 | 3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching | Yibin Ye et.al. | 2404.00838 | null |
2024-03-31 | On the Estimation of Image-matching Uncertainty in Visual Place Recognition | Mubariz Zaffar et.al. | 2404.00546 | null |
2024-03-30 | Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation | Yuan Wang et.al. | 2404.00262 | null |
2024-03-26 | Staircase Localization for Autonomous Exploration in Urban Environments | Jinrae Kim et.al. | 2403.17330 | null |
2024-03-23 | MatchSeg: Towards Better Segmentation via Reference Image Matching | Ruiqiang Xiao et.al. | 2403.15901 | link |
2024-03-20 | Unifying Local and Global Multimodal Features for Place Recognition in Aliased and Low-Texture Environments | Alberto García-Hernández et.al. | 2403.13395 | link |
2024-03-19 | HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching | Ying Chen et.al. | 2403.12543 | null |
2024-03-16 | Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval | Shunsuke Tsubaki et.al. | 2403.10756 | null |
2024-03-16 | Vector search with small radiuses | Gergely Szilvasy et.al. | 2403.10746 | null |
2024-03-15 | Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline | Fangming Yuan et.al. | 2403.10283 | null |
2024-03-15 | Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning | Meixuan Li et.al. | 2403.10252 | null |
2024-03-14 | Virtual birefringence imaging and histological staining of amyloid deposits in label-free tissue using autofluorescence microscopy and deep learning | Xilin Yang et.al. | 2403.09100 | null |
2024-03-18 | Matching Non-Identical Objects | Yusuke Marumo et.al. | 2403.08227 | null |
2024-03-11 | Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed | Yifan Wang et.al. | 2403.04765 | null |
2024-03-07 | Scene Depth Estimation from Traditional Oriental Landscape Paintings | Sungho Kang et.al. | 2403.03408 | null |
2024-02-21 | Visual Style Prompting with Swapping Self-Attention | Jaeseok Jeong et.al. | 2402.12974 | link |
2024-02-16 | GIM: Learning Generalizable Image Matcher From Internet Videos | Xuelun Shen et.al. | 2402.11095 | link |
2024-02-13 | Are Semi-Dense Detector-Free Methods Good at Matching Local Features? | Matthieu Vilain et.al. | 2402.08671 | null |
2024-02-13 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359 | link |
2024-01-31 | Improved Scene Landmark Detection for Camera Localization | Tien Do et.al. | 2401.18083 | link |
2024-03-11 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-24 | Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry | Qi Cai et.al. | 2401.13357 | null |
2024-01-19 | SCENES: Subpixel Correspondence Estimation With Epipolar Supervision | Dominik A. Kloepfer et.al. | 2401.10886 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-11-29 | Prajwal Singh et.al. | 2411.19903 | null | |
2024-11-29 | Gaussian Splashing: Direct Volumetric Rendering Underwater | Nir Mualem et.al. | 2411.19588 | null |
2024-11-29 | ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration | Chaojun Ni et.al. | 2411.19548 | null |
2024-11-29 | LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis | Tianqi Li et.al. | 2411.19525 | null |
2024-11-28 | SAMa: Material-aware 3D Selection and Segmentation | Michael Fischer et.al. | 2411.19322 | null |
2024-11-27 | Surf-NeRF: Surface Regularised Neural Radiance Fields | Jack Naylor et.al. | 2411.18652 | null |
2024-11-26 | MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields | Yixiong Yang et.al. | 2411.17235 | link |
2024-11-25 | The Radiance of Neural Fields: Democratizing Photorealistic and Dynamic Robotic Simulation | Georgina Nuthall et.al. | 2411.16940 | null |
2024-11-27 | SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving | Georg Hess et.al. | 2411.16816 | link |
2024-11-25 | Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction | Ziyu Zhang et.al. | 2411.16392 | null |
2024-11-25 | U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields | Vinayak Gupta et.al. | 2411.16172 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-11-24 | GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision | Xu Baixin et.al. | 2411.15723 | link |
2024-11-23 | NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation | Menglin Zhang et.al. | 2411.15551 | null |
2024-11-23 | SplatSDF: Boosting Neural Implicit SDF via Gaussian Splatting Fusion | Runfa Blark Li et.al. | 2411.15468 | null |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | Robust SG-NeRF: Robust Scene Graph Aided Neural Surface Reconstruction | Yi Gu et.al. | 2411.13620 | null |
2024-11-20 | GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting | Xiaobao Wei et.al. | 2411.12981 | null |
2024-11-25 | SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image | Zixu Wang et.al. | 2411.12471 | null |
2024-11-19 | GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Shaoqing Xu et.al. | 2411.12452 | link |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-18 | LeC |
Zhenxing Mi et.al. | 2411.11374 | null |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting | Kang Chen et.al. | 2411.10504 | link |
2024-11-15 | GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization | Yanhao Sun et.al. | 2411.10033 | null |
2024-11-22 | BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis | David Svitov et.al. | 2411.08508 | null |
2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-12 | TomoGRAF: A Robust and Generalizable Reconstruction Network for Single-View Computed Tomography | Di Xu et.al. | 2411.08158 | null |
2024-11-12 | Material Transforms from Disentangled NeRF Representations | Ivan Lopes et.al. | 2411.08037 | link |
2024-11-11 | LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes | Zefan Qu et.al. | 2411.06757 | null |
2024-11-10 | Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field | Liuyue Xie et.al. | 2411.06365 | null |
2024-11-09 | AI-Driven Stylization of 3D Environments | Yuanbo Chen et.al. | 2411.06067 | null |
2024-11-08 | A Nerf-Based Color Consistency Method for Remote Sensing Images | Zongcheng Zuo et.al. | 2411.05557 | null |
2024-11-08 | Rate-aware Compression for NeRF-based Volumetric Video | Zhiyu Zhang et.al. | 2411.05322 | null |
2024-11-07 | Planar Reflection-Aware Neural Radiance Fields | Chen Gao et.al. | 2411.04984 | null |
2024-11-07 | GANESH: Generalizable NeRF for Lensless Imaging | Rakesh Raj Madavan et.al. | 2411.04810 | null |
2024-11-08 | SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation | Xun Tu et.al. | 2411.04386 | null |
2024-11-06 | Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Rui Peng et.al. | 2411.03637 | link |
2024-11-05 | Enhancing Exploratory Capability of Visual Navigation Using Uncertainty of Implicit Scene Representation | Yichen Wang et.al. | 2411.03487 | link |
2024-11-05 | CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval | Xin Wen et.al. | 2411.02979 | null |
2024-11-05 | Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery | Liv Kåreborn et.al. | 2411.02972 | null |
2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
2024-11-04 | NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields | Eric Zhu et.al. | 2411.02482 | null |
2024-11-05 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-11-06 | GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes | Gaochao Song et.al. | 2411.01853 | null |
2024-11-04 | A Probabilistic Formulation of LiDAR Mapping with Neural Radiance Fields | Matthew McDermott et.al. | 2411.01725 | link |
2024-11-01 | ZIM: Zero-Shot Image Matting for Anything | Beomyoung Kim et.al. | 2411.00626 | link |
2024-10-31 | Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes | Karim Kassab et.al. | 2410.23742 | null |
2024-10-31 | Get a Grip: Multi-Finger Grasp Evaluation at Scale Enables Robust Sim-to-Real Transfer | Tyler Ga Wei Lum et.al. | 2410.23701 | null |
2024-10-31 | XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM | Xiaomeng Wang et.al. | 2410.23690 | link |
2024-10-30 | Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder | Antoine Schnepf et.al. | 2410.22936 | null |
2024-10-28 | MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps | Yating Xu et.al. | 2410.21566 | link |
2024-10-29 | EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior | Xin Xiang et.al. | 2410.20981 | null |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | null |
2024-10-27 | GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields | Yusuke Sekikawa et.al. | 2410.20306 | null |
2024-10-25 | Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization | Weihang Liu et.al. | 2410.19483 | link |
2024-10-25 | Evaluation of strategies for efficient rate-distortion NeRF streaming | Pedro Martin et.al. | 2410.19459 | null |
2024-10-27 | Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis | Liang Han et.al. | 2410.18822 | null |
2024-10-24 | Real-time 3D-aware Portrait Video Relighting | Ziqi Cai et.al. | 2410.18355 | link |
2024-10-22 | Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies | Shrey Vishen et.al. | 2410.18137 | link |
2024-10-23 | VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points | Linus Franke et.al. | 2410.17932 | null |
2024-10-23 | Few-shot NeRF by Adaptive Rendering Loss Regularization | Qingshan Xu et.al. | 2410.17839 | null |
2024-10-23 | Efficient Neural Implicit Representation for 3D Human Reconstruction | Zexu Huang et.al. | 2410.17741 | link |
2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
2024-10-22 | LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias | Haian Jin et.al. | 2410.17242 | null |
2024-10-18 | GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting | Yusen Xie et.al. | 2410.17084 | null |
2024-10-22 | E-3DGS: Gaussian Splatting with Exposure and Motion Events | Xiaoting Yin et.al. | 2410.16995 | link |
2024-10-21 | Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions | Malte Prinzler et.al. | 2410.16395 | null |
2024-10-21 | FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors | Chin-Yang Lin et.al. | 2410.16271 | null |
2024-10-22 | EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting | Bohao Liao et.al. | 2410.15392 | null |
2024-10-19 | Neural Radiance Field Image Refinement through End-to-End Sampling Point Optimization | Kazuhiro Ohta et.al. | 2410.14958 | null |
2024-10-18 | Learning autonomous driving from aerial imagery | Varun Murali et.al. | 2410.14177 | null |
2024-10-18 | DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction | Ange Lou et.al. | 2410.14169 | null |
2024-10-17 | DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering | Jiahao Lu et.al. | 2410.13607 | link |
2024-10-21 | DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation | Guosheng Zhao et.al. | 2410.13571 | null |
2024-10-17 | Object Pose Estimation Using Implicit Representation For Transparent Objects | Varun Burde et.al. | 2410.13465 | null |
2024-10-17 | GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting | Shuichang Lai et.al. | 2410.13349 | null |
2024-10-16 | 3D Gaussian Splatting in Robotics: A Survey | Siting Zhu et.al. | 2410.12262 | null |
2024-10-16 | EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View | Zhaorong Wang et.al. | 2410.12242 | null |
2024-10-14 | 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications | Eduardo R. Corral-Soto et.al. | 2410.10782 | null |
2024-10-14 | NeRF-enabled Analysis-Through-Synthesis for ISAR Imaging of Small Everyday Objects with Sparse and Noisy UWB Radar Data | Md Farhan Tasnim Oshim et.al. | 2410.10085 | null |
2024-10-13 | Magnituder Layers for Implicit Neural Representations in 3D | Sang Min Kim et.al. | 2410.09771 | null |
2024-10-12 | Improving 3D Finger Traits Recognition via Generalizable Neural Rendering | Hongbin Xu et.al. | 2410.09582 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering | Jaehoon Choi et.al. | 2410.08941 | null |
2024-10-11 | Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints | Yicheng He et.al. | 2410.08780 | null |
2024-10-10 | RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image | Xiaoxue Chen et.al. | 2410.08181 | null |
2024-10-10 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
2024-10-11 | NeRF-Accelerated Ecological Monitoring in Mixed-Evergreen Redwood Forest | Adam Korycki et.al. | 2410.07418 | link |
2024-10-09 | DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation | Zhiqi Li et.al. | 2410.06756 | null |
2024-10-09 | MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Zhenhui Ye et.al. | 2410.06734 | null |
2024-10-09 | 3D Representation Methods: A Survey | Zhengren Wang et.al. | 2410.06475 | null |
2024-10-08 | Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters | Guoji Tian et.al. | 2410.05772 | null |
2024-10-07 | Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors | Ziwei Liao et.al. | 2410.05514 | link |
2024-10-07 | PH-Dropout: Prctical Epistemic Uncertainty Quantification for View Synthesis | Chuanhao Sun et.al. | 2410.05468 | link |
2024-10-07 | LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting | Qifeng Chen et.al. | 2410.05111 | null |
2024-10-07 | 6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering | Zhongpai Gao et.al. | 2410.04974 | null |
2024-10-07 | TeX-NeRF: Neural Radiance Fields from Pseudo-TeX Vision | Chonghao Zhong et.al. | 2410.04873 | null |
2024-10-06 | Deformable NeRF using Recursively Subdivided Tetrahedra | Zherui Qiu et.al. | 2410.04402 | null |
2024-10-05 | Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy | Pengcheng Chen et.al. | 2410.04041 | null |
2024-10-02 | MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis | Xiaobiao Du et.al. | 2410.02103 | link |
2024-10-03 | EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis | Alexander Mai et.al. | 2410.01804 | null |
2024-10-02 | 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Yang Cao et.al. | 2410.01647 | link |
2024-10-02 | Gaussian Splatting in Mirrors: Reflection-Aware Rendering via Virtual Camera Optimization | Zihan Wang et.al. | 2410.01614 | null |
2024-10-02 | Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection | Hongru Yan et.al. | 2410.01404 | null |
2024-10-01 | GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer | Youngho Yoon et.al. | 2410.00672 | link |
2024-09-30 | Distributed NeRF Learning for Collaborative Multi-Robot Perception | Hongrui Zhao et.al. | 2409.20289 | null |
2024-09-30 | Active Neural Mapping at Scale | Zijia Kuang et.al. | 2409.20276 | null |
2024-09-30 | OPONeRF: One-Point-One NeRF for Robust Neural Rendering | Yu Zheng et.al. | 2409.20043 | link |
2024-09-28 | G3R: Gradient Guided Generalizable Reconstruction | Yun Chen et.al. | 2409.19405 | null |
2024-09-26 | LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Huan Wang et.al. | 2409.18057 | link |
2024-09-26 | Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions | Weng Fei Low et.al. | 2409.17988 | null |
2024-09-26 | Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry | Qi Zhang et.al. | 2409.17729 | null |
2024-09-26 | TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene | Sandika Biswas et.al. | 2409.17459 | null |
2024-09-25 | SeaSplat: Representing Underwater Scenes with 3D Gaussian Splatting and a Physically Grounded Image Formation Model | Daniel Yang et.al. | 2409.17345 | null |
2024-09-25 | TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans | Aggelina Chatziagapi et.al. | 2409.16666 | null |
2024-09-26 | Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities | Peizhi Yan et.al. | 2409.16147 | link |
2024-09-24 | Disentangled Generation and Aggregation for Robust Radiance Fields | Shihe Shen et.al. | 2409.15715 | null |
2024-09-24 | Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB | Jae Yong Lee et.al. | 2409.15689 | null |
2024-09-23 | AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions | Samarth Chopra et.al. | 2409.15487 | null |
2024-09-22 | MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Wangze Xu et.al. | 2409.14316 | null |
2024-09-21 | MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors | Zhenhua Du et.al. | 2409.14019 | null |
2024-09-19 | CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications | Vladimir Frolov et.al. | 2409.12617 | null |
2024-09-18 | JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation | Sai Tanmay Reddy Chakkera et.al. | 2409.12156 | null |
2024-09-25 | BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling | Lulin Zhang et.al. | 2409.12014 | link |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-21 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
2024-09-16 | Baking Relightable NeRF for Real-time Direct/Indirect Illumination Rendering | Euntae Choi et.al. | 2409.10327 | null |
2024-09-16 | DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments | Mahmud A. Mohamad et.al. | 2409.10041 | link |
2024-09-15 | NARF24: Estimating Articulated Object Structure for Implicit Rendering | Stanley Lewis et.al. | 2409.09829 | null |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2024-09-11 | DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Haibo Yang et.al. | 2409.07454 | null |
2024-09-11 | ThermalGaussian: Thermal 3D Gaussian Splatting | Rongfeng Lu et.al. | 2409.07200 | null |
2024-09-10 | LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation | Archana Swaminathan et.al. | 2409.06703 | null |
2024-09-10 | Sources of Uncertainty in 3D Scene Reconstruction | Marcus Klasson et.al. | 2409.06407 | link |
2024-09-09 | LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo | Wei Zhi Tang et.al. | 2409.06104 | link |
2024-09-09 | G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis | Lutao Jiang et.al. | 2409.05617 | null |
2024-09-09 | From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models | Tessa Pulli et.al. | 2409.05413 | null |
2024-09-09 | KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction | Davide Di Nucci et.al. | 2409.05407 | null |
2024-09-09 | Lagrangian Hashing for Compressed Neural Field Representations | Shrisudhan Govindarajan et.al. | 2409.05334 | null |
2024-09-09 | Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems | Jianheng Liu et.al. | 2409.05310 | null |
2024-09-06 | SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields | Yuze Wang et.al. | 2409.04482 | null |
2024-09-05 | Weight Conditioning for Smooth Optimization of Neural Networks | Hemanth Saratchandran et.al. | 2409.03424 | null |
2024-09-05 | Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction | Shen Chen et.al. | 2409.03213 | null |
2024-09-04 | UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views | Jiaxin Guo et.al. | 2409.02917 | link |
2024-09-03 | GraspSplats: Efficient Manipulation with 3D Feature Splatting | Mazeyu Ji et.al. | 2409.02084 | null |
2024-09-03 | Bokang Zhang et.al. | 2409.01661 | link | |
2024-08-30 | ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images | Xiaoshuai Zhang et.al. | 2408.17027 | null |
2024-08-29 | GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content | Lebin Zhou et.al. | 2408.16866 | null |
2024-09-01 | Generic Objects as Pose Probes for Few-Shot View Synthesis | Zhirui Gao et.al. | 2408.16690 | null |
2024-08-29 | Spurfies: Sparse Surface Reconstruction using Local Geometry Priors | Kevin Raj et.al. | 2408.16544 | null |
2024-08-29 | NeRF-CA: Dynamic Reconstruction of X-ray Coronary Angiography with Extremely Sparse-views | Kirsten W. H. Maas et.al. | 2408.16355 | link |
2024-08-28 | Towards Realistic Example-based Modeling via 3D Gaussian Stitching | Xinyu Gao et.al. | 2408.15708 | null |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-08-27 | GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning | Shubhendu Jena et.al. | 2408.14724 | null |
2024-08-28 | FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2408.14035 | link |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-23 | SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting | Jiseung Hong et.al. | 2408.13285 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-21 | Irregularity Inspection using Neural Radiance Field | Tianqi Ding et.al. | 2408.11251 | null |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | null |
2024-08-20 | Learning Part-aware 3D Representations by Fusing 2D Gaussians and Superquadrics | Zhirui Gao et.al. | 2408.10789 | null |
2024-08-20 | TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai et.al. | 2408.10739 | null |
2024-08-19 | Haoyang Wang et.al. | 2408.10135 | null | |
2024-08-19 | DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery | Corentin Dumery et.al. | 2408.09928 | null |
2024-08-20 | CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning | Haoyu Zhao et.al. | 2408.09663 | null |
2024-08-18 | S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis | Dongze Li et.al. | 2408.09347 | null |
2024-08-17 | SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation | Xiao Cao et.al. | 2408.09144 | null |
2024-08-17 | HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction | Xiao Zhao et.al. | 2408.09104 | null |
2024-08-16 | VF-NeRF: Learning Neural Vector Fields for Indoor Scene Reconstruction | Albert Gassol Puigjaner et.al. | 2408.08766 | link |
2024-08-15 | WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting | Huapeng Li et.al. | 2408.08206 | null |
2024-08-18 | Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space | Hyunjee Lee et.al. | 2408.07416 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-08-13 | ActiveNeRF: Learning Accurate 3D Geometry by Active Pattern Projection | Jianyu Tao et.al. | 2408.06592 | link |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering | Jiameng Li et.al. | 2408.06286 | link |
2024-08-12 | 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) | Jaydeep Rade et.al. | 2408.06244 | null |
2024-08-10 | Radiance Field Learners As UAV First-Person Viewers | Liqi Yan et.al. | 2408.05533 | null |
2024-08-09 | DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-09 | FewShotNeRF: Meta-Learning-based Novel View Synthesis for Rapid Scene-Specific Adaptation | Piraveen Sivakumar et.al. | 2408.04803 | null |
2024-08-06 | LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting | Joanna Kaleta et.al. | 2408.04474 | link |
2024-08-08 | A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery | Mengya Xu et.al. | 2408.04426 | link |
2024-08-08 | Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods | Yiming Zhou et.al. | 2408.04268 | null |
2024-08-07 | Goal-oriented Semantic Communication for the Metaverse Application | Zhe Wang et.al. | 2408.03646 | null |
2024-08-06 | RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis | Hugo Blanc et.al. | 2408.03356 | null |
2024-08-06 | Efficient NeRF Optimization -- Not All Samples Remain Equally Hard | Juuso Korhonen et.al. | 2408.03193 | null |
2024-08-06 | MGFs: Masked Gaussian Fields for Meshing Building based on Multi-View Images | Tengfei Wang et.al. | 2408.03060 | null |
2024-08-04 | PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone | Xin Yang et.al. | 2408.02053 | null |
2024-08-03 | FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields | Yifan Wu et.al. | 2408.01878 | null |
2024-08-03 | E |
Yunshan Qi et.al. | 2408.01840 | null |
2024-08-02 | NeRFoot: Robot-Footprint Estimation for Image-Based Visual Servoing | Daoxin Zhong et.al. | 2408.01251 | null |
2024-08-05 | UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization | Ziwen Guo et.al. | 2408.00860 | null |
2024-07-31 | StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization | Kaiyuan Tang et.al. | 2408.00150 | null |
2024-07-22 | PAV: Personalized Head Avatar from Unstructured Video Collection | Akin Caliskan et.al. | 2407.21047 | null |
2024-07-30 | Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering | Yanpeng Zhao et.al. | 2407.20908 | link |
2024-07-29 | Radiance Fields for Robotic Teleoperation | Maximum Wilder-Smith et.al. | 2407.20194 | link |
2024-07-29 | Garment Animation NeRF with Color Editing | Renke Wang et.al. | 2407.19774 | link |
2024-07-27 | Revisit Self-supervised Depth Estimation with Local Structure-from-Motion | Shengjie Zhu et.al. | 2407.19166 | null |
2024-07-26 | IOVS4NeRF:Incremental Optimal View Selection for Large-Scale NeRFs | Jingpeng Xie et.al. | 2407.18611 | null |
2024-07-24 | SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency | Yiming Xie et.al. | 2407.17470 | null |
2024-07-23 | HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images | Shreyas Singh et.al. | 2407.16503 | link |
2024-07-23 | DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors | Zizheng Yan et.al. | 2407.16260 | null |
2024-07-22 | BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes | Chih-Hai Su et.al. | 2407.15848 | null |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-19 | HOTS3D: Hyper-Spherical Optimal Transport for Semantic Alignment of Text-to-3D Generation | Zezeng Li et.al. | 2407.14419 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Semantic Communications for 3D Human Face Transmission with Neural Radiance Fields | Guanlin Wu et.al. | 2407.13992 | null |
2024-07-18 | EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting | Yuchen Weng et.al. | 2407.13520 | null |
2024-07-18 | GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields | Xiufeng Huang et.al. | 2407.13390 | null |
2024-07-18 | KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter | Yifan Zhan et.al. | 2407.13185 | null |
2024-07-17 | Generalizable Human Gaussians for Sparse View Synthesis | Youngjoong Kwon et.al. | 2407.12777 | link |
2024-07-17 | SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization | Yiyang Chen et.al. | 2407.12667 | link |
2024-07-17 | InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction | Xulong Wang et.al. | 2407.12661 | link |
2024-07-17 | Invertible Neural Warp for NeRF | Shin-Fang Chng et.al. | 2407.12354 | null |
2024-07-17 | Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections | Congrong Xu et.al. | 2407.12306 | null |
2024-07-18 | Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling | Jaehyeok Kim et.al. | 2407.11962 | null |
2024-07-18 | IPA-NeRF: Illusory Poisoning Attack Against Neural Radiance Fields | Wenxiang Jiang et.al. | 2407.11921 | link |
2024-07-16 | DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation | Jiwook Kim et.al. | 2407.11394 | link |
2024-07-15 | Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method | Adam Korycki et.al. | 2407.11238 | null |
2024-07-15 | AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems | Alexey Kotcov et.al. | 2407.10865 | null |
2024-07-15 | Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis | Antoine Legrand et.al. | 2407.10762 | null |
2024-07-15 | IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild | Shuaixian Wang et.al. | 2407.10695 | null |
2024-07-15 | NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis | Yubin Hu et.al. | 2407.10482 | null |
2024-07-15 | Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering | Francesco Di Sario et.al. | 2407.10389 | null |
2024-07-14 | RS-NeRF: Neural Radiance Fields from Rolling Shutter Images | Muyao Niu et.al. | 2407.10267 | link |
2024-07-14 | SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion | Jiyuan Zhang et.al. | 2407.10062 | null |
2024-07-12 | Physics-Informed Learning of Characteristic Trajectories for Smoke Reconstruction | Yiming Wang et.al. | 2407.09679 | link |
2024-07-12 | Radiance Fields from Photons | Sacha Jungerman et.al. | 2407.09386 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-11 | Feasibility of Neural Radiance Fields for Crime Scene Video Reconstruction | Shariq Nadeem Malik et.al. | 2407.08795 | null |
2024-07-11 | WildGaussians: 3D Gaussian Splatting in the Wild | Jonas Kulhanek et.al. | 2407.08447 | link |
2024-07-11 | MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos | Yushuo Chen et.al. | 2407.08414 | link |
2024-07-11 | Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-11 | Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields | Haojie Lian et.al. | 2407.08154 | null |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2024-07-10 | Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model | Qi Song et.al. | 2407.07735 | null |
2024-07-10 | Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field | Ganlin Yang et.al. | 2407.07461 | null |
2024-07-09 | Reference-based Controllable Scene Stylization with Gaussian Splatting | Yiqun Mei et.al. | 2407.07220 | null |
2024-07-09 | Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View | Dogyoon Lee et.al. | 2407.06613 | null |
2024-07-08 | RRM: Relightable assets using Radiance guided Material extraction | Diego Gomez et.al. | 2407.06397 | null |
2024-07-08 | PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes | Mohammad Reza Karimi Dastjerdi et.al. | 2407.06150 | null |
2024-07-08 | Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views | Jiawei Guo et.al. | 2407.05666 | null |
2024-07-08 | GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields | Weiyi Xue et.al. | 2407.05597 | null |
2024-07-08 | Dynamic Neural Radiance Field From Defocused Monocular Video | Xianrui Luo et.al. | 2407.05586 | null |
2024-07-07 | GaussReg: Fast 3D Registration with Gaussian Splatting | Jiahao Chang et.al. | 2407.05254 | null |
2024-07-06 | SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction | Weixing Xie et.al. | 2407.05023 | null |
2024-07-04 | CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images | Junghe Lee et.al. | 2407.03923 | null |
2024-07-02 | MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering | Ahmad AlMughrabi et.al. | 2407.02668 | null |
2024-07-03 | BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream | Wenpu Li et.al. | 2407.02174 | link |
2024-07-01 | Active Human Pose Estimation via an Autonomous UAV Agent | Jingxi Chen et.al. | 2407.01811 | null |
2024-07-01 | DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction | Yujin Ham et.al. | 2407.01761 | null |
2024-07-01 | Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation | Zihan Gao et.al. | 2407.01220 | null |
2024-06-29 | Intrinsic PAPR for Point-level 3D Scene Albedo and Shading Editing | Alireza Moazeni et.al. | 2407.00500 | null |
2024-06-28 | ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction | Ding-Jiun Huang et.al. | 2406.20066 | null |
2024-06-28 | EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting | Daiwei Zhang et.al. | 2406.19811 | null |
2024-06-27 | Shorter SPECT Scans Using Self-supervised Coordinate Learning to Synthesize Skipped Projection Views | Zongyu Li et.al. | 2406.18840 | null |
2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | link |
2024-06-25 | NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods | Jonas Kulhanek et.al. | 2406.17345 | null |
2024-06-24 | From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking | Xiaohao Xu et.al. | 2406.16850 | link |
2024-06-24 | Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis | Jianning Deng et.al. | 2406.16623 | null |
2024-06-24 | Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction | Tong Qin et.al. | 2406.16289 | null |
2024-06-23 | Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study | Zhe Wang et.al. | 2406.16068 | null |
2024-06-23 | Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction | Yangdi Lu et.al. | 2406.15982 | null |
2024-06-22 | psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery | Tongtong Zhang et.al. | 2406.15707 | null |
2024-06-21 | A3D: Does Diffusion Dream about 3D Alignment? | Savva Ignatyev et.al. | 2406.15020 | null |
2024-06-21 | E2GS: Event Enhanced Gaussian Splatting | Hiroyuki Deguchi et.al. | 2406.14978 | link |
2024-06-21 | Relighting Scenes with Object Insertions in Neural Radiance Fields | Xuening Zhu et.al. | 2406.14806 | null |
2024-06-20 | Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment | Yunshan Qi et.al. | 2406.14360 | null |
2024-06-19 | NeRF-Feat: 6D Object Pose Estimation using Feature Rendering | Shishir Reddy Vutukur et.al. | 2406.13796 | null |
2024-06-19 | Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images | Haruo Fujiwara et.al. | 2406.13393 | null |
2024-06-19 | Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields | Youngin Park et.al. | 2406.13251 | link |
2024-06-18 | Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models | Paul Henderson et.al. | 2406.13099 | null |
2024-06-18 | Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings | Ruijie Tang et.al. | 2406.13048 | null |
2024-06-18 | Fast Global Localization on Neural Radiance Field | Mangyu Kong et.al. | 2406.12202 | null |
2024-06-20 | TutteNet: Injective 3D Deformations by Composition of 2D Mesh Deformations | Bo Sun et.al. | 2406.12121 | null |
2024-06-17 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | Uncertainty modeling for fine-tuned implicit functions | Anna Susmelj et.al. | 2406.12082 | null |
2024-06-17 | LLaNA: Large Language and NeRF Assistant | Andrea Amaduzzi et.al. | 2406.11840 | null |
2024-06-17 | Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization | Huaiji Zhou et.al. | 2406.11766 | null |
2024-06-17 | InterNeRF: Scaling Radiance Fields via Parameter Interpolation | Clinton Wang et.al. | 2406.11737 | null |
2024-06-17 | NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation | Niu Guanchen et.al. | 2406.11259 | null |
2024-06-15 | NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows | Zhenggang Tang et.al. | 2406.10543 | link |
2024-06-15 | Federated Neural Radiance Field for Distributed Intelligence | Yintian Zhang et.al. | 2406.10474 | null |
2024-06-14 | Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections | Jiacong Xu et.al. | 2406.10373 | null |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219 | link |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control | Yuzhong Huang et.al. | 2406.10000 | null |
2024-06-14 | dGrasp: NeRF-Informed Implicit Grasp Policies with Supervised Optimization Slopes | Gergely Sóti et.al. | 2406.09939 | null |
2024-06-14 | RaNeuS: Ray-adaptive Neural Surface Reconstruction | Yida Wang et.al. | 2406.09801 | link |
2024-06-13 | Rethinking Score Distillation as a Bridge Between Image Distributions | David McAllister et.al. | 2406.09417 | null |
2024-06-13 | Preserving Identity with Variational Score for General-purpose 3D Editing | Duong H. Le et.al. | 2406.08953 | null |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-14 | AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis | Swapnil Bhosale et.al. | 2406.08920 | null |
2024-06-13 | NeRF Director: Revisiting View Selection in Neural Volume Rendering | Wenhui Xiao et.al. | 2406.08839 | null |
2024-06-12 | ICE-G: Image Conditional Editing of 3D Gaussian Splats | Vishnu Jaganathan et.al. | 2406.08488 | null |
2024-06-12 | OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Yinan Deng et.al. | 2406.08009 | link |
2024-06-12 | Spatial Annealing Smoothing for Efficient Few-shot Neural Rendering | Yuru Xiao et.al. | 2406.07828 | link |
2024-06-11 | C3DAG: Controlled 3D Animal Generation using 3D pose guidance | Sandeep Mishra et.al. | 2406.07742 | null |
2024-06-11 | M-LRM: Multi-view Large Reconstruction Model | Mengfei Li et.al. | 2406.07648 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431 | null |
2024-06-11 | Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion | Xin Yuan et.al. | 2406.06972 | null |
2024-06-11 | Neural Visibility Field for Uncertainty-Driven Active Mapping | Shangjie Xue et.al. | 2406.06948 | null |
2024-06-10 | IllumiNeRF: 3D Relighting without Inverse Rendering | Xiaoming Zhao et.al. | 2406.06527 | null |
2024-06-10 | GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation | Haozhe Xie et.al. | 2406.06526 | link |
2024-06-10 | PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction | Danpeng Chen et.al. | 2406.06521 | null |
2024-06-10 | Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis | Xin Jin et.al. | 2406.06216 | link |
2024-06-10 | ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models | Meng-Li Shih et.al. | 2406.06133 | null |
2024-06-09 | GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement | Peiye Zhuang et.al. | 2406.05649 | null |
2024-06-07 | Multiplane Prior Guided Few-Shot Aerial Scene Rendering | Zihan Gao et.al. | 2406.04961 | null |
2024-06-07 | Multi-style Neural Radiance Field with AdaIN | Yu-Wen Pao et.al. | 2406.04960 | link |
2024-06-06 | Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization | Takuhiro Kaneko et.al. | 2406.04155 | null |
2024-06-06 | How Far Can We Compress Instant-NGP-Based NeRF? | Yihang Chen et.al. | 2406.04101 | link |
2024-06-06 | Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling | Xinhang Liu et.al. | 2406.03723 | null |
2024-06-06 | Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Diwen Wan et.al. | 2406.03697 | link |
2024-06-04 | 3D-HGS: 3D Half-Gaussian Splatting | Haolin Li et.al. | 2406.02720 | link |
2024-06-06 | Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting | Inkyu Shin et.al. | 2406.02541 | null |
2024-06-04 | Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning | Jiaxu Wang et.al. | 2406.02370 | null |
2024-06-03 | Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting | Shaojie Ma et.al. | 2406.01593 | null |
2024-06-03 | Tetrahedron Splatting for 3D Generation | Chun Gu et.al. | 2406.01579 | link |
2024-06-03 | Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting | Fang Li et.al. | 2406.01042 | link |
2024-06-02 | PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency | Yeonsung Jung et.al. | 2406.00798 | null |
2024-06-02 | Representing Animatable Avatar via Factorized Neural Fields | Chunjin Song et.al. | 2406.00637 | null |
2024-06-04 | SuperGaussian: Repurposing Video Models for 3D Super Resolution | Yuan Shen et.al. | 2406.00609 | null |
2024-06-02 | Efficient Neural Light Fields (ENeLF) for Mobile Devices | Austin Peng et.al. | 2406.00598 | null |
2024-06-01 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-05-31 | R |
Ruyi Zha et.al. | 2405.20693 | link |
2024-05-31 | 4Diffusion: Multi-view Video Diffusion Model for 4D Generation | Haiyu Zhang et.al. | 2405.20674 | null |
2024-05-30 | Nan Huang et.al. | 2405.20323 | link | |
2024-05-30 | TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes | Minghao Guo et.al. | 2405.20283 | null |
2024-05-31 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-05-30 | IReNe: Instant Recoloring in Neural Radiance Fields | Alessio Mazzucchelli et.al. | 2405.19876 | null |
2024-05-30 | HINT: Learning Complete Human Neural Representations from Limited Viewpoints | Alessandro Sanvito et.al. | 2405.19712 | null |
2024-05-30 | View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields | Haodi He et.al. | 2405.19678 | link |
2024-05-29 | Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy | Zijie Jiang et.al. | 2405.18863 | null |
2024-06-02 | NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild | Weining Ren et.al. | 2405.18715 | link |
2024-05-28 | Self-supervised Pre-training for Transferable Multi-modal Perception | Xiaohao Xu et.al. | 2405.17942 | link |
2024-05-28 | A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction | Bin Zhang et.al. | 2405.17891 | null |
2024-05-29 | HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction | Haoyu Zhao et.al. | 2405.17872 | link |
2024-05-28 | Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh | Xiangjun Gao et.al. | 2405.17811 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-29 | PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting | Zipeng Wang et.al. | 2405.16829 | null |
2024-05-26 | Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors | Soumava Paul et.al. | 2405.16517 | null |
2024-05-24 | Neural Elevation Models for Terrain Mapping and Path Planning | Adam Dai et.al. | 2405.15227 | link |
2024-05-27 | HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting | Yuanhao Cai et.al. | 2405.15125 | link |
2024-05-24 | GS-Hider: Hiding Messages into 3D Gaussian Splatting | Xuanyu Zhang et.al. | 2405.15118 | null |
2024-05-23 | NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections | Dor Verbin et.al. | 2405.14871 | null |
2024-05-23 | Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling | Liwen Wu et.al. | 2405.14847 | null |
2024-05-23 | Camera Relocalization in Shadow-free Neural Radiance Fields | Shiyao Xu et.al. | 2405.14824 | link |
2024-05-23 | LDM: Large Tensorial SDF Model for Textured Mesh Generation | Rengan Xie et.al. | 2405.14580 | link |
2024-05-23 | JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression | Zihan Zheng et.al. | 2405.14452 | null |
2024-05-22 | DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus | Yu Chen et.al. | 2405.13943 | link |
2024-05-22 | Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances | Licheng Shen et.al. | 2405.13694 | null |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations | Antoine Legrand et.al. | 2405.12728 | null |
2024-05-20 | Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo | Tianqi Liu et.al. | 2405.12218 | link |
2024-05-20 | Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents | Guanlin Wu et.al. | 2405.12155 | null |
2024-05-20 | NPLMV-PS: Neural Point-Light Multi-View Photometric Stereo | Fotios Logothetis et.al. | 2405.12057 | null |
2024-05-19 | Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems | Shengxiang Sun et.al. | 2405.11629 | null |
2024-05-19 | R-NeRF: Neural Radiance Fields for Modeling RIS-enabled Wireless Environments | Huiying Yang et.al. | 2405.11541 | link |
2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129 | link |
2024-05-16 | When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models | Xianzheng Ma et.al. | 2405.10255 | link |
2024-05-15 | From NeRFs to Gaussian Splats, and Back | Siming He et.al. | 2405.09717 | link |
2024-05-14 | Dynamic NeRF: A Review | Jinwei Lin et.al. | 2405.08609 | null |
2024-05-13 | Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs | Mingyu Kim et.al. | 2405.07857 | link |
2024-05-12 | Point Resampling and Ray Transformation Aid to Editable NeRF Models | Zhenyang Li et.al. | 2405.07306 | null |
2024-05-12 | Hologram: Realtime Holographic Overlays via LiDAR Augmented Reconstruction | Ekansh Agrawal et.al. | 2405.07178 | null |
2024-05-11 | TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization | Zhen Tan et.al. | 2405.07027 | link |
2024-05-10 | LIVE: LaTex Interactive Visual Editing | Jinwei Lin et.al. | 2405.06762 | null |
2024-05-14 | SketchDream: Sketch-based Text-to-3D Generation and Editing | Feng-Lin Liu et.al. | 2405.06461 | null |
2024-05-10 | Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering | Xiaohan Zhang et.al. | 2405.06214 | null |
2024-05-10 | Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation | Bardienus P. Duisterhof et.al. | 2405.06181 | null |
2024-05-09 | DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation | Sitian Shen et.al. | 2405.05800 | null |
2024-05-10 | NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior | Gihoon Kim et.al. | 2405.05749 | null |
2024-05-09 | RPBG: Towards Robust Neural Point-based Graphics in the Wild | Qingtian Zhu et.al. | 2405.05663 | link |
2024-05-09 | Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview | Yuhang Ming et.al. | 2405.05526 | null |
2024-05-08 | Ning Wang et.al. | 2405.05010 | null | |
2024-05-08 | DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid | Sidun Liu et.al. | 2405.04416 | null |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-05 | Blending Distributed NeRFs with Tri-stage Robust Pose Optimization | Baijun Ye et.al. | 2405.02880 | null |
2024-05-05 | MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior | Honghua Chen et.al. | 2405.02859 | null |
2024-05-04 | TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes | Christopher Maxey et.al. | 2405.02762 | null |
2024-05-04 | ActiveNeuS: Active 3D Reconstruction using Neural Implicit Surface Uncertainty | Hyunseo Kim et.al. | 2405.02568 | null |
2024-05-03 | Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning | Dhruva Tirumala et.al. | 2405.02425 | null |
2024-05-03 | Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids | Junchen Liu et.al. | 2405.02386 | link |
2024-05-03 | WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights | Youngdong Jang et.al. | 2405.02066 | null |
2024-05-02 | NeRF in Robotics: A Survey | Guangming Wang et.al. | 2405.01333 | null |
2024-05-04 | LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun et.al. | 2405.00900 | null |
2024-05-01 | Depth Priors in Removal Neural Radiance Fields | Zhihao Guo et.al. | 2405.00630 | null |
2024-05-01 | NeRF-Guided Unsupervised Learning of RGB-D Registration | Zhinan Yu et.al. | 2405.00507 | null |
2024-05-01 | RTG-SLAM: Real-time 3D Reconstruction at Scale using Gaussian Splatting | Zhexi Peng et.al. | 2404.19706 | null |
2024-04-30 | NeRF-Insert: 3D Local Editing with Multimodal Control Signals | Benet Oriol Sabat et.al. | 2404.19204 | null |
2024-04-29 | SAGS: Structure-Aware 3D Gaussian Splatting | Evangelos Ververas et.al. | 2404.19149 | null |
2024-04-29 | GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting | Bo Chen et.al. | 2404.19040 | null |
2024-04-29 | Embedded Representation Learning Network for Animating Styled Video Portrait | Tianyong Wang et.al. | 2404.19038 | null |
2024-04-29 | Simple-RF: Regularizing Sparse Input Radiance Fields with Simpler Solutions | Nagabhushan Somraj et.al. | 2404.19015 | null |
2024-04-28 | S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM | Zhiyao Zhang et.al. | 2404.18284 | null |
2024-04-27 | DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction | Chenhe Du et.al. | 2404.17890 | null |
2024-04-26 | Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields | Tianqi Liu et.al. | 2404.17528 | link |
2024-04-25 | Depth Supervised Neural Surface Reconstruction from Airborne Imagery | Vincent Hackstein et.al. | 2404.16429 | null |
2024-04-24 | NeRF-XL: Scaling NeRFs with Multiple GPUs | Ruilong Li et.al. | 2404.16221 | null |
2024-04-24 | ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images | Jinseo Jeong et.al. | 2404.15707 | null |
2024-04-23 | DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft | Sam Earle et.al. | 2404.15538 | null |
2024-04-28 | GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Hongyun Yu et.al. | 2404.14037 | null |
2024-04-22 | NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation | Chi Huang et.al. | 2404.13921 | null |
2024-04-23 | CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory | Yunlong Ran et.al. | 2404.13896 | null |
2024-04-26 | Neural Radiance Field in Autonomous Driving: A Survey | Lei He et.al. | 2404.13816 | null |
2024-04-26 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711 | link |
2024-04-21 | Generalizable Novel-View Synthesis using a Stereo Camera | Haechan Lee et.al. | 2404.13541 | null |
2024-04-20 | High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces | Baoru Huang et.al. | 2404.13437 | null |
2024-04-20 | EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment | Guanghao Li et.al. | 2404.13346 | link |
2024-04-19 | FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction | Maria Dronova et.al. | 2404.12970 | null |
2024-04-22 | Does Gaussian Splatting need SFM Initialization? | Yalda Foroutan et.al. | 2404.12547 | null |
2024-04-18 | MeshLRM: Large Reconstruction Model for High-Quality Mesh | Xinyue Wei et.al. | 2404.12385 | null |
2024-04-18 | AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering | Jingfeng Guo et.al. | 2404.11897 | link |
2024-04-18 | Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations | Yu Feng et.al. | 2404.11852 | null |
2024-04-17 | SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping | Vincent Cartillier et.al. | 2404.11419 | null |
2024-04-16 | Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2404.10625 | null |
2024-04-16 | Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences | Seungwook Kim et.al. | 2404.10603 | null |
2024-04-16 | 1st Place Solution for ICCV 2023 OmniObject3D Challenge: Sparse-View Reconstruction | Hang Du et.al. | 2404.10441 | null |
2024-04-16 | SRGS: Super-Resolution 3D Gaussian Splatting | Xiang Feng et.al. | 2404.10318 | link |
2024-04-16 | Plug-and-Play Acceleration of Occupancy Grid-based NeRF Rendering using VDB Grid and Hierarchical Ray Traversal | Yoshio Kato et.al. | 2404.10272 | link |
2024-04-15 | Taming Latent Diffusion Model for Neural Radiance Field Inpainting | Chieh Hubert Lin et.al. | 2404.09995 | null |
2024-04-15 | Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video | Hongchi Xia et.al. | 2404.09833 | null |
2024-04-15 | DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading | Tong Wu et.al. | 2404.09412 | null |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-04-15 | OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering | Jingrui Ye et.al. | 2404.08449 | null |
2024-04-12 | GPN: Generative Point-based NeRF | Haipeng Wang et.al. | 2404.08312 | link |
2024-04-12 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
2024-04-11 | Connecting NeRFs, Images, and Text | Francesco Ballerini et.al. | 2404.07993 | null |
2024-04-11 | Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation | Keonhee Han et.al. | 2404.07933 | link |
2024-04-12 | NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | William Ljungbergh et.al. | 2404.07762 | link |
2024-04-11 | G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images | Zixiong Huang et.al. | 2404.07474 | link |
2024-04-10 | SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection | Mathis Kruse et.al. | 2404.06832 | link |
2024-04-10 | MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views | Runfa Li et.al. | 2404.06753 | null |
2024-04-10 | Bayesian NeRF: Quantifying Uncertainty with Volume Density in Neural Radiance Fields | Sibeak Lee et.al. | 2404.06727 | link |
2024-04-11 | SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera | Gaole Dai et.al. | 2404.06710 | null |
2024-04-09 | Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion | Fan Yang et.al. | 2404.06429 | null |
2024-04-09 | 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis | Zhicheng Lu et.al. | 2404.06270 | null |
2024-04-09 | GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields | Arnab Dey et.al. | 2404.06246 | null |
2024-04-09 | HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields | Arnab Dey et.al. | 2404.06152 | null |
2024-04-08 | Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation | Y. Wang et.al. | 2404.05236 | null |
2024-04-08 | StylizedGS: Controllable Stylization for 3D Gaussian Splatting | Dingxi Zhang et.al. | 2404.05220 | null |
2024-04-08 | Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos | Fengrui Tian et.al. | 2404.05163 | link |
2024-04-07 | CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis | Gyeongjin Kang et.al. | 2404.04913 | null |
2024-04-07 | GauU-Scene V2: Expanse Lidar Image Dataset Shows Unreliable Geometric Reconstruction Using Gaussian Splatting and NeRF | Butian Xiong et.al. | 2404.04880 | null |
2024-04-07 | NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization | Peng Tu et.al. | 2404.04875 | null |
2024-04-06 | DATENeRF: Depth-Aware Text-based Editing of NeRFs | Sara Rojas et.al. | 2404.04526 | null |
2024-04-05 | Robust Gaussian Splatting | François Darmon et.al. | 2404.04211 | null |
2024-04-04 | SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer | Zijie Wu et.al. | 2404.03736 | link |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views | Francis Engelmann et.al. | 2404.03650 | null |
2024-04-04 | VF-NeRF: Viewshed Fields for Rigid NeRF Registration | Leo Segre et.al. | 2404.03349 | null |
2024-04-03 | GenN2N: Generative NeRF2NeRF Translation | Xiangyue Liu et.al. | 2404.02788 | null |
2024-04-03 | LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis | Zehan Zheng et.al. | 2404.02742 | link |
2024-04-03 | Neural Radiance Fields with Torch Units | Bingnan Ni et.al. | 2404.02617 | null |
2024-04-03 | Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition | Yisheng He et.al. | 2404.02514 | null |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-02 | Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields | Joshua Ahn et.al. | 2404.02155 | null |
2024-04-02 | Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions | Saptarshi Dasgupta et.al. | 2404.01812 | null |
2024-04-01 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400 | null |
2024-04-01 | NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields | Muhammad Zubair Irshad et.al. | 2404.01300 | link |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-02 | StructLDM: Structured Latent Diffusion for 3D Human Generation | Tao Hu et.al. | 2404.01241 | null |
2024-04-01 | Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting | Jiarui Meng et.al. | 2404.01168 | null |
2024-04-01 | SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance | Yuru Xiao et.al. | 2404.00992 | null |
2024-04-01 | FlexiDreamer: Single Image-to-3D Generation with FlexiCubes | Ruowen Zhao et.al. | 2404.00987 | link |
2024-04-01 | Marrying NeRF with Feature Matching for One-step Pose Estimation | Ronghan Chen et.al. | 2404.00891 | null |
2024-03-29 | HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes | Ke Wu et.al. | 2403.20159 | null |
2024-03-29 | Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior | Jaehoon Ko et.al. | 2403.20153 | link |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising | Tianchen Deng et.al. | 2403.20034 | link |
2024-03-29 | SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image | Yunhao Li et.al. | 2403.20018 | link |
2024-03-29 | DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal | Yunhao Li et.al. | 2403.20013 | link |
2024-03-29 | Stable Surface Regularization for Fast Few-Shot NeRF | Byeongin Joung et.al. | 2403.19985 | null |
2024-03-29 | MI-NeRF: Learning a Single Face NeRF from Multiple Identities | Aggelina Chatziagapi et.al. | 2403.19920 | null |
2024-03-28 | Mitigating Motion Blur in Neural Radiance Fields with Events and Frames | Marco Cannici et.al. | 2403.19780 | link |
2024-03-28 | SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects | Avinash Ummadisingu et.al. | 2403.19607 | null |
2024-03-28 | CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians | Avinash Paliwal et.al. | 2403.19495 | null |
2024-03-28 | Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation | Yujin Chen et.al. | 2403.19319 | null |
2024-03-28 | Sine Activated Low-Rank Matrices for Parameter Efficient Learning | Yiping Ji et.al. | 2403.19243 | null |
2024-03-29 | Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | Qiuhong Shen et.al. | 2403.18795 | link |
2024-03-27 | SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery | Camille Billouard et.al. | 2403.18711 | link |
2024-03-27 | Modeling uncertainty for Gaussian Splatting | Luca Savant et.al. | 2403.18476 | null |
2024-03-26 | Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians | Kerui Ren et.al. | 2403.17898 | link |
2024-03-26 | NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation | Jiahao Chen et.al. | 2403.17537 | null |
2024-03-25 | VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation | Yang Chen et.al. | 2403.17001 | null |
2024-03-25 | CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs | Yingji Zhong et.al. | 2403.16885 | null |
2024-03-25 | Spike-NeRF: Neural Radiance Field Based On Spike Camera | Yijia Guo et.al. | 2403.16410 | null |
2024-03-24 | Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields | Haoyuan Wang et.al. | 2403.16224 | null |
2024-03-24 | Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes | Takashi Otonari et.al. | 2403.16141 | null |
2024-03-24 | CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Jiarui Hu et.al. | 2403.16095 | null |
2024-03-24 | Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap | Carl Lindström et.al. | 2403.16092 | null |
2024-03-26 | PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling | Xiaoyun Zheng et.al. | 2403.16080 | link |
2024-03-24 | Semantic Is Enough: Only Semantic Information For NeRF Reconstruction | Ruibo Wang et.al. | 2403.16043 | null |
2024-03-24 | Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields | unhong Zhao et.al. | 2403.15981 | null |
2024-03-23 | DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation | Mu-Yi Shen et.al. | 2403.15791 | link |
2024-03-23 | UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation | Yuliang Guo et.al. | 2403.15705 | link |
2024-03-22 | WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization | Jialu Wang et.al. | 2403.15272 | null |
2024-03-21 | Hyperspectral Neural Radiance Fields | Gerry Chen et.al. | 2403.14839 | null |
2024-03-21 | ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition | Tianhao Wu et.al. | 2403.14619 | null |
2024-03-21 | CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis | Matteo Bonotto et.al. | 2403.14412 | link |
2024-03-21 | InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity | Jiabin Liang et.al. | 2403.14376 | null |
2024-03-21 | Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions | Jiacong Xu et.al. | 2403.14053 | link |
2024-03-20 | MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination | Weiying Wang et.al. | 2403.13348 | null |
2024-03-19 | Depth-guided NeRF Training via Earth Mover's Distance | Anita Rau et.al. | 2403.13206 | null |
2024-03-19 | DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images | Zaid Tasneem et.al. | 2403.13199 | null |
2024-03-19 | Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering | Mingqi Shao et.al. | 2403.12839 | null |
2024-03-19 | Learning Neural Volumetric Pose Features for Camera Localization | Jingyu Lin et.al. | 2403.12800 | null |
2024-03-19 | IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model | Matteo Bortolon et.al. | 2403.12682 | null |
2024-03-18 | FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos | Florian Philipp Stilz et.al. | 2403.12198 | null |
2024-03-18 | ThermoNeRF: Multimodal Neural Radiance Fields for Thermal Novel View Synthesis | Mariam Hassan et.al. | 2403.12154 | link |
2024-03-18 | RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF | Sibi Catley-Chandar et.al. | 2403.11909 | null |
2024-03-18 | GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors | LI Yang et.al. | 2403.11899 | null |
2024-03-18 | Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging | Mert Özer et.al. | 2403.11865 | null |
2024-03-19 | BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting | Lingzhe Zhao et.al. | 2403.11831 | link |
2024-03-18 | Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Yuqi Zhang et.al. | 2403.11812 | link |
2024-03-18 | DVN-SLAM: Dynamic Visual Neural SLAM Based on Local-Global Encoding | Wenhua Wu et.al. | 2403.11776 | null |
2024-03-18 | Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes | Antoine Schnepf et.al. | 2403.11678 | null |
2024-03-18 | UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling | Yujiao Jiang et.al. | 2403.11589 | null |
2024-03-18 | Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem | Mincheol Chang et.al. | 2403.11573 | null |
2024-03-17 | Creating Seamless 3D Maps Using Radiance Fields | Sai Tarun Sathyan et.al. | 2403.11364 | null |
2024-03-17 | SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream | Lin Zhu et.al. | 2403.11222 | link |
2024-03-17 | Recent Advances in 3D Gaussian Splatting | Tong Wu et.al. | 2403.11134 | null |
2024-03-17 | Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications | Yonggan Fu et.al. | 2403.11131 | link |
2024-03-16 | Fast Sparse View Guided NeRF Update for Object Reconfigurations | Ziqi Lu et.al. | 2403.11024 | null |
2024-03-16 | HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering | Seunghyeon Seo et.al. | 2403.10906 | null |
2024-03-15 | FeatUp: A Model-Agnostic Framework for Features at Any Resolution | Stephanie Fu et.al. | 2403.10516 | link |
2024-03-15 | Thermal-NeRF: Neural Radiance Fields from an Infrared Camera | Tianxiang Ye et.al. | 2403.10340 | link |
2024-03-15 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297 | link |
2024-03-15 | GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time | Hao Li et.al. | 2403.10147 | null |
2024-03-15 | URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields | Bo Xu et.al. | 2403.10119 | null |
2024-03-15 | DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video | Huiqiang Sun et.al. | 2403.10103 | null |
2024-03-15 | Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience | Xiaohang Yu et.al. | 2403.09973 | null |
2024-03-14 | GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping | Yuhang Zheng et.al. | 2403.09637 | link |
2024-03-14 | The NeRFect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou et.al. | 2403.09577 | null |
2024-03-14 | VIRUS-NeRF -- Vision, InfraRed and UltraSonic based Neural Radiance Fields | Nicolaj Schmid et.al. | 2403.09477 | link |
2024-03-14 | 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation | Frank Zhang et.al. | 2403.09439 | null |
2024-03-14 | RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes | Thang-Anh-Quan Nguyen et.al. | 2403.09419 | null |
2024-03-14 | PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Tianyuan Yuan et.al. | 2403.09079 | link |
2024-03-13 | Gaussian Splatting in Style | Abhishek Saroha et.al. | 2403.08498 | null |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | link |
2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-12 | Q-SLAM: Quadric Representations for Monocular SLAM | Chensheng Peng et.al. | 2403.08125 | null |
2024-03-12 | SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields | Jungho Lee et.al. | 2403.07547 | link |
2024-03-11 | SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection | Yifu Tao et.al. | 2403.06877 | null |
2024-03-11 | Vosh: Voxel-Mesh Hybrid Representation for Real-Time View Synthesis | Chenhao Zhang et.al. | 2403.06505 | null |
2024-03-13 | FSViewFusion: Few-Shots View Generation of Novel Objects | Rukhshanda Hussain et.al. | 2403.06394 | null |
2024-03-10 | Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis? | Hanxin Zhu et.al. | 2403.06092 | null |
2024-03-09 | Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving | Junyi Cao et.al. | 2403.05907 | link |
2024-03-09 | Large Generative Model Assisted 3D Semantic Communication | Feibo Jiang et.al. | 2403.05783 | null |
2024-03-08 | GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting | Francesco Palandra et.al. | 2403.05154 | null |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508 | null |
2024-03-07 | Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis | Yuanhao Cai et.al. | 2403.04116 | link |
2024-03-08 | DNAct: Diffusion Guided Multi-Task 3D Policy Learning | Ge Yan et.al. | 2403.04115 | null |
2024-03-07 | Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs | Nikhil Mishra et.al. | 2403.04114 | link |
2024-03-06 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding | Zi-Ting Chou et.al. | 2403.03608 | null |
2024-03-05 | A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction | Haofan Lu et.al. | 2403.03241 | null |
2024-03-05 | Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps | Timothy Chen et.al. | 2403.02751 | null |
2024-03-04 | DaReNeRF: Direction-aware Representation for Dynamic Scenes | Ange Lou et.al. | 2403.02265 | null |
2024-03-04 | Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views | Shuai Guo et.al. | 2403.02063 | null |
2024-03-02 | NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning | Linsheng Chen et.al. | 2403.01325 | link |
2024-03-02 | Neural radiance fields-based holography [Invited] | Minsung Kang et.al. | 2403.01137 | null |
2024-03-02 | Neural Field Classifiers via Target Encoding and Classification Loss | Xindi Yang et.al. | 2403.01058 | null |
2024-03-01 | DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots | Chunlin Li et.al. | 2403.00228 | link |
2024-02-28 | NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images | Jingrui Yu et.al. | 2402.18196 | link |
2024-02-26 | Neural Radiance Fields in Medical Imaging: Challenges and Next Steps | Xin Wang et.al. | 2402.17797 | null |
2024-02-27 | Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning | Xiaoyu Zhang et.al. | 2402.17768 | null |
2024-02-27 | VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction | Jiaqi Lin et.al. | 2402.17427 | null |
2024-02-27 | Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis | Zicheng Zhang et.al. | 2402.17364 | link |
2024-02-27 | DivAvatar: Diverse 3D Avatar Generation with a Single Prompt | Weijing Tao et.al. | 2402.17292 | null |
2024-02-27 | CharNeRF: 3D Character Generation from Concept Art | Eddy Chu et.al. | 2402.17115 | null |
2024-02-26 | Disentangled 3D Scene Generation with Layout Learning | Dave Epstein et.al. | 2402.16936 | null |
2024-02-26 | CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency | Hanxin Zhu et.al. | 2402.16407 | null |
2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Zetian Song et.al. | 2402.16366 | null |
2024-02-26 | DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer | Yizhe Wu et.al. | 2402.16308 | null |
2024-02-22 | Consolidating Attention Features for Multi-view Image Editing | Or Patashnik et.al. | 2402.14792 | null |
2024-02-26 | FrameNeRF: A Simple and Efficient Framework for Few-shot Novel View Synthesis | Yan Xing et.al. | 2402.14586 | null |
2024-02-22 | NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection | Chenxi Huang et.al. | 2402.14464 | link |
2024-02-22 | TaylorGrid: Towards Fast and High-Quality Implicit Field Learning via Direct Taylor-based Grid Optimization | Renyi Mao et.al. | 2402.14415 | null |
2024-02-22 | Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields | Seungtae Nam et.al. | 2402.14196 | null |
2024-02-21 | Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Joongho Jo et.al. | 2402.13827 | null |
2024-02-21 | SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields | Zhentao Huang et.al. | 2402.13510 | null |
2024-02-20 | How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey | Fabio Tosi et.al. | 2402.13255 | link |
2024-02-20 | Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields | Bo-Yu Cheng et.al. | 2402.13252 | link |
2024-02-20 | NeRF Solves Undersampled MRI Reconstruction | Tae Jun Jang et.al. | 2402.13226 | null |
2024-02-20 | OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow | Simon Boeder et.al. | 2402.12792 | null |
2024-02-19 | Binary Opacity Grids: Capturing Fine Geometric Detail for Mesh-Based View Synthesis | Christian Reiser et.al. | 2402.12377 | null |
2024-02-19 | Colorizing Monochromatic Radiance Fields | Yean Cheng et.al. | 2402.12184 | null |
2024-02-17 | Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review | Thang-Anh-Quan Nguyen et.al. | 2402.11141 | link |
2024-02-15 | Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions | Muhammad Arbab Arshad et.al. | 2402.10344 | null |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325 | link |
2024-02-13 | Preconditioners for the Stochastic Training of Implicit Neural Representations | Shin-Fang Chng et.al. | 2402.08784 | null |
2024-02-13 | NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs | Michael Fischer et.al. | 2402.08622 | null |
2024-02-13 | H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields | Minyoung Park et.al. | 2402.08138 | null |
2024-02-12 | DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation | Chenchang Li et.al. | 2402.07648 | null |
2024-02-11 | BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis | Leandro A. Passos et.al. | 2402.07310 | link |
2024-02-11 | 3D Gaussian as a New Vision Era: A Survey | Ben Fei et.al. | 2402.07181 | null |
2024-02-09 | ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting | Georgii Stanishevskii et.al. | 2402.06390 | link |
2024-02-07 | NeRF as Non-Distant Environment Emitter in Physics-based Inverse Rendering | Jingwang Ling et.al. | 2402.04829 | null |
2024-02-07 | OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding | Guibiao Liao et.al. | 2402.04648 | link |
2024-02-11 | BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery | Huiqing Zhang et.al. | 2402.04554 | null |