Skip to content

Latest commit

 

History

History
175 lines (169 loc) · 16.7 KB

File metadata and controls

175 lines (169 loc) · 16.7 KB

awesome_deep_learning_interpretability

深度学习近年来关于模型解释性的相关论文。

按引用次数排序可见引用排序

159篇论文pdf(有2篇需要上scihub找)上传到腾讯微云

不定期更新。

Year Publication Paper Citation code
2020 ICLR Knowledge Isomorphism between Neural Networks 0
2020 ICLR Interpretable Complex-Valued Neural Networks for Privacy Protection 2
2019 AI Explanation in artificial intelligence: Insights from the social sciences 495
2019 NMI Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead 142
2019 NeurIPS This looks like that: deep learning for interpretable image recognition 49 Pytorch
2019 NeurIPS A benchmark for interpretability methods in deep neural networks(同arxiv:1806.10758) 6
2019 NeurIPS Full-gradient representation for neural network visualization 3
2019 NeurIPS On the (In) fidelity and Sensitivity of Explanations 3
2019 NeurIPS Towards Automatic Concept-based Explanations 6 Tensorflow
2019 NeurIPS CXPlain: Causal explanations for model interpretation under uncertainty 3
2019 CVPR Interpreting CNNs via Decision Trees 65
2019 CVPR From Recognition to Cognition: Visual Commonsense Reasoning 63 Pytorch
2019 CVPR Attention branch network: Learning of attention mechanism for visual explanation 22
2019 CVPR Interpretable and fine-grained visual explanations for convolutional neural networks 9
2019 CVPR Learning to Explain with Complemental Examples 9
2019 CVPR Revealing Scenes by Inverting Structure from Motion Reconstructions 8 Tensorflow
2019 CVPR Multimodal Explanations by Predicting Counterfactuality in Videos 3
2019 CVPR Visualizing the Resilience of Deep Convolutional Network Interpretations 1
2019 ICCV U-CAM: Visual Explanation using Uncertainty based Class Activation Maps 9
2019 ICCV Towards Interpretable Face Recognition 7
2019 ICCV Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded 10
2019 ICCV Understanding Deep Networks via Extremal Perturbations and Smooth Masks 10 Pytorch
2019 ICCV Explaining Neural Networks Semantically and Quantitatively 4
2019 ICLR Hierarchical interpretations for neural network predictions 22 Pytorch
2019 ICLR How Important Is a Neuron? 10
2019 ICLR Visual Explanation by Interpretation: Improving Visual Feedback Capabilities of Deep Neural Networks 9
2018 ICML Extracting Automata from Recurrent Neural Networks Using Queries and Counterexamples 60 Pytorch
2019 ICML Towards A Deep and Unified Understanding of Deep Neural Models in NLP 10 Pytorch
2019 ICAIS Interpreting black box predictions using fisher kernels 14
2019 ACMFAT Explaining explanations in AI 78
2019 AAAI Interpretation of neural networks is fragile 87 Tensorflow
2019 AAAI Classifier-agnostic saliency map extraction 7
2019 AAAI Can You Explain That? Lucid Explanations Help Human-AI Collaborative Image Retrieval 1
2019 AAAIW Unsupervised Learning of Neural Networks to Explain Neural Networks 9
2019 AAAIW Network Transplanting 4
2019 CSUR A Survey of Methods for Explaining Black Box Models 455
2019 JVCIR Interpretable convolutional neural networks via feedforward design 28 Keras
2019 ExplainAI The (Un)reliability of saliency methods(scihub) 115
2019 ACL Attention is not Explanation 75
2019 arxiv Attention Interpretability Across NLP Tasks 5
2019 arxiv Interpretable CNNs 2
2018 ICLR Towards better understanding of gradient-based attribution methods for deep neural networks 151
2018 ICLR Learning how to explain neural networks: PatternNet and PatternAttribution 111
2018 ICLR On the importance of single directions for generalization 102 Pytorch
2018 ICLR Detecting statistical interactions from neural network weights 42 Pytorch
2018 ICLR Interpretable counting for visual question answering 25 Pytorch
2018 CVPR Interpretable Convolutional Neural Networks 190
2018 CVPR Tell me where to look: Guided attention inference network 98 Chainer
2018 CVPR Multimodal Explanations: Justifying Decisions and Pointing to the Evidence 94 Caffe
2018 CVPR Transparency by design: Closing the gap between performance and interpretability in visual reasoning 66 Pytorch
2018 CVPR Net2vec: Quantifying and explaining how concepts are encoded by filters in deep neural networks 45
2018 CVPR What have we learned from deep representations for action recognition? 23
2018 CVPR Learning to Act Properly: Predicting and Explaining Affordances from Images 21
2018 CVPR Teaching Categories to Human Learners with Visual Explanations 17 Pytorch
2018 CVPR What do Deep Networks Like to See? 12
2018 CVPR Interpret Neural Networks by Identifying Critical Data Routing Paths 10 Tensorflow
2018 ECCV Deep clustering for unsupervised learning of visual features 245 Pytorch
2018 ECCV Explainable neural computation via stack neural module networks 43 Tensorflow
2018 ECCV Grounding visual explanations 34
2018 ECCV Textual explanations for self-driving vehicles 42
2018 ECCV Interpretable basis decomposition for visual explanation 32 Pytorch
2018 ECCV Convnets and imagenet beyond accuracy: Understanding mistakes and uncovering biases 20
2018 ECCV Vqa-e: Explaining, elaborating, and enhancing your answers for visual questions 13
2018 ECCV Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance 13 Pytorch
2018 ECCV Diverse feature visualizations reveal invariances in early layers of deep neural networks 7 Tensorflow
2018 ECCV ExplainGAN: Model Explanation via Decision Boundary Crossing Transformations 1
2018 ICML Interpretability beyond feature attribution: Quantitative testing with concept activation vectors (tcav) 148 Tensorflow
2018 ICML Learning to explain: An information-theoretic perspective on model interpretation 88
2018 ACL Did the Model Understand the Question? 39 Tensorflow
2018 FITEE Visual interpretability for deep learning: a survey 183
2018 NeurIPS Sanity Checks for Saliency Maps 165
2018 NeurIPS Explanations based on the missing: Towards contrastive explanations with pertinent negatives 47 Tensorflow
2018 NeurIPS Towards robust interpretability with self-explaining neural networks 94 Pytorch
2018 NeurIPS Attacks meet interpretability: Attribute-steered detection of adversarial samples 35
2018 NeurIPS Workshop Interpretable Convolutional Filters with SincNet 27
2018 NeurIPS DeepPINK: reproducible feature selection in deep neural networks 21 Keras
2018 NeurIPS Representer point selection for explaining deep neural networks 15 Tensorflow
2018 AAAI Anchors: High-precision model-agnostic explanations 269
2018 AAAI Improving the adversarial robustness and interpretability of deep neural networks by regularizing their input gradients 138 Tensorflow
2018 AAAI Deep learning for case-based reasoning through prototypes: A neural network that explains its predictions 80 Tensorflow
2018 AAAI Interpreting CNN Knowledge via an Explanatory Graph 67 Matlab
2018 AAAI Examining CNN Representations with respect to Dataset Bias 32
2018 WACV Grad-cam++: Generalized gradient-based visual explanations for deep convolutional networks 113
2018 IJCV Top-down neural attention by excitation backprop 285
2018 TPAMI Interpreting deep visual representations via network dissection 69
2018 DSP Methods for interpreting and understanding deep neural networks(scihub) 568
2018 Access Peeking inside the black-box: A survey on Explainable Artificial Intelligence (XAI) 228
2018 JAIR Learning Explanatory Rules from Noisy Data 111 Tensorflow
2018 MIPRO Explainable artificial intelligence: A survey 82
2018 AIES Detecting Bias in Black-Box Models Using Transparent Model Distillation 28
2018 BMVC Rise: Randomized input sampling for explanation of black-box models 46
2018 arxiv Manipulating and measuring model interpretability 101
2018 arxiv How convolutional neural network see the world-A survey of convolutional neural network visualization methods 33
2018 arxiv Revisiting the importance of individual units in cnns via ablation 31
2018 arxiv Computationally Efficient Measures of Internal Neuron Importance 1
2017 ICML Understanding Black-box Predictions via Influence Functions 610 Pytorch
2017 ICML Axiomatic attribution for deep networks 561 Keras
2017 ICML Learning Important Features Through Propagating Activation Differences 489
2017 ICLR Visualizing deep neural network decisions: Prediction difference analysis 231 Caffe
2017 ICLR Exploring LOTS in Deep Neural Networks 26
2017 NeurIPS A Unified Approach to Interpreting Model Predictions 894
2017 NeurIPS Real time image saliency for black box classifiers 128 Pytorch
2017 NeurIPS SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability 122
2017 CVPR Mining Object Parts from CNNs via Active Question-Answering 16
2017 CVPR Network dissection: Quantifying interpretability of deep visual representations 435
2017 CVPR Improving Interpretability of Deep Neural Networks with Semantic Information 52
2017 CVPR MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network 100 Torch
2017 CVPR Interpretable 3d human action analysis with temporal convolutional networks 133
2017 CVPR Making the V in VQA matter: Elevating the role of image understanding in Visual Question Answering 454
2017 CVPR Knowing when to look: Adaptive attention via a visual sentinel for image captioning 525 Torch
2017 ICCV Grad-cam: Visual explanations from deep networks via gradient-based localization 1755 Pytorch
2017 ICCV Interpretable Explanations of Black Boxes by Meaningful Perturbation 339 Pytorch
2017 ICCV Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention 96
2017 ICCV Understanding and comparing deep neural networks for age and gender classification 48
2017 ICCV Learning to disambiguate by asking discriminative questions 11
2017 IJCAI Right for the right reasons: Training differentiable models by constraining their explanations 118
2017 IJCAI Understanding and improving convolutional neural networks via concatenated rectified linear units 237 Caffe
2017 AAAI Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning 28 Matlab
2017 ACL Visualizing and Understanding Neural Machine Translation 63
2017 EMNLP A causal framework for explaining the predictions of black-box sequence-to-sequence models 68
2017 CVPR Workshop Looking under the hood: Deep neural network visualization to interpret whole-slide image analysis outcomes for colorectal polyps 16
2017 survey Interpretability of deep learning models: a survey of results 63
2017 arxiv SmoothGrad: removing noise by adding noise 265
2017 arxiv Interpretable & explorable approximations of black box models 91
2017 arxiv Distilling a neural network into a soft decision tree 153 Pytorch
2017 arxiv Towards interpretable deep neural networks by leveraging adversarial examples 52
2017 arxiv Explainable artificial intelligence: Understanding, visualizing and interpreting deep learning models 294
2017 arxiv Contextual Explanation Networks 32 Pytorch
2017 arxiv Challenges for transparency 74
2017 ACMSOPP Deepxplore: Automated whitebox testing of deep learning systems 364
2017 CEURW What does explainable AI really mean? A new conceptualization of perspectives 83
2017 TVCG ActiVis: Visual Exploration of Industry-Scale Deep Neural Network Models 131
2016 NeurIPS Synthesizing the preferred inputs for neurons in neural networks via deep generator networks 277 Caffe
2016 NeurIPS Understanding the effective receptive field in deep convolutional neural networks 358
2016 CVPR Inverting Visual Representations with Convolutional Networks 295
2016 CVPR Visualizing and Understanding Deep Texture Representations 89
2016 CVPR Analyzing Classifiers: Fisher Vectors and Deep Neural Networks 94
2016 ECCV Generating Visual Explanations 256 Caffe
2016 ECCV Design of kernels in convolutional neural networks for image classification 13
2016 ICML Understanding and improving convolutional neural networks via concatenated rectified linear units 237
2016 ICML Visualizing and comparing AlexNet and VGG using deconvolutional layers 31
2016 EMNLP Rationalizing Neural Predictions 276 Pytorch
2016 IJCV Visualizing deep convolutional neural networks using natural pre-images 243 Matlab
2016 IJCV Visualizing Object Detection Features 23 Caffe
2016 KDD Why should i trust you?: Explaining the predictions of any classifier 2766
2016 TVCG Visualizing the hidden activity of artificial neural networks 143
2016 TVCG Towards better analysis of deep convolutional neural networks 210
2016 NAACL Visualizing and understanding neural models in nlp 292 Torch
2016 arxiv Understanding neural networks through representation erasure 150
2016 arxiv Grad-CAM: Why did you say that? 106
2016 arxiv Investigating the influence of noise and distractors on the interpretation of neural networks 29
2016 arxiv Attentive Explanations: Justifying Decisions and Pointing to the Evidence 46
2016 arxiv The Mythos of Model Interpretability 1108
2016 arxiv Multifaceted feature visualization: Uncovering the different types of features learned by each neuron in deep neural networks 140
2015 ICLR Striving for Simplicity: The All Convolutional Net 1998 Pytorch
2015 CVPR Understanding deep image representations by inverting them 1009 Matlab
2015 ICCV Understanding deep features with computer-generated imagery 100 Caffe
2015 ICML Workshop Understanding Neural Networks Through Deep Visualization 1072 Tensorflow
2015 AAS Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model 331
2014 ECCV Visualizing and Understanding Convolutional Networks 8856 Pytorch
2014 ICLR Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps 2277 Pytorch
  • 论文talk