Deep-Learning-with-Differential-Privacy

Objective

This project serves as an introduction to reading ML literature, and then applying this knowledge to deep learning and differential privacy concerns. The goal of this project is to understand deep learning models and how to protect the privacy of an individual’s data. Different algorithmic techniques for learning was implemented on medical image datasets and an analysis of privacy costs within the framework of differential privacy was be completed to evaluate the merits and room for improvement of different techniques. This project's deliverable was a research paper summarizing the results found throughout the fall and winter semesters. The paper can be found here.

Team Members

Nicole Streltsov (@NicoleStrel)
Ritvik Jayanthi (@RitvikJayanthi)
Alec Dong (@AlecDong)
Ria Upreti (@ria-upreti)
Akriti Sharma (@Akriti-Sharma1)
Bolade Amoussou (@cdw18)
Mikhael Orteza (@xPreliator)
Divya Gupta (@gdivyagupta)

Datasets:
- /chest-data/: gzip Numpy array files, from Chest Pneumonia X-ray images dataset
- /knee-data/: gzip Numpy array files, from Knee Osteoarthritis X-ray images dataset
Techniques:
- /DP-SGD/ (Tensorflow Objax): Differential Privacy with Stochiastic Gradient Descent, from the paper Abadi et al.
- /DP-SGD-JL/ (Tensorflow Keras): Differential Privacy with Stochastic Gradient Descent and JL Projections, from the paper Bu et al.
- /DP-SGD-FL/ (PyTorch): Differential Privacy with Stochastic Gradient Descent and Federated Learning, referencing the paper Wei et al.
- /PATE/ (PyTorch): Private Aggregation of Teacher Ensembles (PATE) algorithm, from the paper Uniyal et al.
Python Scripts:
- load_dataset_into_pickle.py: reads a directory of images, transforms the data into Numpy arrays, applies data segmentation and saves into gzip pickle files.
- visualize_dataset.py: reads a directory of images to create a scatter plot of image size and label distribution.
- metrics_calc_helper_functions.py: helper functions to calculate metrics for comparison, and to dump the data into text files.
- runtime_and_memory_graphs.py: generates graphs to compare memory/runtime of all techniques for the chest and knee datasets.
Metrics:
- /metrics/: stores text files of the metrics from our techniques for both the chest and knee datasets

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep-Learning-with-Differential-Privacy

Objective

Team Members

Contents

About

Releases

Packages

Contributors 8

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
DP-PATE		DP-PATE
DP-SGD-FL		DP-SGD-FL
DP-SGD-JL		DP-SGD-JL
DP-SGD		DP-SGD
chest-data		chest-data
knee-data		knee-data
metrics		metrics
.gitignore		.gitignore
README.md		README.md
load_dataset_into_pickle.py		load_dataset_into_pickle.py
metrics_calc_helper_functions.py		metrics_calc_helper_functions.py
runtime_and_memory_graphs.py		runtime_and_memory_graphs.py
visualize_dataset.py		visualize_dataset.py

NicoleStrel/Deep-Learning-with-Differential-Privacy

Folders and files

Latest commit

History

Repository files navigation

Deep-Learning-with-Differential-Privacy

Objective

Team Members

Contents

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 8

Languages

Packages