Skip to content

Latest commit

 

History

History
39 lines (30 loc) · 2.81 KB

README.md

File metadata and controls

39 lines (30 loc) · 2.81 KB

SIPvsML

Software Integrity Protection Versus Machine Learning attacks

This repository hosts implementation code for Master's Thesis. The work evaluates effectiveness of Obfuscation & Software Integrity Protection schemes against Machine Learning-based attacks.

The image below summarizes the results:

Project Structure

  • /sip_ml_pipeline - Contains entire ML pipeline from data generation to rendering result charts
  • /notebooks - Interactive notebooks for data examination
  • /code2vec - Reference to external code embedding component
  • /diagrams - Draw.io diagram xml file sources

Requirements

python3 -m venv venv &&
source venv/bin/activate &&
pip install -r requirements.txt

Training Data

The full training data, including features, splits and results is ~500GB. Raw Data only include source programs without preprocessing or feature extraction. Results Data only contains result .json files and model weights.