MLOPs Hackathon

This repository contains all necessary examples for the MLOPS Hackathon.

Goal

The goal of the hackathon, is to train and deploy a ML model using different platforms, while using multiple MLOPs tools to track and monitor the hackathon.

Get started locally

To install the dependencies:

conda create -n mlopshackathon python=3.9
conda activate mlopshackathon
pip install -r requirements.txt

Script usage

Train a model

To train a model, execute the following script:

python training.py --max_epochs=5 --gpu=-1

Inference

To perform basic inference, execute the following script:

python inference.py --checkpoint files/weights/MNIST_classifier_mobilenetv3_rwepoch=4-val_loss=0.04.ckpt --image files/imgs_inference/MNIST_digit.png

These scripts should be enough to try out various ML platforms, be it for training or deployment.

Hackathon

The hackathon is split into three parts: training platforms, inference platforms and MLOPs solutions. These should be apporached in parallel, with multiple teams.

Training team

The training team should take the training code and perform trainings on different platforms. Notes should be taken on how these platforms approach training, and what they offer for our use case. The training times, prices and performances should be benchmarked as well. Platforms to try out:

Grid.ai
Vertex ai

Inference team

The inference team should take the inference script with the provided weights and deploy it to different platforms. In parallel, they should develop a script to test the platforms at different loads (1-10000 simultaneous inferences). Notes should be taken on the facility to deploy, as well as the offering of services for deployment.

Vertex ai
...

MLOPs team

The MLOPs team should test the integration of different frameworks into the code, capturing the correct metrics. The ideal scenario would be that all the trainings and inferences performed by the two respective teams can be centrally monitored, and the results be compared accross multiple platforms. Frameworks to test:

MLFlow
Kubeflow
clearml

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
files		files
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
inference.py		inference.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLOPs Hackathon

Goal

Get started locally

Script usage

Train a model

Inference

Hackathon

Training team

Inference team

MLOPs team

About

Releases

Packages

Languages

SquareFactory/mlops-hackathon

Folders and files

Latest commit

History

Repository files navigation

MLOPs Hackathon

Goal

Get started locally

Script usage

Train a model

Inference

Hackathon

Training team

Inference team

MLOPs team

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages