End to end ML Project

Project setup:

Open this in VSCode
Install Dev Containers
Do Cmd + Shift + P -> Dev Containers: Rebuild Container Without Cache
Activate the conda virtual environment: source activate endtoend
Inside Dev Container, run mlflow and prefect local servers: nohup bash ./start_backend.sh

Model training:

Run: python main.py

Model training using Docker:

Build: docker build . -t endtoend:latest

Run: docker run endtoend:latest

Model serving (deployment)

For batch inference, do the following:

Start the data generation worker process in a terminal instance: make start_data_generator_worker
Start the batch inference worker process in another terminal instance: make start_batch_inference_worker
Deploy the flows for #1 and #2: prefect deploy --all

If you want to run it for debugging, make sure you change the CRON expressions in prefect.yaml

Tools used

Pandas for data processing/engineering
Sklearn for feature engineering and model development
Pytest for testing
MLFlow for experimentation tracking
Prefect for workflow management(done) + orchestration (TBD)
Black, isort and Flake8 for code styling and linting

TODO list:

✅ Train and test flow
✅ Log metrics and artifacts to MLFlow
✅ Prefect for workflows
✅ Makefile
✅ Basic tests
⌛ Model monitoring + scheduling it
⌛ Containerization
✅ Use databases for input/output
🔜 Feature store and vector store - TBD
🔜 Streaming features

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
.devcontainer		.devcontainer
.vscode		.vscode
artifacts		artifacts
dataset		dataset
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.prefectignore		.prefectignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
config.toml		config.toml
entrypoint.sh		entrypoint.sh
main.py		main.py
prefect.yaml		prefect.yaml
start_backend.sh		start_backend.sh
stop_backend.sh		stop_backend.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

End to end ML Project

Model training:

Model training using Docker:

Model serving (deployment)

Tools used

TODO list:

About

Releases

Packages

Languages

arghhjayy/EndToEndML

Folders and files

Latest commit

History

Repository files navigation

End to end ML Project

Model training:

Model training using Docker:

Model serving (deployment)

Tools used

TODO list:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages