MLProjectTemplate

A Data Science project template with MLOps frameworks

Setting Up the Project using this template:

Click on Use this template -> create a new repository
Put your repository name and description -> create repository
Clone the repository to your local machine
Rename/Update:
1. template.py: update the project name and add more files/folders [Optional]
2. the project folder (src/project_name) to (src/YourProjectName)
3. README
4. requirements.txt with your project dependencies
5. setup.py with your/your project details
Create a virtual environment using setup.sh:
```
bash setup.sh 
```
Activate the virtual environment (optional, if not done in step 6):
```
source activate ./venv
```

Workflow:

Update config: config/config.yaml
Update the entity: src/project_name/entity/config_entity.py
Update the configuration manager: src/project_name/config/configuration.py
Update the components: src/project_name/components
Update the pipeline: src/project_name/pipeline
Update entry point: main.py
Update application: app.py

Project Organization:

|── .dvc                                <- dvc cache, config, remote info
|── .github                             <- ci/cd workflows
|── artifacts                           <- o/p of pipe that are needed for execution of next pipeline
|── config                              <- configurations for all the components of the project
|── docs                                <- documentation of the project src code
|── flowcharts                          <- flowcharts of the project components, overall, etc.
|── logs                                <- log file generated while running the component/s
|── mlruns                              <- MLFlow: all ml experiments and their artifacts 
|── notebooks                           <- jupyter-notebooks
|    |__ data                           <- sample data to do experiments in the notebook
|
|── references                          <- quick reference for the project. e.g. research papers
|── reports                             <- reports generated in the project life-cycle
|	|__ figures                         
|
|── scripts                             <- shell scripts to invoke operations from the command line
|   |── setup.sh                        <- initial setup of the project
|   |__ test.sh                         <- run tests
|
|── src
|    |__ ProjectName
|        |── __init__.py
|        |── logger.py                  <- A custom logger to stream logs in a terminal/log in file
|        |── exception.py               <- A custom exception handler for the project
|        |── components                 <- components of the ml pipeline
|        |    |── __init__.py
|        |    |── data_ingestion.py
|        |    |── data_valaidation.py
|        |    |── data_preprocessing.py
|        |    |── data_transformation.py
|        |    |── model_training.py
|        |    |── model_evaluation.py
|        |    |__ prediction.py
|        |    
|        |── config                     <- configuration for the component
|        |    |── configration.py
|        |    |__ __init__.py
|        |
|        |── constants                  <- constants used in the project
|        |    |── __init__.py
|        |        |── PROJECT_ROOT
|        |
|        |── entity                     < datasclass for each component
|        |    |── __init__.py
|        |    |__ config_entity.py
|        |
|        |── pipeline                   <- all steps in the pipeline
|        |    |── __init__.py
|        |    |── data_ingestion.py
|        |    |── data_valaidation.py
|        |    |── data_preprocessing.py
|        |    |── data_transformation.py
|        |    |── model_training.py
|        |    |── model_evaluation.py
|        |    |__ predict.py
|        |   
|        |── utils                      <- all utilities used in the project
|             |── __init__.py
|             |__ common.py	        <- common utilities for the project
|
|── static                              <- assets/CSS/JS
|     |── assets
|         |__ img
|
|── templates                           <- HTML
|    |── index.html
|    |__ results.html
|
|── tests                               <- tests for the project
|    |── unit
|    |__ integration
|
|── .dvcignore
|── .gitignore
|── app.py                              <- expose application via API
|── DockerFile                          <- containerize the application
|── dvc.yaml                            <- DVC pipeline
|── LICENSE
|── main.py                             <- entry point for the project
|── README.md
|── requirements.txt
|── setup.py                            <- package project 
|__ template.py                         <- creates empty files/folders

Owner: Aashish Kumar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLProjectTemplate

Setting Up the Project using this template:

Workflow:

Project Organization:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.dvc		.dvc
.github/workflows		.github/workflows
artifacts		artifacts
config		config
docs		docs
flowcharts		flowcharts
logs		logs
mlruns		mlruns
notebooks		notebooks
references		references
reports		reports
scripts		scripts
src/project_name		src/project_name
static/assets/img		static/assets/img
templates		templates
tests		tests
.dvcignore		.dvcignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
dvc.yaml		dvc.yaml
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py

License

heydido/MLProjectTemplate

Folders and files

Latest commit

History

Repository files navigation

MLProjectTemplate

Setting Up the Project using this template:

Workflow:

Project Organization:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages