Skip to content

eyadsibai/machine-learning-docker-image

Repository files navigation

Machine Learning/Data Science Platform (Docker Image)

Requirements

  • docker
  • docker-machine (to deploy to google cloud)

How to use (in Google Cloud)

docker-machine create docker-dsp -d google --google-project={project_id} --google-machine-type n1-highmem-8	--google-disk-size "10" --google-disk-type "pd-standard" --google-preemptible --google-machine-image ubuntu-os-cloud/global/images/family/ubuntu-1404-lts --google-scopes "https://www.googleapis.com/auth/cloud-platform"
eval $(docker-machine env docker-dsp)
docker run -d -p 8888:8080 -e "PROJECT_ID={project_id}" eyadsibai/docker-dsp start.sh jupyter lab --NotebookApp.token=''
  • get the ip address of the machine
docker-machine ip docker-dsp
  • open the site http://{docker-machine ip docker-dsp}:8080
  • to stop the machine
docker-machine stop docker-dsp
docker-machine start docker-dsp
  • to delete the instance
docker-machine rm docker-dsp

Note

when you stop the machine, it would cost you nothing except for the disk that you have it attached. For Google cloud (10GB of disk would cost ~0.4$/month)

How to use (Locally)

docker run -d -p 8888:8888 -v <local path>:/home/jovyan/work eyadsibai/docker-dsp start.sh jupyter lab --NotebookApp.token=''

TODO

  • access local files (whether running locally or on google machine)

Gear included

  • Python 3.6
  • aws packages and tools
  • data analysis and data manipulation/data storage pandas/fastparquet/
  • distributed/out-of-core/data workflow numba/luigi/airflow
  • multipurpose machine learning libraries scikit-learn/orange3
  • nlp nltk
  • network analysis networkx
  • visualization matplotlib/seaborn/holoviews/ggplot/bokeh

Releases

No releases published

Packages

No packages published