- docker
- docker-machine (to deploy to google cloud)
docker-machine create docker-dsp -d google --google-project={project_id} --google-machine-type n1-highmem-8 --google-disk-size "10" --google-disk-type "pd-standard" --google-preemptible --google-machine-image ubuntu-os-cloud/global/images/family/ubuntu-1404-lts --google-scopes "https://www.googleapis.com/auth/cloud-platform"
eval $(docker-machine env docker-dsp)
docker run -d -p 8888:8080 -e "PROJECT_ID={project_id}" eyadsibai/docker-dsp start.sh jupyter lab --NotebookApp.token=''
- get the ip address of the machine
docker-machine ip docker-dsp
- open the site http://{docker-machine ip docker-dsp}:8080
- to stop the machine
docker-machine stop docker-dsp
docker-machine start docker-dsp
- to delete the instance
docker-machine rm docker-dsp
when you stop the machine, it would cost you nothing except for the disk that you have it attached. For Google cloud (10GB of disk would cost ~0.4$/month)
docker run -d -p 8888:8888 -v <local path>:/home/jovyan/work eyadsibai/docker-dsp start.sh jupyter lab --NotebookApp.token=''
- access local files (whether running locally or on google machine)
- Python 3.6
- aws packages and tools
- data analysis and data manipulation/data storage pandas/fastparquet/
- distributed/out-of-core/data workflow numba/luigi/airflow
- multipurpose machine learning libraries scikit-learn/orange3
- nlp nltk
- network analysis networkx
- visualization matplotlib/seaborn/holoviews/ggplot/bokeh