This repo is the starter for ecs_s3_scraper. The idea being that if you clone this repo and follow the setup steps below, then it becomes super easy to follow along the hands-on coding in the ecs_s3_scraper.
Just follow the steps below, and if you notice any failures, then please log an issue
-
Note: v6 or v7 should be fine
-
Install yarn package manager
$ npm install -g yarn
-
Git clone this repo in a directory of your choice, if you haven't already.
$ git clone https://github.com/anishk123/ecs_s3_scraper_starter.git
-
Get to the working directory
$ cd ecs_s3_scraper_starter
-
Run yarn install
$ yarn install
Note: the above will install the libraries and dependencies required for the project
-
Run index.js
$ node index.js
Note: the above will open up a web browser that is written in electron, and it will search the keyword "code and coffee vancouver"
-
Download Docker - Docker CE is the free version
-
Download a nightmare docker image
$ docker pull ivanvanderbyl/docker-nightmare
Note: We will be adding our own web scraper code to this image.
-
Download a fake S3 docker image
$ docker pull lphoward/fake-s3
Note: This image will be used to run a mock S3 service that we can develop our code against, instead of using actual AWS S3
You are DONE! and ready to rock on over to ecs_s3_scraper