ocho_scraper

installation

clone repo

git clone [email protected]:pungprakearti/ocho_scraper.git

change directory into repo

cd ocho_scraper

create secret file

touch secret.py

edit secret file with URLs to scrape

Use whichever editor you prefer. The syntax inside the secret.py is:

url1 = "http://www.google.com"
url2 = "http://www.yahoo.com"

running the scraper

python3 ocho_scraper.py

remove local saved data and rescrape all

This is an optional step. Sometimes the instructors have typos or change code after you've already scraped it. Use this command to rescrape everything. If you want to scrape individual files, just delete the directory for the lecture/exercise and when you run ocho_scraper without -a, it will rescrape that specific folder.

python3 ocho_scraper.py -a

open the indexer

open index.html

Use command + f to search for keywords in the indexer and the links will take you right to where the information lives.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
ocho_indexer.py		ocho_indexer.py
ocho_preview.gif		ocho_preview.gif
ocho_progress_bar.py		ocho_progress_bar.py
ocho_scraper.py		ocho_scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ocho_scraper

installation

running the scraper

open the indexer

profit

COHORT OCHO FOREVER!!!

About

Releases

Packages

Languages

pungprakearti/ocho_scraper

Folders and files

Latest commit

History

Repository files navigation

ocho_scraper

installation

running the scraper

open the indexer

profit

COHORT OCHO FOREVER!!!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages