Skip to content

pungprakearti/ocho_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ocho_scraper

animated gif of ocho_scraper functionality

installation

clone repo

git clone [email protected]:pungprakearti/ocho_scraper.git

change directory into repo

cd ocho_scraper

create secret file

touch secret.py

edit secret file with URLs to scrape

Use whichever editor you prefer. The syntax inside the secret.py is:

url1 = "http://www.google.com"
url2 = "http://www.yahoo.com"

running the scraper

python3 ocho_scraper.py

remove local saved data and rescrape all

This is an optional step. Sometimes the instructors have typos or change code after you've already scraped it. Use this command to rescrape everything. If you want to scrape individual files, just delete the directory for the lecture/exercise and when you run ocho_scraper without -a, it will rescrape that specific folder.

python3 ocho_scraper.py -a

open the indexer

open index.html

Use command + f to search for keywords in the indexer and the links will take you right to where the information lives.


profit

burning money

COHORT OCHO FOREVER!!!

About

Python web scraper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages