Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quality Control for AHP_parser #11

Open
krammy19 opened this issue Feb 24, 2021 · 5 comments
Open

Quality Control for AHP_parser #11

krammy19 opened this issue Feb 24, 2021 · 5 comments
Labels

Comments

@krammy19
Copy link
Collaborator

krammy19 commented Feb 24, 2021

It would be good to verify that the urls that we're pulling in are actually valid with no errors.

Can someone please do a simple loop on the AHP_parser to request the sites and pull the status codes? If we're getting anything besides 200 codes, then we have some problems.

@xconnieex
Copy link
Collaborator

Which URLs/sites are these? Is it this one: CA_city_websites_final.csv?

@xconnieex xconnieex added the good first issue Good for newcomers label Feb 24, 2021
@dineshkumar-23
Copy link

Hello,
Could you please specify which URLs to check the status for?

@dineshkumar-23
Copy link

Ok cool. Could you please specify the URLs to check the status? Is it one of the columns in this file 'CA_city_websites_final.csv'?

@krammy19
Copy link
Collaborator Author

krammy19 commented Mar 3, 2021

Sorry about the delay in responding! I'm talking about the urls returned by the html-request scraper.

I would encourage you to try running the scraper on your own to find any issues, but you can also find the output on this Google Sheet: https://docs.google.com/spreadsheets/d/11offSYz2irnjI-9tILkcI-ClclRUZ0pyhXtPy-G4i8g/edit?usp=sharing

All columns besides CITY and CITY_URL are what needs to be quality-checked.

@krammy19
Copy link
Collaborator Author

krammy19 commented Mar 9, 2021

Update: html-request scraper 2 has been renamed to AHP_parser

@krammy19 krammy19 changed the title Quality Control for html-request scraper2 Quality Control for AHP_parser Mar 9, 2021
@krammy19 krammy19 added On Hold and removed good first issue Good for newcomers labels Sep 6, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants