Feature request: have the ability to return URLs not in input data within a certain region #20
Labels
product: transit data quality
Items that are a part of the Transit Data Quality Product of which @evansiroky is the product owner
status: on hold
User Story (Cal-ITP)
As a research data analyst,
I want to know if there are more up-to-date GTFS URLs found on feed aggregator websites than the GTFS URLs that Cal-ITP has
so that I can maintain a database of the GTFS URLs of the CA transit agencies
and so that I can have additional sources of information indicating which GTFS URLs transit agencies have
User Story (Community User)
As a transit application developer,
I want to get a list of all GTFS URLs on all feed aggregator websites for a particular region
so that I can have a complete list of all GTFS URLs to download data from to power my transit application
Acceptance Criteria
Given
For transitland, it seems like the agencies can be queried to determine where they operate and compared with the feeds found based off of the input URLs. The command line arguments could look something like this:
--transit-land-adm1_iso=US-CA
For transitfeeds, the hardcoded location could be made configurable via a command line argument:
--transit-feeds-location=67-california-usa
Then The URLs found on the aggregator websites that weren't within the input list URLs should be outputted in a separate section of the output.
Example:
When searching for all transitfeeds URLs in Saskatchewan, Canada, but also checking against a single input URL, the CLI input and result could be as follow:
CLI Input
JSON Output
The text was updated successfully, but these errors were encountered: