sec_scraper is a scraping tool, leveraging the Scrapy project framework. It's aimed at crawling the Securities and Exchange Commission (SEC), extracting 10-K (annual) financial documents for each company in the SEC's company directory. It is anticipated that these documents will be later used for sentiment analysis as part of alpha research using Quantopian.
- Python 3.4+
- Scrapy 1.6
The program was developed on Windows 10, utilising the Windows Anaconda distribution. In order to install scrapy using the Windows anaconda, execute the following command using conda
. Please refer to the Scrapy installation guide for further details
conda install -c conda-forge scrapy