Skip to content

Merge several years of data from House of Representatives together

License

Notifications You must be signed in to change notification settings

shmcminn/house-disbursements

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

House disbursement reports

Every quarter, the House of Representatives releases a .csv of its spending here. The chamber started doing so in 2016, whereas before it would release a PDF file that the Sunlight Foundation crawled and converted to a csv.

This repo has the old Sunlight files as well as all 2016 data files included. To add new files from the House's quarterly releases, add their namesm as strings to the house_csv_files list.

The python script get_new_data.py combines them into a "master_spending.csv" file. get_new_data_filter.py does the same but allows filtering. create_file_subset_data.py is a type of filtering script as well, but only to view the first X number of lines in your file, and it assumes you already have your "master_spending.csv" file in the present working directory.

Lots of credit goes to StackOverflow commenters on these, with forum threads linked in the comments.

This is also my first github repo, so play nice as I figure out what I'm doing :)

-Sean McMinn

Data reporter, CQ Roll Call

@shmcminn

[email protected]

About

Merge several years of data from House of Representatives together

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages