The goals of this project are to:
- Clean up publicly available disbursement data to make it easier for reporters, elected officials, constituents, anyone to understand how money is being spent.
- Match and consolidate like vendors
- Catch and fix misspellings
- Visualize data to provide insight into the breakdown of expenditures. Some simple calculations could include:
- Spending by Member
- Amount received by Vendor
- Compensation by Staff Title
- A well-documented methodology for cleaning data
- A stable and reliable single .csv for further exploration
- A flexible data visualization tool
- A detailed explanation for all expenses by office and category
NOTE: This is an educational experiment designed to provide some beneficial insight.
D3 Visualization of Top 25 Vendors from 2017 Q1 Statement of Disbursements
- Python
- Javascript
- Pandas.py
- React.js
- D3.js
- Jupyter Notebook for data analysis (To see notebook, go here).
Data Source: http://disbursements.house.gov/archive.shtml
- Josh Tauberer and Daniel Schuman
- Congress Tech Spending by The OpenGov Foundation
- Processing House Disbursements by Sunlight Foundation, now ProPublica
- Legislative Branch Capacity Working Group by Lee Drutman and Kevin Kosar
- Congressional Operations Research by Congressional Management Foundation
- Campaign Expenditures by Colin Kearns
- For getting me up and running, Takayuki Ito