Zach’s CfA Summer: “Made With [EPA] Data”

When Code for America selected me as a Google Summer of Code Intern for the Government Gems and Eggs project this summer, I knew the next few months were going to be awesome. For the first time ever, I was able to meet and work with people truly committed to making government data more accessible.

Most developers don’t especially enjoy working with APIs. It normally consists of reading a large amount of documentation with few examples, followed by a fair amount of troubleshooting whenever things don’t work as expected. On top of that, visualizations and graphs aren’t generally made overnight — a fair amount of development is spent working with data formats you might not be especially fond of (I’m looking at you XML).

But, when data and APIs become accessible, developers can make visualizations and applications that allow multiple individuals to grasp the larger picture of what’s taking place around them (the immediate examples that come to mind are the NYC Graffiti Snapshot or Zeit’s visualization of mobile telecom data).

This summer I ended up writing 8 different Python API wrappers — all of which are available on Code for America’s Github page — along with a base module other developers can use going forward.

The big news from our work this summer, in my opinion, are matching Ruby and Python wrappers for the EPA’s API (an API so large and complex that no other wrappers had previously been written for it). A wealth of government data and information is now available to developers — and the documentation on the Github repositories features tons of working examples.

Throughout my last few weeks as an intern at Code for America, I worked on finishing madewithdata.com — a simple site that currently allows users to explore the radiation and water pollution information now made available through the EPA API wrappers. I actually learned quite a few things while developing the site:

  • The top 25 most-populated US cities are made up of 531 ZIP Codes.
  • 2190 facilities in those ZIP Codes have permits to pollute public water sources.
  • The ZIP Code 32218 in Jacksonville, Florida has the most polluters — 178 facilities have been granted EPA permits to pollute public water sources (some of the records date back to the 1970s).

Also, all the code I used to make the site (and find the number of polluters) is available on my personal Github page.

Original post

Leave a Comment

Leave a comment

Leave a Reply