Skip to content

Scraper and parser of Cleveland City Council's records and the produced text.

Notifications You must be signed in to change notification settings

ysakthi/drocer

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

64 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

drocer

Cleveland City Council's records are available as PDFs from 1996-present.

As PDFs, there's many things that you cannot to do with them: search in multiple files at one time; easily extract data from them for further analysis; browse legislation by a particular topic; and more.

In short, PDFs make it very difficult for programmers, journalists, researchers to use the data inside them.

What's here:

The year folders (e.g. 2017) consist of plain text files extracted from PDFs of The City Record, Cleveland City Council's legislative notes The files are not recommended for reading. There are additional small errors like misspellings and improper spacing especially for lists. Tables and graphics in the City Record are also missing from these plain text files. Check the original PDFs or download them by year in bulk to verify if any information is missing.

city-record-contents-formatting.md - boilerplate text that is used in the city record and what it means.

Learn more about our goals, project roadmap, and how you can help at:

http://www.opencleveland.org/projects/drocer/

https://hackpad.com/City-Council-Records-7c707KHL5Wq

About

Scraper and parser of Cleveland City Council's records and the produced text.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 95.9%
  • Python 2.4%
  • Java 1.7%