https://github.com/datamade/pupa-legistar

Pupa compliant python library for scraping Legistar sites -- legislation management sites hosted by by Granicus.

https://github.com/datamade/pupa-legistar

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    2 of 7 committers (28.6%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.6%) to scientific vocabulary

Keywords from Contributors

datamade de-duplicating dedupe dedupe-library entity-resolution record-linkage chicago councilmatic
Last synced: 10 months ago · JSON representation

Repository

Pupa compliant python library for scraping Legistar sites -- legislation management sites hosted by by Granicus.

Basic Info
  • Host: GitHub
  • Owner: datamade
  • License: mit
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 144 KB
Statistics
  • Stars: 0
  • Watchers: 4
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 12 years ago · Last pushed over 9 years ago
Metadata Files
Readme License

README.md

DEPRECATED, See https://github.com/opencivicdata/python-legistar-scraper/

Pupa Legistar

Pupa compliant python library for scraping Legistar sites -- legislation management sites hosted by by Granicus.

Legistar sites include - Chicago - Philadelphia - Oakland - and many other cities

Pupa framework for managing municipal data - standard - docs

Requirements

  • python 2.7
  • mongo

Installation

```console

pip install -r requirements.txt python setup.py install ```

Setup

Everything should be set up to run already with this repository, but if you want to change your mongo settings, edit pupa_settings.py

Usage

Make sure you have mongo installed and running.

Init the mongo collection. Default database is pupa console pupa init chicago

Run the scraper. This will generate a bunch of static json files in scraped_data and stuff them in to the pupa database. console pupa update chicago

Owner

  • Name: datamade
  • Login: datamade
  • Kind: organization
  • Email: info@datamade.us
  • Location: Chicago, IL

We build open source technology using open data to empower journalists, researchers, governments and advocacy organizations.

GitHub Events

Total
Last Year

Committers

Last synced: 12 months ago

All Time
  • Total Commits: 213
  • Total Committers: 7
  • Avg Commits per committer: 30.429
  • Development Distribution Score (DDS): 0.404
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Forest Gregg f****g@u****u 127
Mjumbe Wawatu Ukweli m****u@g****m 47
Derek Eder d****r@g****m 27
Chris Brown c****n@s****u 7
Eric van Zanten e****c@b****s 3
guelo u****r@g****m 1
Eric van Zanten e****n@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 12 months ago

All Time
  • Total issues: 0
  • Total pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: almost 3 years
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • fgregg (1)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

requirements.txt pypi
  • coverage *
  • lxml *
  • nose *
  • pupa *
  • pymongo *
  • slate *