openstates-scrapers
source for Open States scrapers
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
14 of 249 committers (5.6%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (2.3%) to scientific vocabulary
Keywords
government
hacktoberfest
python
scrapers
states
united-states
Keywords from Contributors
closember
reinforcement-learning
transformation
climate
Last synced: 6 months ago
·
JSON representation
·
Repository
source for Open States scrapers
Basic Info
- Host: GitHub
- Owner: openstates
- License: gpl-3.0
- Language: Python
- Default Branch: main
- Homepage: https://openstates.org
- Size: 30.9 MB
Statistics
- Stars: 876
- Watchers: 52
- Forks: 493
- Open Issues: 5
- Releases: 0
Topics
government
hacktoberfest
python
scrapers
states
united-states
Created almost 17 years ago
· Last pushed 6 months ago
Metadata Files
Readme
License
Citation
Authors
README.md
Open States Scrapers
This repository contains the code responsible for scraping bills & votes for Open States.
Links
Owner
- Name: Open States
- Login: openstates
- Kind: organization
- Email: contact@openstates.org
- Website: https://openstates.org
- Twitter: openstates
- Repositories: 26
- Profile: https://github.com/openstates
Citation (CITATION.cff)
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!
cff-version: 1.2.0
title: Open States Scrapers
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
authors:
- given-names: James
family-names: Turk
orcid: 'https://orcid.org/0000-0003-1762-1420'
- given-names: Michael
family-names: Stephens
- given-names: Tim
family-names: Showers
- given-names: Thom
family-names: Neale
- given-names: Miles
family-names: Watkins
- given-names: Paul
family-names: Tagliamonte
- given-names: Rylie
family-names: Johnson
- given-names: Rachel
family-names: Shorey
- given-names: Dan
family-names: Schneiderman
- given-names: Josh
family-names: Carp
repository-code: 'https://openstates.org'
abstract: >-
Python web scrapers for all 50 state legislatures, DC, and
Puerto Rico.
license: GPL-3.0
Committers
Last synced: 8 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| James Turk | j****k@g****m | 4,216 |
| Tim Showers | s****t@g****m | 1,638 |
| Michael Stephens | m****s@s****m | 1,569 |
| Thom Neale | t****e@g****m | 1,198 |
| Paul Tagliamonte | p****g@s****m | 855 |
| NewAgeAirbender | 3****r | 582 |
| Miles Watkins | m****s@g****m | 562 |
| Paul Tagliamonte | t****g@p****g | 436 |
| John Seekins | j****n@c****m | 390 |
| Jesse Mortenson | j****n@g****m | 340 |
| Rachel Shorey | r****y@g****m | 338 |
| Michelle Orden | m****n@M****l | 296 |
| schneidy | s****l@g****m | 271 |
| Joshua Carp | j****p@g****m | 254 |
| Andy Lo | a****o@s****m | 248 |
| Dan Schneiderman | s****y@c****u | 141 |
| Hitesh Garg | g****h@g****m | 134 |
| Christopher Yamas | c****s@b****u | 129 |
| user | u****r@a****x | 115 |
| braykuka | b****a@g****m | 114 |
| judgejudes | j****1@u****u | 107 |
| Gabriel J. Pérez | g****l@g****m | 106 |
| Chris Yamas | c****s@C****l | 93 |
| Amy Cesal | a****l@s****m | 91 |
| alexobaseki | a****x@p****m | 86 |
| Sruthi Vedantham | s****i@c****m | 81 |
| ehtisham | s****z@g****m | 74 |
| Bikram Bharti | b****5@i****n | 70 |
| Brandon Lewis | b****s@a****m | 69 |
| estaub | e****b@s****g | 63 |
| and 219 more... | ||
Committer Domains (Top 20 + Academic)
sunlightfoundation.com: 11
civiceagle.com: 6
patch.com: 4
pluralpolicy.com: 2
college.harvard.edu: 2
uchicago.edu: 2
man.com: 1
dunkel.(none): 1
bitmechanic.com: 1
google.com: 1
cs.jhu.edu: 1
annerajb-server.(none): 1
idealist.org: 1
wawd.com: 1
quorum.us: 1
abonilla5.americas.hpqcorp.net: 1
clyde.westell.com: 1
front4.(none): 1
michelles-mbp.home: 1
gregjd.com: 1
umich.edu: 1
columbia.edu: 1
csh.rit.edu: 1
berkeley.edu: 1
u.northwestern.edu: 1
itbhu.ac.in: 1
vt.edu: 1
usc.edu: 1
ufl.edu: 1
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 6
- Total pull requests: 1,288
- Average time to close issues: about 1 hour
- Average time to close pull requests: 8 days
- Total issue authors: 5
- Total pull request authors: 21
- Average comments per issue: 0.33
- Average comments per pull request: 0.32
- Merged pull requests: 1,179
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 3
- Pull requests: 745
- Average time to close issues: about 1 hour
- Average time to close pull requests: about 12 hours
- Issue authors: 2
- Pull request authors: 14
- Average comments per issue: 0.0
- Average comments per pull request: 0.28
- Merged pull requests: 672
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- jessemortenson (2)
- arpitjain099 (1)
- estaub (1)
- pgoldtho (1)
- braykuka (1)
Pull Request Authors
- showerst (414)
- jessemortenson (355)
- NewAgeAirbender (271)
- alexobaseki (102)
- braykuka (95)
- chrisyamas (17)
- austinhouck (4)
- jealob (4)
- nickrecchi (4)
- sroomf (2)
- sacerdoted (2)
- flooie (2)
- Desitrain22 (2)
- hasna-akbarali (2)
- elseagle (2)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
poetry.lock
pypi
- black 22.3.0 develop
- mypy-extensions 0.4.3 develop
- pathspec 0.9.0 develop
- platformdirs 2.5.2 develop
- tomli 2.0.1 develop
- appnope 0.1.3
- argcomplete 1.10.3
- asgiref 3.5.0
- attrs 20.3.0
- backcall 0.2.0
- bcrypt 3.2.0
- beautifulsoup4 4.8.2
- boto3 1.21.46
- botocore 1.24.46
- certifi 2021.10.8
- cffi 1.15.0
- chardet 3.0.4
- charset-normalizer 2.0.12
- click 8.1.2
- cloudscraper 1.2.60
- colorama 0.4.4
- compressed-rtf 1.0.6
- cryptography 36.0.2
- cssselect 1.1.0
- decorator 5.1.1
- dj-database-url 0.5.0
- django 3.2.13
- docx2txt 0.8
- ebcdic 1.1.1
- et-xmlfile 1.1.0
- extract-msg 0.29.0
- feedparser 6.0.8
- greenlet 1.1.2
- idna 3.3
- imapclient 2.1.0
- ipython 7.34.0
- jedi 0.18.1
- jellyfish 0.6.1
- jmespath 1.0.0
- jsonschema 3.2.0
- lxml 4.8.0
- matplotlib-inline 0.1.3
- mysqlclient 1.4.6
- olefile 0.46
- openpyxl 3.0.9
- openstates 6.11.0
- paramiko 2.10.3
- parso 0.8.3
- pdfminer.six 20191110
- pexpect 4.8.0
- pickleshare 0.7.5
- pillow 9.1.0
- prompt-toolkit 3.0.29
- psycopg2-binary 2.9.3
- ptyprocess 0.7.0
- pycparser 2.21
- pycryptodome 3.14.1
- pydantic 1.9.0
- pygments 2.12.0
- pynacl 1.5.0
- pyparsing 3.0.8
- pyrsistent 0.18.1
- python-dateutil 2.8.2
- python-pptx 0.6.21
- pytz 2019.3
- pytz-deprecation-shim 0.1.0.post0
- pyyaml 5.4.1
- requests 2.28.1
- requests-toolbelt 0.9.1
- s3transfer 0.5.2
- scrapelib 2.0.7
- sgmllib3k 1.0.0
- six 1.12.0
- sortedcontainers 2.4.0
- soupsieve 2.3.2.post1
- spatula 0.8.10
- speechrecognition 3.8.1
- sqlalchemy 1.4.35
- sqlparse 0.4.2
- suds-py3 1.4.5.0
- textract 1.6.5
- traitlets 5.1.1
- typing-extensions 4.2.0
- tzdata 2022.1
- tzlocal 4.2
- urllib3 1.26.9
- us 2.0.2
- wcwidth 0.2.5
- xlrd 1.2.0
- xlsxwriter 3.0.3
pyproject.toml
pypi
- black ^22 develop
- SQLAlchemy ^1.3
- chardet ^3.0
- cloudscraper ^1.2.58
- feedparser ^6.0
- lxml ^4.4
- mysqlclient ^1.4.6
- openstates ^6.11.0
- paramiko ^2.9.2
- python ^3.9
- python-dateutil ^2.8
- pytz ^2019.3
- requests ^2.22
- spatula ^0.8
- suds-py3 ^1.3
- xlrd <2
.github/workflows/ca-docker.yml
actions
- actions/checkout v3 composite
- docker/build-push-action v3 composite
- docker/login-action v2 composite
.github/workflows/docker.yml
actions
- actions/checkout v3 composite
- docker/build-push-action v3 composite
- docker/login-action v2 composite
.github/workflows/lint.yml
actions
- actions/checkout v3 composite
- actions/setup-python v4 composite
.github/workflows/scrape.yml
actions
- actions/cache v2 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- snok/install-poetry v1.3.3 composite
Dockerfile
docker
- python 3.9-slim build
docker-compose.yml
docker
- mariadb 10.5
- mdillon/postgis 11-alpine