Science Score: 31.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 1 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.9%) to scientific vocabulary
Repository
Innovation across ages
Basic Info
- Host: GitHub
- Owner: cverluise
- License: mit
- Language: Python
- Default Branch: nightly
- Homepage: https://cverluise.github.io/patentcity/
- Size: 32.4 MB
Statistics
- Stars: 66
- Watchers: 4
- Forks: 3
- Open Issues: 4
- Releases: 0
Metadata Files
README.md
patentCity
🗃️ This repository is the codebase of the patentCity database.
📚 The patentCity database is a comprehensive database reporting patentees' data extracted from patent texts as well as enriched data since the 19th century in Germany (including East Germany), France, Great Britain and the USA.
💥 The goal of the database is to spur research on the history of innovation and to deepen the set of natural experiments from which historians, economists and scientists in general can learn to improve our understanding of innovation dynamics.
🌎 Explore and visualize the patentCity database online at patentcity.xyz (click on the map under the "Explore" section).
📥 Download the patentCity database here (CC-BY-4).
📝 The patentCity codebase and database are documented at doc.patentcity.xyz.
🤗 We open source our code (MIT) to support future extensions, and a collaborative way to create and continuously improve research databases.
💌 patentCity is due to expand and improve continuously in the coming years. Make sure to receive updates, join our newsletter and star the GitHub repository!
©️ patentCity is the backbone of Bergeaud and Verluise (2022). If you use the data or the codebase, make sure to cite the paper.
Developer guide
Installation and set-up
```shell script
Clone repo
git clone https://github.com/cverluise/patentcity.git
set up env/ install dependencies and CLI
cd patentcity/ poetry install ```
Install poetry if needed.
```shell script
get data & models (dev-only, access to gcp bucket needed)
dvc pull ```
Install dvc if needed.
API doc
The API doc is available at the documentation website under the API section.
Overall, the API is thought to be versatile and to adapt to new documents seamlessly. The main functions are covered by the patencity CLI
Owner
- Name: Cyril Verluise
- Login: cverluise
- Kind: user
- Location: Paris
- Company: Economics of Innovation Lab @ Collège de France
- Website: https://cverluise.github.io/
- Repositories: 12
- Profile: https://github.com/cverluise
Phd Student in Economics & Data scientist. Interested in knowledge creation and diffusion. Currently working on patent related projects.
Citation (CITATION.bib)
@unpublished{bergeaudVerluise2021,
title={A new Dataset to Study a Century of Innovation
in Europe and in the US},
author={Bergeaud, Antonin and Verluise, Cyril},
year={2022},
note = {Mimeo}
}
GitHub Events
Total
- Watch event: 4
- Issue comment event: 3
- Pull request review event: 1
- Pull request event: 1
- Fork event: 2
Last Year
- Watch event: 4
- Issue comment event: 3
- Pull request review event: 1
- Pull request event: 1
- Fork event: 2
Dependencies
- 152 dependencies
- gitpython ^3.1.12 develop
- mkdocs ^1.1.2 develop
- mkdocs-material ^7.1.3 develop
- mkdocs-minify-plugin ^0.3.0 develop
- mkdocstrings ^0.15.2 develop
- pymdown-extensions ^7.1 develop
- typer-cli ^0.0.11 develop
- Cython ^0.29.23
- bs4 ^0.0.1
- dvc ^2.0.18
- fuzzysearch ^0.7.3
- fuzzyset ^0.0.19
- google-cloud ^0.34.0
- google-cloud-bigquery ^1.27.2
- googlemaps ^4.4.2
- lxml ^4.6.2
- pandas ^1.0.5
- pre-commit ^2.6.0
- python ^3.8
- python-levenshtein ^0.12.2
- smart_open ^2.1.0
- spacy ^3.0.1
- spacy-lookups-data ^0.3.2
- spacy-transformers ^1.0.1
- tabulate ^0.8.7
- typer ^0.3.2
- aiohttp ==3.7.3 development
- aioredis ==1.3.1 development
- appdirs ==1.4.4 development
- async-timeout ==3.0.1 development
- attrs ==20.3.0 development
- beautifulsoup4 ==4.9.3 development
- blessings ==1.7 development
- blis ==0.7.4 development
- boto3 ==1.15.16 development
- botocore ==1.18.16 development
- bs4 ==0.0.1 development
- cachetools ==4.1.1 development
- catalogue ==2.0.1 development
- certifi ==2020.6.20 development
- cffi ==1.14.3 development
- cfgv ==3.2.0 development
- chardet ==3.0.4 development
- click ==7.1.2 development
- colorama ==0.4.3 development
- colorful ==0.5.4 development
- cymem ==2.0.5 development
- distlib ==0.3.1 development
- filelock ==3.0.12 development
- ftfy ==5.8 development
- future ==0.18.2 development
- fuzzysearch ==0.7.3 development
- gitdb ==4.0.5 development
- gitpython ==3.1.12 development
- google ==3.0.0 development
- google-api-core ==1.22.4 development
- google-auth ==1.22.1 development
- google-cloud ==0.34.0 development
- google-cloud-bigquery ==1.28.0 development
- google-cloud-core ==1.4.3 development
- google-crc32c ==1.0.0 development
- google-resumable-media ==1.1.0 development
- googleapis-common-protos ==1.52.0 development
- googlemaps ==4.4.2 development
- gpustat ==0.6.0 development
- grpcio ==1.35.0 development
- hiredis ==1.1.0 development
- htmlmin ==0.1.12 development
- identify ==1.5.6 development
- idna ==2.10 development
- importlib-metadata ==2.0.0 development
- jinja2 ==2.11.2 development
- jmespath ==0.10.0 development
- joblib ==0.17.0 development
- jsmin ==2.2.2 development
- jsonschema ==3.2.0 development
- livereload ==2.6.3 development
- lunr ==0.5.8 development
- lxml ==4.5.2 development
- markdown ==3.3.1 development
- markupsafe ==1.1.1 development
- mkdocs ==1.1.2 development
- mkdocs-material ==5.5.14 development
- mkdocs-material-extensions ==1.0.1 development
- mkdocs-minify-plugin ==0.3.0 development
- msgpack ==1.0.2 development
- multidict ==5.1.0 development
- murmurhash ==1.0.5 development
- nltk ==3.5 development
- nodeenv ==1.5.0 development
- numpy ==1.19.2 development
- nvidia-ml-py3 ==7.352.0 development
- opencensus ==0.7.12 development
- opencensus-context ==0.1.2 development
- packaging ==20.9 development
- pandas ==1.1.3 development
- pathy ==0.3.5 development
- pre-commit ==2.7.1 development
- preshed ==3.0.5 development
- prometheus-client ==0.9.0 development
- protobuf ==3.13.0 development
- psutil ==5.8.0 development
- py-spy ==0.3.4 development
- pyasn1 ==0.4.8 development
- pyasn1-modules ==0.2.8 development
- pycparser ==2.20 development
- pydantic ==1.7.3 development
- pygments ==2.7.1 development
- pymdown-extensions ==7.1 development
- pyparsing ==2.4.7 development
- pyrsistent ==0.17.3 development
- python-dateutil ==2.8.1 development
- pytz ==2020.1 development
- pyyaml ==5.3.1 development
- ray ==0.8.7 development
- redis ==3.4.1 development
- regex ==2020.10.11 development
- requests ==2.24.0 development
- rsa ==4.6 development
- s3transfer ==0.3.3 development
- sacremoses ==0.0.43 development
- shellingham ==1.3.2 development
- six ==1.15.0 development
- smart-open ==2.2.1 development
- smmap ==3.0.5 development
- soupsieve ==2.0.1 development
- spacy ==3.0.1 development
- spacy-alignments ==0.7.2 development
- spacy-legacy ==3.0.1 development
- spacy-lookups-data ==0.3.2 development
- spacy-ray ==0.1.0 development
- spacy-transformers ==1.0.1 development
- srsly ==2.4.0 development
- tabulate ==0.8.7 development
- thinc ==8.0.1 development
- tokenizers ==0.9.4 development
- toml ==0.10.1 development
- torch ==1.7.1 development
- torchcontrib ==0.0.2 development
- tornado ==6.0.4 development
- tqdm ==4.50.2 development
- transformers ==4.2.2 development
- typer ==0.3.2 development
- typer-cli ==0.0.11 development
- typing-extensions ==3.7.4.3 development
- urllib3 ==1.25.10 development
- virtualenv ==20.0.34 development
- wasabi ==0.8.2 development
- wcwidth ==0.2.5 development
- yarl ==1.6.3 development
- zipp ==3.3.0 development
- aiohttp ==3.7.3
- aioredis ==1.3.1
- appdirs ==1.4.4
- async-timeout ==3.0.1
- attrs ==20.3.0
- beautifulsoup4 ==4.9.3
- blessings ==1.7
- blis ==0.7.4
- boto3 ==1.15.16
- botocore ==1.18.16
- bs4 ==0.0.1
- cachetools ==4.1.1
- catalogue ==2.0.1
- certifi ==2020.6.20
- cffi ==1.14.3
- cfgv ==3.2.0
- chardet ==3.0.4
- click ==7.1.2
- colorama ==0.4.3
- colorful ==0.5.4
- cymem ==2.0.5
- distlib ==0.3.1
- filelock ==3.0.12
- ftfy ==5.8
- fuzzysearch ==0.7.3
- google ==3.0.0
- google-api-core ==1.22.4
- google-auth ==1.22.1
- google-cloud ==0.34.0
- google-cloud-bigquery ==1.28.0
- google-cloud-core ==1.4.3
- google-crc32c ==1.0.0
- google-resumable-media ==1.1.0
- googleapis-common-protos ==1.52.0
- googlemaps ==4.4.2
- gpustat ==0.6.0
- grpcio ==1.35.0
- hiredis ==1.1.0
- identify ==1.5.6
- idna ==2.10
- jinja2 ==2.11.2
- jmespath ==0.10.0
- joblib ==0.17.0
- jsonschema ==3.2.0
- lxml ==4.5.2
- markupsafe ==1.1.1
- msgpack ==1.0.2
- multidict ==5.1.0
- murmurhash ==1.0.5
- nodeenv ==1.5.0
- numpy ==1.19.2
- nvidia-ml-py3 ==7.352.0
- opencensus ==0.7.12
- opencensus-context ==0.1.2
- packaging ==20.9
- pandas ==1.1.3
- pathy ==0.3.5
- pre-commit ==2.7.1
- preshed ==3.0.5
- prometheus-client ==0.9.0
- protobuf ==3.13.0
- psutil ==5.8.0
- py-spy ==0.3.4
- pyasn1 ==0.4.8
- pyasn1-modules ==0.2.8
- pycparser ==2.20
- pydantic ==1.7.3
- pyparsing ==2.4.7
- pyrsistent ==0.17.3
- python-dateutil ==2.8.1
- pytz ==2020.1
- pyyaml ==5.3.1
- ray ==0.8.7
- redis ==3.4.1
- regex ==2020.10.11
- requests ==2.24.0
- rsa ==4.6
- s3transfer ==0.3.3
- sacremoses ==0.0.43
- shellingham ==1.3.2
- six ==1.15.0
- smart-open ==2.2.1
- soupsieve ==2.0.1
- spacy ==3.0.1
- spacy-alignments ==0.7.2
- spacy-legacy ==3.0.1
- spacy-lookups-data ==0.3.2
- spacy-ray ==0.1.0
- spacy-transformers ==1.0.1
- srsly ==2.4.0
- tabulate ==0.8.7
- thinc ==8.0.1
- tokenizers ==0.9.4
- toml ==0.10.1
- torch ==1.7.1
- torchcontrib ==0.0.2
- tqdm ==4.50.2
- transformers ==4.2.2
- typer ==0.3.2
- typing-extensions ==3.7.4.3
- urllib3 ==1.25.10
- virtualenv ==20.0.34
- wasabi ==0.8.2
- wcwidth ==0.2.5
- yarl ==1.6.3