patentcity

Innovation across ages

https://github.com/cverluise/patentcity

Science Score: 31.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.9%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Innovation across ages

Basic Info
Statistics
  • Stars: 66
  • Watchers: 4
  • Forks: 3
  • Open Issues: 4
  • Releases: 0
Created almost 6 years ago · Last pushed over 3 years ago
Metadata Files
Readme License Code of conduct Citation

README.md

patentCity


🗃️ This repository is the codebase of the patentCity database.

📚 The patentCity database is a comprehensive database reporting patentees' data extracted from patent texts as well as enriched data since the 19th century in Germany (including East Germany), France, Great Britain and the USA.

💥 The goal of the database is to spur research on the history of innovation and to deepen the set of natural experiments from which historians, economists and scientists in general can learn to improve our understanding of innovation dynamics.

🌎 Explore and visualize the patentCity database online at patentcity.xyz (click on the map under the "Explore" section).

📥 Download the patentCity database here (CC-BY-4).

📝 The patentCity codebase and database are documented at doc.patentcity.xyz.

🤗 We open source our code (MIT) to support future extensions, and a collaborative way to create and continuously improve research databases.

💌 patentCity is due to expand and improve continuously in the coming years. Make sure to receive updates, join our newsletter and star the GitHub repository!

©️ patentCity is the backbone of Bergeaud and Verluise (2022). If you use the data or the codebase, make sure to cite the paper.

Developer guide

Installation and set-up

```shell script

Clone repo

git clone https://github.com/cverluise/patentcity.git

set up env/ install dependencies and CLI

cd patentcity/ poetry install ```

Install poetry if needed.

```shell script

get data & models (dev-only, access to gcp bucket needed)

dvc pull ```

Install dvc if needed.

API doc

The API doc is available at the documentation website under the API section.

Overall, the API is thought to be versatile and to adapt to new documents seamlessly. The main functions are covered by the patencity CLI

Owner

  • Name: Cyril Verluise
  • Login: cverluise
  • Kind: user
  • Location: Paris
  • Company: Economics of Innovation Lab @ Collège de France

Phd Student in Economics & Data scientist. Interested in knowledge creation and diffusion. Currently working on patent related projects.

Citation (CITATION.bib)

@unpublished{bergeaudVerluise2021,
  title={A new Dataset to Study a Century of Innovation
in Europe and in the US},
  author={Bergeaud, Antonin and Verluise, Cyril},
  year={2022},
  note = {Mimeo}
}

GitHub Events

Total
  • Watch event: 4
  • Issue comment event: 3
  • Pull request review event: 1
  • Pull request event: 1
  • Fork event: 2
Last Year
  • Watch event: 4
  • Issue comment event: 3
  • Pull request review event: 1
  • Pull request event: 1
  • Fork event: 2

Dependencies

poetry.lock pypi
  • 152 dependencies
pyproject.toml pypi
  • gitpython ^3.1.12 develop
  • mkdocs ^1.1.2 develop
  • mkdocs-material ^7.1.3 develop
  • mkdocs-minify-plugin ^0.3.0 develop
  • mkdocstrings ^0.15.2 develop
  • pymdown-extensions ^7.1 develop
  • typer-cli ^0.0.11 develop
  • Cython ^0.29.23
  • bs4 ^0.0.1
  • dvc ^2.0.18
  • fuzzysearch ^0.7.3
  • fuzzyset ^0.0.19
  • google-cloud ^0.34.0
  • google-cloud-bigquery ^1.27.2
  • googlemaps ^4.4.2
  • lxml ^4.6.2
  • pandas ^1.0.5
  • pre-commit ^2.6.0
  • python ^3.8
  • python-levenshtein ^0.12.2
  • smart_open ^2.1.0
  • spacy ^3.0.1
  • spacy-lookups-data ^0.3.2
  • spacy-transformers ^1.0.1
  • tabulate ^0.8.7
  • typer ^0.3.2
requirements-dev.txt pypi
  • aiohttp ==3.7.3 development
  • aioredis ==1.3.1 development
  • appdirs ==1.4.4 development
  • async-timeout ==3.0.1 development
  • attrs ==20.3.0 development
  • beautifulsoup4 ==4.9.3 development
  • blessings ==1.7 development
  • blis ==0.7.4 development
  • boto3 ==1.15.16 development
  • botocore ==1.18.16 development
  • bs4 ==0.0.1 development
  • cachetools ==4.1.1 development
  • catalogue ==2.0.1 development
  • certifi ==2020.6.20 development
  • cffi ==1.14.3 development
  • cfgv ==3.2.0 development
  • chardet ==3.0.4 development
  • click ==7.1.2 development
  • colorama ==0.4.3 development
  • colorful ==0.5.4 development
  • cymem ==2.0.5 development
  • distlib ==0.3.1 development
  • filelock ==3.0.12 development
  • ftfy ==5.8 development
  • future ==0.18.2 development
  • fuzzysearch ==0.7.3 development
  • gitdb ==4.0.5 development
  • gitpython ==3.1.12 development
  • google ==3.0.0 development
  • google-api-core ==1.22.4 development
  • google-auth ==1.22.1 development
  • google-cloud ==0.34.0 development
  • google-cloud-bigquery ==1.28.0 development
  • google-cloud-core ==1.4.3 development
  • google-crc32c ==1.0.0 development
  • google-resumable-media ==1.1.0 development
  • googleapis-common-protos ==1.52.0 development
  • googlemaps ==4.4.2 development
  • gpustat ==0.6.0 development
  • grpcio ==1.35.0 development
  • hiredis ==1.1.0 development
  • htmlmin ==0.1.12 development
  • identify ==1.5.6 development
  • idna ==2.10 development
  • importlib-metadata ==2.0.0 development
  • jinja2 ==2.11.2 development
  • jmespath ==0.10.0 development
  • joblib ==0.17.0 development
  • jsmin ==2.2.2 development
  • jsonschema ==3.2.0 development
  • livereload ==2.6.3 development
  • lunr ==0.5.8 development
  • lxml ==4.5.2 development
  • markdown ==3.3.1 development
  • markupsafe ==1.1.1 development
  • mkdocs ==1.1.2 development
  • mkdocs-material ==5.5.14 development
  • mkdocs-material-extensions ==1.0.1 development
  • mkdocs-minify-plugin ==0.3.0 development
  • msgpack ==1.0.2 development
  • multidict ==5.1.0 development
  • murmurhash ==1.0.5 development
  • nltk ==3.5 development
  • nodeenv ==1.5.0 development
  • numpy ==1.19.2 development
  • nvidia-ml-py3 ==7.352.0 development
  • opencensus ==0.7.12 development
  • opencensus-context ==0.1.2 development
  • packaging ==20.9 development
  • pandas ==1.1.3 development
  • pathy ==0.3.5 development
  • pre-commit ==2.7.1 development
  • preshed ==3.0.5 development
  • prometheus-client ==0.9.0 development
  • protobuf ==3.13.0 development
  • psutil ==5.8.0 development
  • py-spy ==0.3.4 development
  • pyasn1 ==0.4.8 development
  • pyasn1-modules ==0.2.8 development
  • pycparser ==2.20 development
  • pydantic ==1.7.3 development
  • pygments ==2.7.1 development
  • pymdown-extensions ==7.1 development
  • pyparsing ==2.4.7 development
  • pyrsistent ==0.17.3 development
  • python-dateutil ==2.8.1 development
  • pytz ==2020.1 development
  • pyyaml ==5.3.1 development
  • ray ==0.8.7 development
  • redis ==3.4.1 development
  • regex ==2020.10.11 development
  • requests ==2.24.0 development
  • rsa ==4.6 development
  • s3transfer ==0.3.3 development
  • sacremoses ==0.0.43 development
  • shellingham ==1.3.2 development
  • six ==1.15.0 development
  • smart-open ==2.2.1 development
  • smmap ==3.0.5 development
  • soupsieve ==2.0.1 development
  • spacy ==3.0.1 development
  • spacy-alignments ==0.7.2 development
  • spacy-legacy ==3.0.1 development
  • spacy-lookups-data ==0.3.2 development
  • spacy-ray ==0.1.0 development
  • spacy-transformers ==1.0.1 development
  • srsly ==2.4.0 development
  • tabulate ==0.8.7 development
  • thinc ==8.0.1 development
  • tokenizers ==0.9.4 development
  • toml ==0.10.1 development
  • torch ==1.7.1 development
  • torchcontrib ==0.0.2 development
  • tornado ==6.0.4 development
  • tqdm ==4.50.2 development
  • transformers ==4.2.2 development
  • typer ==0.3.2 development
  • typer-cli ==0.0.11 development
  • typing-extensions ==3.7.4.3 development
  • urllib3 ==1.25.10 development
  • virtualenv ==20.0.34 development
  • wasabi ==0.8.2 development
  • wcwidth ==0.2.5 development
  • yarl ==1.6.3 development
  • zipp ==3.3.0 development
requirements.txt pypi
  • aiohttp ==3.7.3
  • aioredis ==1.3.1
  • appdirs ==1.4.4
  • async-timeout ==3.0.1
  • attrs ==20.3.0
  • beautifulsoup4 ==4.9.3
  • blessings ==1.7
  • blis ==0.7.4
  • boto3 ==1.15.16
  • botocore ==1.18.16
  • bs4 ==0.0.1
  • cachetools ==4.1.1
  • catalogue ==2.0.1
  • certifi ==2020.6.20
  • cffi ==1.14.3
  • cfgv ==3.2.0
  • chardet ==3.0.4
  • click ==7.1.2
  • colorama ==0.4.3
  • colorful ==0.5.4
  • cymem ==2.0.5
  • distlib ==0.3.1
  • filelock ==3.0.12
  • ftfy ==5.8
  • fuzzysearch ==0.7.3
  • google ==3.0.0
  • google-api-core ==1.22.4
  • google-auth ==1.22.1
  • google-cloud ==0.34.0
  • google-cloud-bigquery ==1.28.0
  • google-cloud-core ==1.4.3
  • google-crc32c ==1.0.0
  • google-resumable-media ==1.1.0
  • googleapis-common-protos ==1.52.0
  • googlemaps ==4.4.2
  • gpustat ==0.6.0
  • grpcio ==1.35.0
  • hiredis ==1.1.0
  • identify ==1.5.6
  • idna ==2.10
  • jinja2 ==2.11.2
  • jmespath ==0.10.0
  • joblib ==0.17.0
  • jsonschema ==3.2.0
  • lxml ==4.5.2
  • markupsafe ==1.1.1
  • msgpack ==1.0.2
  • multidict ==5.1.0
  • murmurhash ==1.0.5
  • nodeenv ==1.5.0
  • numpy ==1.19.2
  • nvidia-ml-py3 ==7.352.0
  • opencensus ==0.7.12
  • opencensus-context ==0.1.2
  • packaging ==20.9
  • pandas ==1.1.3
  • pathy ==0.3.5
  • pre-commit ==2.7.1
  • preshed ==3.0.5
  • prometheus-client ==0.9.0
  • protobuf ==3.13.0
  • psutil ==5.8.0
  • py-spy ==0.3.4
  • pyasn1 ==0.4.8
  • pyasn1-modules ==0.2.8
  • pycparser ==2.20
  • pydantic ==1.7.3
  • pyparsing ==2.4.7
  • pyrsistent ==0.17.3
  • python-dateutil ==2.8.1
  • pytz ==2020.1
  • pyyaml ==5.3.1
  • ray ==0.8.7
  • redis ==3.4.1
  • regex ==2020.10.11
  • requests ==2.24.0
  • rsa ==4.6
  • s3transfer ==0.3.3
  • sacremoses ==0.0.43
  • shellingham ==1.3.2
  • six ==1.15.0
  • smart-open ==2.2.1
  • soupsieve ==2.0.1
  • spacy ==3.0.1
  • spacy-alignments ==0.7.2
  • spacy-legacy ==3.0.1
  • spacy-lookups-data ==0.3.2
  • spacy-ray ==0.1.0
  • spacy-transformers ==1.0.1
  • srsly ==2.4.0
  • tabulate ==0.8.7
  • thinc ==8.0.1
  • tokenizers ==0.9.4
  • toml ==0.10.1
  • torch ==1.7.1
  • torchcontrib ==0.0.2
  • tqdm ==4.50.2
  • transformers ==4.2.2
  • typer ==0.3.2
  • typing-extensions ==3.7.4.3
  • urllib3 ==1.25.10
  • virtualenv ==20.0.34
  • wasabi ==0.8.2
  • wcwidth ==0.2.5
  • yarl ==1.6.3