https://github.com/climatecompatiblegrowth/research_index_backend
Provisioning the graph database which forms the backend of the research index
https://github.com/climatecompatiblegrowth/research_index_backend
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.4%) to scientific vocabulary
Repository
Provisioning the graph database which forms the backend of the research index
Basic Info
- Host: GitHub
- Owner: ClimateCompatibleGrowth
- License: mit
- Language: Jupyter Notebook
- Default Branch: main
- Size: 329 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 17
- Releases: 0
Metadata Files
readme.md
Ingest and Natural Language Processing of Research Outputs
The package is not yet deployed to PyPI. Only an editable (development) install is possible.
- Provide a list of DOIs in a CSV file format
list_of_dois.csv - Clone the repository
git clonehttps://github.com/ClimateCompatibleGrowth/research_index_backend.git - Change directory
cd research_index_backend - Install the package
pip install -e .as an editable package (development install) Obtain an OpenAIRE Graph refresh token and create a .env file with the following parameters:
MG_HOST= MG_PORT= MG_PORT_ALT= MG_USER= MG_PASS= ORCID_NAME_SIMILARITY_THRESHOLD= NAME_SIMILARITY_THRESHOLD= OPENAIRE_API="https://api.openaire.eu" OPENAIRE_SERVICE="https://services.openaire.eu" REFRESH_TOKEN=Provision Memgraph graph database and set up environment variables
Once the VM is up and running, SSH into the VM, download and install memgraph
$ curl -O https://download.memgraph.com/memgraph/v2.14.1/ubuntu-20.04/memgraph_2.14.1-1_amd64.deb
$ sudo dpkg -i /memgraph_2.14.1-1_amd64.deb
Run the backend:
research_index --help usage: research_index [-h] [-i] [-l LIMIT] [-u] list_of_dois positional arguments: list_of_dois Path to CSV file containing list of DOIs options: -h, --help Show this help message and exit -i, --initialise Delete existing data and create new database -l, --limit N Limit number of DOIs to process (default: 50) -u, --update-metadata Update metadata for existing DOIs -w, --write-metadata Save JSON responses to disk Examples: -> Process 10 DOIs from file: $ research_index list_of_dois.csv -l 10 # Process 10 DOIs from file -> Update metadata for existing DOIs and save metadata $ research_index list_of_dois.csv --update-metadata --write-metadata
Development
The package is maintained using hatch.
To run the tests run:
hatch test
Owner
- Name: Climate Compatible Growth
- Login: ClimateCompatibleGrowth
- Kind: organization
- Location: United Kingdom
- Website: www.climatecompatiblegrowth.com
- Twitter: ResearchCcg
- Repositories: 41
- Profile: https://github.com/ClimateCompatibleGrowth
GitHub Events
Total
- Issues event: 17
- Delete event: 9
- Issue comment event: 13
- Push event: 47
- Pull request review comment event: 18
- Pull request review event: 18
- Pull request event: 22
- Create event: 11
Last Year
- Issues event: 17
- Delete event: 9
- Issue comment event: 13
- Push event: 47
- Pull request review comment event: 18
- Pull request review event: 18
- Pull request event: 22
- Create event: 11
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 8
- Total pull requests: 7
- Average time to close issues: 25 days
- Average time to close pull requests: 9 days
- Total issue authors: 2
- Total pull request authors: 2
- Average comments per issue: 0.13
- Average comments per pull request: 0.71
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 8
- Pull requests: 7
- Average time to close issues: 25 days
- Average time to close pull requests: 9 days
- Issue authors: 2
- Pull request authors: 2
- Average comments per issue: 0.13
- Average comments per pull request: 0.71
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- willu47 (13)
- FrancisTembo (5)
Pull Request Authors
- willu47 (8)
- FrancisTembo (5)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- gqlalchemy *
- pandas *
- rdflib *
- requests *
- tqdm *
- anyio ==3.6.2
- appnope ==0.1.3
- argon2-cffi ==21.3.0
- argon2-cffi-bindings ==21.2.0
- arrow ==1.2.3
- asttokens ==2.2.1
- attrs ==22.2.0
- backcall ==0.2.0
- beautifulsoup4 ==4.11.2
- bleach ==6.0.0
- certifi ==2022.12.7
- cffi ==1.15.1
- charset-normalizer ==3.0.1
- comm ==0.1.2
- contourpy ==1.0.7
- cycler ==0.11.0
- debugpy ==1.6.6
- decorator ==5.1.1
- defusedxml ==0.7.1
- et-xmlfile ==1.1.0
- executing ==1.2.0
- fastjsonschema ==2.16.2
- filelock ==3.9.0
- fonttools ==4.38.0
- fqdn ==1.5.1
- huggingface-hub ==0.12.0
- idna ==3.4
- ipykernel ==6.21.2
- ipython ==8.10.0
- ipython-genutils ==0.2.0
- ipywidgets ==8.0.4
- isoduration ==20.11.0
- jedi ==0.18.2
- jinja2 ==3.1.2
- jsonpointer ==2.3
- jsonschema ==4.17.3
- jupyter ==1.0.0
- jupyter-client ==8.0.3
- jupyter-console ==6.6.1
- jupyter-core ==5.2.0
- jupyter-events ==0.6.3
- jupyter-server ==2.3.0
- jupyter-server-terminals ==0.4.4
- jupyterlab-pygments ==0.2.2
- jupyterlab-widgets ==3.0.5
- kiwisolver ==1.4.4
- markupsafe ==2.1.2
- matplotlib ==3.7.0
- matplotlib-inline ==0.1.6
- mistune ==2.0.5
- nbclassic ==0.5.2
- nbclient ==0.7.2
- nbconvert ==7.2.9
- nbformat ==5.7.3
- nest-asyncio ==1.5.6
- notebook ==6.5.2
- notebook-shim ==0.2.2
- openpyxl ==3.1.0
- packaging ==23.0
- pandas ==1.5.3
- pandocfilters ==1.5.0
- parso ==0.8.3
- pexpect ==4.8.0
- pickleshare ==0.7.5
- platformdirs ==3.0.0
- prometheus-client ==0.16.0
- prompt-toolkit ==3.0.37
- psutil ==5.9.4
- ptyprocess ==0.7.0
- pure-eval ==0.2.2
- pycparser ==2.21
- pygments ==2.14.0
- pyparsing ==3.0.9
- pyrsistent ==0.19.3
- python-dateutil ==2.8.2
- python-json-logger ==2.0.7
- pytz ==2022.7.1
- pyyaml ==6.0
- pyzmq ==25.0.0
- qtconsole ==5.4.0
- qtpy ==2.3.0
- regex ==2022.10.31
- requests ==2.28.2
- rfc3339-validator ==0.1.4
- rfc3986-validator ==0.1.1
- seaborn ==0.12.2
- send2trash ==1.8.0
- sniffio ==1.3.0
- soupsieve ==2.4
- stack-data ==0.6.2
- terminado ==0.17.1
- tinycss2 ==1.2.1
- tokenizers ==0.13.2
- tornado ==6.2
- tqdm ==4.64.1
- traitlets ==5.9.0
- transformers ==4.26.1
- uri-template ==1.2.0
- urllib3 ==1.26.14
- wcwidth ==0.2.6
- webcolors ==1.12
- webencodings ==0.5.1
- websocket-client ==1.5.1
- widgetsnbextension ==4.0.5