openscienceindividual
Artificial Intelligence And Open Science In Research Software Engineering
Science Score: 49.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 3 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (15.9%) to scientific vocabulary
Repository
Artificial Intelligence And Open Science In Research Software Engineering
Basic Info
- Host: GitHub
- Owner: jorgesaenzdemiera
- License: mit
- Language: Python
- Default Branch: main
- Size: 64.1 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 2
Metadata Files
README.md
OpenScience
Description
Assesments for Artificial Intelligence And Open Science In Research Software Engineering * Assesment 1. Grobid Analysis with best practices: Create a Gobrid client which will perform an analysis over 10 open-access articles. The program will: - Draw a keyword cloud based on the words found in the abstract of your papers. - Create a visualization showing the number of gures per article. - Create a list of the links found in each paper.
Requirements
- Python 3 (See Python Downloads for instalation)
- Docker (See Docker Downloads for instalation)
- Grobid Server (See Grobid Documentation for more detailed instalation)
- Full image instalation:
docker pull grobid/grobid:0.8.0
- Full image instalation:
- Grobid Python Client (See Grobid Client Python Github for more detailed instalation)
git clone https://github.com/kermitt2/grobid_client_python cd grobid_client_python python3 setup.py install
Installation instructions
- Clone this repository
git clone https://github.com/jorgesaenzdemiera/OpenScience cd OpenScience - Create a Virtual Environment with all the required libraries ``` python -m venv grobid
# En Windows .\grobid\Scripts\activate # En macOS/Linux source grobid/bin/activate
pip install -r requirements.txt ```
Execution instructions
- Include the articles you want to analyze in the input folder
- Run the Grobid Server
docker run --rm --init --ulimit core=0 -p 8070:8070 grobid/grobid:0.8.0 - Run the Grobid Client
cd Assesment\ 1.\ Grobid\ Analysis/code/ python3 grobid.py - Keep the necessary information of the Analysis (Output file will be removed after the next execution of the Client)
Running example
See the example with 10 computer vision articles in the repository.
Output after execution include: * An image featuring the word cloud generated from the abstract of each article processed with Grobid. * An histogram showing the number of figures per article. * A text file with all the links that appear in each one of the articles.
Preferred citation
Use CITATION.cff
Owner
- Login: jorgesaenzdemiera
- Kind: user
- Repositories: 1
- Profile: https://github.com/jorgesaenzdemiera
CodeMeta (codemeta.json)
{
"@context": "https://doi.org/10.5063/schema/codemeta-2.0",
"@type": "SoftwareSourceCode",
"license": "https://spdx.org/licenses/MIT",
"codeRepository": "https://github.com/jorgesaenzdemiera/OpenScience",
"dateCreated": "2024-03-04",
"datePublished": "2024-03-05",
"dateModified": "2024-03-05",
"name": "Grobid PDF Analysis",
"version": "1.0.0",
"identifier": "10.5281/zenodo.10785363",
"applicationCategory": "Data Analysis",
"keywords": [
"open science",
"grobid",
"pdf analysis",
"python",
"github"
],
"programmingLanguage": [
"Python 3"
],
"author": [
{
"@type": "Person",
"givenName": "Jorge",
"familyName": "Senz de Miera",
"email": "jorgesaenzdemiera2@gmail.com"
}
]
}
GitHub Events
Total
Last Year
Dependencies
- python 3.11-buster build
- lfoppiano/grobid 0.8.0
- et-xmlfile ==1.1.0
- grobid_client ==0.8.5
- matplotlib ==3.7.1
- wordcloud ==1.9.3