openscienceindividual

Artificial Intelligence And Open Science In Research Software Engineering

https://github.com/jorgesaenzdemiera/openscienceindividual

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.9%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Artificial Intelligence And Open Science In Research Software Engineering

Basic Info
  • Host: GitHub
  • Owner: jorgesaenzdemiera
  • License: mit
  • Language: Python
  • Default Branch: main
  • Size: 64.1 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 2
Created over 2 years ago · Last pushed over 2 years ago
Metadata Files
Readme License Citation Codemeta

README.md

OpenScience

DOI Documentation Status

Description

Assesments for Artificial Intelligence And Open Science In Research Software Engineering * Assesment 1. Grobid Analysis with best practices: Create a Gobrid client which will perform an analysis over 10 open-access articles. The program will: - Draw a keyword cloud based on the words found in the abstract of your papers. - Create a visualization showing the number of gures per article. - Create a list of the links found in each paper.

Requirements

  • Python 3 (See Python Downloads for instalation)
  • Docker (See Docker Downloads for instalation)
  • Grobid Server (See Grobid Documentation for more detailed instalation)
    • Full image instalation:
      docker pull grobid/grobid:0.8.0
  • Grobid Python Client (See Grobid Client Python Github for more detailed instalation) git clone https://github.com/kermitt2/grobid_client_python cd grobid_client_python python3 setup.py install

Installation instructions

  1. Clone this repository git clone https://github.com/jorgesaenzdemiera/OpenScience cd OpenScience
  2. Create a Virtual Environment with all the required libraries ``` python -m venv grobid

# En Windows .\grobid\Scripts\activate # En macOS/Linux source grobid/bin/activate

pip install -r requirements.txt ```

Execution instructions

  1. Include the articles you want to analyze in the input folder
  2. Run the Grobid Server docker run --rm --init --ulimit core=0 -p 8070:8070 grobid/grobid:0.8.0
  3. Run the Grobid Client cd Assesment\ 1.\ Grobid\ Analysis/code/ python3 grobid.py
  4. Keep the necessary information of the Analysis (Output file will be removed after the next execution of the Client)

Running example

See the example with 10 computer vision articles in the repository.

Output after execution include: * An image featuring the word cloud generated from the abstract of each article processed with Grobid. * An histogram showing the number of figures per article. * A text file with all the links that appear in each one of the articles.

Preferred citation

Use CITATION.cff

Owner

  • Login: jorgesaenzdemiera
  • Kind: user

CodeMeta (codemeta.json)

{
  "@context": "https://doi.org/10.5063/schema/codemeta-2.0",
  "@type": "SoftwareSourceCode",
  "license": "https://spdx.org/licenses/MIT",
  "codeRepository": "https://github.com/jorgesaenzdemiera/OpenScience",
  "dateCreated": "2024-03-04",
  "datePublished": "2024-03-05",
  "dateModified": "2024-03-05",
  "name": "Grobid PDF Analysis",
  "version": "1.0.0",
  "identifier": "10.5281/zenodo.10785363",
  "applicationCategory": "Data Analysis",
  "keywords": [
    "open science",
    "grobid",
    "pdf analysis",
    "python",
    "github"
  ],
  "programmingLanguage": [
    "Python 3"
  ],
  "author": [
    {
      "@type": "Person",
      "givenName": "Jorge",
      "familyName": "Senz de Miera",
      "email": "jorgesaenzdemiera2@gmail.com"
    }
  ]
}

GitHub Events

Total
Last Year

Dependencies

Dockerfile docker
  • python 3.11-buster build
docker-compose.yml docker
  • lfoppiano/grobid 0.8.0
docs/requirements.txt pypi
  • et-xmlfile ==1.1.0
  • grobid_client ==0.8.5
  • matplotlib ==3.7.1
  • wordcloud ==1.9.3