ai

This repository will help me understand artificial intelligence's subject as well as how to use GitHub and Zenodo

https://github.com/pokoyokjk/ai

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (16.6%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

This repository will help me understand artificial intelligence's subject as well as how to use GitHub and Zenodo

Basic Info
  • Host: GitHub
  • Owner: Pokoyokjk
  • License: cc0-1.0
  • Language: Python
  • Default Branch: main
  • Size: 8.96 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 3
Created about 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation

README.md

Description

The project consists on three scripts which iterate over a list of PDF paths. For each PDF, the processes the article, extract the abstract, generate a word cloud, count figures, and extracts links. The results are printed to the console.

For a detailed code explanation consult the release section.

Requirements

Ubuntu 22.04

Python 3.10

Grobid 0.8.0

Grobidclientpython 0.0.8

Conda 23.11.0

matplotlib 3.4.3

wordcloud 1.8.1

beautifulsoup4 4.10.0

Installation and execution instructions (in Ubuntu)

1. Install Python

$ sudo apt update

$ sudo apt install python3

2. Create a safe environment to install packages without affecting the system-wide Python installation

Link for conda installation: https://docs.conda.io/projects/conda/en/latest/user-guide/install/index.html

Commands to create and activate the environment

$ conda activate

$ conda create -n python=3.10

$ conda activate

3. Make sure you have the following Python libraries installed. You can install them with the commands:

$ pip install wordcloud

$ pip install matplotlib

$ pip install beautifulsoup4

4. Install grobid

  • Ensure you have the JDK installed:

$ sudo apt-get update

$ sudo apt-get install default-jdk

  • Get Grobid's GitHub repository

$ wget --no-check-certificate https://github.com/kermitt2/grobid/archive/0.8.0.zip

  • Unzip

$ unzip 0.8.0.zip

($ sudo apt install unzip)

5. Install grobidclientpython

Link to the GitHub repository: https://github.com/kermitt2/grobidclientpython/blob/master/Readme.md

6. Create a directory

Inside, you will add the pdfs you want to proccess.

7. Download the code from this GitHub's repository

You can find it in the release section or in the code file.

Change the directory's path for your own (created in 6). You can see the preestablished name is "pdfs".

8. Run the code

Run Grobid (if the following command does not work, see other options in Grobid's GitHub repository).

$ cd grobid-0.8.0

$ ./gradlew run

You can check if it is working by accessing http://localhost:8070/

While running grobid in the first Linux window, change your directory to grobidclientpython in another window.

$ cd grobidclientpython

And then, run your chosen script

$ pyhton /path/of/the/script

Running examples

Consult the repository's tests file.

Citation

Consult the repository's citation file.

Where to get help

Grobid's GitHub repository: https://github.com/kermitt2/grobid/

Conda's web installation page: https://docs.conda.io/projects/conda/en/latest/user-guide/install/index.html

Contact: gloria.cumia@alumnos.upm.es

DOI

https://doi.org/10.5281/zenodo.10732799

Owner

  • Login: Pokoyokjk
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Gloria"
  given-names: "Cumia"
  orcid: "https://orcid.org/0000-0000-0000-0000"
title: "AI"
version: 2.3.3
doi: 10.5281/zenodo.1234
date-released: 2024-03-05
url: "https://github.com/Pokoyokjk/AI"

GitHub Events

Total
Last Year

Dependencies

docker/Dockerfile docker
  • python 3.10 build
docker/docker-compose.yml docker
dependencies/requirements.txt pypi
  • beautifulsoup4 ==4.10.0
  • matplotlib ==3.4.3
  • wordcloud ==1.8.1