ai
This repository will help me understand artificial intelligence's subject as well as how to use GitHub and Zenodo
Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 2 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (16.6%) to scientific vocabulary
Repository
This repository will help me understand artificial intelligence's subject as well as how to use GitHub and Zenodo
Basic Info
- Host: GitHub
- Owner: Pokoyokjk
- License: cc0-1.0
- Language: Python
- Default Branch: main
- Size: 8.96 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 3
Metadata Files
README.md
Description
The project consists on three scripts which iterate over a list of PDF paths. For each PDF, the processes the article, extract the abstract, generate a word cloud, count figures, and extracts links. The results are printed to the console.
For a detailed code explanation consult the release section.
Requirements
Ubuntu 22.04
Python 3.10
Grobid 0.8.0
Grobidclientpython 0.0.8
Conda 23.11.0
matplotlib 3.4.3
wordcloud 1.8.1
beautifulsoup4 4.10.0
Installation and execution instructions (in Ubuntu)
1. Install Python
$ sudo apt update
$ sudo apt install python3
2. Create a safe environment to install packages without affecting the system-wide Python installation
Link for conda installation: https://docs.conda.io/projects/conda/en/latest/user-guide/install/index.html
Commands to create and activate the environment
$ conda activate
$ conda create -n
$ conda activate
3. Make sure you have the following Python libraries installed. You can install them with the commands:
$ pip install wordcloud
$ pip install matplotlib
$ pip install beautifulsoup4
4. Install grobid
- Ensure you have the JDK installed:
$ sudo apt-get update
$ sudo apt-get install default-jdk
- Get Grobid's GitHub repository
$ wget --no-check-certificate https://github.com/kermitt2/grobid/archive/0.8.0.zip
- Unzip
$ unzip 0.8.0.zip
($ sudo apt install unzip)
5. Install grobidclientpython
Link to the GitHub repository: https://github.com/kermitt2/grobidclientpython/blob/master/Readme.md
6. Create a directory
Inside, you will add the pdfs you want to proccess.
7. Download the code from this GitHub's repository
You can find it in the release section or in the code file.
Change the directory's path for your own (created in 6). You can see the preestablished name is "pdfs".
8. Run the code
Run Grobid (if the following command does not work, see other options in Grobid's GitHub repository).
$ cd grobid-0.8.0
$ ./gradlew run
You can check if it is working by accessing http://localhost:8070/
While running grobid in the first Linux window, change your directory to grobidclientpython in another window.
$ cd grobidclientpython
And then, run your chosen script
$ pyhton /path/of/the/script
Running examples
Consult the repository's tests file.
Citation
Consult the repository's citation file.
Where to get help
Grobid's GitHub repository: https://github.com/kermitt2/grobid/
Conda's web installation page: https://docs.conda.io/projects/conda/en/latest/user-guide/install/index.html
Contact: gloria.cumia@alumnos.upm.es
DOI
https://doi.org/10.5281/zenodo.10732799
Owner
- Login: Pokoyokjk
- Kind: user
- Repositories: 1
- Profile: https://github.com/Pokoyokjk
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Gloria" given-names: "Cumia" orcid: "https://orcid.org/0000-0000-0000-0000" title: "AI" version: 2.3.3 doi: 10.5281/zenodo.1234 date-released: 2024-03-05 url: "https://github.com/Pokoyokjk/AI"
GitHub Events
Total
Last Year
Dependencies
- python 3.10 build
- beautifulsoup4 ==4.10.0
- matplotlib ==3.4.3
- wordcloud ==1.8.1