refextract
Science Score: 31.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.7%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: Refuburst
- Language: Jupyter Notebook
- Default Branch: main
- Size: 6.82 MB
Statistics
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Reference/Citation Buster
This Python script analyzes citations and references in PDF documents. It extracts text from PDF files, processes citations, and checks for various citation and reference-related criteria.
Installation
Prerequisites: Anaconda installed on your system
Steps:
Open Anaconda
Open Anaconda Terminal
Run the installdependancies.py script to install all dependent modules. "python installdependancies.py"
Run the app.py script to start the application.
python app.py
Usage
Upload a PDF file using the provided interface.
The script will extract text from the PDF and analyze citations and references. Results will be displayed, including the presence of Harvard citations, citation counts, and reference order checks. You can download the CSV files containing detailed citation and reference data for further analysis. Functionality
Extract Text from PDF: Utilizes various libraries to extract text from PDF files.
Citation Analysis: Analyzes citations in the extracted text, checks citation syntax, and counts citation occurrences. Reference Order Check: Verifies if references in the PDF are in alphabetical order. Harvard Citation Generation: Generates Harvard-style citations for references found in the PDF. Downloadable Reports: Provides CSV files containing detailed citation and reference data for download.
Contributors Glen Frank Nneka Jessie Rahul Shiva Sowmya Suresh Uday Sindhura Nikhil Harshini Vijay Rajesh
License
This project is licensed under the MIT License - see the LICENSE file for details.
Feel free to adjust the content as needed and let me know if you need further assistance!
Owner
- Login: Refuburst
- Kind: user
- Repositories: 1
- Profile: https://github.com/Refuburst
Citation (citation_data.csv)
Citations,Author,Year,Form,In Sentence Use,et_al syntax "(Eddie_4072, 2024)",Eddie_4072 ,2024,end,✔,n/a "(Verma, 2023)",Verma ,2023,end,✔,n/a "(Sharma, 2022)",Sharma ,2022,end,✔,n/a "(Galli, 2022)","Galli ",2022,end,✔,n/a "(Bhandari, 2024)",Bhandari ,2024,end,✔,n/a