dedup-asreview
Deduplication challenge for the AI-Aided Systematic Reviewing summer school course at Utrecht University.
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.4%) to scientific vocabulary
Repository
Deduplication challenge for the AI-Aided Systematic Reviewing summer school course at Utrecht University.
Basic Info
- Host: GitHub
- Owner: javimangal
- License: mit
- Default Branch: main
- Size: 69.3 KB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Deduplication of bibliographic records with ASySD in R
Deduplication challenge for the AI-Aided Systematic Reviewing summer school course at Utrecht University.
How to use
The suggested use of this repository starts with making sure that R and RStudio are installed in your computer: 1. Install R and RStudio on your computer if you haven't done so. (Note that these analyses were conducted under R version 4.4.0 and RStudio 2024.12.1). 2. Clone this repository. If you do not know how to do this, you can follow these instructions. Alternatively, you can download the ZIP file, unpack it, and place it in a folder in your computer. 3. You should now have all these files in your computer with an identical folder structure (described in the following section). 4. In the main directory, open the file named dedup-asreview.Rproj in RStudio. 5. You can navigate through the folders on the right-bottom panel of R Studio. Open the R folder. You should now see a series of files ending with .qmd. 6. Open one of the .qmd files. You can run every chunk of code sequentially to reproduce the analyses. Make sure to respect the order and if something fails, I recommend that you start running al chunks of code from the beginning. If you don't know how to run a chunk of code, you can imitate what this person is doing. If you get a message saying "Access denied", change from Visual to Source mode which can be done with the Ctrl+Shift+F4 command. 7. Please note that scripts are meant to be sourced into the flow of analyses in the main .qmd files. You may encounter problems if you attempt to run the scripts independently.
If you are not able to follow the prior steps, you may also consider reviewing the PDF reports documenting the analyses.
- deduplication.qmd. Main commented code file for deduplication with ASySD. PDF
Project Structure
The project structure distinguishes three kinds of folders: - read-only (RO): not edited by either code or researcher - human-writeable (HW): edited by the researcher only. - project-generated (PG): folders generated when running the code; these folders can be deleted or emptied and will be completely reconstituted as the project is run.
``` . ├── .gitignore ├── CITATION.cff ├── LICENSE ├── README.md ├── data <- All project data, ignored by git │ ├── processed <- The final, canonical data sets for modeling. (PG) │ ├── raw <- The original, immutable data dump. (RO) │ └── temp <- Intermediate data that has been transformed. (PG) ├── docs <- Documentation notebook for users (HW) │ ├── manuscript <- Manuscript source, e.g., LaTeX, Markdown, etc. (HW) │ └── reports <- Other project reports and notebooks (e.g. Jupyter, .Rmd) (HW) ├── results │ ├── figures <- Figures for the manuscript or reports (PG) │ └── output <- Other output for the manuscript or reports (PG) └── R <- Source code for this project (HW)
```
Contact
You can contact me or post a request in this repository in case you encounter any issues.
License
This project is licensed under the terms of the MIT License.
This project structure repository is adapted from the Utrecht University simple R project template, which builds upon the Good Enough Project Cookiecutter template by Barbara Vreede (2019).
Owner
- Name: Javier Mancilla Galindo
- Login: javimangal
- Kind: user
- Location: Utrecht, Netherlands
- Company: @UtrechtUniversity
- Twitter: Javimangal
- Repositories: 1
- Profile: https://github.com/javimangal
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
title: "Deduplication of bibliographic records with ASySD in R"
authors:
- given-names: Javier
family-names: Mancilla Galindo
email: j.mancillagalindo@uu.nl
affiliation: >-
Institute for Risk Assessment Sciences, Utrecht
University
orcid: 'https://orcid.org/0000-0002-0718-467X'
license: MIT
repository-code: "https://github.com/javimangal/dedup-asreview"
GitHub Events
Total
- Push event: 2
- Create event: 2
Last Year
- Push event: 2
- Create event: 2