text2sdg

Detect UN Sustainable Development Goals in Text

https://github.com/dwulff/text2sdg

Science Score: 46.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 5 DOI reference(s) in README
  • Academic publication links
    Links to: arxiv.org, zenodo.org
  • Committers with academic emails
    1 of 6 committers (16.7%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (18.3%) to scientific vocabulary

Keywords

natural-language-processing sustainability sustainable-development sustainable-development-goals
Last synced: 6 months ago · JSON representation

Repository

Detect UN Sustainable Development Goals in Text

Basic Info
  • Host: GitHub
  • Owner: dwulff
  • Language: R
  • Default Branch: main
  • Homepage: https://text2sdg.io
  • Size: 20.9 MB
Statistics
  • Stars: 21
  • Watchers: 2
  • Forks: 4
  • Open Issues: 3
  • Releases: 0
Topics
natural-language-processing sustainability sustainable-development sustainable-development-goals
Created almost 5 years ago · Last pushed over 1 year ago
Metadata Files
Readme Changelog

README.md

cran version DOI downloads

text2sdg

The United Nations’ Sustainable Development Goals (SDGs) have become an important guideline for higher-education and research institutions to monitor and plan their contributions to social, economic, and environmental transformations.

The text2sdg package is the first open-source, multi-system analysis package that identifies SDGs in text, opening up the opportunity to monitor any type of text-based data, including scientific output and corporate publications.

General Information

The text2sdg package is developed by Dirk U. Wulff and Dominik S. Meier, with contributions from Rui Mata and the Center for Cognitive and Decision Sciences. It is published under the GNU General Public License.

An overview of the package can be accessed online or from within R using ?text2sdg.

Installation

The current stable version is available on CRAN and can be installed via install.packages("text2sdg").

The latest development version on GitHub can be installed via devtools::install_github("dwulff/text2sdg"). Note that this requires prior installation of the devtools package.

Usage

To identify SDGs in a series of documents, the user can choose between two approaches, an individual systems approach implementing six individual query systems and an ensemble approach powered by machine learning that integrates these systems. It is recommended to use the more accurate and bias-free ensemble approach (see Wulff, Meier, & Mata, 2024).

```r

vector of texts

texts = c("This is text 1", "This is text 2")

individual systems approach

hits = detectsdgsystems(texts)

ensemble approach

hits = detect_sdg(texts) ```

For a complete tutorial on the use of the package, visit this page or call vignette("text2sdg") from within R.

Citation

If you use the text2sdg package for published work, we kindly ask that you cite the package as follows:

Meier, D. S., Mata, R., & Wulff, D. U. (2021). text2sdg: An open-source solution to monitoring sustainable development goals from text. arXiv. https://arxiv.org/abs/2110.05856

Depending on the use of the package, also consider referencing the following article:

Wulff, D. U., Meier, D. S., & Mata, R. (2024). Using novel data and ensemble models to improve automated labeling of Sustainable Development Goals. Sustainability Science. https://doi.org/10.1007/s11625-024-01516-3

Owner

  • Name: Dirk Wulff
  • Login: dwulff
  • Kind: user
  • Location: Basel, Switzerland
  • Company: Center for Cognitive Decision Science, University of Basel

Cognitive, decision, and data science at University of Basel and MPI for Human Development, Berlin.

GitHub Events

Total
  • Issues event: 2
  • Watch event: 3
Last Year
  • Issues event: 2
  • Watch event: 3

Committers

Last synced: 6 months ago

All Time
  • Total Commits: 271
  • Total Committers: 6
  • Avg Commits per committer: 45.167
  • Development Distribution Score (DDS): 0.568
Past Year
  • Commits: 2
  • Committers: 1
  • Avg Commits per committer: 2.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Dominik Meier d****r@o****m 117
Dirk Wulff d****f@u****m 108
Dominik Meier d****r@u****h 28
Dirk Wulff d****f@g****m 13
Dirk Wulff d****f@M****l 3
Dominik Meier 4****s@u****m 2
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 6
  • Total pull requests: 2
  • Average time to close issues: 24 minutes
  • Average time to close pull requests: 3 days
  • Total issue authors: 5
  • Total pull request authors: 1
  • Average comments per issue: 0.33
  • Average comments per pull request: 0.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 3
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 3
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • krlmlr (2)
  • psychobas (1)
  • DataStrategist (1)
  • robertrentzsch (1)
  • grlju (1)
Pull Request Authors
  • grlju (2)
Top Labels
Issue Labels
enhancement (1)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 1,105 last-month
  • Total docker downloads: 48
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 9
  • Total maintainers: 1
cran.r-project.org: text2sdg

Detecting UN Sustainable Development Goals in Text

  • Versions: 9
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 1,105 Last month
  • Docker Downloads: 48
Rankings
Forks count: 17.8%
Stargazers count: 18.7%
Downloads: 24.9%
Average: 25.3%
Dependent packages count: 29.8%
Dependent repos count: 35.5%
Maintainers (1)
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • R >= 3.5.0 depends
  • corpustools >= 0.4.2 imports
  • dplyr * imports
  • ggplot2 * imports
  • lifecycle * imports
  • magrittr * imports
  • stringr * imports
  • tibble * imports
  • tidyr * imports
  • knitr * suggests
  • rmarkdown * suggests
  • testthat * suggests