wikimorphemes

Extracting Morphemes from Wiktionary Entries

https://github.com/macmillancontentscience/wikimorphemes

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.7%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Extracting Morphemes from Wiktionary Entries

Basic Info
  • Host: GitHub
  • Owner: macmillancontentscience
  • License: other
  • Language: R
  • Default Branch: main
  • Size: 224 KB
Statistics
  • Stars: 2
  • Watchers: 1
  • Forks: 1
  • Open Issues: 22
  • Releases: 0
Created about 5 years ago · Last pushed over 3 years ago
Metadata Files
Readme Contributing License Code of conduct Citation

README.Rmd

---
output: github_document
---



```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "100%"
)
```

# wikimorphemes


[![Lifecycle: experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)


The goal of wikimorphemes is to provide tools for extracting morphemes from Wiktionary entries.
The wikimorphemes api is *not yet stable* and may change.

## Installation

You can install the released version of wikimorphemes from [CRAN](https://CRAN.R-project.org) with:

``` r
# No you can't.
#install.packages("wikimorphemes")
```

And the development version from [GitHub](https://github.com/) with:

``` r
# install.packages("devtools")
devtools::install_github("macmillancontentscience/wikimorphemes")
```

## Examples

The main function in wikimorphemes is `process_word`.

```{r process-word}
library(wikimorphemes)
process_word("unaffable")
process_word("understandable")
process_word("standing")
```

## Contributing to {wikimorphemes}

If you are going to work on this package, please begin by running `download_wikimorphemes_lookup()`. This will download the latest cached version of the lookup (10.7MB), which will then be used in tests.

## Code of Conduct
  
Please note that the wikimorphemes project is released with a [Contributor Code of Conduct](https://contributor-covenant.org/version/2/0/CODE_OF_CONDUCT.html). 
By contributing to this project, you agree to abide by its terms.

Owner

  • Name: Macmillan Learning Content Science
  • Login: macmillancontentscience
  • Kind: organization
  • Email: jon.harmon@macmillan.com
  • Location: Austin, TX

A team of data scientists researching and applying content analysis techniques

Citation (CITATION.cff)

cff-version: 1.2.0
title: wikimorphemes
message: >-
  Please cite this package using the metadata from this file.
type: software
authors:
  - given-names: Jon
    family-names: Harmon
    email: jonthegeek@gmail.com
    orcid: 'https://orcid.org/0000-0003-4781-4346'
    affiliation: Macmillan Learning
  - given-names: Jonathan
    family-names: Bratt
    email: jonathan.bratt@macmillan.com
    affiliation: Macmillan Learning
    orcid: 'https://orcid.org/0000-0003-2859-0076'
url: >-
  https://github.com/macmillancontentscience/wikimorphemes
abstract: >-
  Morphemes are the smallest linguistic units within
  words that can carry meaning. Wiktionary (the free
  dictionary) has a wealth of human-tagged
  information about morphemes. This package seeks to
  extract that information.
keywords:
  - morpheme
  - tokenizer
  - natural language processing
license: Apache-2.0
version: 0.3.0

GitHub Events

Total
Last Year

Dependencies

DESCRIPTION cran
  • R >= 2.10 depends
  • WikipediR >= 1.6.0 imports
  • dlr >= 0.0.0.9004 imports
  • dplyr >= 1.0.4 imports
  • fs >= 1.5.0 imports
  • lubridate * imports
  • magrittr >= 1.5 imports
  • memoise >= 1.1.0 imports
  • purrr >= 0.3.4 imports
  • rlang >= 0.4.10 imports
  • stringdist >= 0.9.6.3 imports
  • stringi >= 1.6.2 imports
  • stringr >= 1.4.0 imports
  • tibble >= 3.0.4 imports
  • utils * imports
  • xml2 * imports
  • knitr * suggests
  • progress * suggests
  • rmarkdown * suggests
  • testthat * suggests