wikimorphemes

Extracting Morphemes from Wiktionary Entries

https://github.com/macmillancontentscience/wikimorphemes

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.7%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Extracting Morphemes from Wiktionary Entries

Basic Info

Host: GitHub
Owner: macmillancontentscience
License: other
Language: R
Default Branch: main
Size: 224 KB

Statistics

Stars: 2
Watchers: 1
Forks: 1
Open Issues: 22
Releases: 0

Created over 5 years ago · Last pushed almost 4 years ago

Metadata Files

Readme Contributing License Code of conduct Citation

README.Rmd

---
output: github_document
---



```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "100%"
)
```

# wikimorphemes


[![Lifecycle: experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)


The goal of wikimorphemes is to provide tools for extracting morphemes from Wiktionary entries.
The wikimorphemes api is *not yet stable* and may change.

## Installation

You can install the released version of wikimorphemes from [CRAN](https://CRAN.R-project.org) with:

``` r
# No you can't.
#install.packages("wikimorphemes")
```

And the development version from [GitHub](https://github.com/) with:

``` r
# install.packages("devtools")
devtools::install_github("macmillancontentscience/wikimorphemes")
```

## Examples

The main function in wikimorphemes is `process_word`.

```{r process-word}
library(wikimorphemes)
process_word("unaffable")
process_word("understandable")
process_word("standing")
```

## Contributing to {wikimorphemes}

If you are going to work on this package, please begin by running `download_wikimorphemes_lookup()`. This will download the latest cached version of the lookup (10.7MB), which will then be used in tests.

## Code of Conduct
  
Please note that the wikimorphemes project is released with a [Contributor Code of Conduct](https://contributor-covenant.org/version/2/0/CODE_OF_CONDUCT.html). 
By contributing to this project, you agree to abide by its terms.

Owner

Name: Macmillan Learning Content Science
Login: macmillancontentscience
Kind: organization
Email: jon.harmon@macmillan.com
Location: Austin, TX

Repositories: 10
Profile: https://github.com/macmillancontentscience

A team of data scientists researching and applying content analysis techniques

Citation (CITATION.cff)

cff-version: 1.2.0
title: wikimorphemes
message: >-
  Please cite this package using the metadata from this file.
type: software
authors:
  - given-names: Jon
    family-names: Harmon
    email: jonthegeek@gmail.com
    orcid: 'https://orcid.org/0000-0003-4781-4346'
    affiliation: Macmillan Learning
  - given-names: Jonathan
    family-names: Bratt
    email: jonathan.bratt@macmillan.com
    affiliation: Macmillan Learning
    orcid: 'https://orcid.org/0000-0003-2859-0076'
url: >-
  https://github.com/macmillancontentscience/wikimorphemes
abstract: >-
  Morphemes are the smallest linguistic units within
  words that can carry meaning. Wiktionary (the free
  dictionary) has a wealth of human-tagged
  information about morphemes. This package seeks to
  extract that information.
keywords:
  - morpheme
  - tokenizer
  - natural language processing
license: Apache-2.0
version: 0.3.0

GitHub Events

Total

Last Year

Dependencies

DESCRIPTION cran

R >= 2.10 depends
WikipediR >= 1.6.0 imports
dlr >= 0.0.0.9004 imports
dplyr >= 1.0.4 imports
fs >= 1.5.0 imports
lubridate * imports
magrittr >= 1.5 imports
memoise >= 1.1.0 imports
purrr >= 0.3.4 imports
rlang >= 0.4.10 imports
stringdist >= 0.9.6.3 imports
stringi >= 1.6.2 imports
stringr >= 1.4.0 imports
tibble >= 3.0.4 imports
utils * imports
xml2 * imports
knitr * suggests
progress * suggests
rmarkdown * suggests
testthat * suggests

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science