wikimorphemes
Extracting Morphemes from Wiktionary Entries
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.7%) to scientific vocabulary
Last synced: 6 months ago
·
JSON representation
·
Repository
Extracting Morphemes from Wiktionary Entries
Basic Info
- Host: GitHub
- Owner: macmillancontentscience
- License: other
- Language: R
- Default Branch: main
- Size: 224 KB
Statistics
- Stars: 2
- Watchers: 1
- Forks: 1
- Open Issues: 22
- Releases: 0
Created about 5 years ago
· Last pushed over 3 years ago
Metadata Files
Readme
Contributing
License
Code of conduct
Citation
README.Rmd
---
output: github_document
---
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```
# wikimorphemes
[](https://lifecycle.r-lib.org/articles/stages.html#experimental)
The goal of wikimorphemes is to provide tools for extracting morphemes from Wiktionary entries.
The wikimorphemes api is *not yet stable* and may change.
## Installation
You can install the released version of wikimorphemes from [CRAN](https://CRAN.R-project.org) with:
``` r
# No you can't.
#install.packages("wikimorphemes")
```
And the development version from [GitHub](https://github.com/) with:
``` r
# install.packages("devtools")
devtools::install_github("macmillancontentscience/wikimorphemes")
```
## Examples
The main function in wikimorphemes is `process_word`.
```{r process-word}
library(wikimorphemes)
process_word("unaffable")
process_word("understandable")
process_word("standing")
```
## Contributing to {wikimorphemes}
If you are going to work on this package, please begin by running `download_wikimorphemes_lookup()`. This will download the latest cached version of the lookup (10.7MB), which will then be used in tests.
## Code of Conduct
Please note that the wikimorphemes project is released with a [Contributor Code of Conduct](https://contributor-covenant.org/version/2/0/CODE_OF_CONDUCT.html).
By contributing to this project, you agree to abide by its terms.
Owner
- Name: Macmillan Learning Content Science
- Login: macmillancontentscience
- Kind: organization
- Email: jon.harmon@macmillan.com
- Location: Austin, TX
- Repositories: 10
- Profile: https://github.com/macmillancontentscience
A team of data scientists researching and applying content analysis techniques
Citation (CITATION.cff)
cff-version: 1.2.0
title: wikimorphemes
message: >-
Please cite this package using the metadata from this file.
type: software
authors:
- given-names: Jon
family-names: Harmon
email: jonthegeek@gmail.com
orcid: 'https://orcid.org/0000-0003-4781-4346'
affiliation: Macmillan Learning
- given-names: Jonathan
family-names: Bratt
email: jonathan.bratt@macmillan.com
affiliation: Macmillan Learning
orcid: 'https://orcid.org/0000-0003-2859-0076'
url: >-
https://github.com/macmillancontentscience/wikimorphemes
abstract: >-
Morphemes are the smallest linguistic units within
words that can carry meaning. Wiktionary (the free
dictionary) has a wealth of human-tagged
information about morphemes. This package seeks to
extract that information.
keywords:
- morpheme
- tokenizer
- natural language processing
license: Apache-2.0
version: 0.3.0
GitHub Events
Total
Last Year
Dependencies
DESCRIPTION
cran
- R >= 2.10 depends
- WikipediR >= 1.6.0 imports
- dlr >= 0.0.0.9004 imports
- dplyr >= 1.0.4 imports
- fs >= 1.5.0 imports
- lubridate * imports
- magrittr >= 1.5 imports
- memoise >= 1.1.0 imports
- purrr >= 0.3.4 imports
- rlang >= 0.4.10 imports
- stringdist >= 0.9.6.3 imports
- stringi >= 1.6.2 imports
- stringr >= 1.4.0 imports
- tibble >= 3.0.4 imports
- utils * imports
- xml2 * imports
- knitr * suggests
- progress * suggests
- rmarkdown * suggests
- testthat * suggests