nametagger

Named Entity Recognition with the Nametag Maximum Entropy Markov model

https://github.com/bnosac/nametagger

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.0%) to scientific vocabulary

Keywords

ner r-package
Last synced: 6 months ago · JSON representation

Repository

Named Entity Recognition with the Nametag Maximum Entropy Markov model

Basic Info
  • Host: GitHub
  • Owner: bnosac
  • License: mpl-2.0
  • Language: C++
  • Default Branch: master
  • Size: 3.03 MB
Statistics
  • Stars: 12
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 4
Topics
ner r-package
Created almost 6 years ago · Last pushed over 2 years ago
Metadata Files
Readme Changelog License

README.md

Named Entity Recognition with the Nametag Maximum Entropy Markov model

This repository contains an R package which wraps the NameTag C++ library (https://github.com/ufal/nametag), allowing the following:

  • Fit a Maximum Entropy Markov model machine learning model
  • Use the model to get predictions alongside the model on new data
  • The focus of the implementation is in the area of Natural Language Processing where this R package allows you to easily build and apply models for named entity recognition, text chunking, part of speech tagging, intent recognition or classification of any category you have in mind.

Example

  • Downloading a pretrained named entity recognition model.
  • Note: look to the help of function ?nametagger which allows you to train your own named entity recognition model on your own data

``` library(nametagger) model <- nametaggerdownloadmodel("english-conll-140408", modeldir = tempdir()) x <- data.frame(docid = c(1, 1, 2), sentence_id = c(1, 2, 1), text = c("I\nlive\nin\nNew\nYork\nand\nI\nwork\nfor\nApple\nInc.", "Why\ndon't\nyou\ncome\nvisit\nme", "Good\nnews\nfrom\nAmazon\nas\nJohn\nworks\nthere\n.")) predict(model, x)

docid sentenceid term_id term entity 1 1 1 I O 1 1 2 live O 1 1 3 in O 1 1 4 New B-LOC 1 1 5 York I-LOC 1 1 6 and O 1 1 7 I O 1 1 8 work O 1 1 9 for O 1 1 10 Apple B-ORG 1 1 11 Inc. I-ORG 1 2 1 Why O 1 2 2 don't O 1 2 3 you O 1 2 4 come O 1 2 5 visit O 1 2 6 me O 2 1 1 Good O 2 1 2 news O 2 1 3 from O 2 1 4 Amazon B-LOC 2 1 5 as O 2 1 6 John B-PER 2 1 7 works O 2 1 8 there O 2 1 9 . O ```

Installation

  • For regular users, install the package from your local CRAN mirror install.packages("nametagger")
  • For installing the development version of this package: remotes::install_github("bnosac/nametagger")

Look to the documentation of the functions.

help(package = "nametagger")

Support in text mining

Need support in text mining? Contact BNOSAC: http://www.bnosac.be

Owner

  • Name: bnosac
  • Login: bnosac
  • Kind: organization

open sourced projects

GitHub Events

Total
  • Watch event: 1
  • Push event: 2
Last Year
  • Watch event: 1
  • Push event: 2

Committers

Last synced: almost 3 years ago

All Time
  • Total Commits: 36
  • Total Committers: 1
  • Avg Commits per committer: 36.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Jan Wijffels j****s@b****e 36
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 2
  • Total pull requests: 1
  • Average time to close issues: 11 days
  • Average time to close pull requests: 1 day
  • Total issue authors: 2
  • Total pull request authors: 1
  • Average comments per issue: 1.0
  • Average comments per pull request: 3.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • shahronak47 (1)
  • jwijffels (1)
Pull Request Authors
  • skvrnami (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • cran 245 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 5
  • Total maintainers: 1
cran.r-project.org: nametagger

Named Entity Recognition in Texts using 'NameTag'

  • Versions: 5
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 245 Last month
Rankings
Stargazers count: 19.8%
Forks count: 28.8%
Dependent packages count: 29.8%
Average: 32.8%
Dependent repos count: 35.5%
Downloads: 50.1%
Maintainers (1)
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • R >= 2.10 depends
  • Rcpp >= 0.11.5 imports
  • utils * imports
  • udpipe >= 0.2 suggests
.github/workflows/R-CMD-check.yml actions
  • actions/checkout v3 composite
  • r-lib/actions/check-r-package v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite