Jabberwocky

Jabberwocky: an ontology-aware toolkit for manipulating text - Published in JOSS (2020)

https://github.com/sap218/jabberwocky

Science Score: 93.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 7 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: joss.theoj.org
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Keywords

annotation grep jabberwocky ontology plotting python synonyms text-mining textual-data tfidf

Scientific Fields

Artificial Intelligence and Machine Learning Computer Science - 83% confidence
Last synced: 4 months ago · JSON representation

Repository

NLP toolkit for those nonsensical ontologies

Basic Info
Statistics
  • Stars: 16
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 4
Topics
annotation grep jabberwocky ontology plotting python synonyms text-mining textual-data tfidf
Created about 6 years ago · Last pushed 8 months ago
Metadata Files
Readme Changelog Contributing License

README.md

Jabberwocky

DOI

Jabberwocky is a toolkit for NLP and ontologies. Since we all know ontologies are nonsense.

Functionality

Read the documentation for more detail.

script | description ------- | ----------- bandersnatch | extract metadata from ontology classes catch | annotate corpus with key terms & generate wordcloud bite | rank terms in order of importance & bar plot arise | update ontology with new metadata eyes | plot an ontology via web or tree format

When combining these Jabberwocky functions, users can create an NLP workflow.

workflow

Running

Within each directory, there is a file params_*.py which users can edit. Meaning users shouldn't need to edit the main/primary script.

Check the individual directory READMEs for parameter information.

Prerequisites

Check requirements.py for a list of packages and versions.

Changelog / Version

See the Changelog (ordered by newest first).

Contributing / Issues

Please read the Contributing Guidelines.

License

The license is MIT and so users only need to cite (below) if using.

Citing

@article{Pendleton2020, doi = {10.21105/joss.02168}, url = {https://doi.org/10.21105/joss.02168}, year = {2020}, publisher = {The Open Journal}, volume = {5}, number = {51}, pages = {2168}, author = {Samantha C. Pendleton and Georgios V. Gkoutos}, title = {Jabberwocky: an ontology-aware toolkit for manipulating text}, journal = {Journal of Open Source Software} }

The poem, Jabberwocky, written by Lewis Carrol, is described as a "nonsense" poem :dragon:


End of page

Owner

  • Name: Samantha C Pendleton
  • Login: sap218
  • Kind: user
  • Location: UK
  • Company: University of Oxford

Data Wrangler / Data Scientist | ML, NLP, ontologies | PhD

JOSS Publication

Jabberwocky: an ontology-aware toolkit for manipulating text
Published
July 01, 2020
Volume 5, Issue 51, Page 2168
Authors
Samantha C. Pendleton ORCID
Institute of Cancer and Genomic Sciences, University of Birmingham, UK, University Hospitals Birmingham NHS Foundation Trust, UK
Georgios V. Gkoutos
Institute of Cancer and Genomic Sciences, University of Birmingham, UK, University Hospitals Birmingham NHS Foundation Trust, UK
Editor
Mark A. Jensen ORCID
Tags
Ontologies Text

GitHub Events

Total
  • Release event: 1
  • Delete event: 1
  • Push event: 10
  • Pull request event: 4
  • Create event: 3
Last Year
  • Release event: 1
  • Delete event: 1
  • Push event: 10
  • Pull request event: 4
  • Create event: 3

Committers

Last synced: 5 months ago

All Time
  • Total Commits: 123
  • Total Committers: 3
  • Avg Commits per committer: 41.0
  • Development Distribution Score (DDS): 0.496
Past Year
  • Commits: 4
  • Committers: 1
  • Avg Commits per committer: 4.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Samantha C Pendleton s****c@g****m 62
Samantha s****e@o****m 60
Mark A. Jensen m****t@f****s 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 4 months ago

All Time
  • Total issues: 13
  • Total pull requests: 7
  • Average time to close issues: 15 days
  • Average time to close pull requests: about 5 hours
  • Total issue authors: 2
  • Total pull request authors: 2
  • Average comments per issue: 1.15
  • Average comments per pull request: 0.29
  • Merged pull requests: 7
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 4
  • Average time to close issues: N/A
  • Average time to close pull requests: 2 minutes
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • sap218 (10)
  • balhoff (3)
Pull Request Authors
  • sap218 (7)
  • majensen (1)
Top Labels
Issue Labels
request (10) documentation (7) help (2) bug (1)
Pull Request Labels
request (4) documentation (2)

Packages

  • Total packages: 1
  • Total downloads: unknown
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 2
proxy.golang.org: github.com/sap218/jabberwocky
  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 5.4%
Average: 5.6%
Dependent repos count: 5.8%
Last synced: 4 months ago

Dependencies

requirements.txt pypi
  • BeautifulSoup4 *
  • click *
  • lxml *
  • matplotlib *
  • pandas *
  • pytest *
  • scikit-learn *
  • spacy *
setup.py pypi