english-wordnet

The Open English WordNet

https://github.com/globalwordnet/english-wordnet

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (4.2%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

The Open English WordNet

Basic Info
  • Host: GitHub
  • Owner: globalwordnet
  • License: other
  • Language: Python
  • Default Branch: main
  • Homepage: https://en-word.net/
  • Size: 469 MB
Statistics
  • Stars: 606
  • Watchers: 23
  • Forks: 73
  • Open Issues: 34
  • Releases: 7
Created over 7 years ago · Last pushed 6 months ago
Metadata Files
Readme Contributing License Citation

README.md

Open English Wordnet

Open English Wordnet is a lexical network of the English language grouping words into synsets and linking them according to relationships such as hypernymy, antonymy and meronymy. It is intended to be used in natural language processing applications and provides deep lexical information about the English language as a graph.

Open English Wordnet is a fork of the Princeton WordNet developed under an open source methodology. The quality and veracity of the resource may differ from the Princeton Wordnet and we welcome contributions. Contributions to this wordnet may eventually be incorporated into future releases of Princeton WordNet. Correspondance to previous versions and wordnets in other language is provided through the Collaborative Interlingual Index (CILI). The Open English Wordnet is available as individual files in GWN-LMF format.

Releases

Open English Wordnet is released through the Open English Wordnet website. The versions released are

The size of each resource is as follows

| Edition | Words | Synsets | Relations | |---------|---------|---------|-----------| | 2024 | 161,705 | 120,630 | 419,168 | | 2023 | 161,338 | 120,135 | 415,905 | | 2022 | 161,221 | 120,068 | 386,437 | | 2021 | 163,161 | 120,039 | 384,505 | | 2020 | 163,079 | 120,052 | 385,211 | | 2019 | 160,051 | 117,791 | 378,201 | | Princeton 3.1 | 159,015 | 117,791 | 378,203 |

Usage

To compile these into a single file please use the following script(s)

python scripts/from_yaml.py

This will create a file at wn.xml that contains the complete wordnet.

Further conversions are available through the converter here.

Changes

We welcome changes, to make a change please read our contributing guidelines and make a pull request.

Open English Wordnet is a high-quality resource that acts as a gold-standard for natural language processing, as such we cannot accept any automatically generated results that have not been manually validated.

Please be aware that we use the Global WordNet Association LMF and please read the guidelines for using the format

License

Open English Wordnet is released under CC-BY 4.0

References

The canonical citation for English Wordnet is:

More recent papers describing it include:

It incorporates material from:

  • Christiane Fellbaum, editor (1998) WordNet: An Electronic Lexical Database. The MIT Press, Cambridge, MA.
  • Merrick Choo Yeu Herng and Francis Bond (2021) Taboo wordnet. In Proceedings of the 11th Global Wordnet Conference (GWC2021), University of South Africa (UNISA).

Contributors

  • John P. McCrae
  • Alexandre Rademaker
  • Ewa Rudnicka
  • Bernard Bou
  • Daiki Nomura
  • David Cillessen
  • Ciara O'Loughlin
  • Cathal McGovern
  • Francis Bond
  • Eric Kafe
  • Michael Wayne Goodman
  • Merrick Choo Yeu Herng
  • Enejda Nasaj

Owner

  • Name: Global WordNet Association
  • Login: globalwordnet
  • Kind: organization

Connecting wordnets for all languages in the world.

Citation (citation.bib)

@inproceedings{mccrae-etal-2019-english,
    title = "{E}nglish {W}ord{N}et 2019 {--} An Open-Source {W}ord{N}et for {E}nglish",
    author = "McCrae, John P.  and
      Rademaker, Alexandre  and
      Bond, Francis  and
      Rudnicka, Ewa  and
      Fellbaum, Christiane",
    booktitle = "Proceedings of the 10th Global Wordnet Conference",
    month = jul,
    year = "2019",
    address = "Wroclaw, Poland",
    publisher = "Global Wordnet Association",
    url = "https://aclanthology.org/2019.gwc-1.31",
    pages = "245--252",
    abstract = "We describe the release of a new wordnet for English based on the Princeton WordNet, but now developed under an open-source model. In particular, this version of WordNet, which we call English WordNet 2019, which has been developed by multiple people around the world through GitHub, fixes many errors in previous wordnets for English. We give some details of the changes that have been made in this version and give some perspectives about likely future changes that will be made as this project continues to evolve.",
}

GitHub Events

Total
  • Create event: 29
  • Release event: 1
  • Issues event: 65
  • Watch event: 115
  • Delete event: 31
  • Issue comment event: 87
  • Push event: 66
  • Pull request review comment event: 1
  • Pull request review event: 4
  • Pull request event: 82
  • Fork event: 12
Last Year
  • Create event: 29
  • Release event: 1
  • Issues event: 65
  • Watch event: 115
  • Delete event: 31
  • Issue comment event: 87
  • Push event: 66
  • Pull request review comment event: 1
  • Pull request review event: 4
  • Pull request event: 82
  • Fork event: 12

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 27
  • Total pull requests: 35
  • Average time to close issues: 3 months
  • Average time to close pull requests: 15 days
  • Total issue authors: 6
  • Total pull request authors: 5
  • Average comments per issue: 0.07
  • Average comments per pull request: 0.17
  • Merged pull requests: 20
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 27
  • Pull requests: 35
  • Average time to close issues: 3 months
  • Average time to close pull requests: 15 days
  • Issue authors: 6
  • Pull request authors: 5
  • Average comments per issue: 0.07
  • Average comments per pull request: 0.17
  • Merged pull requests: 20
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • jmccrae (48)
  • aclueless (13)
  • fcbond (7)
  • cmbant (5)
  • 1313ou (3)
  • rob-ross (2)
  • arademaker (2)
  • ekaf (2)
  • PeterLaska123 (1)
  • busterbeam (1)
  • notevenaperson (1)
  • balki (1)
  • vtempest (1)
  • bradleyszoke (1)
  • drewvid (1)
Pull Request Authors
  • jmccrae (79)
  • 1313ou (23)
  • zabalaamelie30-code (11)
  • x-englishwordnet (2)
  • goodmami (1)
  • fcbond (1)
  • ekaf (1)
  • vtempest (1)
Top Labels
Issue Labels
change relation (19) enhancement (10) definition (10) synset duplicate (9) new synset (9) delete synset (6) synset member (4) validation check (3) release format (3) add relation (2) bug (2) wontfix (1) synset split (1)
Pull Request Labels
change relation (2) validation check (1)

Dependencies

.github/workflows/main.yml actions
  • actions/checkout v2 composite