huspacy

HuSpaCy: industrial-strength Hungarian natural language processing

https://github.com/huspacy/huspacy

Science Score: 64.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org
  • Committers with academic emails
    4 of 9 committers (44.4%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.1%) to scientific vocabulary

Keywords

dependency-parsing hungarian hunlp huspacy information-extraction lemmatization machine-learning morphological-analysis named-entity-recognition natural-language-processing ner nlp pos-tagger python spacy spacy-models spacy-pipeline text-mining universal-dependencies

Keywords from Contributors

mesh spacy-extension sequences interactive hacking network-simulation
Last synced: 4 months ago · JSON representation ·

Repository

HuSpaCy: industrial-strength Hungarian natural language processing

Basic Info
  • Host: GitHub
  • Owner: huspacy
  • License: apache-2.0
  • Language: Python
  • Default Branch: master
  • Homepage: https://huspacy.github.io
  • Size: 2.11 MB
Statistics
  • Stars: 170
  • Watchers: 14
  • Forks: 16
  • Open Issues: 2
  • Releases: 14
Topics
dependency-parsing hungarian hunlp huspacy information-extraction lemmatization machine-learning morphological-analysis named-entity-recognition natural-language-processing ner nlp pos-tagger python spacy spacy-models spacy-pipeline text-mining universal-dependencies
Created over 8 years ago · Last pushed 5 months ago
Metadata Files
Readme Funding License Citation

README.md

![project logo](https://raw.githubusercontent.com/huspacy/huspacy/develop/.github/resources/logo.png) [![python version](https://img.shields.io/badge/Python-%3E=3.7-blue)](https://github.com/huspacy/huspacy) [![spacy](https://img.shields.io/badge/built%20with-spaCy-09a3d5.svg)](https://spacy.io) ![PyPI - Wheel](https://img.shields.io/pypi/wheel/huspacy) [![PyPI version](https://badge.fury.io/py/huspacy.svg)](https://pypi.org/project/huspacy/) [![Ask DeepWiki](https://deepwiki.com/badge.svg)](https://deepwiki.com/huspacy/huspacy) [![Demo](https://img.shields.io/badge/Try%20the-Demo-important)](https://huggingface.co/spaces/huspacy/demo)
[![Build](https://github.com/huspacy/huspacy/actions/workflows/build.yml/badge.svg)](https://github.com/huspacy/huspacy/actions/workflows/build.yml) [![Models](https://github.com/huspacy/huspacy/actions/workflows/test_models.yml/badge.svg)](https://github.com/huspacy/huspacy/actions/workflows/test_models.yml) [![Downloads](https://static.pepy.tech/personalized-badge/huspacy?period=month&units=international_system&left_color=grey&right_color=green&left_text=Downloads/month)](https://pepy.tech/project/huspacy) [![Downloads](https://static.pepy.tech/personalized-badge/huspacy?period=total&units=international_system&left_color=grey&right_color=green&left_text=Downloads)](https://pepy.tech/project/huspacy) [![Hits](https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fgithub.com%2Fhuspacy%2Fhuspacy&count_bg=%2379C83D&title_bg=%23555555&icon=&icon_color=%23E7E7E7&title=hits&edge_flat=true)](https://hits.seeyoufarm.com) [![stars](https://img.shields.io/github/stars/huspacy/huspacy?style=social)](https://github.com/huspacy/huspacy)

HuSpaCy is a spaCy library providing industrial-strength Hungarian language processing facilities through spaCy models. The released pipelines consist of a tokenizer, sentence splitter, lemmatizer, tagger (predicting morphological features as well), dependency parser and a named entity recognition module. Word and phrase embeddings are also available through spaCy's API. All models have high throughput, decent memory usage and close to state-of-the-art accuracy. A live demo is available here, model releases are published to Hugging Face Hub.

This repository contains material to build HuSpaCy and all of its models in a reproducible way.

Installation

To get started using the tool, first, we need to download one of the models. The easiest way to achieve this is to install huspacy (from PyPI) and then fetch a model through its API.

bash pip install huspacy

```python import huspacy

Download the latest CPU optimized model

huspacy.download() ```

Install the models directly

You can install the latest models directly from 🤗 Hugging Face Hub:

  • CPU optimized large model: pip install hu_core_news_lg@https://huggingface.co/huspacy/hu_core_news_lg/resolve/main/hu_core_news_lg-any-py3-none-any.whl
  • GPU optimized transformers model: pip install hu_core_news_trf@https://huggingface.co/huspacy/hu_core_news_trf/resolve/main/hu_core_news_trf-any-py3-none-any.whl

To speed up inference on GPU, CUDA must be installed as described in https://spacy.io/usage.

Read more on the models here

Quickstart

HuSpaCy is fully compatible with spaCy's API, newcomers can easily get started with spaCy 101 guide.

Although HuSpacy models can be loaded with spacy.load(...), the tool provides convenience methods to easily access downloaded models.

```python

Load the model using spacy.load(...)

import spacy nlp = spacy.load("hucorenews_lg") ```

```python

Load the default large model (if downloaded)

import huspacy nlp = huspacy.load() ```

```python

Load the model directly as a module

import hucorenewslg nlp = hucorenewslg.load() ```

To process texts, you can simply call the loaded model (i.e. the nlp callable object)

python doc = nlp("Csiribiri csiribiri zabszalma - négy csillag közt alszom ma.")

As HuSpaCy is built on spaCy, the returned doc document contains all the annotations given by the pipeline components.

API Documentation is available in our website.

Models overview

We provide several pretrained models:

  1. hu_core_news_lg is a CNN-based large model which achieves a good balance between accuracy and processing speed. This default model provides tokenization, sentence splitting, part-of-speech tagging (UD labels w/ detailed morphosyntactic features), lemmatization, dependency parsing and named entity recognition and ships with pretrained word vectors.
  2. hu_core_news_trf is built on huBERT and provides the same functionality as the large model except the word vectors. It comes with much higher accuracy in the price of increased computational resource usage. We suggest using it with GPU support.
  3. hu_core_news_md greatly improves on hu_core_news_lg's throughput by loosing some accuracy. This model could be a good choice when processing speed is crucial.
  4. hu_core_news_trf_xl is an experimental model built on XLM-RoBERTa-large. It provides the same functionality as the hu_core_news_trf model, however it comes with slightly higher accuracy in the price of significantly increased computational resource usage. We suggest using it with GPU support.

HuSpaCy's model versions follows spaCy's versioning scheme.

A demo of the models is available at Hugging Face Spaces.

To read more about the model's architecture we suggest reading the relevant sections from spaCy's documentation.

Comparison

| Models | md | lg | trf | trf_xl | |-----------------|--------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------|
| Embeddings | 100d floret | 300d floret | transformer:
huBERT | transformer:
XLM-RoBERTa-large | | Target hardware | CPU | CPU | GPU | GPU | | Accuracy | ⭑⭑⭑⭒ | ⭑⭑⭑⭑ | ⭑⭑⭑⭑⭒ | ⭑⭑⭑⭑⭑ | | Resource usage | ⭑⭑⭑⭑⭑ | ⭑⭑⭑⭑ | ⭑⭑ | ⭒ |

Citation

If you use HuSpaCy or any of its models, please cite it as:

arxiv

bibtex @InProceedings{HuSpaCy:2023, author= {"Orosz, Gy{\"o}rgy and Szab{\'o}, Gerg{\H{o}} and Berkecz, P{\'e}ter and Sz{\'a}nt{\'o}, Zsolt and Farkas, Rich{\'a}rd"}, editor= {"Ek{\v{s}}tein, Kamil and P{\'a}rtl, Franti{\v{s}}ek and Konop{\'i}k, Miloslav"}, title = {{"Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines"}}, booktitle = {{"Text, Speech, and Dialogue"}}, year = "2023", publisher = {{"Springer Nature Switzerland"}}, address = {{"Cham"}}, pages = "58--69", isbn = "978-3-031-40498-6" }

arxiv

bibtex @InProceedings{HuSpaCy:2021, title = {{HuSpaCy: an industrial-strength Hungarian natural language processing toolkit}}, booktitle = {{XVIII. Magyar Sz{\'a}m{\'\i}t{\'o}g{\'e}pes Nyelv{\'e}szeti Konferencia}}, author = {Orosz, Gy{\"o}rgy and Sz{\' a}nt{\' o}, Zsolt and Berkecz, P{\' e}ter and Szab{\' o}, Gerg{\H o} and Farkas, Rich{\' a}rd}, location = {{Szeged}}, pages = "59--73", year = {2022}, }

Contact

For feature requests, issues and bugs please use the GitHub Issue Tracker. Otherwise, reach out to us in the Discussion Forum.

Authors

HuSpaCy is implemented in the SzegedAI team, coordinated by Orosz György in the Hungarian AI National Laboratory, MILAB program.

License

This library is released under the Apache 2.0 License

Trained models have their own license (CC BY-SA 4.0) as described on the models page.

Owner

  • Name: HuSpaCy
  • Login: huspacy
  • Kind: organization
  • Email: gyorgy@orosz.link
  • Location: Hungary

HuSpaCy: industrial-strength Hungarian natural language processing

Citation (CITATION.cff)

cff-version: 1.2.0
message: 'If you use this software in your work, please cite both the article from preferred-citation and the software itself: https://huspacy.github.io/huspacy/publications/'
title: 'Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines'
authors:
  - given-names: György
    family-names: Orosz
  - given-names: Zsolt
    family-names: Szántó
  - given-names: Péter
    family-names: Berkecz
  - given-names: Gergő
    family-names: Szabó
  - given-names: Richárd
    family-names: Farkas
repository-code: https://github.com/huspacy/huspacy
repository-artifact: https://huggingface.co/huspacy
keywords:
  - spaCy
  - Hungarian
  - NLP
type: software
url: https://huspacy.github.io
date-released: 2022-01-05
contact:
    - email: "gyorgy@orosz.link"
      family-names: Orosz
      given-names: György
preferred-citation:
    type: article
    title: 'Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines'
    abstract: "This paper presents a set of industrial-grade text processing models for Hungarian that achieve near state-of-the-art performance while balancing resource efficiency and accuracy. Models have been implemented in the spaCy framework, extending the HuSpaCy toolkit with several improvements to its architecture. Compared to existing NLP tools for Hungarian, all of our pipelines feature all basic text processing steps including tokenization, sentence-boundary detection, part-of-speech tagging, morphological feature tagging, lemmatization, dependency parsing and named entity recognition with high accuracy and throughput. We thoroughly evaluated the proposed enhancements, compared the pipelines with state-of-the-art tools and demonstrated the competitive performance of the new models in all text preprocessing steps. All experiments are reproducible and the pipelines are freely available under a permissive license."
    doi: "10.1007/978-3-031-40498-6_6"
    identifiers:
    - type: other
      value: "arXiv:2308.12635"
      description: The ArXiv preprint of the paper
    authors:
      - given-names: György
        family-names: Orosz
      - given-names: Gergő
        family-names: Szabó
      - given-names: Péter
        family-names: Berkecz
      - given-names: Zsolt
        family-names: Szántó
      - given-names: Richárd
        family-names: Farkas
    year: "2023"
    start: 58
    end: 69
    conference:
        name: 'Text, Speech, and Dialogue 2023'
    editors-series:
        - family-names: Kamil
          given-names: Ekštein
        - family-names: František
          given-names: Pártl
        - family-names: Miloslav
          given-names: Konopík
    journal: "Text, Speech, and Dialogue: TSD 2023. Lecture Notes in Computer Science"
    issue-title: "TSD 2023: Text, Speech, and Dialogue"
    volume-title: "TSD 2023: Text, Speech, and Dialogue"
    volume: 14102
    isbn: 978-3-031-40498-6
    publisher:
        name: "Springer Nature Switzerland"
        city: Cham

GitHub Events

Total
  • Issues event: 4
  • Watch event: 16
  • Issue comment event: 4
  • Push event: 20
  • Fork event: 2
  • Create event: 1
Last Year
  • Issues event: 4
  • Watch event: 16
  • Issue comment event: 4
  • Push event: 20
  • Fork event: 2
  • Create event: 1

Committers

Last synced: 8 months ago

All Time
  • Total Commits: 725
  • Total Committers: 9
  • Avg Commits per committer: 80.556
  • Development Distribution Score (DDS): 0.19
Past Year
  • Commits: 46
  • Committers: 1
  • Avg Commits per committer: 46.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Gyorgy Orosz g****y@o****k 587
Peter Berkecz b****r@g****m 62
SzaboGergo01 g****o@g****m 38
Gyorgy Orosz g****z@l****m 18
Zsolt Szántó s****s@i****u 15
Szabó Gergő h****2@s****u 2
rfarkas r****s@i****u 1
dependabot[bot] 4****] 1
a100 a****0@a****1 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 27
  • Total pull requests: 39
  • Average time to close issues: 5 months
  • Average time to close pull requests: about 1 month
  • Total issue authors: 23
  • Total pull request authors: 8
  • Average comments per issue: 2.0
  • Average comments per pull request: 0.56
  • Merged pull requests: 26
  • Bot issues: 0
  • Bot pull requests: 10
Past Year
  • Issues: 5
  • Pull requests: 0
  • Average time to close issues: 2 months
  • Average time to close pull requests: N/A
  • Issue authors: 5
  • Pull request authors: 0
  • Average comments per issue: 2.6
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • speter00 (3)
  • uveges (2)
  • laklaja (2)
  • skrbog (1)
  • mpjuhasz (1)
  • rudradeva-hub (1)
  • DaveFerger (1)
  • attilanagy234 (1)
  • bsipos-precognox (1)
  • lakatosd (1)
  • astuanax (1)
  • HMRLKE (1)
  • Dbgsaoge (1)
  • VamperAta (1)
  • gaborstats (1)
Pull Request Authors
  • dependabot[bot] (10)
  • qeterme (9)
  • oroszgy (8)
  • SzaboGergo01 (7)
  • zsozso21 (2)
  • balcsi32 (1)
  • dvarnai (1)
  • rfarkas (1)
Top Labels
Issue Labels
bug (15) enhancement (5) lemmatizer (1) parser (1) documentation (1) tokenizer (1) help wanted (1)
Pull Request Labels
dependencies (10) lemmatizer (6) improvement (4) enhancement (4) parser (1) documentation (1)

Dependencies

hu_core_news_lg/poetry.lock pypi
  • bump2version 1.0.1 develop
  • blis 0.7.7
  • catalogue 2.0.7
  • certifi 2021.10.8
  • charset-normalizer 2.0.12
  • click 7.1.2
  • colorama 0.4.4
  • configparser 5.2.0
  • conllu 4.4.2
  • contexttimer 0.3.3
  • cymem 2.0.6
  • docker-pycreds 0.4.0
  • filelock 3.6.0
  • gitdb 4.0.9
  • gitpython 3.1.27
  • huggingface-hub 0.0.12
  • idna 3.3
  • jinja2 3.1.2
  • langcodes 3.3.0
  • markupsafe 2.1.1
  • murmurhash 1.0.7
  • numpy 1.22.3
  • packaging 21.3
  • pathtools 0.1.2
  • pathy 0.6.1
  • preshed 3.0.6
  • promise 2.3
  • protobuf 3.20.1
  • psutil 5.9.0
  • pydantic 1.8.2
  • pyparsing 3.0.8
  • python-dateutil 2.8.2
  • pyyaml 6.0
  • requests 2.27.1
  • sentry-sdk 1.5.10
  • shortuuid 1.0.8
  • six 1.16.0
  • smart-open 5.2.1
  • smmap 5.0.0
  • spacy 3.3.0
  • spacy-conll 3.2.0
  • spacy-huggingface-hub 0.0.6
  • spacy-legacy 3.0.9
  • spacy-loggers 1.0.2
  • srsly 2.4.3
  • subprocess32 3.5.4
  • thinc 8.0.15
  • tqdm 4.64.0
  • typer 0.3.2
  • typing-extensions 4.2.0
  • urllib3 1.25.11
  • wandb 0.10.33
  • wasabi 0.9.1
hu_core_news_lg/pyproject.toml pypi
  • bump2version ^1.0.1 develop
  • conllu ^4.4
  • contexttimer ^0.3.3
  • python ^3.8
  • spacy ~3.3
  • spacy-huggingface-hub ^0.0.6
  • spacy_conll ^3.0
  • wandb ^0.10.23
  • wheel ^0.37.0
hu_core_news_trf/poetry.lock pypi
  • atomicwrites 1.4.0 develop
  • attrs 21.4.0 develop
  • flake8 4.0.1 develop
  • iniconfig 1.1.1 develop
  • mccabe 0.6.1 develop
  • pluggy 1.0.0 develop
  • py 1.11.0 develop
  • pycodestyle 2.8.0 develop
  • pyflakes 2.4.0 develop
  • pytest 6.2.5 develop
  • toml 0.10.2 develop
  • blis 0.7.7
  • bump2version 1.0.1
  • catalogue 2.0.7
  • certifi 2021.10.8
  • charset-normalizer 2.0.12
  • click 7.1.2
  • colorama 0.4.4
  • configparser 5.2.0
  • conllu 4.4.1
  • contexttimer 0.3.3
  • cymem 2.0.6
  • docker-pycreds 0.4.0
  • filelock 3.6.0
  • gitdb 4.0.9
  • gitpython 3.1.27
  • huggingface-hub 0.0.12
  • idna 3.3
  • jinja2 3.1.1
  • joblib 1.1.0
  • langcodes 3.3.0
  • markupsafe 2.1.1
  • murmurhash 1.0.6
  • numpy 1.22.3
  • packaging 21.3
  • pathtools 0.1.2
  • pathy 0.6.1
  • preshed 3.0.6
  • promise 2.3
  • protobuf 3.20.0
  • psutil 5.9.0
  • pydantic 1.8.2
  • pyparsing 3.0.7
  • python-dateutil 2.8.2
  • pyyaml 6.0
  • regex 2022.3.15
  • requests 2.27.1
  • sacremoses 0.0.49
  • sentry-sdk 1.5.8
  • shortuuid 1.0.8
  • six 1.16.0
  • smart-open 5.2.1
  • smmap 5.0.0
  • spacy 3.2.4
  • spacy-alignments 0.8.4
  • spacy-conll 3.1.0
  • spacy-experimental 0.4.0
  • spacy-huggingface-hub 0.0.6
  • spacy-legacy 3.0.9
  • spacy-loggers 1.0.2
  • spacy-transformers 1.1.5
  • srsly 2.4.2
  • subprocess32 3.5.4
  • thinc 8.0.15
  • tokenizers 0.10.3
  • torch 1.9.1
  • tqdm 4.63.1
  • transformers 4.10.3
  • typer 0.3.2
  • typing-extensions 4.1.1
  • urllib3 1.25.11
  • wandb 0.10.33
  • wasabi 0.9.1
hu_core_news_trf/pyproject.toml pypi
  • bump2version ^1.0.1 develop
  • conllu ^4.4.1
  • contexttimer ^0.3.3
  • python ^3.8
  • spacy ~3.2.3
  • spacy-experimental 0.4.0
  • spacy-huggingface-hub ^0.0.6
  • spacy-transformers ^1.1.4
  • spacy_conll ^3.0
  • torch ~1.9.0
  • wandb ^0.10.23
  • wheel ^0.37.0
huspacy/poetry.lock pypi
  • atomicwrites 1.4.0 develop
  • attrs 21.4.0 develop
  • bleach 4.1.0 develop
  • bump2version 1.0.1 develop
  • cffi 1.15.0 develop
  • commonmark 0.9.1 develop
  • cryptography 36.0.2 develop
  • docutils 0.18.1 develop
  • flake8 4.0.1 develop
  • importlib-metadata 4.11.3 develop
  • iniconfig 1.1.1 develop
  • jeepney 0.8.0 develop
  • keyring 23.5.0 develop
  • lazydocs 0.4.8 develop
  • mccabe 0.6.1 develop
  • pkginfo 1.8.2 develop
  • pluggy 1.0.0 develop
  • py 1.11.0 develop
  • pycodestyle 2.8.0 develop
  • pycparser 2.21 develop
  • pyflakes 2.4.0 develop
  • pygments 2.11.2 develop
  • pytest 7.1.1 develop
  • pytest-codeblocks 0.14.0 develop
  • pywin32-ctypes 0.2.0 develop
  • readme-renderer 34.0 develop
  • requests-toolbelt 0.9.1 develop
  • rfc3986 2.0.0 develop
  • rich 12.2.0 develop
  • secretstorage 3.3.1 develop
  • six 1.16.0 develop
  • tomli 2.0.1 develop
  • twine 4.0.0 develop
  • webencodings 0.5.1 develop
  • zipp 3.8.0 develop
  • blis 0.7.7
  • catalogue 2.0.7
  • certifi 2021.10.8
  • charset-normalizer 2.0.12
  • click 8.0.4
  • colorama 0.4.4
  • cymem 2.0.6
  • idna 3.3
  • jinja2 3.1.1
  • langcodes 3.3.0
  • markupsafe 2.1.1
  • murmurhash 1.0.6
  • numpy 1.22.3
  • packaging 21.3
  • pathy 0.6.1
  • preshed 3.0.6
  • pydantic 1.8.2
  • pyparsing 3.0.7
  • requests 2.27.1
  • smart-open 5.2.1
  • spacy 3.2.4
  • spacy-legacy 3.0.9
  • spacy-loggers 1.0.2
  • srsly 2.4.2
  • thinc 8.0.15
  • tqdm 4.64.0
  • typer 0.4.1
  • typing-extensions 4.1.1
  • urllib3 1.26.9
  • wasabi 0.9.1
huspacy/pyproject.toml pypi
  • bump2version ^1.0.1 develop
  • flake8 ^4.0.1 develop
  • lazydocs ^0.4.8 develop
  • pytest ^7.0.0 develop
  • pytest_codeblocks ^0.14.0 develop
  • twine ^4.0.0 develop
  • python ^3.8
  • spacy ~3.2.3
.github/workflows/build.yml actions
  • Gr1N/setup-poetry v7 composite
  • actions/cache v2 composite
  • actions/checkout v2 composite
  • actions/download-artifact v3 composite
  • actions/setup-python v4 composite
  • actions/upload-artifact v3 composite
.github/workflows/docs.yml actions
  • Gr1N/setup-poetry v7 composite
  • actions/cache v2 composite
  • actions/checkout v3 composite
  • actions/setup-python v4 composite
.github/workflows/test_models.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v4 composite
hu_core_news_md/poetry.lock pypi
  • GitPython 3.1.29 develop
  • PyYAML 6.0 develop
  • bump2version 1.0.1 develop
  • conllu 4.5.2 develop
  • contexttimer 0.3.3 develop
  • docker-pycreds 0.4.0 develop
  • filelock 3.8.0 develop
  • gitdb 4.0.9 develop
  • huggingface-hub 0.0.12 develop
  • pathtools 0.1.2 develop
  • pip 22.3.1 develop
  • promise 2.3 develop
  • protobuf 4.21.9 develop
  • psutil 5.9.4 develop
  • sentry-sdk 1.10.1 develop
  • setproctitle 1.3.2 develop
  • shortuuid 1.0.10 develop
  • six 1.16.0 develop
  • smmap 5.0.0 develop
  • spacy-conll 3.2.0 develop
  • spacy-huggingface-hub 0.0.6 develop
  • wandb 0.13.5 develop
  • wheel 0.38.4 develop
  • Jinja2 3.1.2
  • MarkupSafe 2.1.1
  • blis 0.7.9
  • catalogue 2.0.8
  • certifi 2022.9.24
  • charset-normalizer 2.1.1
  • click 7.1.2
  • colorama 0.4.6
  • confection 0.0.3
  • cupy-cuda111 11.2.0
  • cymem 2.0.7
  • fastrlock 0.8.1
  • idna 3.4
  • langcodes 3.3.0
  • murmurhash 1.0.9
  • numpy 1.23.4
  • packaging 21.3
  • pathy 0.6.2
  • preshed 3.0.8
  • pydantic 1.10.2
  • pyparsing 3.0.9
  • requests 2.28.1
  • setuptools 65.5.1
  • smart-open 5.2.1
  • spacy 3.4.3
  • spacy-legacy 3.0.10
  • spacy-loggers 1.0.3
  • srsly 2.4.5
  • thinc 8.1.5
  • tqdm 4.64.1
  • typer 0.3.2
  • typing-extensions 4.4.0
  • urllib3 1.26.12
  • wasabi 0.10.1
hu_core_news_md/pyproject.toml pypi
  • bump2version ^1.0.1 develop
  • conllu ^4.4.1 develop
  • contexttimer ^0.3.3 develop
  • pip >=22.3.1 develop
  • setuptools >=65.4.1 develop
  • spacy-huggingface-hub 0.0.6 develop
  • spacy-loggers ~1.0.3 develop
  • spacy_conll ^3.0 develop
  • wandb ^0.13.3 develop
  • wheel >=0.37.0 develop
  • python ^3.8
  • spacy ~3.4.1
hu_core_news_trf_xl/pyproject.toml pypi
  • bump2version ^1.0.1 develop
  • conllu ^4.4.1
  • contexttimer ^0.3.3
  • python ^3.8
  • setuptools ^65.4.1
  • spacy ~3.4.1
  • spacy-experimental 0.6.0
  • spacy-huggingface-hub 0.0.6
  • spacy-loggers ~1.0.3
  • spacy-transformers 1.1.8
  • spacy_conll ^3.0
  • torch ~1.10.0+cu111
  • wandb ^0.13.3
  • wheel ^0.37.0
hu_vectors_web_lg/poetry.lock pypi
  • bump2version 1.0.1 develop
  • blis 0.7.7
  • catalogue 2.0.7
  • certifi 2021.10.8
  • charset-normalizer 2.0.12
  • click 7.1.2
  • colorama 0.4.4
  • configparser 5.2.0
  • contexttimer 0.3.3
  • cymem 2.0.6
  • docker-pycreds 0.4.0
  • filelock 3.6.0
  • gitdb 4.0.9
  • gitpython 3.1.27
  • huggingface-hub 0.0.12
  • idna 3.3
  • jinja2 3.1.1
  • langcodes 3.3.0
  • markupsafe 2.1.1
  • murmurhash 1.0.6
  • numpy 1.22.3
  • packaging 21.3
  • pathtools 0.1.2
  • pathy 0.6.1
  • preshed 3.0.6
  • promise 2.3
  • protobuf 3.20.0
  • psutil 5.9.0
  • pydantic 1.8.2
  • pyparsing 3.0.8
  • python-dateutil 2.8.2
  • pyyaml 6.0
  • requests 2.27.1
  • sentry-sdk 1.5.10
  • shortuuid 1.0.8
  • six 1.16.0
  • smart-open 5.2.1
  • smmap 5.0.0
  • spacy 3.2.4
  • spacy-huggingface-hub 0.0.6
  • spacy-legacy 3.0.9
  • spacy-loggers 1.0.2
  • srsly 2.4.3
  • subprocess32 3.5.4
  • thinc 8.0.15
  • tqdm 4.64.0
  • typer 0.3.2
  • typing-extensions 4.2.0
  • urllib3 1.25.11
  • wandb 0.10.33
  • wasabi 0.9.1
hu_vectors_web_lg/pyproject.toml pypi
  • bump2version ^1.0.1 develop
  • contexttimer ^0.3.3
  • python ^3.8
  • spacy ~3.2.3
  • spacy-huggingface-hub ^0.0.6
  • wandb ^0.10.23
  • wheel ^0.37.0
hu_vectors_web_md/poetry.lock pypi
  • bump2version 1.0.1 develop
  • blis 0.7.7
  • catalogue 2.0.7
  • certifi 2021.10.8
  • charset-normalizer 2.0.12
  • click 7.1.2
  • colorama 0.4.4
  • configparser 5.2.0
  • contexttimer 0.3.3
  • cymem 2.0.6
  • docker-pycreds 0.4.0
  • filelock 3.6.0
  • floret 0.10.2
  • gitdb 4.0.9
  • gitpython 3.1.27
  • huggingface-hub 0.0.12
  • idna 3.3
  • jinja2 3.1.1
  • langcodes 3.3.0
  • markupsafe 2.1.1
  • murmurhash 1.0.6
  • numpy 1.22.3
  • packaging 21.3
  • pathtools 0.1.2
  • pathy 0.6.1
  • preshed 3.0.6
  • promise 2.3
  • protobuf 3.20.0
  • psutil 5.9.0
  • pydantic 1.8.2
  • pyparsing 3.0.8
  • python-dateutil 2.8.2
  • pyyaml 6.0
  • requests 2.27.1
  • sentry-sdk 1.5.10
  • shortuuid 1.0.8
  • six 1.16.0
  • smart-open 5.2.1
  • smmap 5.0.0
  • spacy 3.2.4
  • spacy-huggingface-hub 0.0.6
  • spacy-legacy 3.0.9
  • spacy-loggers 1.0.2
  • srsly 2.4.3
  • subprocess32 3.5.4
  • thinc 8.0.15
  • tqdm 4.64.0
  • typer 0.3.2
  • typing-extensions 4.2.0
  • urllib3 1.25.11
  • wandb 0.10.33
  • wasabi 0.9.1
hu_vectors_web_md/pyproject.toml pypi
  • bump2version ^1.0.1 develop
  • contexttimer ^0.3.3
  • floret ^0.10.2
  • python ^3.8
  • spacy ~3.2.3
  • spacy-huggingface-hub ^0.0.6
  • wandb ^0.10.23
  • wheel ^0.37.0