biglittledecoder
[NeurIPS'23] Speculative Decoding with Big Little Decoder
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.1%) to scientific vocabulary
Keywords
Repository
[NeurIPS'23] Speculative Decoding with Big Little Decoder
Basic Info
- Host: GitHub
- Owner: kssteven418
- License: apache-2.0
- Language: Python
- Default Branch: main
- Homepage: https://arxiv.org/pdf/2302.07863.pdf
- Size: 100 MB
Statistics
- Stars: 90
- Watchers: 5
- Forks: 10
- Open Issues: 3
- Releases: 0
Topics
Metadata Files
README.md
Speculative Decoding with Big Little Decoder (BiLD)
This repo implements Speculative Decoding with Big Little Decoder (BiLD) on top of the HuggingFace framework.
Check out the paper for more details.

What is Big Little Decoder?
Big Little Decoder is a simple framework that enables faster generative inference. It can dramatically accelerate text generation by ~2x, without compromising performance on a variety of text generation scenarios. Furthermore, it is a simple plug-and-play solution that requires no training or architecture redesign.
Here's the key underlying idea:
- BiLD offloads the majority of simple word decisions to a smaller model, and only switches the control back to the larger model when needed.
- The small model "fallbacks" to the large model, when it runs into a hard-to-predict word.
- In case the small model makes a misstep, the larger model can "rollback" the predictions to correct the error
- This collaborative text generation combines the small model's fast autoregressive execution with the large model's accurate and efficient non-autoregressive execution!
Running BiLD for Machine Translation
Prerequisite
You need to prepare your own large and small models. You can either use HuggingFace's pretrained models or finetune them on your target tasks. Please refer to the HuggingFace's official instructions for more detail on loading and/or finetuning pretrained models.
Evaluation
We provide a script that evaluates BiLD on machine translation tasks: examples/pytorch/run_bild_translation.py.
BiLD evaluation command:
CUDA_VISIBLE_DEVICES=0 python run_bild_translation.py --model bild --small [small_model_path] --large [large_model_path] \
--dataset_name iwslt2017 --dataset_config iwslt2017-de-en --source_lang de --target_lang en --bild_rollback [RB] --bild_fallback [FB]
* This command runs bild on the IWSLT 2017 De-En translation task.
* [small_model_path] and [large_model_path] are paths to the small and the large model, respectively (prepared as prerequisite).
* [RB] is the rollback threshold (normally 2~5 works fine). [FB] is the fallback threshold that can have a value from 0 to 1. For more details of these two hyperparameters, please refer to our paper.
We also provide a command for running the baseline model:
CUDA_VISIBLE_DEVICES=0 python run_bild_translation.py --model [model_path] \
--dataset_name iwslt2017 --dataset_config iwslt2017-de-en --source_lang de --target_lang en
* [model_path] is the path to the baseline model (e.g. [small_model_path] or [large_model_path])
Pretrained Checkpoints
We provide finetuned checkpoints that were used for the evaluations in our paper.
| Dataset | Model | Link | | -------- | -------- | -------- | | IWSLT-2017-De-En | mT5-small | link | | IWSLT-2017-De-En | mT5-small (aligned) | link | | IWSLT-2017-De-En | mT5-large | link | | WMT-2014-De-En | mT5-small | link | | WMT-2014-De-En | mT5-small (aligned) | link | | WMT-2014-De-En | mT5-large | link | | XSUM | T5-small | link | | XSUM | T5-small (aligned) | link | | XSUM | T5-large | link | | CNNDM | T5-small | link | | CNNDM | T5-small (aligned) | link | | CNNDM | T5-large | link |
Owner
- Name: Sehoon Kim
- Login: kssteven418
- Kind: user
- Location: Berkeley, United States
- Company: BAIR, UC Berkeley
- Website: sehoonkim.org
- Repositories: 29
- Profile: https://github.com/kssteven418
Citation (CITATION.cff)
cff-version: "1.2.0"
date-released: 2020-10
message: "If you use this software, please cite it using these metadata."
title: "Transformers: State-of-the-Art Natural Language Processing"
url: "https://github.com/huggingface/transformers"
authors:
- family-names: Wolf
given-names: Thomas
- family-names: Debut
given-names: Lysandre
- family-names: Sanh
given-names: Victor
- family-names: Chaumond
given-names: Julien
- family-names: Delangue
given-names: Clement
- family-names: Moi
given-names: Anthony
- family-names: Cistac
given-names: Perric
- family-names: Ma
given-names: Clara
- family-names: Jernite
given-names: Yacine
- family-names: Plu
given-names: Julien
- family-names: Xu
given-names: Canwen
- family-names: "Le Scao"
given-names: Teven
- family-names: Gugger
given-names: Sylvain
- family-names: Drame
given-names: Mariama
- family-names: Lhoest
given-names: Quentin
- family-names: Rush
given-names: "Alexander M."
preferred-citation:
type: conference-paper
authors:
- family-names: Wolf
given-names: Thomas
- family-names: Debut
given-names: Lysandre
- family-names: Sanh
given-names: Victor
- family-names: Chaumond
given-names: Julien
- family-names: Delangue
given-names: Clement
- family-names: Moi
given-names: Anthony
- family-names: Cistac
given-names: Perric
- family-names: Ma
given-names: Clara
- family-names: Jernite
given-names: Yacine
- family-names: Plu
given-names: Julien
- family-names: Xu
given-names: Canwen
- family-names: "Le Scao"
given-names: Teven
- family-names: Gugger
given-names: Sylvain
- family-names: Drame
given-names: Mariama
- family-names: Lhoest
given-names: Quentin
- family-names: Rush
given-names: "Alexander M."
booktitle: "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations"
month: 10
start: 38
end: 45
title: "Transformers: State-of-the-Art Natural Language Processing"
year: 2020
publisher: "Association for Computational Linguistics"
url: "https://www.aclweb.org/anthology/2020.emnlp-demos.6"
address: "Online"
GitHub Events
Total
- Watch event: 9
- Issue comment event: 1
Last Year
- Watch event: 9
- Issue comment event: 1
Issues and Pull Requests
Last synced: about 1 year ago
All Time
- Total issues: 5
- Total pull requests: 4
- Average time to close issues: about 2 hours
- Average time to close pull requests: less than a minute
- Total issue authors: 4
- Total pull request authors: 2
- Average comments per issue: 0.6
- Average comments per pull request: 0.25
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: 6 minutes
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 1.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- jaemin-han (1)
- SuhongMoon (1)
- sufeidechabei (1)
- manyuanbin (1)
Pull Request Authors
- kssteven418 (1)
- sidjha1 (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- actions/cache v2 composite
- actions/checkout v2 composite
- actions/upload-artifact v2 composite
- actions/checkout v2 composite
- actions/checkout v3 composite
- docker/build-push-action v2 composite
- docker/login-action v1 composite
- docker/setup-buildx-action v1 composite
- actions/checkout v2 composite
- docker/build-push-action v2 composite
- docker/login-action v1 composite
- docker/setup-buildx-action v1 composite
- actions/checkout v2 composite
- actions/download-artifact v2 composite
- actions/checkout v2 composite
- actions/download-artifact v2 composite
- actions/upload-artifact v2 composite
- actions/cache v2 composite
- actions/checkout v2 composite
- actions/upload-artifact v2 composite
- actions/checkout v1 composite
- conda-incubator/setup-miniconda v2 composite
- actions/checkout v2 composite
- actions/download-artifact v2 composite
- actions/upload-artifact v2 composite
- actions/checkout v2 composite
- actions/download-artifact v2 composite
- actions/upload-artifact v2 composite
- actions/checkout v3 composite
- tj-actions/changed-files v22.2 composite
- actions/checkout v2 composite
- actions/download-artifact v2 composite
- actions/upload-artifact v2 composite
- actions/checkout v2 composite
- actions/download-artifact v2 composite
- actions/upload-artifact v2 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/cache v2 composite
- actions/checkout v2 composite
- nvidia/cuda 11.2.2-cudnn8-devel-ubuntu20.04 build
- ubuntu 18.04 build
- python 3.8 build
- nvidia/cuda 10.2-cudnn7-devel-ubuntu18.04 build
- $BASE_DOCKER_IMAGE latest build
- ubuntu 18.04 build
- nvcr.io/nvidia/pytorch 21.03-py3 build
- nvcr.io/nvidia/pytorch 21.03-py3 build
- nvidia/cuda 11.2.2-cudnn8-devel-ubuntu20.04 build
- google/cloud-sdk slim build
- ubuntu 18.04 build
- nvidia/cuda 11.2.2-cudnn8-devel-ubuntu20.04 build
- nvcr.io/nvidia/pytorch 22.02-py3 build
- conllu * test
- datasets >=1.1.3 test
- evaluate >=0.2.0 test
- nltk * test
- pytest * test
- rouge-score * test
- seqeval * test
- tensorboard * test
- datasets >=1.1.3
- flax >=0.3.5
- jax >=0.2.8
- jaxlib >=0.1.59
- optax >=0.0.9
- datasets >=1.8.0
- flax >=0.3.5
- jax >=0.2.17
- jaxlib >=0.1.68
- optax >=0.0.8
- datasets >=1.1.3
- evaluate >=0.2.0
- flax >=0.3.5
- jax >=0.2.8
- jaxlib >=0.1.59
- optax >=0.0.8
- datasets >=1.1.3
- flax >=0.3.5
- jax >=0.2.8
- jaxlib >=0.1.59
- optax >=0.0.8
- datasets >=1.8.0
- flax >=0.3.5
- jax >=0.2.8
- jaxlib >=0.1.59
- optax >=0.0.8
- seqeval *
- flax >=0.3.5
- jax >=0.2.8
- jaxlib >=0.1.59
- optax >=0.0.8
- torch ==1.9.0
- torchvision ==0.10.0
- conllu *
- datasets >=1.1.3
- elasticsearch *
- faiss-cpu *
- fire *
- git-python ==1.0.3
- matplotlib *
- nltk *
- pandas *
- protobuf *
- psutil *
- pytest *
- ray *
- rouge-score *
- sacrebleu *
- scikit-learn *
- sentencepiece *
- seqeval *
- streamlit *
- tensorboard *
- tensorflow_datasets *
- conllu *
- datasets >=1.1.3
- elasticsearch *
- faiss-cpu *
- fire *
- git-python ==1.0.3
- matplotlib *
- nltk *
- pandas *
- protobuf *
- psutil *
- pytest *
- rouge-score *
- sacrebleu *
- scikit-learn *
- sentencepiece *
- seqeval *
- streamlit *
- tensorboard *
- tensorflow_datasets *
- accelerate main test
- conllu * test
- datasets >=1.13.3 test
- elasticsearch * test
- evaluate >=0.2.0 test
- faiss-cpu * test
- fire * test
- git-python ==1.0.3 test
- jiwer * test
- librosa * test
- matplotlib * test
- nltk * test
- pandas * test
- protobuf * test
- psutil * test
- pytest * test
- rouge-score * test
- sacrebleu >=1.4.12 test
- scikit-learn * test
- sentencepiece * test
- seqeval * test
- streamlit * test
- tensorboard * test
- tensorflow_datasets * test
- torchvision * test
- datasets >=1.14.0
- evaluate *
- librosa *
- torch >=1.6
- torchaudio *
- torch >=1.3
- datasets >=1.8.0
- torch >=1.5.0
- torchvision >=0.6.0
- datasets >=1.17.0
- evaluate *
- torch >=1.5.0
- torchvision >=0.6.0
- datasets >=1.8.0
- torch >=1.5.0
- torchvision >=0.6.0
- accelerate *
- datasets >=1.8.0
- evaluate *
- protobuf *
- sentencepiece *
- torch >=1.3
- accelerate *
- evaluate *
- protobuf *
- sentencepiece *
- torch >=1.3
- accelerate *
- datasets >=1.8.0
- evaluate *
- torch >=1.3.0
- datasets >=2.0.0
- evaluate *
- torch >=1.3
- accelerate >=0.5.0
- datasets >=1.12.0
- librosa *
- torch >=1.5
- torchaudio *
- datasets >=1.18.0
- evaluate *
- jiwer *
- librosa *
- torch >=1.5
- torchaudio *
- accelerate *
- datasets >=1.8.0
- evaluate *
- nltk *
- protobuf *
- py7zr *
- rouge-score *
- sentencepiece *
- torch >=1.3
- accelerate *
- datasets >=1.8.0
- evaluate *
- protobuf *
- scikit-learn *
- scipy *
- sentencepiece *
- torch >=1.3
- protobuf *
- sentencepiece *
- torch >=1.3
- accelerate *
- datasets >=1.8.0
- evaluate *
- seqeval *
- torch >=1.3
- accelerate *
- datasets >=1.8.0
- evaluate *
- protobuf *
- py7zr *
- sacrebleu >=1.4.12
- sentencepiece *
- torch >=1.3
- transformers ==3.5.1
- transformers ==3.5.1
- nltk *
- py-rouge *
- transformers ==3.5.1
- transformers ==3.5.1
- datasets ==2.3.2
- evaluate ==0.2.2
- scikit-learn ==1.1.2
- transformers ==4.21.1
- wandb ==0.13.1
- datasets ==1.16.0
- datasketch ==1.5.7
- dpu_utils *
- huggingface-hub ==0.1.0
- tensorboard ==2.6.0
- torch ==1.11.0
- transformers ==4.19.0
- wandb ==0.12.0
- APScheduler ==3.9.1
- Brotli ==1.0.9
- Cython ==0.29.28
- Deprecated ==1.2.13
- Flask ==2.0.3
- Flask-Compress ==1.11
- GitPython ==3.1.18
- Jinja2 ==2.11.3
- Keras-Preprocessing ==1.1.2
- Mako ==1.2.2
- Markdown ==3.3.6
- MarkupSafe ==1.1.1
- Pillow ==9.0.1
- Pint ==0.16.1
- PyYAML ==6.0
- Pygments ==2.11.2
- SQLAlchemy ==1.4.32
- SoundFile ==0.10.3.post1
- Werkzeug ==2.0.3
- absl-py ==1.0.0
- aiohttp ==3.8.1
- aiosignal ==1.2.0
- alembic ==1.7.7
- appdirs ==1.4.4
- arrow ==1.2.2
- asttokens ==2.0.5
- astunparse ==1.6.3
- async-timeout ==4.0.2
- attrs ==21.4.0
- audioread ==2.1.9
- autopage ==0.5.0
- backcall ==0.2.0
- backoff ==1.11.1
- backports.zoneinfo ==0.2.1
- binaryornot ==0.4.4
- black ==22.1.0
- boto3 ==1.16.34
- botocore ==1.19.63
- cachetools ==5.0.0
- certifi ==2021.10.8
- cffi ==1.15.0
- chardet ==4.0.0
- charset-normalizer ==2.0.12
- chex ==0.1.1
- click ==8.0.4
- cliff ==3.10.1
- clldutils ==3.11.1
- cloudpickle ==2.0.0
- cmaes ==0.8.2
- cmd2 ==2.4.0
- codecarbon ==1.2.0
- colorlog ==6.6.0
- cookiecutter ==2.1.1
- cryptography ==36.0.2
- csvw ==2.0.0
- cycler ==0.11.0
- dash ==2.3.0
- dash-bootstrap-components ==1.0.3
- dash-core-components ==2.0.0
- dash-html-components ==2.0.0
- dash-table ==5.0.0
- datasets ==2.0.0
- decorator ==5.1.1
- dill ==0.3.4
- dlinfo ==1.2.1
- dm-tree ==0.1.6
- docker ==4.4.4
- execnet ==1.9.0
- executing ==0.8.3
- faiss-cpu ==1.7.2
- fasteners ==0.17.3
- filelock ==3.6.0
- fire ==0.4.0
- flake8 ==4.0.1
- flatbuffers ==2.0
- flax ==0.4.0
- fonttools ==4.31.1
- frozenlist ==1.3.0
- fsspec ==2022.2.0
- fugashi ==1.1.2
- gast ==0.5.3
- gitdb ==4.0.9
- glfw ==2.5.1
- google-auth ==2.6.2
- google-auth-oauthlib ==0.4.6
- google-pasta ==0.2.0
- greenlet ==1.1.2
- grpcio ==1.44.0
- gym ==0.23.1
- gym-notices ==0.0.6
- h5py ==3.6.0
- huggingface-hub ==0.4.0
- hypothesis ==6.39.4
- idna ==3.3
- imageio ==2.16.1
- importlib-metadata ==4.11.3
- importlib-resources ==5.4.0
- iniconfig ==1.1.1
- ipadic ==1.0.0
- ipython ==8.1.1
- isodate ==0.6.1
- isort ==5.10.1
- itsdangerous ==2.1.1
- jax ==0.3.4
- jaxlib ==0.3.2
- jedi ==0.18.1
- jinja2-time ==0.2.0
- jmespath ==0.10.0
- joblib ==1.2.0
- jsonschema ==4.4.0
- keras ==2.8.0
- kiwisolver ==1.4.0
- kubernetes ==12.0.1
- libclang ==13.0.0
- librosa ==0.9.1
- llvmlite ==0.38.0
- matplotlib ==3.5.1
- matplotlib-inline ==0.1.3
- mccabe ==0.6.1
- msgpack ==1.0.3
- mujoco-py ==2.1.2.14
- multidict ==6.0.2
- multiprocess ==0.70.12.2
- mypy-extensions ==0.4.3
- nltk ==3.7
- numba ==0.55.1
- numpy ==1.22.3
- oauthlib ==3.2.1
- onnx ==1.11.0
- onnxconverter-common ==1.9.0
- opt-einsum ==3.3.0
- optax ==0.1.1
- optuna ==2.10.0
- packaging ==21.3
- pandas ==1.4.1
- parameterized ==0.8.1
- parso ==0.8.3
- pathspec ==0.9.0
- pbr ==5.8.1
- pexpect ==4.8.0
- phonemizer ==3.0.1
- pickleshare ==0.7.5
- plac ==1.3.4
- platformdirs ==2.5.1
- plotly ==5.6.0
- pluggy ==1.0.0
- pooch ==1.6.0
- portalocker ==2.0.0
- poyo ==0.5.0
- prettytable ==3.2.0
- prompt-toolkit ==3.0.28
- protobuf ==3.19.5
- psutil ==5.9.0
- ptyprocess ==0.7.0
- pure-eval ==0.2.2
- py ==1.11.0
- py-cpuinfo ==8.0.0
- pyOpenSSL ==22.0.0
- pyarrow ==7.0.0
- pyasn1 ==0.4.8
- pyasn1-modules ==0.2.8
- pycodestyle ==2.8.0
- pycparser ==2.21
- pyctcdecode ==0.3.0
- pyflakes ==2.4.0
- pygtrie ==2.4.2
- pynvml ==11.4.1
- pyparsing ==3.0.7
- pyperclip ==1.8.2
- pypng ==0.0.21
- pyrsistent ==0.18.1
- pytest ==7.1.1
- pytest-forked ==1.4.0
- pytest-timeout ==2.1.0
- pytest-xdist ==2.5.0
- python-dateutil ==2.8.2
- python-slugify ==6.1.1
- pytz ==2022.1
- pytz-deprecation-shim ==0.1.0.post0
- ray ==1.11.0
- redis ==4.1.4
- regex ==2022.3.15
- requests ==2.27.1
- requests-oauthlib ==1.3.1
- resampy ==0.2.2
- responses ==0.18.0
- rfc3986 ==1.5.0
- rouge-score ==0.0.4
- rsa ==4.8
- s3transfer ==0.3.7
- sacrebleu ==1.5.1
- sacremoses ==0.0.49
- scikit-learn ==1.0.2
- scipy ==1.8.0
- segments ==2.2.0
- sentencepiece ==0.1.96
- sigopt ==8.2.0
- six ==1.16.0
- smmap ==5.0.0
- sortedcontainers ==2.4.0
- stack-data ==0.2.0
- stevedore ==3.5.0
- tabulate ==0.8.9
- tenacity ==8.0.1
- tensorboard ==2.8.0
- tensorboard-data-server ==0.6.1
- tensorboard-plugin-wit ==1.8.1
- tensorboardX ==2.5
- tensorflow ==2.8.1
- tensorflow-io-gcs-filesystem ==0.24.0
- termcolor ==1.1.0
- text-unidecode ==1.3
- tf-estimator-nightly ==2.8.0.dev2021122109
- tf2onnx ==1.9.3
- threadpoolctl ==3.1.0
- timeout-decorator ==0.5.0
- timm ==0.5.4
- tokenizers ==0.11.6
- tomli ==2.0.1
- toolz ==0.11.2
- torch ==1.11.0
- torchaudio ==0.11.0
- torchvision ==0.12.0
- tqdm ==4.63.0
- traitlets ==5.1.1
- typing-extensions ==4.1.1
- tzdata ==2022.1
- tzlocal ==4.1
- unidic ==1.1.0
- unidic-lite ==1.0.8
- uritemplate ==4.1.1
- urllib3 ==1.26.9
- wasabi ==0.9.0
- wcwidth ==0.2.5
- websocket-client ==1.3.1
- wrapt ==1.14.0
- xxhash ==3.0.0
- yarl ==1.7.2
- zipp ==3.7.0
- transformers ==3.5.1
- gitpython ==3.0.2
- psutil ==5.6.6
- scipy >=1.4.1
- tensorboard >=1.14.0
- tensorboardX ==1.8
- transformers *
- transformers >=4.9.2
- torch >=1.9.0
- joblib >=0.13.2
- matplotlib *
- numpy >=1.17.2
- scipy *
- torch >=1.10.1
- transformers >=3.5
- datasets *
- flax *
- jsonlines *
- sentencepiece *
- wandb *
- flax >=0.3.5
- jax >=0.2.8
- jaxlib >=0.1.59
- optax >=0.0.8
- torch ==1.9.0
- torchvision ==0.10.0
- datasets *
- pillow *
- seqeval *
- datasets >=1.1.3
- elasticsearch *
- faiss-cpu *
- streamlit *
- CacheControl ==0.12.6
- Jinja2 >=2.11.3
- MarkupSafe ==1.1.1
- Pillow >=8.1.1
- PyYAML >=5.4
- Pygments >=2.7.4
- QtPy ==1.9.0
- Send2Trash ==1.5.0
- appdirs ==1.4.3
- argon2-cffi ==20.1.0
- async-generator ==1.10
- attrs ==20.2.0
- backcall ==0.2.0
- certifi ==2020.6.20
- cffi ==1.14.2
- chardet ==3.0.4
- click ==7.1.2
- colorama ==0.4.3
- contextlib2 ==0.6.0
- cycler ==0.10.0
- datasets ==1.0.0
- decorator ==4.4.2
- defusedxml ==0.6.0
- dill ==0.3.2
- distlib ==0.3.0
- distro ==1.4.0
- entrypoints ==0.3
- filelock ==3.0.12
- future ==0.18.2
- html5lib ==1.0.1
- idna ==2.8
- ipaddr ==2.2.0
- ipykernel ==5.3.4
- ipython *
- ipython-genutils ==0.2.0
- ipywidgets ==7.5.1
- jedi ==0.17.2
- joblib ==1.2.0
- jsonschema ==3.2.0
- jupyter ==1.0.0
- jupyter-client ==6.1.7
- jupyter-console ==6.2.0
- jupyter-core ==4.6.3
- jupyterlab-pygments ==0.1.1
- kiwisolver ==1.2.0
- lockfile ==0.12.2
- matplotlib ==3.3.1
- mistune ==2.0.3
- msgpack ==0.6.2
- nbclient ==0.5.0
- nbconvert ==6.5.1
- nbformat ==5.0.7
- nest-asyncio ==1.4.0
- notebook ==6.4.12
- numpy ==1.22.0
- opencv-python ==4.4.0.42
- packaging ==20.3
- pandas ==1.1.2
- pandocfilters ==1.4.2
- parso ==0.7.1
- pep517 ==0.8.2
- pexpect ==4.8.0
- pickleshare ==0.7.5
- progress ==1.5
- prometheus-client ==0.8.0
- prompt-toolkit ==3.0.7
- ptyprocess ==0.6.0
- pyaml ==20.4.0
- pyarrow ==1.0.1
- pycparser ==2.20
- pyparsing ==2.4.6
- pyrsistent ==0.16.0
- python-dateutil ==2.8.1
- pytoml ==0.1.21
- pytz ==2020.1
- pyzmq ==19.0.2
- qtconsole ==4.7.7
- regex ==2020.7.14
- requests ==2.22.0
- retrying ==1.3.3
- sacremoses ==0.0.43
- sentencepiece ==0.1.91
- six ==1.14.0
- terminado ==0.8.3
- testpath ==0.4.4
- tokenizers ==0.8.1rc2
- torch ==1.6.0
- torchvision ==0.7.0
- tornado ==6.0.4
- tqdm ==4.48.2
- traitlets *
- urllib3 ==1.26.5
- wcwidth ==0.2.5
- webencodings ==0.5.1
- wget ==3.2
- widgetsnbextension ==3.5.1
- xxhash ==2.0.0
- datasets >=1.1.3
- ltp *
- protobuf *
- sentencepiece *
- h5py >=2.10.0
- knockknock >=0.1.8.1
- numpy >=1.18.2
- scipy >=1.4.1
- torch >=1.4.0
- torch >=1.10
- conllu *
- datasets >=1.1.3
- elasticsearch *
- faiss-cpu *
- fire *
- git-python ==1.0.3
- matplotlib *
- nltk *
- pandas *
- protobuf *
- psutil *
- pytest *
- pytorch-lightning *
- rouge-score *
- sacrebleu *
- scikit-learn *
- sentencepiece *
- seqeval *
- streamlit *
- tensorboard *
- tensorflow_datasets *
- transformers ==3.5.1
- GitPython *
- datasets >=1.0.1
- faiss-cpu >=1.6.3
- psutil >=5.7.0
- pytorch-lightning >=1.5.10
- ray >=1.10.0
- torch >=1.4.0
- transformers *
- datasets *
- faiss-cpu >=1.7.2
- nvidia-ml-py3 ==7.352.0
- psutil >=5.9.1
- pytorch-lightning ==1.6.4
- ray >=1.13.0
- torch >=1.11.0
- accelerate *
- datasets >=1.8.0
- protobuf *
- scikit-learn *
- scipy *
- sentencepiece *
- torch >=1.3
- conllu *
- datasets >=1.1.3
- elasticsearch *
- faiss-cpu *
- fire *
- git-python ==1.0.3
- matplotlib *
- nltk *
- pandas *
- protobuf *
- psutil *
- pytest *
- pytorch-lightning *
- rouge-score *
- sacrebleu *
- scikit-learn *
- sentencepiece *
- streamlit *
- tensorboard *
- tensorflow_datasets *
- datasets *
- nltk *
- numpy *
- pandas *
- CacheControl ==0.12.6
- Jinja2 >=2.11.3
- MarkupSafe ==1.1.1
- Pillow >=8.1.1
- PyYAML >=5.4
- Pygments >=2.7.4
- QtPy ==1.9.0
- Send2Trash ==1.5.0
- appdirs ==1.4.3
- argon2-cffi ==20.1.0
- async-generator ==1.10
- attrs ==20.2.0
- backcall ==0.2.0
- certifi ==2020.6.20
- cffi ==1.14.2
- chardet ==3.0.4
- click ==7.1.2
- colorama ==0.4.3
- contextlib2 ==0.6.0
- cycler ==0.10.0
- datasets ==1.0.0
- decorator ==4.4.2
- defusedxml ==0.6.0
- dill ==0.3.2
- distlib ==0.3.0
- distro ==1.4.0
- entrypoints ==0.3
- filelock ==3.0.12
- future ==0.18.2
- html5lib ==1.0.1
- idna ==2.8
- ipaddr ==2.2.0
- ipykernel ==5.3.4
- ipython *
- ipython-genutils ==0.2.0
- ipywidgets ==7.5.1
- jedi ==0.17.2
- joblib ==1.2.0
- jsonschema ==3.2.0
- jupyter ==1.0.0
- jupyter-client ==6.1.7
- jupyter-console ==6.2.0
- jupyter-core ==4.6.3
- jupyterlab-pygments ==0.1.1
- kiwisolver ==1.2.0
- lockfile ==0.12.2
- matplotlib ==3.3.1
- mistune ==2.0.3
- msgpack ==0.6.2
- nbclient ==0.5.0
- nbconvert ==6.5.1
- nbformat ==5.0.7
- nest-asyncio ==1.4.0
- notebook ==6.4.12
- numpy ==1.22.0
- opencv-python ==4.4.0.42
- packaging ==20.3
- pandas ==1.1.2
- pandocfilters ==1.4.2
- parso ==0.7.1
- pep517 ==0.8.2
- pexpect ==4.8.0
- pickleshare ==0.7.5
- progress ==1.5
- prometheus-client ==0.8.0
- prompt-toolkit ==3.0.7
- ptyprocess ==0.6.0
- pyaml ==20.4.0
- pyarrow ==1.0.1
- pycparser ==2.20
- pyparsing ==2.4.6
- pyrsistent ==0.16.0
- python-dateutil ==2.8.1
- pytoml ==0.1.21
- pytz ==2020.1
- pyzmq ==19.0.2
- qtconsole ==4.7.7
- regex ==2020.7.14
- requests ==2.22.0
- retrying ==1.3.3
- sacremoses ==0.0.43
- sentencepiece ==0.1.91
- six ==1.14.0
- terminado ==0.8.3
- testpath ==0.4.4
- tokenizers ==0.8.1rc2
- torch ==1.6.0
- torchvision ==0.7.0
- tornado ==6.0.4
- tqdm ==4.48.2
- traitlets *
- urllib3 ==1.26.5
- wcwidth ==0.2.5
- webencodings ==0.5.1
- wget ==3.2
- widgetsnbextension ==3.5.1
- xxhash ==2.0.0
- datasets *
- jiwer ==2.2.0
- lang-trans ==0.6.0
- librosa ==0.8.0
- torch >=1.5.0
- torchaudio *
- transformers *
- datasets >=1.18.0
- jiwer *
- librosa *
- torch >=1.5
- torchaudio *
- accelerate main test
- conllu * test
- datasets >=1.13.3 test
- elasticsearch * test
- evaluate >=0.2.0 test
- faiss-cpu * test
- fire * test
- git-python ==1.0.3 test
- jiwer * test
- librosa * test
- matplotlib * test
- nltk * test
- pandas * test
- protobuf * test
- psutil * test
- pytest * test
- rouge-score * test
- sacrebleu >=1.4.12 test
- scikit-learn * test
- sentencepiece * test
- seqeval * test
- streamlit * test
- tensorboard * test
- tensorflow * test
- tensorflow_datasets * test
- tensorflow >=2.3
- datasets >=1.8.0
- sentencepiece *
- protobuf *
- sentencepiece *
- tensorflow >=2.3
- datasets >=1.4.0
- evaluate >=0.2.0
- tensorflow >=2.3.0
- datasets >=1.4.0
- evaluate >=0.2.0
- tensorflow >=2.3.0
- datasets >=1.1.3
- evaluate >=0.2.0
- protobuf *
- sentencepiece *
- tensorflow >=2.3
- datasets >=1.4.0
- evaluate >=0.2.0
- tensorflow >=2.3.0
- datasets >=1.4.0
- evaluate >=0.2.0
- tensorflow >=2.3.0
- deps *
- datasets ==1.8.0 test