stt

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

https://github.com/coqui-ai/stt

Science Score: 33.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: zenodo.org
  • Committers with academic emails
    10 of 183 committers (5.5%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.7%) to scientific vocabulary

Keywords

asr automatic-speech-recognition deep-learning speech-recognition speech-recognition-api speech-recognizer speech-to-text stt tensorflow voice-recognition

Keywords from Contributors

speaker-encoder speech speech-synthesis text-to-speech voice-conversion multi-speaker-tts melgan hifigan glow-tts tacotron
Last synced: 6 months ago · JSON representation

Repository

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Basic Info
  • Host: GitHub
  • Owner: coqui-ai
  • License: mpl-2.0
  • Language: C++
  • Default Branch: main
  • Homepage: https://coqui.ai
  • Size: 53.4 MB
Statistics
  • Stars: 2,503
  • Watchers: 61
  • Forks: 298
  • Open Issues: 107
  • Releases: 32
Topics
asr automatic-speech-recognition deep-learning speech-recognition speech-recognition-api speech-recognizer speech-to-text stt tensorflow voice-recognition
Created almost 5 years ago · Last pushed almost 2 years ago
Metadata Files
Readme Contributing License Code of conduct

README.rst

.. note::
   **This project is no longer actively maintained**, and we have stopped hosting the online Model Zoo. We've seen focus shift towards newer STT models such as [Whisper](https://github.com/openai/whisper), and have ourselves focused on [Coqui TTS](https://github.com/coqui-ai/TTS) and [Coqui Studio](https://coqui.ai/).
   
   The models will remain available in [the releases of the coqui-ai/STT-models repo](https://github.com/coqui-ai/STT-models/releases).

.. image:: images/coqui-STT-logo-green.png
   :alt: Coqui STT logo


.. |doc-img| image:: https://readthedocs.org/projects/stt/badge/?version=latest
   :target: https://stt.readthedocs.io/?badge=latest
   :alt: Documentation

.. |covenant-img| image:: https://img.shields.io/badge/Contributor%20Covenant-2.0-4baaaa.svg
   :target: CODE_OF_CONDUCT.md
   :alt: Contributor Covenant

.. |gitter-img| image:: https://badges.gitter.im/coqui-ai/STT.svg
   :target: https://gitter.im/coqui-ai/STT?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge
   :alt: Gitter Room

.. |doi| image:: https://zenodo.org/badge/344354127.svg
   :target: https://zenodo.org/badge/latestdoi/344354127

|doc-img| |covenant-img| |gitter-img| |doi|

`👉 Subscribe to 🐸Coqui's Newsletter `_

**Coqui STT** (🐸STT) is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. 🐸STT is battle tested in both production and research 🚀

🐸STT features
---------------

* High-quality pre-trained STT model.
* Efficient training pipeline with Multi-GPU support.
* Streaming inference.
* Multiple possible transcripts, each with an associated confidence score.
* Real-time inference.
* Small-footprint acoustic model.
* Bindings for various programming languages.

`Quickstart `_
================================================================

Where to Ask Questions
----------------------

.. list-table::
   :widths: 25 25
   :header-rows: 1

   * - Type
     - Link
   * - 🚨 **Bug Reports**
     - `Github Issue Tracker `_
   * - 🎁 **Feature Requests & Ideas**
     - `Github Issue Tracker `_
   * - ❔ **Questions**
     - `Github Discussions `_
   * - 💬 **General Discussion**
     - `Github Discussions `_ or `Gitter Room `_


Links & Resources
-----------------
.. list-table::
   :widths: 25 25
   :header-rows: 1

   * - Type
     - Link
   * - 📰 **Documentation**
     - `stt.readthedocs.io `_
   * - 🚀 **Latest release with pre-trained models**
     - `see the latest release on GitHub `_
   * - 🤝 **Contribution Guidelines**
     - `CONTRIBUTING.rst `_

Owner

  • Name: coqui
  • Login: coqui-ai
  • Kind: organization
  • Email: info@coqui.ai

Coqui, a startup providing open speech tech for everyone 🐸

GitHub Events

Total
  • Watch event: 246
  • Issue comment event: 2
  • Fork event: 21
Last Year
  • Watch event: 246
  • Issue comment event: 2
  • Fork event: 21

Committers

Last synced: 8 months ago

All Time
  • Total Commits: 2,786
  • Total Committers: 183
  • Avg Commits per committer: 15.224
  • Development Distribution Score (DDS): 0.595
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Reuben Morais r****s@g****m 1,129
Alexandre Lissy l****x@l****g 570
Kelly Davis k****s@m****m 138
Tilman Kamp t****p@p****e 134
Josh Meyer j****r@g****m 110
wasertech d****y@w****h 48
CatalinVoss c****n@c****u 45
Chris Lord c****t@g****m 41
Daniel d****l@m****e 39
Carlos Fonseca M c****1@h****m 26
dabinat d****t@e****m 25
Alessio Placitelli a****i@a****t 21
godeffroy g****t@a****m 19
josh j****r@m****m 15
Yi-Hua Chiu m****3@g****m 13
Kelly Davis k****s@c****i 13
Aya Jafari a****i@c****i 12
andi4191 a****1@g****m 11
Bernardo Henz b****z@g****m 10
Mike C. Fletcher m****h@v****m 10
Shubham Kumar s****3@g****m 10
Francis Tyers f****s 10
Alexandre Lissy a****b@m****m 9
Rob r****h@o****m 9
lissyx 1****x 9
Erik Ziegler e****r@s****e 9
Richard Hamnett r****t 8
Andre Natal a****l@g****m 7
Daniel Souza d****5@g****m 7
Tiago Morais Morgado e****8@g****m 7
and 153 more...

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 54
  • Total pull requests: 52
  • Average time to close issues: about 2 months
  • Average time to close pull requests: 22 days
  • Total issue authors: 40
  • Total pull request authors: 23
  • Average comments per issue: 5.15
  • Average comments per pull request: 2.62
  • Merged pull requests: 22
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • wasertech (8)
  • JRMeyer (3)
  • FrontierDK (3)
  • SuperKogito (2)
  • johns10 (2)
  • reuben (2)
  • Baerlie (1)
  • bernardohenz (1)
  • mo-g (1)
  • Anlubi (1)
  • tgwaste (1)
  • BjornTheProgrammer (1)
  • Patronics (1)
  • Himly1 (1)
  • Freemanlabs (1)
Pull Request Authors
  • wasertech (10)
  • dsouza95 (6)
  • Dexterp37 (5)
  • reuben (5)
  • FenNlay (3)
  • adeepH (3)
  • ZhengkunMei (2)
  • nan-wang (2)
  • Dmole (2)
  • NanoNabla (2)
  • mariano-balto (1)
  • ChamathKB (1)
  • ehwgal (1)
  • Benjamin-Loison (1)
  • BeauregardTA (1)
Top Labels
Issue Labels
bug (36) enhancement (16)
Pull Request Labels

Packages

  • Total packages: 12
  • Total downloads:
    • npm 721 last-month
    • pypi 2,483 last-month
  • Total docker downloads: 659
  • Total dependent packages: 2
    (may contain duplicates)
  • Total dependent repositories: 50
    (may contain duplicates)
  • Total versions: 442
  • Total maintainers: 4
pypi.org: stt

A library for doing speech recognition using a Coqui STT model

  • Versions: 35
  • Dependent Packages: 0
  • Dependent Repositories: 20
  • Downloads: 852 Last month
  • Docker Downloads: 0
Rankings
Stargazers count: 1.6%
Docker downloads count: 1.8%
Dependent repos count: 3.2%
Forks count: 3.4%
Average: 4.8%
Downloads: 8.4%
Dependent packages count: 10.1%
Maintainers (2)
Last synced: 6 months ago
proxy.golang.org: github.com/coqui-ai/stt
  • Versions: 152
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Stargazers count: 1.5%
Forks count: 1.9%
Average: 5.2%
Dependent packages count: 8.1%
Dependent repos count: 9.3%
Last synced: 6 months ago
npmjs.org: stt

A library for doing speech recognition using a Coqui STT model

  • Versions: 32
  • Dependent Packages: 1
  • Dependent Repositories: 25
  • Downloads: 702 Last month
  • Docker Downloads: 659
Rankings
Docker downloads count: 0.9%
Stargazers count: 2.0%
Forks count: 2.2%
Dependent repos count: 2.5%
Downloads: 4.5%
Average: 5.5%
Dependent packages count: 21.0%
Maintainers (2)
Last synced: 6 months ago
proxy.golang.org: github.com/coqui-ai/STT
  • Versions: 152
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 6.5%
Average: 6.7%
Dependent repos count: 6.9%
Last synced: 6 months ago
pypi.org: stt-tflite

A library for doing speech recognition using a Coqui STT model

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 2
  • Downloads: 139 Last month
Rankings
Stargazers count: 1.6%
Forks count: 3.4%
Average: 8.7%
Dependent packages count: 10.1%
Dependent repos count: 11.6%
Downloads: 17.0%
Maintainers (1)
Last synced: 6 months ago
pypi.org: coqui-stt-training

Training code for Coqui STT

  • Versions: 33
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 63 Last month
Rankings
Stargazers count: 1.6%
Forks count: 3.4%
Dependent packages count: 10.1%
Average: 10.6%
Downloads: 16.4%
Dependent repos count: 21.6%
Maintainers (1)
Last synced: 6 months ago
npmjs.org: stt-tflite

A library for doing speech recognition using a Coqui STT model

  • Versions: 3
  • Dependent Packages: 1
  • Dependent Repositories: 1
  • Downloads: 19 Last month
Rankings
Stargazers count: 2.0%
Forks count: 2.2%
Dependent repos count: 10.3%
Average: 15.7%
Dependent packages count: 21.0%
Downloads: 43.2%
Maintainers (1)
Last synced: 7 months ago
pypi.org: stt-gpu

A library for doing speech recognition using a Coqui STT model

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 5 Last month
Rankings
Stargazers count: 1.6%
Forks count: 3.4%
Dependent packages count: 10.1%
Average: 21.2%
Dependent repos count: 21.6%
Downloads: 69.4%
Maintainers (1)
Last synced: 6 months ago
npmjs.org: stt-gpu

A library for doing speech recognition using a Coqui STT model

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 0 Last month
Rankings
Stargazers count: 2.5%
Forks count: 2.8%
Average: 22.4%
Dependent repos count: 25.3%
Dependent packages count: 32.9%
Downloads: 48.3%
Maintainers (1)
Last synced: 6 months ago
pypi.org: iara-stt

A library for doing speech recognition using a Coqui STT model

  • Versions: 11
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 252 Last month
Rankings
Dependent packages count: 9.0%
Average: 30.0%
Dependent repos count: 50.9%
Maintainers (1)
Last synced: 6 months ago
pypi.org: iara-stt-training

Training code for Coqui STT

  • Versions: 14
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 1,151 Last month
Rankings
Dependent packages count: 9.0%
Average: 30.0%
Dependent repos count: 51.0%
Maintainers (1)
Last synced: 7 months ago
pypi.org: iarahealth-stt-training

Training code for Coqui STT

  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 21 Last month
Rankings
Dependent packages count: 10.1%
Average: 38.7%
Dependent repos count: 67.3%
Maintainers (1)
Last synced: 6 months ago

Dependencies

native_client/java/app/build.gradle maven
  • com.android.support.constraint:constraint-layout 1.1.3 implementation
  • com.android.support:appcompat-v7 27.1.1 implementation
  • junit:junit 4.12 testImplementation
native_client/java/libstt/build.gradle maven
  • junit:junit 4.12 testImplementation
.github/actions/check_artifact_exists/package-lock.json npm
  • @actions/core 1.2.6 development
  • @actions/exec 1.1.0 development
  • @actions/github 4.0.0 development
  • @actions/http-client 1.0.11 development
  • @actions/io 1.1.1 development
  • @octokit/auth-token 2.4.5 development
  • @octokit/core 3.4.0 development
  • @octokit/endpoint 6.0.11 development
  • @octokit/graphql 4.6.1 development
  • @octokit/openapi-types 6.0.0 development
  • @octokit/plugin-paginate-rest 2.13.3 development
  • @octokit/plugin-rest-endpoint-methods 4.15.0 development
  • @octokit/plugin-throttling 3.4.1 development
  • @octokit/request 5.4.15 development
  • @octokit/request-error 2.0.5 development
  • @octokit/types 6.13.0 development
  • @vercel/ncc 0.27.0 development
  • adm-zip 0.5.5 development
  • before-after-hook 2.2.1 development
  • bottleneck 2.19.5 development
  • deprecation 2.3.1 development
  • filesize 6.1.0 development
  • is-plain-object 5.0.0 development
  • node-fetch 2.6.1 development
  • once 1.4.0 development
  • tunnel 0.0.6 development
  • universal-user-agent 6.0.0 development
  • wrappy 1.0.2 development
.github/actions/check_artifact_exists/package.json npm
  • @actions/core ^1.2.6 development
  • @actions/exec ^1.1.0 development
  • @actions/github ^4.0.0 development
  • @octokit/plugin-throttling ^3.4.1 development
  • @vercel/ncc ^0.27.0 development
  • adm-zip ^0.5.2 development
  • filesize ^6.1.0 development
doc/requirements.txt pypi
  • MarkupSafe ==2.0.1
  • breathe ==4.27.0
  • docutils >=0.12,<=0.17.1
  • furo ==2021.2.28b28
  • pygments ==2.7.4
  • recommonmark ==0.7.1
  • semver ==2.8.1
  • sphinx ==3.5.2
native_client/ctcdecode/setup.py pypi
  • numpy *
native_client/python/setup.py pypi
  • numpy *
requirements_eval_tflite.txt pypi
  • absl-py ==0.9.0
  • attrdict ==2.0.1
  • numpy ==1.16.0
  • pandas ==0.25.3
  • progressbar2 ==3.47.0
  • python-utils ==2.3.0
  • six ==1.13.0
  • stt *
requirements_tests.txt pypi
  • absl-py *
  • argparse *
  • semver *
requirements_transcribe.txt pypi
  • webrtcvad *
.github/actions/check_artifact_exists/action.yml actions
  • dist/index.js node12 javascript
.github/workflows/build-and-test.yml actions
  • ./.github/actions/chroot-bind-mount * composite
  • ./.github/actions/install-python-upstream * composite
  • ./.github/actions/install-xldd * composite
  • ./.github/actions/multistrap * composite
  • ./.github/actions/node-build * composite
  • ./.github/actions/node-install * composite
  • ./.github/actions/numpy_vers * composite
  • ./.github/actions/python-build * composite
  • ./.github/actions/run-tests * composite
  • ./.github/actions/upload-release-asset * composite
  • ./.github/actions/win-install-sox * composite
  • ./.github/actions/win-node-build * composite
  • ./.github/actions/win-numpy-vers * composite
  • ./.github/actions/win-python-build * composite
  • ./.github/actions/win-run-tests * composite
  • actions/cache v2 composite
  • actions/checkout v2 composite
  • actions/download-artifact v2 composite
  • actions/setup-java v2 composite
  • actions/setup-node v2 composite
  • actions/setup-python v2 composite
  • actions/upload-artifact v2 composite
  • android-actions/setup-android v2 composite
  • docker/login-action f054a8b539a109f9f41c372932f1ae047eff08c9 composite
  • ilammy/msvc-dev-cmd v1 composite
  • msys2/setup-msys2 v2 composite
  • nttld/setup-ndk v1 composite
  • softprops/action-gh-release v1 composite
.github/workflows/lint.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
native_client/dotnet/STTConsoleNetCore/STTConsoleNetCore.csproj nuget
  • NAudio 2.1.0