html5-speech-to-text
This is an online speech recognition tool implemented entirely in pure JavaScript. It utilizes the browser's built-in Web Speech API to convert audio captured by the microphone into text and combines it with time segments to create subtitles.
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 5 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (3.5%) to scientific vocabulary
Keywords
Repository
This is an online speech recognition tool implemented entirely in pure JavaScript. It utilizes the browser's built-in Web Speech API to convert audio captured by the microphone into text and combines it with time segments to create subtitles.
Basic Info
- Host: GitHub
- Owner: pulipulichen
- License: mit
- Language: JavaScript
- Default Branch: master
- Homepage: https://blog.pulipuli.info/2019/01/web-speech-to-text-speech-recognition.html
- Size: 22.7 MB
Statistics
- Stars: 60
- Watchers: 5
- Forks: 20
- Open Issues: 2
- Releases: 3
Topics
Metadata Files
README.md
HTML5-Speech-to-Text: 線上檔案語音轉文字字幕
This is an online speech recognition tool implemented entirely in pure JavaScript. It utilizes the browser's built-in Web Speech API to convert audio captured by the microphone into text and combines it with time segments to create subtitles.
純JavaScript實作的線上語音識別工具。它利用瀏覽器內建的Web Speech API,將麥克風收到的聲音轉換成文字,並搭配時間片段組合成字幕。

Online Demo
https://pulipulichen.github.io/HTML5-Speech-to-Text/
Techniques
- JavaScript
- Web Speech API: 瀏覽器內建的語音識別API。
Article
Citation
Chen, Y.-T. (2024). HTML5-Speech-to-Text (20240518.182526) [Computer software]. Zenodo. https://doi.org/10.5281/ZENODO.11213438
Reference
- https://wavesurfer-js.org/
- 華語文能力測驗關鍵詞彙:牛刀小試篇 https://www.youtube.com/watch?v=GE7sc_XvJ8w
- Semantic UI: https://semantic-ui.com/globals/site.html
- Semantic ICON: https://semantic-ui.com/elements/icon.html
- ICON: http://www.iconninja.com/comment-chat-speak-talk-run-icon-509858
- https://www.nikse.dk/subtitleedit/online#
- https://www.pkstep.com/archives/38130#%E4%B8%8B%E8%BC%89
Owner
- Name: Pulipuli Chen
- Login: pulipulichen
- Kind: user
- Location: Taiwan
- Website: http://blog.pulipuli.info
- Repositories: 431
- Profile: https://github.com/pulipulichen
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it using these metadata."
authors:
- family-names: Chen
given-names: Yung-Ting
orcid: https://orcid.org/0000-0003-3751-610X
email: https://blog.pulipuli.info
title: "HTML5-Speech-to-Text"
license: "MIT"
repository-code: "https://github.com/pulipulichen/HTML5-Speech-to-Text"
abstract: "This is an online speech recognition tool implemented entirely in pure JavaScript. It utilizes the browser's built-in Web Speech API to convert audio captured by the microphone into text and combines it with time segments to create subtitles."
identifiers:
- type: doi
value: 10.5281/zenodo.11213438
GitHub Events
Total
- Watch event: 3
Last Year
- Watch event: 3
Issues and Pull Requests
Last synced: about 1 year ago
All Time
- Total issues: 4
- Total pull requests: 1
- Average time to close issues: 4 months
- Average time to close pull requests: about 3 hours
- Total issue authors: 3
- Total pull request authors: 1
- Average comments per issue: 0.5
- Average comments per pull request: 0.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- pulipulichen (2)
- xin-h963 (1)
- kaichen0712 (1)
Pull Request Authors
- mlc130104 (1)