https://github.com/alan-turing-institute/netts
Toolbox for creating networks capturing semantic content of speech transcripts.
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
3 of 7 committers (42.9%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (15.4%) to scientific vocabulary
Keywords
Keywords from Contributors
Repository
Toolbox for creating networks capturing semantic content of speech transcripts.
Basic Info
- Host: GitHub
- Owner: alan-turing-institute
- License: gpl-3.0
- Language: Python
- Default Branch: main
- Homepage: https://alan-turing-institute.github.io/netts/
- Size: 145 MB
Statistics
- Stars: 8
- Watchers: 1
- Forks: 1
- Open Issues: 43
- Releases: 9
Topics
Metadata Files
README.md
netts - NETworks of Transcript Semantics
Toolbox for constructing semantic speech networks from speech transcripts.
About
The algorithms in this toolbox create a semantic speech graph from transcribed speech. Speech transcripts are short paragraphs of largely raw, uncleaned speech-like text. For example:
'I see a man and he is wearing a jacket. He is standing in the dark against a light post. On the picture there seems to be like a park and... Or trees but in those trees there are little balls of light reflections as well. I cannot see the... Anything else because it’s very dark. But the man on the picture seems to wear a hat and he seems to have a hoodie on as well. The picture is very mysterious, which I like about it, but for me I would like to understand more about the picture.' -- Example Transcript
Below is the semantic speech graph constructed from this text.
Figure 1. Semantic Speech Graph. Nodes represents an entity mentioned by the speaker (e.g. I, man, jacket). Edges represent relations between nodes mentioned by the speaker (e.g. see, has on).
Getting started
Read the full documentation here.
Where to get it
You can install the latest release from PyPi
bash
pip install netts
or get the latest development version from GitHub (not stable)
bash
pip install git+https://github.com/alan-turing-institute/netts
Additional dependencies
Netts requires the Java Runtime Environment. Instructions for downloading and installing for your operating system can be found here.
Netts also requires a few additional dependencies to work which you can download with the netts CLI that was installed by pip
bash
netts install
Basic usage
The quickest way to process a transcript is with the CLI.
bash
netts run transcript.txt outputs
where transcript.txt is a text file containing transcribed speech and outputs is the name of a directory to write the outputs to. Additional logging information can be found in netts_log.log.
Contributors
Netts was written by Caroline Nettekoven in collaboration with Sarah Morgan.
Netts was packaged in collaboration with Oscar Giles, Iain Stenson and Helen Duncan.
Owner
- Name: The Alan Turing Institute
- Login: alan-turing-institute
- Kind: organization
- Email: info@turing.ac.uk
- Website: https://turing.ac.uk
- Repositories: 477
- Profile: https://github.com/alan-turing-institute
The UK's national institute for data science and artificial intelligence.
GitHub Events
Total
- Issues event: 2
- Issue comment event: 1
Last Year
- Issues event: 2
- Issue comment event: 1
Committers
Last synced: about 2 years ago
Top Committers
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 36
- Total pull requests: 65
- Average time to close issues: about 2 months
- Average time to close pull requests: 11 days
- Total issue authors: 5
- Total pull request authors: 5
- Average comments per issue: 1.33
- Average comments per pull request: 0.45
- Merged pull requests: 43
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: 15 days
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 1.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- OscartGiles (25)
- Iain-S (8)
- weiwei-wch (1)
- helendduncan (1)
- mhauru (1)
Pull Request Authors
- Iain-S (45)
- OscartGiles (14)
- rkdan (3)
- carobellum (2)
- helendduncan (1)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- pypi 27 last-month
- Total dependent packages: 0
- Total dependent repositories: 1
- Total versions: 7
- Total maintainers: 4
pypi.org: netts
Toolbox for constructing NETworks of Transcript Semantics.
- Homepage: https://github.com/alan-turing-institute/netts
- Documentation: https://alan-turing-institute.github.io/netts/
- License: GPL-3.0-or-later3
-
Latest release: 0.3.2
published almost 3 years ago
Rankings
Maintainers (4)
Dependencies
- black ^22.3.0 develop
- devtools ^0.8.0 develop
- flake8 ^3.9.2 develop
- isort ^5.9.3 develop
- mkdocs-click ^0.7.0 develop
- mkdocs-material ^8.2.13 develop
- mypy ^0.910 develop
- pre-commit ^2.14.0 develop
- pylint ^2.9.6 develop
- pytest ^6.2.4 develop
- pytest-cov ^2.12.1 develop
- pytest-mock ^3.6.1 develop
- safety ^1.10.3 develop
- matplotlib ^3.4.3
- networkx ^2.6.2
- nltk ^3.6.2
- numpy ^1.22.2
- pandas ^1.3.2
- pydantic ^1.8.2
- pyopenie ^0.2.0
- python >=3.8,<3.11
- requests ^2.26.0
- rtoml ^0.7.0
- stanza ^1.2.3
- tqdm ^4.62.2
- typer ^0.4.1
- astroid 2.9.3 develop
- asttokens 2.0.5 develop
- atomicwrites 1.4.0 develop
- attrs 21.4.0 develop
- black 22.3.0 develop
- cfgv 3.3.1 develop
- coverage 6.3.2 develop
- devtools 0.8.0 develop
- distlib 0.3.4 develop
- dparse 0.5.1 develop
- executing 0.8.3 develop
- filelock 3.6.0 develop
- flake8 3.9.2 develop
- ghp-import 2.1.0 develop
- identify 2.4.11 develop
- importlib-metadata 4.11.3 develop
- iniconfig 1.1.1 develop
- isort 5.10.1 develop
- jinja2 3.1.2 develop
- lazy-object-proxy 1.7.1 develop
- markdown 3.3.6 develop
- markupsafe 2.1.1 develop
- mccabe 0.6.1 develop
- mergedeep 1.3.4 develop
- mkdocs 1.3.0 develop
- mkdocs-click 0.7.0 develop
- mkdocs-material 8.2.13 develop
- mkdocs-material-extensions 1.0.3 develop
- mypy 0.910 develop
- mypy-extensions 0.4.3 develop
- nodeenv 1.6.0 develop
- pathspec 0.9.0 develop
- platformdirs 2.5.1 develop
- pluggy 1.0.0 develop
- pre-commit 2.17.0 develop
- py 1.11.0 develop
- pycodestyle 2.7.0 develop
- pyflakes 2.3.1 develop
- pygments 2.12.0 develop
- pylint 2.12.2 develop
- pymdown-extensions 9.4 develop
- pytest 6.2.5 develop
- pytest-cov 2.12.1 develop
- pytest-mock 3.7.0 develop
- pyyaml 6.0 develop
- pyyaml-env-tag 0.1 develop
- safety 1.10.3 develop
- toml 0.10.2 develop
- virtualenv 20.13.3 develop
- watchdog 2.1.7 develop
- wrapt 1.13.3 develop
- zipp 3.8.0 develop
- certifi 2021.10.8
- charset-normalizer 2.0.12
- click 8.1.3
- colorama 0.4.4
- cycler 0.11.0
- emoji 1.7.0
- fonttools 4.30.0
- idna 3.3
- joblib 1.1.0
- kiwisolver 1.4.0
- matplotlib 3.5.1
- networkx 2.7.1
- nltk 3.7
- numpy 1.22.3
- packaging 21.3
- pandas 1.4.1
- pillow 9.0.1
- protobuf 3.19.4
- pydantic 1.9.0
- pyopenie 0.2.0
- pyparsing 3.0.7
- python-dateutil 2.8.2
- python-dotenv 0.19.2
- pytz 2021.3
- regex 2022.3.15
- requests 2.27.1
- rtoml 0.7.1
- setuptools-scm 6.4.2
- shellingham 1.4.0
- six 1.16.0
- stanza 1.3.0
- tomli 1.2.3
- torch 1.11.0
- tqdm 4.63.0
- typer 0.4.1
- typing-extensions 4.1.1
- urllib3 1.26.9