https://github.com/bootphon/abnet3

Siamese network for unsupervised speech representation learning

https://github.com/bootphon/abnet3

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 8 committers (12.5%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.0%) to scientific vocabulary

Keywords

artificial-neural-networks machine-learning speech-processing
Last synced: 5 months ago · JSON representation

Repository

Siamese network for unsupervised speech representation learning

Basic Info
  • Host: GitHub
  • Owner: bootphon
  • License: gpl-3.0
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 378 KB
Statistics
  • Stars: 11
  • Watchers: 15
  • Forks: 1
  • Open Issues: 2
  • Releases: 0
Topics
artificial-neural-networks machine-learning speech-processing
Created over 7 years ago · Last pushed over 7 years ago

https://github.com/bootphon/abnet3/blob/master/

# ABnet3

Representation learning package using side information, system for subword modeling for [Zeroresource challenge](http://sapience.dec.ens.fr/bootphon/2017/index.html).

### Overview



Build Representation for speech frames based on side information. Composed of different modules :

* `model.py`
* `loss.py`
* `sampler.py`
* `trainer.py`
* `embedder.py`
* `utils.py`
* `features.py`

### Installation of the package

#### Using conda

To install the ABnet3 package, you can use Anaconda, and either create a conda environment:

    conda env create --name abnet3 python=3.6 -f environment.yml

or use a conda environment you already have with python 3 :
    conda env update -f environment.yml

To install with GPU support (replace cuda75 with your version of cuda)

    conda install  pytorch=0.2 cuda75 -c pytorch

#### Using pip

- install the version 0.2.0 of pytorch for your hardware (http://pytorch.org/previous-versions/)

- install the pip packages : `pip install -r requirements.txt`
Once all the necessary packages are installed, simply launch:

#### Run abnet3 installation

    python setup.py build && python setup.py install

If you want to work on ABnet3 and develop your own modules, instead of:

    python setup.py install

you can launch:

    python setup.py develop

### Tensorboard vizualisation

The package tensorboardX needs to be installed to train the model: `pip install tensorboardX`.

The package will save train / dev loss during training. To vizualise them :

- Install tensorboard (`conda install tensorflow tensorflow-tensorboard`)

- run `tensorboard --logdir path/to/logdir`.
The default logdir is `./run` in the current directory.

### Documentation

You can see examples for running the gridsearch and replicating our results
in the repository https://github.com/Rachine/sampling_siamese2018

The cli documentation is here https://coml.lscp.ens.fr/git/Rachine/abnet3/src/master/gridsearch.md

### Tests

The package comes with a unit-tests suit. To run it, first install *pytest* on your Python environment:

    pip install pytest
    pytest test/

#### References

    .. [1] Riad, R., Dancette, C., Karadayi, J., Zeghidour, N., Schatz, T., Dupoux, E.
           *Sampling strategies in Siamese Networks for unsupervised speech representation learning.*
           In Nineteenth Annual Conference of the International Speech Communication Association

    .. [2] Thiolliere, R., Dunbar, E., Synnaeve, G., Versteegh, M., & Dupoux, E.
           *A hybrid dynamic time warping-deep neural network architecture for unsupervised acoustic modeling.*
           In Sixteenth Annual Conference of the International Speech Communication Association

    .. [3] Zeghidour, N., Synnaeve, G., Usunier, N. & Dupoux, E.
           *Joint Learning of Speaker and Phonetic Similarities with Siamese Networks.*
           In: INTERSPEECH-2016, (pp 1295-1299)



### Acknowledgments
A part of the code is inspired from the previous version in Theano of  [ABnet](https://github.com/bootphon/abnet2), and the [examples in Pytorch](https://github.com/pytorch/examples)

Owner

  • Name: CoML
  • Login: bootphon
  • Kind: organization
  • Email: syntheticlearner@gmail.com
  • Location: Paris, France

GitHub Events

Total
Last Year

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 608
  • Total Committers: 8
  • Avg Commits per committer: 76.0
  • Development Distribution Score (DDS): 0.61
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
DiegoAndai d****i@u****l 237
Corentin Dancette c****e@g****m 209
rachine r****3@g****m 144
Rachid Riad r****e@M****l 10
karadayi j****i@g****m 5
rchaabouni c****a@g****m 1
rchaabouni r****i@o****r 1
Mathieu Bernard m****d@i****r 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 9 months ago

All Time
  • Total issues: 1
  • Total pull requests: 2
  • Average time to close issues: N/A
  • Average time to close pull requests: almost 6 years
  • Total issue authors: 1
  • Total pull request authors: 1
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • Rachine (1)
Pull Request Authors
  • cdancette (3)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

environment.yml pypi
  • tensorboardX ==1.0
requirements.txt pypi
  • cython *
  • h5py >=2.3.0
  • numpy >=1.13
  • numpydoc *
  • pytest ==3.5
  • pyyaml *
  • scipy >=0.13.0
  • tensorboardX ==1.0