https://github.com/amir22010/pytext
A natural language modeling framework based on PyTorch
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.1%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
A natural language modeling framework based on PyTorch
Basic Info
- Host: GitHub
- Owner: Amir22010
- License: other
- Language: Python
- Default Branch: master
- Homepage: https://pytext.readthedocs.io/en/master/
- Size: 2.17 MB
Statistics
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Fork of facebookresearch/pytext
Created about 7 years ago
· Last pushed about 7 years ago
https://github.com/Amir22010/pytext/blob/master/
# Overview
[](https://circleci.com/gh/facebookresearch/pytext)
PyText is a deep-learning based NLP modeling framework built on PyTorch. PyText addresses the often-conflicting requirements of enabling rapid experimentation and of serving models at scale. It achieves this by providing simple and extensible interfaces and abstractions for model components, and by using PyTorchs capabilities of exporting models for inference via the optimized Caffe2 execution engine. We are using PyText in Facebook to iterate quickly on new modeling ideas and then seamlessly ship them at scale.
**Core PyText features:**
- Production ready models for various NLP/NLU tasks:
- Text classifiers
- [Yoon Kim (2014): Convolutional Neural Networks for Sentence Classification](https://arxiv.org/abs/1408.5882)
- [Lin et al. (2017): A Structured Self-attentive Sentence Embedding](https://arxiv.org/abs/1703.03130)
- Sequence taggers
- [Lample et al. (2016): Neural Architectures for Named Entity Recognition](https://www.aclweb.org/anthology/N16-1030)
- Joint intent-slot model
- [Zhang et al. (2016): A Joint Model of Intent Determination and Slot Filling for Spoken Language Understanding](https://www.ijcai.org/Proceedings/16/Papers/425.pdf)
- Contextual intent-slot models
- Distributed-training support built on the new C10d backend in PyTorch 1.0
- Extensible components that allows easy creation of new models and tasks
- Reference implementation and a pretrained model for the paper: [Gupta et al. (2018): Semantic Parsing for Task Oriented Dialog using Hierarchical Representations](http://aclweb.org/anthology/D18-1300)
- Ensemble training support
# Installing PyText
### PyText requires Python 3.6.1 or above.
*To get started on a Cloud VM, check out [our guide](https://pytext.readthedocs.io/en/master/installation.html#cloud-vm-setup)*.
We recommend using a virtualenv:
```
$ python3 -m venv pytext_venv
$ source pytext_venv/bin/activate
(pytext_venv) $ pip install pytext-nlp
```
Detailed instructions and more installation options can be found in our [Documentation](https://pytext.readthedocs.io/en/master/installation.html). If you encounter issues with missing dependencies during installation, please refer to [OS Dependencies](https://pytext.readthedocs.io/en/master/installation.html#os-dependencies).
# Train your first text classifier
For this first example, we'll train a CNN-based text-classifier that classifies text utterances, using the examples in `tests/data/train_data_tiny.tsv`. The data and configs files can be obtained either by cloning the repository or by downloading the files manually from GitHub.
```
(venv) $ pytext train < demo/configs/docnn.json
```
By default, the model is created in `/tmp/model.pt`
Now you can export your model as a caffe2 net:
```
(venv) $ pytext export < demo/configs/docnn.json
```
You can use the exported caffe2 model to predict the class of raw utterances like this:
```
(venv) $ pytext --config-file demo/configs/docnn.json predict <<< '{"raw_text": "create an alarm for 1:30 pm"}'
```
More examples and tutorials can be found in [Full Documentation](https://pytext.readthedocs.io/en/master/).
# Join the community
* Facebook group: https://www.facebook.com/groups/pytext/
# License
PyText is BSD-licensed, as found in the LICENSE file.
Owner
- Name: Amir Khan
- Login: Amir22010
- Kind: user
- Location: India
- Repositories: 3
- Profile: https://github.com/Amir22010
working on developing a state of art AI solutions mainly in computer vision, chat bots and nlp domain. building an awesome AI as a professional developer 😍.