textblob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

https://github.com/sloria/textblob

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    1 of 37 committers (2.7%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.4%) to scientific vocabulary

Keywords

natural-language-processing nlp nltk pattern python python-3

Keywords from Contributors

distributed serializer parallel closember http-server http-client asyncio aiohttp templates data-mining
Last synced: 6 months ago · JSON representation

Repository

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Basic Info
Statistics
  • Stars: 9,423
  • Watchers: 258
  • Forks: 1,169
  • Open Issues: 72
  • Releases: 3
Topics
natural-language-processing nlp nltk pattern python python-3
Created over 12 years ago · Last pushed 6 months ago
Metadata Files
Readme Changelog Contributing License Security Authors Notice

README.rst

TextBlob: Simplified Text Processing
====================================

.. image:: https://badgen.net/pypi/v/TextBlob
    :target: https://pypi.org/project/textblob/
    :alt: Latest version

.. image:: https://github.com/sloria/TextBlob/actions/workflows/build-release.yml/badge.svg
    :target: https://github.com/sloria/TextBlob/actions/workflows/build-release.yml
    :alt: Build status


Homepage: `https://textblob.readthedocs.io/ `_

`TextBlob` is a Python library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, and more.


.. code-block:: python

    from textblob import TextBlob

    text = """
    The titular threat of The Blob has always struck me as the ultimate movie
    monster: an insatiably hungry, amoeba-like mass able to penetrate
    virtually any safeguard, capable of--as a doomed doctor chillingly
    describes it--"assimilating flesh on contact.
    Snide comparisons to gelatin be damned, it's a concept with the most
    devastating of potential consequences, not unlike the grey goo scenario
    proposed by technological theorists fearful of
    artificial intelligence run rampant.
    """

    blob = TextBlob(text)
    blob.tags  # [('The', 'DT'), ('titular', 'JJ'),
    #  ('threat', 'NN'), ('of', 'IN'), ...]

    blob.noun_phrases  # WordList(['titular threat', 'blob',
    #            'ultimate movie monster',
    #            'amoeba-like mass', ...])

    for sentence in blob.sentences:
        print(sentence.sentiment.polarity)
    # 0.060
    # -0.341


TextBlob stands on the giant shoulders of `NLTK`_ and `pattern`_, and plays nicely with both.

Features
--------

- Noun phrase extraction
- Part-of-speech tagging
- Sentiment analysis
- Classification (Naive Bayes, Decision Tree)
- Tokenization (splitting text into words and sentences)
- Word and phrase frequencies
- Parsing
- `n`-grams
- Word inflection (pluralization and singularization) and lemmatization
- Spelling correction
- Add new models or languages through extensions
- WordNet integration

Get it now
----------
::

    $ pip install -U textblob
    $ python -m textblob.download_corpora

Examples
--------

See more examples at the `Quickstart guide`_.

.. _`Quickstart guide`: https://textblob.readthedocs.io/en/latest/quickstart.html#quickstart


Documentation
-------------

Full documentation is available at https://textblob.readthedocs.io/.

Project Links
-------------

- Docs: https://textblob.readthedocs.io/
- Changelog: https://textblob.readthedocs.io/en/latest/changelog.html
- PyPI: https://pypi.python.org/pypi/TextBlob
- Issues: https://github.com/sloria/TextBlob/issues

License
-------

MIT licensed. See the bundled `LICENSE `_ file for more details.

.. _pattern: https://github.com/clips/pattern/
.. _NLTK: http://nltk.org/

Owner

  • Name: Steven Loria
  • Login: sloria
  • Kind: user
  • Location: Brooklyn, NY

Always a student, forever a junior developer

GitHub Events

Total
  • Issues event: 36
  • Watch event: 282
  • Delete event: 28
  • Issue comment event: 50
  • Push event: 50
  • Pull request review event: 12
  • Pull request review comment event: 7
  • Pull request event: 65
  • Fork event: 30
  • Create event: 24
Last Year
  • Issues event: 36
  • Watch event: 282
  • Delete event: 28
  • Issue comment event: 50
  • Push event: 50
  • Pull request review event: 12
  • Pull request review comment event: 7
  • Pull request event: 65
  • Fork event: 30
  • Create event: 24

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 562
  • Total Committers: 37
  • Avg Commits per committer: 15.189
  • Development Distribution Score (DDS): 0.331
Past Year
  • Commits: 44
  • Committers: 4
  • Avg Commits per committer: 11.0
  • Development Distribution Score (DDS): 0.409
Top Committers
Name Email Commits
Steven Loria s****1@g****m 376
pre-commit-ci[bot] 6****] 35
dependabot-preview[bot] 2****] 19
Jon Coe j****e@o****m 17
dependabot[bot] 4****] 15
dependabot[bot] s****t@d****m 14
Matthew Honnibal m****w@h****m 12
Roman Yankovsky r****n@y****e 11
Jeff Schnurr j****r@g****m 7
pyup-bot g****t@p****o 6
Adrián López a****z@s****m 5
Joseph Albert j****7@p****u 5
John Franey 1****y 5
Evan Dempsey e****y@g****m 5
Jamie Moschella 4****o 4
Roman Korolev r****v@t****i 2
Jeff Kolb j****b@g****m 2
Daniel Ong d****1@g****m 2
Jhon Eslava e****4@g****m 2
Adel Qalieh a****5@g****m 1
Adrián López Calvo a****o@g****m 1
David Karesh d****k@g****m 1
Evgeny Kemerov g****b@k****v 1
Karthikeyan Singaravelan t****i@g****m 1
Daniel Ong d****g@o****m 1
Adam Chainz a****m@a****u 1
Lage Ragnarsson l****n@o****m 1
Nitish Kulshrestha n****g@g****m 1
PAVEL\Pavel m****l@g****m 1
Pete Keen p****n@b****o 1
and 7 more...

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 102
  • Total pull requests: 189
  • Average time to close issues: about 4 years
  • Average time to close pull requests: 4 months
  • Total issue authors: 100
  • Total pull request authors: 22
  • Average comments per issue: 2.25
  • Average comments per pull request: 0.28
  • Merged pull requests: 129
  • Bot issues: 2
  • Bot pull requests: 143
Past Year
  • Issues: 6
  • Pull requests: 69
  • Average time to close issues: 4 days
  • Average time to close pull requests: 5 days
  • Issue authors: 6
  • Pull request authors: 7
  • Average comments per issue: 0.17
  • Average comments per pull request: 0.23
  • Merged pull requests: 51
  • Bot issues: 1
  • Bot pull requests: 52
Top Authors
Issue Authors
  • tirkarthi (2)
  • smalldatascaled (2)
  • dependabot[bot] (1)
  • adrien731 (1)
  • progenysw (1)
  • Goldziher (1)
  • dagrha (1)
  • wanghaisheng (1)
  • nmstoker (1)
  • munichong (1)
  • ShiJunyiSJ1 (1)
  • dashu101 (1)
  • cliff88 (1)
  • Sandy4321 (1)
  • xasg (1)
Pull Request Authors
  • pre-commit-ci[bot] (75)
  • dependabot-preview[bot] (34)
  • dependabot[bot] (34)
  • johnfraney (9)
  • sloria (8)
  • pyup-bot (3)
  • tirkarthi (3)
  • bhavyajammu (3)
  • jonathangreen (2)
  • rahulgit7 (2)
  • smontanaro (2)
  • t2kpbraune (2)
  • kjdeveloper8 (2)
  • jschnurr (2)
  • sudoguy (1)
Top Labels
Issue Labels
enhancement (3) ready (2) announcement (1) please-help (1) dependencies (1) python (1)
Pull Request Labels
dependencies (68) python (32) security (3) github_actions (2)

Packages

  • Total packages: 3
  • Total downloads:
    • pypi 1,420,555 last-month
  • Total docker downloads: 382,076
  • Total dependent packages: 97
    (may contain duplicates)
  • Total dependent repositories: 6,538
    (may contain duplicates)
  • Total versions: 69
  • Total maintainers: 1
pypi.org: textblob

Simple, Pythonic text processing. Sentiment analysis, part-of-speech tagging, noun phrase parsing, and more.

  • Versions: 60
  • Dependent Packages: 93
  • Dependent Repositories: 6,514
  • Downloads: 1,420,555 Last month
  • Docker Downloads: 382,076
Rankings
Dependent repos count: 0.1%
Dependent packages count: 0.3%
Stargazers count: 0.3%
Downloads: 0.4%
Average: 0.5%
Docker downloads count: 0.9%
Forks count: 1.2%
Maintainers (1)
Last synced: 6 months ago
conda-forge.org: textblob
  • Versions: 7
  • Dependent Packages: 4
  • Dependent Repositories: 24
Rankings
Stargazers count: 3.3%
Forks count: 4.5%
Average: 6.9%
Dependent repos count: 7.5%
Dependent packages count: 12.5%
Last synced: 6 months ago
anaconda.org: textblob

TextBlob is a Python library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, and more.

  • Versions: 2
  • Dependent Packages: 0
  • Dependent Repositories: 0
Rankings
Dependent packages count: 51.1%
Average: 53.7%
Dependent repos count: 56.2%
Last synced: 6 months ago