Recent Releases of textnets
textnets - Textnets version 0.10.3
Fixes deployment issues.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc about 1 year ago
textnets - Textnets version 0.10.2
- Attempts to avoid the "tofu problem" () in plots by setting an appropriate font for CJK characters.
- Skips stemming during tokenization if no lemmatizer is available.
- Improves documentation.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc about 1 year ago
textnets - Textnets version 0.10.1
- Fixes version metadata.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc about 1 year ago
textnets - Textnets version 0.10.0
- Switches to
matplotlibfor plotting; removes dependency on Cairo. - Adds
figsizeoption totn.params. - Drops support for Python 3.9.
- Updates to spaCy 3.8.
- Removes experimental FCA features.
- Adds a
max_docsoption (contributed by @McWashr in #62). - Enables proper tokenization of Chinese text. (In response to #65.)
- Switches from Poetry to PDM to handle project dependencies and builds.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc about 1 year ago
textnets - Textnets version 0.9.5
- Sets
language_levelcompiler directive for Cython. (In response to #63). - Updates to spaCy 3.7.4 and igraph 0.11.3.
- Removes references to conda-forge package as it has become difficult to keep updated.
- Adds dependency on pyarrow to satisfy future pandas releases.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 2 years ago
textnets - Textnets version 0.9.4
- Updates to spaCy 3.7.2 and to igraph 0.10.5.
- Adds Python 3.12 compatibility. (In response to #61.)
- Fixes a bug in how tf-idf is calculated without sublinear scaling.
- If the corpus contains long documents, adjust spaCy's maximum document length parameter as needed.
:rocket:
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 2 years ago
textnets - Textnets version 0.9.3
- Updates to igraph 0.10.5.
- Updates to spaCy 3.6.0, bringing support for Slovenian.
- Add dependency on
spacy-lookups-datafor better support of languages without pre-trained models (e.g., Turkish).
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 3 years ago
textnets - Textnets version 0.9.2
- Improves documentation.
- Updates to scipy 1.10.
- Fixes how the compiled extension is built.
- Builds binary wheels for a greater number of platforms.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 3 years ago
textnets - Textnets version 0.9.1
No substantive change from previous release.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 3 years ago
textnets - Textnets version 0.9.0
- Improves documentation.
- Parallelizes execution of NLP and backbone extraction for large corpora and networks.
- Uses sparse matrices for some operations to avoid running out of memory.
- Makes other improvements to efficiency.
- Introduces a
NodeTypeenum to differentiate document and term nodes. - Updates to pandas 2.0 and spaCy 3.5.3.
- Now also tested for compatibility with PyPy 3.9!
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 3 years ago
textnets - Textnets version 0.8.8
- Fixes bug in disparity filter.
- Updates to spaCy 3.5.1, along various other dependencies.
- Improves testing
Scientific Software - Peer-reviewed
- Python
Published by jboynyc about 3 years ago
textnets - Textnets version 0.8.7
- Updates to spaCy 3.5 and igraph 0.10.4.
- When initializing
Corpuswith data that includes duplicated document labels, issue a warning, and concatenate documents with shared labels. (Implemented in response to #54). - Various code quality improvements.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 3 years ago
textnets - Textnets version 0.8.6
- Improves documentation.
- Progress bar for long-running operations (NLP and disparity filter). The progress bar is not shown during non-interactive use. To disable, set
tn.params["progress_bar"] = False. - Python 3.11 compatibility now confirmed on Windows, too.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 3 years ago
textnets - Textnets version 0.8.5
- Python 3.11 compatibility! (YMMV if using Windows.)
- Adds an optional parameter
remove_weak_edgeswhen initializing aTextnetto create a sparser graph. - Adds
ProjectedTextnet.mproperty to access one-mode graphs' adjacency matrices. - Adds Stoltz and Taylor's (2019) textual spanning measure.
- Adds
TextnetBase.cluster_strengthandTextnetBase.cluster_local_ccto calculate the weighted degree and local clustering coefficient of nodes within the subgraph formed by the cluster they belong to. - Improves display of top nodes per cluster (
top_cluster_nodes). - Adds Korean language code.
- Fixes several bugs that occurred when setting document attributes.
- Updated and expanded documentation.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 3 years ago
textnets - Textnets version 0.8.4
- Updates to spaCy 3.4.1,
igraph0.10.1, andleidenalg0.9. - Adds Ukrainian language code.
- Improves type checking.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 3 years ago
textnets - Textnets version 0.8.3
- Updates to spaCy 3.4, bringing support for Croatian to textnets.
- Updates various other dependencies.
- Adds optional dependency for experimental Formal Concept Analysis features. To install, run
pip install textnets[fca]. Graphviz must also be installed separately for lattice visualization purposes.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 4 years ago
textnets - Textnets version 0.8.2
- Makes
TextnetBasean abstract base class, since it is not meant to be instantiated. Implements separate graph partition methods for the classesTextnetandProjectedTextnetto work around an occasional bug. - Adds Finnish and Swedish language codes.
- Improves type hints.
- Adds dark mode to docs!
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 4 years ago
textnets - Textnets version 0.8.1
- Fix bug #36.
- Updates dependencies, including
igraphand spaCy.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 4 years ago
textnets - Textnets version 0.8.0
- This release removes Python 3.7 compatibility. You now need to use Python 3.8, 3.9 or 3.10 to use
textnets. - Fixes a bug in the HTML representation of the top-level module.
- Updates dependencies, including spaCy.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc about 4 years ago
textnets - Textnets version 0.7.1
- Fixes #35 (invisible edges when scaling by weight).
- Updates some dependencies.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 4 years ago
textnets - Textnets version 0.7.0
- Adds abilitiy to save and load an instance of
Corpus,Textnetandparamsto and from file usingCorpus.save,tn.load_corpus,Textnet.save,tn.load_textnet,tn.params.saveandtn.params.load. The same file can be used for all three kinds of objects, so all relevant data for a project can be saved in one file. - Some further optimization of backbone extraction.
- Adds bipartite centrality measures (HITS, CoHITS and BiRank) and a bipartite clustering coefficient.
- Improved testing and type hints.
- Expanded documentation with advanced topics, including the new save/load feature and interacting with other libraries for network analysis and machine learning. Docs now use the PyData theme.
- Improvements to visualization. When plotting, nodes and edges can now be scaled by any attribute.
- Breaking change: Term weighing now happens in the
corpussubmodule, so thesublinearargument has to be passed to the methods for term extraction (tokenized,noun_phrasesandngrams). This change will make it easier to add additional term extraction and weighing options. - Adds
tn.init_seed()utility to quickly initialize pseudorandom number generator. - Adds Python 3.10 compatibility.
- Updates dependencies, including
igraphwith some relevant upstream changes contributed by yours truly, as well as spaCy.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 4 years ago
textnets - Textnets version 0.6.0
- Adds
paramsas a container for global parameters. This makes it possible to fix the random seed and to change the resolution parameter for the community detection algorithm, among others. If the parameterautodownloadis set to true, textnets will attempt to download all required spaCy language models automatically. - Added HTML representation for the root module that displays versions of key dependencies.
- Added back string representations of
CorpusandTextnetBase-derived classes. - Adds a
Corpus.from_dictmethod. Corpusnow exposes thelangattribute, so the corpus language can be set after initialization of a class instance.- The bipartite layout optionally used by
Textnet.plotis now horizontal, so node types are arranged in columns rather than rows. That way node labels are less likely to overlap. - Adds
label_nodesargument to theTextnet.plotmethod to label both types of nodes. Defaults toFalse. - Adds
node_opacityandedge_opacityarguments forTextnet.plot. - Makes polygons marking clusters more visually appealing by adding opacity.
- Probably fixes a bug that would occasionally result in an exception being raised during plotting (
IndexError: color index too large). - When initializing an instance of the
Textnetclass, you can now optionally pass the argumentconnected=True, in which case only the largest component of the underlying network is kept. When creating a one-mode projection usingTextnet.project, aconnectedargument can also be passed. - Adds
TextnetBase.save_graphto save the underlying graph (for instance, for further processing in Gephi). - Improved and extended documentation and docstrings.
- Update dependencies.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 4 years ago
textnets - Textnets version 0.5.4
This release is an attempt to fix the cross-platform build and deploy pipeline (to ensure binary wheels for all platforms are uploaded to PyPI). Otherwise, there is no substantive change from previous release.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 4 years ago
textnets - Textnets version 0.5.3
- Adds Catalan, Macedonian and Russian language models.
- Significantly speeds up backbone extraction by implementing the disparity filter integrand in Cython. (If the compiled extension cannot be loaded for some reason, it falls back on an interpreted function.)
- PyPI should now receive binary wheels for Mac, Windows and Linux (via GitHub Actions) to ease installation on each platform.
- Improved type annotations.
- Update several dependencies.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 4 years ago
textnets - Textnets version 0.5.2
- Improve the handling of edge cases when initializing the
CorpusandTextnetclasses, such as empty data being provided. - Added ability to run the tutorial in the documentation interactively using thebe.
- Update to spacy 3.1 and bump other dependencies.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 5 years ago
textnets - Textnets version 0.5.1
- Adds
Corpus.ngramsmethod as alternative toCorpus.noun_phrases. This is useful when working in languages that do not have noun chunks, such as Chinese. The tutorial now also mentions this as an alternative totokenizedandnoun_phrases. (Closes #26.) - Fixes a bug in
Corpus.from_files. (This bug was probably part of the issue in #28.) - Introduces HTML representations of core classes for nicer integration in Jupyter notebooks.
- Updates several dependencies (
pandasandleidenalgamong them).
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 5 years ago
textnets - Textnets version 0.5.0
- Migrate continuous integration testing from Travis to GitHub Actions.
- Continuous integration tests now run for MacOS and Windows too.
- Update to Spacy 3 and bump other dependency versions.
- Improvements to documentation.
- Handle dependencies and build project using Poetry (PEP 517 and 518).
- Remove deprecated command-line interface.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 5 years ago
textnets - Textnets version 0.4.11
- Python 3.9 compatibility!
- Updated documentation with conda-forge installation option.
- Bump versions for numerous dependencies.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 5 years ago
textnets - Textnets version 0.4.10
- Add
cairocffidependency and update installation docs. - Bump
leidenalgdependency to version 0.8.1.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc over 5 years ago
textnets - Textnets version 0.4.9
This version adds a color_clusters option to Textnet plotting methods. This colors nodes according to their partition using a bespoke color palette.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago
textnets - Textnets version 0.4.8
- The
Corpusclass now handles missing data (#13). - Support for more corpus languages. If no statistical language model is available,
Corpustries to use a basic ("blank") model. - Improved documentation around dependencies and language support.
- Added tests.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago
textnets - Textnets version 0.4.5
Textnet.plotandProjectedTextnet.plotnow accept arguments to selectively suppress node or edge labels.node_label_filterandedge_label_filteroptionally take a function that is mapped to the iterator of nodes and edges. Only nodes or edges for which the function returnsTrueare displayed in the plot. For example,node_label_filter=lambda n: n.degree() > 2ensures that only nodes with a degree greater than 2 are displayed.- Noun phrases created by
Corpus.noun_phrases()can now be normalized (lemmatized) by passingnormalize=True(defaults toFalse). Corpusnow has a useful string representation.- Documentation updates, particularly to show the label filter functionality.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago
textnets - Textnets version 0.4.3
- Python 3.7 compatibility is here, closing issue #8.
- New
circular_layoutoption forTextnet.plot. This is based "Tidier Drawings" and looks very nice for some bipartite graphs. - String representation of
Textnetinstances now gives helpful information. - Updated documentation to note changed Python version requirement.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago
textnets - Textnets version 0.4.2
ProjectedTextnet.plotnow takes an argument,alpha, that allows for pruning the graph in order to visualize the "backbone." This is useful when working with hairball graphs, which is common when creating textnets. Right now, it uses Serrano et al.'s disparity filter. That means that edges with an alpha value greater than the one specified are discarded, so lower values mean more extreme pruning.- Language models can now be specified using a short ISO language code.
- Bipartite networks can now be plotted using a layered layout (Sugiyama). Simply pass
sugiyama_layout=TruetoTextnet.plot. - Incremental improvements to documentation.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago
textnets - Textnets version 0.4.1
- Documented
TextnetBasemethods to output lists of nodes ranked by various centrality measures:top_betweennessand several more. - Added
top_cluster_nodesto output list of top nodes per cluster found via community detection. This is useful when trying to interpret such clusters as themes/topics (in the projected word-to-word graph) or as groupings (in the document-to-document graph). - Small additions to documentation.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago
textnets - Textnets version 0.4.0
Lots of changes, some of them breaking, but overall just providing nicer abstractions over the underlying pandas and igraph stuff.
- Introduced
TextnetBaseandProjectedTextnetclasses, and madeTextneta descendant of the former. - Improved code modularity to make it easier to add features.
Corpusis now based on a Series rather than a DataFrame.- Added methods for creating an instance of
Corpus:from_df,from_csv,from_sql. - Expanded and improved documentation.
- Added bibliography to documentation using a Sphinx bibtex plugin.
- A first contributor!
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago
textnets - Textnets version 0.3.6
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago
textnets - Beta release with improved documentation
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago
textnets - First beta release
See what textnets can now do in this demo notebook.
The package also has more documentation, more unit tests, and should be easier to install.
Scientific Software - Peer-reviewed
- Python
Published by jboynyc almost 6 years ago