https://github.com/alan-turing-institute/nestpackage

https://github.com/alan-turing-institute/nestpackage

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (19.3%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: alan-turing-institute
  • Language: Python
  • Default Branch: main
  • Size: 698 KB
Statistics
  • Stars: 5
  • Watchers: 4
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created almost 4 years ago · Last pushed over 2 years ago
Metadata Files
Readme

README.md

NEST

The NEtwork STatistic (NEST) package is designed to give a quick and easy way to produce in depth initial exploratory analysis of a network dataset. Full details can be found in the documentation.

This framework where given an input graph, of any type,

  • directed or undirected,
  • temporal or static,
  • with (or without) node attributes
  • with (or without) edge attributes

node and/or edge attributes, temporal or {static}, and will produce an exhaustive range of statistics summerising this graph, in a nicely presented document in numerous formats, namely:

  • Pdf
  • Csv
  • Html
  • Others to follow

The outputs are designed to allow a researcher/data scientist to quickly evaluate a dataset, or to share crucial statistics of a dataset with others.

The statistics in question cover many different areas of network science, leveraging several python based network science libraries, with a strong emphasis on the excellent networkx library. They include:

  • Basic summary statistics (number of nodes etc)
  • Centrality Measures
  • Community structure
  • Path based measures
  • Spectral measures
  • Motif measures
  • Time series measures

Further, we strongly encourage others to contribute either your own measures to our library, or indeed any other additions to the library that may be helpful to others.

Installation

The most direct way to install the package to use it directly is to install it via pip. For now it can be directly installed from github, and in future it will be available on directly on PyPI

{bash} pip install git+https://github.com/alan-turing-institute/nestpackage

The package can also just be clone directly from our github package, which is the recommended route if you wish to add additional statistics.

Requirements

Required: - matplotlib - networkx - numpy - pandas - scipy - seaborn - scikit-learn

The package also requires at least one of the following: - pandoc - reportlab

Finally, the following packages are optional and are needed for some statistics: - motifcluster - python-louvain

Usage

The package can be used both directly from python or alternatively via a command line tool.

The command line tool has the following options:

Optons:

| Arguments | Description | |---------------|------------------------------------| | -h, --help | show this help message and exit | | --datafile | Csv file path | | --src | Source Columns separated by commas | | --dst | Dest Columns separated by commas | | --directed | Data is directed (Default) | | --no-directed | Data is undirected (Not Default) | | --weight | Weight Column | | --time | Time Column | | --outputfile | Output file | | --data_name | Data set name |

Examples

Specify a data file (represented as an edge list).. bash nest --data_file exampleData.csv

Specify columns that make up the source node . The combination of columns are used the source ID. bash nest --src F,G

Specify columns that make up the destination. The combination of columns are used the source ID. bash nest --dst F,G

Putting this all together, with a specification of the time column we could get the following command:

bash nest --data_file exampleData.csv --src Col1,Col2 --dst Col3,Col4 --time time

Owner

  • Name: The Alan Turing Institute
  • Login: alan-turing-institute
  • Kind: organization
  • Email: info@turing.ac.uk

The UK's national institute for data science and artificial intelligence.

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Issues and Pull Requests

Last synced: over 1 year ago

All Time
  • Total issues: 0
  • Total pull requests: 4
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.25
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • andeElliott (4)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

docs/requirements.txt pypi
  • matplotlib *
  • motifcluster *
  • networkx *
  • numpy *
  • pandas *
  • pandoc *
  • python-louvain *
  • reportlab *
  • scikit-learn *
  • scipy *
  • seaborn *
  • statsmodels *
pyproject.toml pypi