https://github.com/alan-turing-institute/nestpackage
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (19.3%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: alan-turing-institute
- Language: Python
- Default Branch: main
- Size: 698 KB
Statistics
- Stars: 5
- Watchers: 4
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
NEST
The NEtwork STatistic (NEST) package is designed to give a quick and easy way to produce in depth initial exploratory analysis of a network dataset. Full details can be found in the documentation.
This framework where given an input graph, of any type,
- directed or undirected,
- temporal or static,
- with (or without) node attributes
- with (or without) edge attributes
node and/or edge attributes, temporal or {static}, and will produce an exhaustive range of statistics summerising this graph, in a nicely presented document in numerous formats, namely:
- Csv
- Html
- Others to follow
The outputs are designed to allow a researcher/data scientist to quickly evaluate a dataset, or to share crucial statistics of a dataset with others.
The statistics in question cover many different areas of network science,
leveraging several python based network science libraries, with a strong
emphasis on the excellent networkx library. They include:
- Basic summary statistics (number of nodes etc)
- Centrality Measures
- Community structure
- Path based measures
- Spectral measures
- Motif measures
- Time series measures
Further, we strongly encourage others to contribute either your own measures to our library, or indeed any other additions to the library that may be helpful to others.
Installation
The most direct way to install the package to use it directly is to install it via pip. For now it can be directly installed from github, and in future it will be available on directly on PyPI
{bash}
pip install git+https://github.com/alan-turing-institute/nestpackage
The package can also just be clone directly from our github package, which is the recommended route if you wish to add additional statistics.
Requirements
Required: - matplotlib - networkx - numpy - pandas - scipy - seaborn - scikit-learn
The package also requires at least one of the following: - pandoc - reportlab
Finally, the following packages are optional and are needed for some statistics: - motifcluster - python-louvain
Usage
The package can be used both directly from python or alternatively via a command line tool.
The command line tool has the following options:
Optons:
| Arguments | Description | |---------------|------------------------------------| | -h, --help | show this help message and exit | | --datafile | Csv file path | | --src | Source Columns separated by commas | | --dst | Dest Columns separated by commas | | --directed | Data is directed (Default) | | --no-directed | Data is undirected (Not Default) | | --weight | Weight Column | | --time | Time Column | | --outputfile | Output file | | --data_name | Data set name |
Examples
Specify a data file (represented as an edge list)..
bash
nest --data_file exampleData.csv
Specify columns that make up the source node . The combination of columns are used the source ID.
bash
nest --src F,G
Specify columns that make up the destination. The combination of columns are used the source ID.
bash
nest --dst F,G
Putting this all together, with a specification of the time column we could get the following command:
bash
nest --data_file exampleData.csv --src Col1,Col2 --dst Col3,Col4 --time time
Owner
- Name: The Alan Turing Institute
- Login: alan-turing-institute
- Kind: organization
- Email: info@turing.ac.uk
- Website: https://turing.ac.uk
- Repositories: 477
- Profile: https://github.com/alan-turing-institute
The UK's national institute for data science and artificial intelligence.
GitHub Events
Total
- Watch event: 1
Last Year
- Watch event: 1
Issues and Pull Requests
Last synced: over 1 year ago
All Time
- Total issues: 0
- Total pull requests: 4
- Average time to close issues: N/A
- Average time to close pull requests: less than a minute
- Total issue authors: 0
- Total pull request authors: 1
- Average comments per issue: 0
- Average comments per pull request: 0.25
- Merged pull requests: 3
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- andeElliott (4)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- matplotlib *
- motifcluster *
- networkx *
- numpy *
- pandas *
- pandoc *
- python-louvain *
- reportlab *
- scikit-learn *
- scipy *
- seaborn *
- statsmodels *