datacrumbs

A multi-level dataflow profiler for capture I/O calls from worklows.

https://github.com/llnl/datacrumbs

Science Score: 52.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
    Organization llnl has institutional domain (software.llnl.gov)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.0%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

A multi-level dataflow profiler for capture I/O calls from worklows.

Basic Info
  • Host: GitHub
  • Owner: LLNL
  • License: mit
  • Language: Jupyter Notebook
  • Default Branch: develop
  • Size: 385 MB
Statistics
  • Stars: 3
  • Watchers: 5
  • Forks: 3
  • Open Issues: 5
  • Releases: 4
Created over 1 year ago · Last pushed 6 months ago
Metadata Files
Readme License Citation Codeowners

README.ares.md

Readme for Ares Cluster

Manage Environments

bash export DATACRUMBS_DIR=/mnt/common/$USER/datacrumbs export DATACRUMBS_INSTALL_DIR=/mnt/common/$USER/datacrumbs/install

Dependencies

Use release version of bcc 0.35.0

Install system dependencies:

bash sudo apt install libelf-dev libclang-14 libclang-14-dev zip sudo update-alternatives --install /usr/bin/clang clang /usr/bin/clang-14 50

Ensure llvm-config is symlinked using update-alternatives like so:

bash update-alternatives --install /usr/bin/clang clang /usr/bin/clang-14 50

Installation

bash export DATACRUMBS_INSTALL_DIR=/mnt/common/$USER/datacrumbs/install export DATACRUMBS_DIR=/mnt/common/$USER/datacrumbs git clone git@github.com:hariharan-devarajan/datacrumbs.git cd datacrumbs mkdir build cd build cmake -DCMAKE_INSTALL_PREFIX=${DATACRUMBS_INSTALL_DIR} .. make install -j9 cd -

Install Python Environment

Create a python environment

bash virtualenv --python=python3 ${DATACRUMBS_INSTALL_DIR} source ${DATACRUMBS_INSTALL_DIR}/bin/activate pip install --upgrade pip pip install -r ${DATACRUMBS_DIR}/requirements.txt pip install -r ${DATACRUMBS_DIR}/bcc_requirements.txt

Install bccs python bindings

bash cmake -DPYTHON_CMD=python -DCMAKE_INSTALL_PREFIX=${DATACRUMBS_INSTALL_DIR} .. make install -j9

You may have to copy the python bindings manually if the above does not work:

bash cp -rv bcc-python3/bcc $DATACRUMBS_INSTALL_DIR/lib/python3.10/site-packages/

Add this to your activate file. This makes sure when you activate the environment, the paths are set correctly for bcc:

bash BCC_INSTALL_DIR="/mnt/repo/software/bcc" BCC_SITEPKG="$BCC_INSTALL_DIR/lib/python3.10/site-packages/" export PATH=$PATH:$BCC_INSTALL_DIR/bin:$BCC_INSTALL_DIR/sbin export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$BCC_INSTALL_DIR/lib:$BCC_INSTALL_DIR/lib64 export PYTHONPATH=$PYTHONPATH:$BCC_SITEPKG

Or simply run the following commands to set the environment variables:

```bash cat << EOF >> ${DATACRUMBSINSTALLDIR}/bin/activate BCCINSTALLDIR="/mnt/repo/software/bcc" BCCSITEPKG="$BCCINSTALL_DIR/lib/python3.10/site-packages/"

export PATH=$PATH:$BCCINSTALLDIR/bin:$BCCINSTALLDIR/sbin export LDLIBRARYPATH=$LDLIBRARYPATH:$BCCINSTALLDIR/lib:$BCCINSTALLDIR/lib64 export PYTHONPATH=$PYTHONPATH:$BCC_SITEPKG EOF ```

Running the tool

Setup

bash export DATACRUMBS_DIR=/mnt/common/$USER/datacrumbs source ${DATACRUMBS_DIR}/scripts/ares/server/setup.sh source ${DATACRUMBS_DIR}/scripts/ares/server/init.sh source ${DATACRUMBS_DIR}/scripts/ares/server/run.sh

Running Simple Test

bash export DATACRUMBS_DIR=/mnt/common/$USER/datacrumbs export DATACRUMBS_INSTALL_DIR=/mnt/common/$USER/datacrumbs/install source ${DATACRUMBS_DIR}/scripts/ares/client/setup.sh source ${DATACRUMBS_DIR}/scripts/ares/client/tests/test_posix.sh

Slurm integration

Slurm job installation will be added here (need to be done by admin)

Owner

  • Name: Lawrence Livermore National Laboratory
  • Login: LLNL
  • Kind: organization
  • Email: github-admin@llnl.gov
  • Location: Livermore, CA, USA

For over 70 years, the Lawrence Livermore National Laboratory has applied science and technology to make the world a safer place.

Citation (CITATION.cff)

cff-version: 1.2.0
message: If you use this software, please cite the software and the paper.
title: "DataCrumbs"
version: 0.0.3
license: MIT
url: https://github.com/hariharan-devarajan/datacrumbs
repository-code: https://github.com/hariharan-devarajan/datacrumbs
contact:
  - name: Hariharan Devarajan
    email: mani.hariharan@gmail.com
authors:
  - family-names: Devarajan
    given-names: Hariharan
    orcid: https://orcid.org/0000-0001-5625-3494

GitHub Events

Total
  • Create event: 26
  • Issues event: 9
  • Watch event: 1
  • Delete event: 25
  • Member event: 1
  • Issue comment event: 1
  • Push event: 76
  • Pull request review comment event: 12
  • Pull request review event: 22
  • Pull request event: 40
  • Fork event: 1
Last Year
  • Create event: 26
  • Issues event: 9
  • Watch event: 1
  • Delete event: 25
  • Member event: 1
  • Issue comment event: 1
  • Push event: 76
  • Pull request review comment event: 12
  • Pull request review event: 22
  • Pull request event: 40
  • Fork event: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 6
  • Total pull requests: 23
  • Average time to close issues: 2 days
  • Average time to close pull requests: 2 days
  • Total issue authors: 2
  • Total pull request authors: 3
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.09
  • Merged pull requests: 8
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 6
  • Pull requests: 23
  • Average time to close issues: 2 days
  • Average time to close pull requests: 2 days
  • Issue authors: 2
  • Pull request authors: 3
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.09
  • Merged pull requests: 8
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • nrj5k (4)
  • hariharan-devarajan (2)
Pull Request Authors
  • hariharan-devarajan (19)
  • asparsa (3)
  • trws (1)
Top Labels
Issue Labels
Pull Request Labels