datacrumbs
A multi-level dataflow profiler for capture I/O calls from worklows.
Science Score: 52.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
✓Institutional organization owner
Organization llnl has institutional domain (software.llnl.gov) -
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.0%) to scientific vocabulary
Repository
A multi-level dataflow profiler for capture I/O calls from worklows.
Basic Info
- Host: GitHub
- Owner: LLNL
- License: mit
- Language: Jupyter Notebook
- Default Branch: develop
- Size: 385 MB
Statistics
- Stars: 3
- Watchers: 5
- Forks: 3
- Open Issues: 5
- Releases: 4
Metadata Files
README.ares.md
Readme for Ares Cluster
Manage Environments
bash
export DATACRUMBS_DIR=/mnt/common/$USER/datacrumbs
export DATACRUMBS_INSTALL_DIR=/mnt/common/$USER/datacrumbs/install
Dependencies
Use release version of bcc 0.35.0
Install system dependencies:
bash
sudo apt install libelf-dev libclang-14 libclang-14-dev zip
sudo update-alternatives --install /usr/bin/clang clang /usr/bin/clang-14 50
Ensure llvm-config is symlinked using update-alternatives like so:
bash
update-alternatives --install /usr/bin/clang clang /usr/bin/clang-14 50
Installation
bash
export DATACRUMBS_INSTALL_DIR=/mnt/common/$USER/datacrumbs/install
export DATACRUMBS_DIR=/mnt/common/$USER/datacrumbs
git clone git@github.com:hariharan-devarajan/datacrumbs.git
cd datacrumbs
mkdir build
cd build
cmake -DCMAKE_INSTALL_PREFIX=${DATACRUMBS_INSTALL_DIR} ..
make install -j9
cd -
Install Python Environment
Create a python environment
bash
virtualenv --python=python3 ${DATACRUMBS_INSTALL_DIR}
source ${DATACRUMBS_INSTALL_DIR}/bin/activate
pip install --upgrade pip
pip install -r ${DATACRUMBS_DIR}/requirements.txt
pip install -r ${DATACRUMBS_DIR}/bcc_requirements.txt
Install bccs python bindings
bash
cmake -DPYTHON_CMD=python -DCMAKE_INSTALL_PREFIX=${DATACRUMBS_INSTALL_DIR} ..
make install -j9
You may have to copy the python bindings manually if the above does not work:
bash
cp -rv bcc-python3/bcc $DATACRUMBS_INSTALL_DIR/lib/python3.10/site-packages/
Add this to your activate file.
This makes sure when you activate the
environment, the paths are set correctly for bcc:
bash
BCC_INSTALL_DIR="/mnt/repo/software/bcc"
BCC_SITEPKG="$BCC_INSTALL_DIR/lib/python3.10/site-packages/"
export PATH=$PATH:$BCC_INSTALL_DIR/bin:$BCC_INSTALL_DIR/sbin
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$BCC_INSTALL_DIR/lib:$BCC_INSTALL_DIR/lib64
export PYTHONPATH=$PYTHONPATH:$BCC_SITEPKG
Or simply run the following commands to set the environment variables:
```bash cat << EOF >> ${DATACRUMBSINSTALLDIR}/bin/activate BCCINSTALLDIR="/mnt/repo/software/bcc" BCCSITEPKG="$BCCINSTALL_DIR/lib/python3.10/site-packages/"
export PATH=$PATH:$BCCINSTALLDIR/bin:$BCCINSTALLDIR/sbin export LDLIBRARYPATH=$LDLIBRARYPATH:$BCCINSTALLDIR/lib:$BCCINSTALLDIR/lib64 export PYTHONPATH=$PYTHONPATH:$BCC_SITEPKG EOF ```
Running the tool
Setup
bash
export DATACRUMBS_DIR=/mnt/common/$USER/datacrumbs
source ${DATACRUMBS_DIR}/scripts/ares/server/setup.sh
source ${DATACRUMBS_DIR}/scripts/ares/server/init.sh
source ${DATACRUMBS_DIR}/scripts/ares/server/run.sh
Running Simple Test
bash
export DATACRUMBS_DIR=/mnt/common/$USER/datacrumbs
export DATACRUMBS_INSTALL_DIR=/mnt/common/$USER/datacrumbs/install
source ${DATACRUMBS_DIR}/scripts/ares/client/setup.sh
source ${DATACRUMBS_DIR}/scripts/ares/client/tests/test_posix.sh
Slurm integration
Slurm job installation will be added here (need to be done by admin)
Owner
- Name: Lawrence Livermore National Laboratory
- Login: LLNL
- Kind: organization
- Email: github-admin@llnl.gov
- Location: Livermore, CA, USA
- Website: https://software.llnl.gov
- Twitter: LLNL_OpenSource
- Repositories: 520
- Profile: https://github.com/LLNL
For over 70 years, the Lawrence Livermore National Laboratory has applied science and technology to make the world a safer place.
Citation (CITATION.cff)
cff-version: 1.2.0
message: If you use this software, please cite the software and the paper.
title: "DataCrumbs"
version: 0.0.3
license: MIT
url: https://github.com/hariharan-devarajan/datacrumbs
repository-code: https://github.com/hariharan-devarajan/datacrumbs
contact:
- name: Hariharan Devarajan
email: mani.hariharan@gmail.com
authors:
- family-names: Devarajan
given-names: Hariharan
orcid: https://orcid.org/0000-0001-5625-3494
GitHub Events
Total
- Create event: 26
- Issues event: 9
- Watch event: 1
- Delete event: 25
- Member event: 1
- Issue comment event: 1
- Push event: 76
- Pull request review comment event: 12
- Pull request review event: 22
- Pull request event: 40
- Fork event: 1
Last Year
- Create event: 26
- Issues event: 9
- Watch event: 1
- Delete event: 25
- Member event: 1
- Issue comment event: 1
- Push event: 76
- Pull request review comment event: 12
- Pull request review event: 22
- Pull request event: 40
- Fork event: 1
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 6
- Total pull requests: 23
- Average time to close issues: 2 days
- Average time to close pull requests: 2 days
- Total issue authors: 2
- Total pull request authors: 3
- Average comments per issue: 0.0
- Average comments per pull request: 0.09
- Merged pull requests: 8
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 6
- Pull requests: 23
- Average time to close issues: 2 days
- Average time to close pull requests: 2 days
- Issue authors: 2
- Pull request authors: 3
- Average comments per issue: 0.0
- Average comments per pull request: 0.09
- Merged pull requests: 8
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- nrj5k (4)
- hariharan-devarajan (2)
Pull Request Authors
- hariharan-devarajan (19)
- asparsa (3)
- trws (1)