cytosnake-benchmarks

CytoSnake benchmarks

https://github.com/wayscience/cytosnake-benchmarks

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.7%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

CytoSnake benchmarks

Basic Info
  • Host: GitHub
  • Owner: WayScience
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 1.4 MB
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 1
  • Open Issues: 6
  • Releases: 0
Created over 2 years ago · Last pushed about 2 years ago
Metadata Files
Readme Citation

README.md

CytoSnake-Benchmarks

Welcome to the CytoSnake's Benchmark Repository, where we conduct version-controlled performance tracking of CytoSnake's workflows. This repository benchmarks various image-based profiling processing workflows. We use this repo as a version control system to maintain transparency and track changes in performance. Below we document our benchmarking process, including the datasets used, configuration files, performance metrics measured, and the selected workflow benchmarked Python notebooks.

Structure

image

CytoSnake-Benchmark repository structure

The image above provides an overview of the CytoSnake-Benchmark repository structure. The central elements include the src/ directory (red), housing utility functions for processing raw benchmark files and allowing those functions to be modular. Adjacent is the all-benchmarks/ directory (green), acting as a central repository for benchmarks conducted on specific datasets (blue). Within each tested benchmark folder (yellow) (e.g inside benchmark name 1) a jupyter notebook is used to process the benchmark files and generates plots in the image/ dir and benchmarking results in the results/ dir. Each folder features a detailed README specifying test contents and datasets used.

Below is a table that describes all of the currently available benchmarks inside the all-benchmarks directory: | Directory Name | Description | | --------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | | cell-health-cp-cp_process | Benchmarks the cp_process workflow using the cell-health dataset cell profile features |

Installation and Usage

Installation

To get started, follow these steps:

  1. Clone the benchmark repository to your local machine:

bash git clone git@github.com:WayScience/CytoSnake-Benchmarks.git

  1. To set up the required dependencies, create a conda environment by utilizing the cytosnake_benchmarks.yaml file.

bash conda env create -f env.yaml

Note: If you already have mamba installed, you can substitute conda with mamba. It is advised to use mamba as it is widely recognized for being a faster alternative to conda. If you haven't installed mamba yet, you can follow the installation instructions here.

  1. Then install the CytoSnake-Benchmark into the local Python environment:

bash pip install -e .

This will give the notebooks to have access to all the functions within the src/ directory.

Creating a benchmark

To create a benchmark, follow these steps: First, create a folder in the all-benchmarks/ directory with the structure {data_type}_{features}_benchmarks/ for the file name. Next, transfer all the necessary files into this folder, which should include the benchmark generated when executing CytoSnake's benchmarking mode. Utilize notebooks to document and display all the raw code and figures generated during the benchmarking analysis. Lastly, it's recommended to store all intermediate files and figures in the repository, allowing others to review and access them.

Owner

  • Name: The Way Lab
  • Login: WayScience
  • Kind: organization
  • Location: United States of America

The Way Lab at CU Anschutz

Citation (CITATION.cff)

cff-version: 1.0.3
message: Please use the citation below to cite this repository.
title: CytoSnake-Benchmarks
version: 0.0.1
authors:
  - name: Erik Serrano
    orcid: https://orcid.org/0000-0003-4188-9080
    affiliation: University of Colorado, Anschutz Medical Campus
    email: erik.serrano@cuanschutz.edu
license: BSD-3-Clause
keywords:
  - benchmarking
  - workflows
  - snakemake
repository-code: https://github.com/username/repo

# below are datasets used under this repo
dependencies:
  - name: Cell-Health Cell Painting Single Cell Profiles
    version: 5.0.0
    doi: https://doi.org/10.1091/mbc.E20-12-0784.
  - name: NF1 Cell Painting Data
    url: https://github.com/WayScience/nf1_cellpainting_data

GitHub Events

Total
Last Year

Dependencies

setup.py pypi