https://github.com/atlarge-research/sharebench

https://github.com/atlarge-research/sharebench

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.3%) to scientific vocabulary
Last synced: 9 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: atlarge-research
  • License: gpl-3.0
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 2.38 MB
Statistics
  • Stars: 0
  • Watchers: 9
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created almost 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License

README.md

ShareBench

ShareBench is a tool set for real-world performance analysis studies of distributed resource-sharing mechanisms and policies, developed as part of the Computer Science Bachelor Thesis on Performance Characterization of Distributed Resource-Sharing Mechanisms by Lennart K. M. Schulz.

The work consists of two components: ShareBench-Base and ShareBench-Gen. The former is an infrastructure framework for automated real-world performance analysis studies. The latter is a workload generator for OLAP workloads based on the TPC-DS data set and queries. Documentation for each of the components can be found in the docs folder.

The only supported composition is Spark on Kubernetes. Support for other Application Frameworks or Resource Managers is not currently planned.

The work was initially based on the spark-data-generator by Sacheendra Talluri. Code annotations indicate what material is copied/adapted from that source.

Experiment Data

To reduce the size of this repository, the data obtained through the experiments is not directly included but rather provided in sharebench-data.

Citation

When using ShareBench for research, please use the following BibTeX entry for citations: bibtex @thesis{schulz2024Sharebench, author = {Schulz, Lennart K. M.}, title = {{ShareBench}: Performance Characterization of Distributed Resource-Sharing Mechanisms}, institution = {{VU} Amsterdam}, type = {Bachelor Thesis}, date = {2024}, }

Owner

  • Name: @Large Research
  • Login: atlarge-research
  • Kind: organization
  • Email: info@atlarge-research.com

Massivizing Computer Systems

GitHub Events

Total
Last Year

Dependencies

templates/Dockerfile docker
  • apache/spark 3.4.1-scala2.12-java11-ubuntu build
requirements.txt pypi
  • Jinja2 ==3.1.4
  • PyYAML ==6.0.1
  • ipykernel ==6.29.5
  • matplotlib ==3.9.0
  • numpy ==2.0.0
  • pandas ==2.2.2