https://github.com/bytehamster/mphf-experiments

Comparison of different MPHF algorithms

https://github.com/bytehamster/mphf-experiments

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 48 DOI reference(s) in README
  • Academic publication links
    Links to: researchgate.net
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.6%) to scientific vocabulary
Last synced: 7 months ago · JSON representation

Repository

Comparison of different MPHF algorithms

Basic Info
  • Host: GitHub
  • Owner: ByteHamster
  • License: gpl-3.0
  • Language: C++
  • Default Branch: main
  • Size: 2.83 MB
Statistics
  • Stars: 7
  • Watchers: 6
  • Forks: 3
  • Open Issues: 0
  • Releases: 1
Created over 3 years ago · Last pushed 11 months ago
Metadata Files
Readme License

README.md

MPHF-Experiments

Comparison of a wide range different minimal perfect hash functions (MPHFs). From these, it can generate comprehensive plots like Pareto plots, and simple comparison tables used in several papers.

The framework provides a unified interface to test basically all modern MPHF constructions that are currently available, including:

Cloning the Repository

This repository contains submodules. To clone the repository including submodules, use the following command.

git clone --recursive https://github.com/ByteHamster/MPHF-Experiments.git

Running the Experiments Directly

Compiling works like with every cmake project.

cmake -B ./build -DCMAKE_BUILD_TYPE=Release cmake --build ./build -j

This might take about 5-15 minutes because of the large number of competitors. You can then run one of the benchmarks, for example ./build/TablePtrHash --help or ./build/Comparison --help.

Code Structure

The main comparison code can be found in the src directory. This includes tabular comparisons like they are used in different papers, as well as the more general Pareto plot in src/Comparison.cpp. To add a new competitor to the framework, have a look at the contenders directory. For each contender, there are two files. (1) A general wrapper header class that unifies the interface of the competitor, and (2) a cpp file that tests a wide range of configurations for the general Pareto plot. The cpp file should contain all meaningful configurations to cover all possible trade-offs. After adding a contender, make sure to re-run cmake. If you want to add a new comparison table, make sure to also adapt the CMakeLists.txt file accordingly.

Running the Experiments with Docker

For easier reproducibility and less setup overhead, we provide a docker image to run the experiments. However, for the measurements in the papers, we run the code directly and with more data points. We refer to Docker.md for details on how to use this repository with Docker.

License

This code is licensed under the GPLv3.

Owner

  • Login: ByteHamster
  • Kind: user
  • Location: Germany
  • Company: Karlsruhe Institute of Technology

I'm a PhD student at Karlsruhe Institute of Technology. In my freetime, I maintain AntennaPod and contribute to other projects like K-9 Mail and Baikal Server.

GitHub Events

Total
  • Create event: 6
  • Commit comment event: 2
  • Issues event: 1
  • Watch event: 3
  • Delete event: 5
  • Issue comment event: 4
  • Push event: 79
  • Pull request review event: 2
  • Pull request review comment event: 6
  • Pull request event: 7
  • Fork event: 5
Last Year
  • Create event: 6
  • Commit comment event: 2
  • Issues event: 1
  • Watch event: 3
  • Delete event: 5
  • Issue comment event: 4
  • Push event: 79
  • Pull request review event: 2
  • Pull request review comment event: 6
  • Pull request event: 7
  • Fork event: 5

Committers

Last synced: 12 months ago

All Time
  • Total Commits: 334
  • Total Committers: 3
  • Avg Commits per committer: 111.333
  • Development Distribution Score (DDS): 0.078
Past Year
  • Commits: 110
  • Committers: 3
  • Avg Commits per committer: 36.667
  • Development Distribution Score (DDS): 0.236
Top Committers
Name Email Commits
ByteHamster i****o@b****m 308
Piotr Beling p****g@w****l 25
Stefan Hermann s****n@g****m 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 2
  • Total pull requests: 10
  • Average time to close issues: 1 day
  • Average time to close pull requests: 8 days
  • Total issue authors: 2
  • Total pull request authors: 2
  • Average comments per issue: 1.5
  • Average comments per pull request: 0.3
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 10
  • Average time to close issues: N/A
  • Average time to close pull requests: 8 days
  • Issue authors: 1
  • Pull request authors: 2
  • Average comments per issue: 0.0
  • Average comments per pull request: 0.3
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • RagnarGrootKoerkamp (1)
  • meling (1)
Pull Request Authors
  • ByteHamster (6)
  • stefanfred (4)
Top Labels
Issue Labels
Pull Request Labels