parallel-hashmap

A family of header-only, very fast and memory-friendly hashmap and btree containers.

https://github.com/greg7mdp/parallel-hashmap

Keywords

concurrency hash hash-container memory-footprint multi-thread parallel parallel-programming tables unordered-map unordered-set

Last synced: 10 months ago · JSON representation ·

Repository

A family of header-only, very fast and memory-friendly hashmap and btree containers.

Basic Info

Host: GitHub
Owner: greg7mdp
License: apache-2.0
Language: C++
Default Branch: master
Homepage: https://greg7mdp.github.io/parallel-hashmap/
Size: 2.96 MB

Statistics

Stars: 3,043
Watchers: 68
Forks: 292
Open Issues: 4
Releases: 23

Topics

concurrency hash hash-container memory-footprint multi-thread parallel parallel-programming tables unordered-map unordered-set

Created over 7 years ago · Last pushed 10 months ago

Metadata Files

Readme Funding License Citation

The Parallel Hashmap

Overview

This repository aims to provide a set of excellent hash map implementations, as well as a btree alternative to std::map and std::set, with the following characteristics:

Header only: nothing to build, just copy the parallel_hashmap directory to your project and you are good to go.
drop-in replacement for std::unordered_map, std::unordered_set, std::map and std::set
Compiler with C++11 support required, C++14 and C++17 APIs are provided (such as try_emplace)
Very efficient, significantly faster than your compiler's unordered map/set or Boost's, or than sparsepp
Memory friendly: low memory usage, although a little higher than sparsepp
Supports heterogeneous lookup
Easy to forward declare: just include phmap_fwd_decl.h in your header files to forward declare Parallel Hashmap containers [note: this does not work currently for hash maps with pointer keys]
Dump/load feature: when a flat hash map stores data that is std::trivially_copyable, the table can be dumped to disk and restored as a single array, very efficiently, and without requiring any hash computation. This is typically about 10 times faster than doing element-wise serialization to disk, but it will use 10% to 60% extra disk space. See examples/serialize.cc. (flat hash map/set only)
Tested on Windows (vs2015 & vs2017, vs2019, vs2022, Intel compiler 18 and 19), linux (g++ 4.8, 5, 6, 7, 8, 9, 10, 11, 12, clang++ 3.9 to 16) and MacOS (g++ and clang++) - click on travis and appveyor icons above for detailed test status.
Automatic support for boost's hash_value() method for providing the hash function (see examples/hash_value.h). Also default hash support for std::pair and std::tuple.
natvis visualization support in Visual Studio (hash map/set only)

@byronhe kindly provided this Chinese translation of the README.md.

Parallel-hashmap or GTL?

The observant among us may have noticed that I have two github repos, parallel-hashmap and gtl, which both provide very similar functionality. Indeed the hash tables in both are equivalent and the code mostly the same. The main difference is that parallel-hashmap only requires a C++11 compiler, while gtl requires a C++20 compiler.

My recommendation would be to use gtl if you are compiling with C++20 or higher, and parallel-hashmap otherwise. While the included hash maps are equivalent, gtl is where new development occurs, and it will include useful new classes.

Fast and memory friendly

Click here For a full writeup explaining the design and benefits of the Parallel Hashmap.

The hashmaps and btree provided here are built upon those open sourced by Google in the Abseil library. The hashmaps use closed hashing, where values are stored directly into a memory array, avoiding memory indirections. By using parallel SSE2 instructions, these hashmaps are able to look up items by checking 16 slots in parallel, allowing the implementation to remain fast even when the table is filled up to 87.5% capacity.

IMPORTANT: This repository borrows code from the abseil-cpp repository, with modifications, and may behave differently from the original. This repository is an independent work, with no guarantees implied or provided by the authors. Please visit abseil-cpp for the official Abseil libraries.

Installation

Copy the parallel_hashmap directory to your project. Update your include path. That's all.

If you are using Visual Studio, you probably want to add phmap.natvis to your projects. This will allow for a clear display of the hash table contents in the debugger.

A cmake configuration files (CMakeLists.txt) is provided for building the tests and examples. Command for building and running the tests is:

```sh cmake -DPHMAPBUILDTESTS=ON -DPHMAPBUILDEXAMPLES=ON -B build

cmake --build build

ctest --test-dir build ```

Example

```c++

include

using phmap::flathashmap;

int main() { // Create an unorderedmap of three strings (that map to strings) flathash_map email = { { "tom", "tom@gmail.com"}, { "jeff", "jk@gmail.com"}, { "jim", "jimg@microsoft.com"} };

// Iterate and print keys and values
for (const auto& n : email)
    std::cout << n.first << "'s email is: " << n.second << "\n";

// Add a new entry
email["bill"] = "bg@whatever.com";

// and print it
std::cout << "bill's email is: " << email["bill"] << "\n";

return 0;

} ```

Various hash maps and their pros and cons

The header parallel_hashmap/phmap.h provides the implementation for the following eight hash tables: - phmap::flathashset - phmap::flathashmap - phmap::nodehashset - phmap::nodehashmap - phmap::parallelflathashset - phmap::parallelflathashmap - phmap::parallelnodehashset - phmap::parallelnodehashmap

The header parallel_hashmap/btree.h provides the implementation for the following btree-based ordered containers: - phmap::btreeset - phmap::btreemap - phmap::btreemultiset - phmap::btreemultimap

The btree containers are direct ports from Abseil, and should behave exactly the same as the Abseil ones, modulo small differences (such as supporting std::stringview instead of absl::stringview, and being forward declarable).

When btrees are mutated, values stored within can be moved in memory. This means that pointers or iterators to values stored in btree containers can be invalidated when that btree is modified. This is a significant difference with std::map and std::set, as the std containers do offer a guarantee of pointer stability. The same is true for the 'flat' hash maps and sets.

The full types with template parameters can be found in the parallelhashmap/phmapfwd_decl.h header, which is useful for forward declaring the Parallel Hashmaps when necessary.

Key decision points for hash containers:

The flat hash maps will move the keys and values in memory. So if you keep a pointer to something inside a flat hash map, this pointer may become invalid when the map is mutated. The node hash maps don't, and should be used instead if this is a problem.
The flat hash maps will use less memory, and usually be faster than the node hash maps, so use them if you can. the exception is when the values inserted in the hash map are large (say more than 100 bytes [needs testing]) and costly to move.
The parallel hash maps are preferred when you have a few hash maps that will store a very large number of values. The non-parallel hash maps are preferred if you have a large number of hash maps, each storing a relatively small number of values.
The benefits of the parallel hash maps are: a. reduced peak memory usage (when resizing), and b. multithreading support (and inherent internal parallelism)

Key decision points for btree containers:

Btree containers are ordered containers, which can be used as alternatives to std::map and std::set. They store multiple values in each tree node, and are therefore more cache friendly and use significantly less memory.

Btree containers will usually be preferable to the default red-black trees of the STL, except when: - pointer stability or iterator stability is required - the value_type is large and expensive to move

When an ordering is not needed, a hash container is typically a better choice than a btree one.

Changes to Abseil's hashmaps

The default hash framework is std::hash, not absl::Hash. However, if you prefer the default to be the Abseil hash framework, include the Abseil headers before phmap.h and define the preprocessor macro PHMAP_USE_ABSL_HASH.
The erase(iterator) and erase(const_iterator) both return an iterator to the element following the removed element, as does the std::unordered_map. A non-standard void _erase(iterator) is provided in case the return value is not needed.
No new types, such as absl::string_view, are provided. All types with a std::hash<> implementation are supported by phmap tables (including std::string_view of course if your compiler provides it).
The Abseil hash tables internally randomize a hash seed, so that the table iteration order is non-deterministic. This can be useful to prevent Denial Of Service attacks when a hash table is used for a customer facing web service, but it can make debugging more difficult. The phmap hashmaps by default do not implement this randomization, but it can be enabled by adding #define PHMAP_NON_DETERMINISTIC 1 before including the header phmap.h (as is done in rawhashset_test.cc).
Unlike the Abseil hash maps, we do an internal mixing of the hash value provided. This prevents serious degradation of the hash table performance when the hash function provided by the user has poor entropy distribution. The cost in performance is very minimal, and this helps provide reliable performance even with imperfect hash functions. Disabling this mixing is possible by defining the preprocessor macro PHMAP_DISABLE_MIX=1 before phmap.h is included, but it is not recommended.

Memory usage

| type | memory usage | additional peak memory usage when resizing | |-----------------------|-------------------|-----------------------------------------------| | flat tables | flat_mem_usage | flat_peak_usage | | node tables | node_mem_usage | node_peak_usage | | parallel flat tables | | parallel_flat_peak | | parallel node tables | | parallel_node_peak |

size() is the number of values in the container, as returned by the size() method
load_factor() is the ratio: size() / bucket_count(). It varies between 0.4375 (just after the resize) to 0.875 (just before the resize). The size of the bucket array doubles at each resize.
the value 9 comes from sizeof(void *) + 1, as the node hash maps store one pointer plus one byte of metadata for each entry in the bucket array.
flat tables store the values, plus one byte of metadata per value), directly into the bucket array, hence the sizeof(C::value_type) + 1.
the additional peak memory usage (when resizing) corresponds the the old bucket array (half the size of the new one, hence the 0.5), which contains the values to be copied to the new bucket array, and which is freed when the values have been copied.
the parallel hashmaps, when created with a template parameter N=4, create 16 submaps. When the hash values are well distributed, and in single threaded mode, only one of these 16 submaps resizes at any given time, hence the factor 0.03 roughly equal to 0.5 / 16

Iterator invalidation for hash containers

The rules are the same as for std::unordered_map, and are valid for all the phmap hash containers:

| Operations | Invalidated | |-------------------------------------------|----------------------------| | All read only operations, swap, std::swap | Never | | clear, rehash, reserve, operator= | Always | | insert, emplace, emplace_hint, operator[] | Only if rehash triggered | | erase | Only to the element erased |

Iterator invalidation for btree containers

Unlike for std::map and std::set, any mutating operation may invalidate existing iterators to btree containers.

| Operations | Invalidated | |-------------------------------------------|----------------------------| | All read only operations, swap, std::swap | Never | | clear, operator= | Always | | insert, emplace, emplace_hint, operator[] | Yes | | erase | Yes |

Example 2 - providing a hash function for a user-defined class

In order to use a flathashset or flathashmap, a hash function should be provided. This can be done with one of the following methods:

Provide a hash functor via the HashFcn template parameter
As with boost, you may add a hash_value() friend function in your class.

For example:

```c++

include // minimal header providing phmap::HashState()

include

using std::string;

struct Person { bool operator==(const Person &o) const { return first == o.first && last == o.last && age == o.age; }

friend size_t hash_value(const Person &p)
{
    return phmap::HashState().combine(0, p._first, p._last, p._age);
}

string _first;
string _last;
int    _age;

}; ```

Inject a specialization of std::hash for the class into the "std" namespace. We provide a convenient and small header phmap_utils.h which allows to easily add such specializations.

For example:

file "Person.h"

```c++

include // minimal header providing phmap::HashState()

include

using std::string;

struct Person { bool operator==(const Person &o) const { return first == o.first && last == o.last && age == o.age; }

string _first;
string _last;
int    _age;

};

namespace std { // inject specialization of std::hash for Person into namespace std // ---------------------------------------------------------------- template<> struct hash { std::sizet operator()(Person const &p) const { return phmap::HashState().combine(0, p.first, p.last, p.age); } }; } ```

The std::hash specialization for Person combines the hash values for both first and last name and age, using the convenient phmap::HashState() function, and returns the combined hash value.

file "main.cpp"

```c++

include "Person.h" // defines Person with std::hash specialization

include

int main() { // As we have defined a specialization of std::hash() for Person, // we can now create sparsehashset or sparsehashmap of Persons // ---------------------------------------------------------------- phmap::flathashset persons = { { "John", "Mitchell", 35 }, { "Jane", "Smith", 32 }, { "Jane", "Smith", 30 }, };

for (auto& p: persons)
    std::cout << p._first << ' ' << p._last << " (" << p._age << ")" << '\n';

} ```

Thread safety

Parallel Hashmap containers follow the thread safety rules of the Standard C++ library. In Particular:

A single phmap hash table is thread safe for reading from multiple threads. For example, given a hash table A, it is safe to read A from thread 1 and from thread 2 simultaneously.
If a single hash table is being written to by one thread, then all reads and writes to that hash table on the same or other threads must be protected. For example, given a hash table A, if thread 1 is writing to A, then thread 2 must be prevented from reading from or writing to A.
It is safe to read and write to one instance of a type even if another thread is reading or writing to a different instance of the same type. For example, given hash tables A and B of the same type, it is safe if A is being written in thread 1 and B is being read in thread 2.
The parallel tables can be made internally thread-safe for concurrent read and write access, by providing a synchronization type (for example std::mutex) as the last template argument. Because locking is performed at the submap level, a high level of concurrency can still be achieved. Read access can be done safely using if_contains(), which passes a reference value to the callback while holding the submap lock. Similarly, write access can be done safely using modify_if, try_emplace_l or lazy_emplace_l. However, please be aware that iterators or references returned by standard APIs are not protected by the mutex, so they cannot be used reliably on a hash map which can be changed by another thread.
Examples on how to use various mutex types, including boost::mutex, boost::shared_mutex and absl::Mutex can be found in examples/bench.cc

Using the Parallel Hashmap from languages other than C++

While C++ is the native language of the Parallel Hashmap, we welcome bindings making it available for other languages. One such implementation has been created for Python and is described below:

GetPy - A Simple, Fast, and Small Hash Map for Python: GetPy is a thin and robust binding to The Parallel Hashmap (https://github.com/greg7mdp/parallel-hashmap.git) which is the current state of the art for minimal memory overhead and fast runtime speed. The binding layer is supported by PyBind11 (https://github.com/pybind/pybind11.git) which is fast to compile and simple to extend. Serialization is handled by Cereal (https://github.com/USCiLab/cereal.git) which supports streaming binary serialization, a critical feature for the large hash maps this package is designed to support.

Acknowledgements

Many thanks to the Abseil developers for implementing the swiss table and btree data structures (see abseil-cpp) upon which this work is based, and to Google for releasing it as open-source.

Owner

Name: Gregory Popovitch
Login: greg7mdp
Kind: user
Location: Michigan, USA

Twitter: greg7mdp
Repositories: 45
Profile: https://github.com/greg7mdp

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Popovitch
    given-names: Gregory
    orcid: https://orcid.org/0009-0005-7245-5930
title: "The Parallel Hashmap C++ library"
license: Apache-2.0
version: 2.0.0
date-released: 2025-01-21

GitHub Events

Total

Create event: 10
Release event: 1
Issues event: 33
Watch event: 478
Delete event: 7
Issue comment event: 59
Push event: 18
Pull request review comment event: 4
Pull request review event: 9
Pull request event: 21
Fork event: 48

Last Year

Create event: 10
Release event: 1
Issues event: 33
Watch event: 478
Delete event: 7
Issue comment event: 59
Push event: 18
Pull request review comment event: 4
Pull request review event: 9
Pull request event: 21
Fork event: 48

Committers

Last synced: about 1 year ago

All Time

Total Commits: 686
Total Committers: 30
Avg Commits per committer: 22.867
Development Distribution Score (DDS): 0.096

Past Year

Commits: 26
Committers: 4
Avg Commits per committer: 6.5
Development Distribution Score (DDS): 0.154

Top Committers

Name	Email	Commits
greg	g**p@g**m	620
Brian McKinnon	b**n@g**m	17
sunkaicheng	s**g@b**g	13
Serge Weinstock	s**k@c**m	3
Sergey Fedorov	v**d@g**m	3
Force Charlie	i**b@q**m	2
Ed Catmur	ed@c****k	2
stdpain	d**8@g**m	2
Dennis Nienhüser	n**r@k**g	2
Colin Gaudreau	c**u@g**m	2
Andrew Dean	A**n@m**m	1
lisa0916lisa@163.com	l**u@l**l	1
tingyu	l**u@b**m	1
Ali Ghaffaari	a**i@g**m	1
Binglin Chang	d**y@g**m	1
Colin Gaudreau	c**u@u**m	1
Dheeraj R Reddy	d**y@g**m	1
Force Charlie	c**o@o**m	1
George Cholerton	g**n@g**m	1
Jendrik Seipp	j**p@g**m	1
Kevin Kiningham	k**h@s**u	1
Maikel van den Hurk	m**k@h**m	1
Patrick Fasano	p**k@p**m	1
byronhe	1****e	1
ilobilo	i**5@g**m	1
madisetti	m**i@j**u	1
myozka	m**a@g**m	1
scivision	1****n	1
tang donghai	t**s@g**m	1
yanz	d**7@g**m	1

Committer Domains (Top 20 + Academic)

jhu.edu: 1 patrickfasano.com: 1 stanford.edu: 1 ubisoft.com: 1 bilibili.com: 1 microsoft.com: 1 kde.org: 1 catmur.uk: 1 qq.com: 1 coremont.com: 1 bigo.sg: 1

Issues and Pull Requests

Last synced: 10 months ago

All Time

Total issues: 137
Total pull requests: 46
Average time to close issues: about 1 month
Average time to close pull requests: about 11 hours
Total issue authors: 100
Total pull request authors: 24
Average comments per issue: 6.19
Average comments per pull request: 2.24
Merged pull requests: 38
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 19
Pull requests: 12
Average time to close issues: about 1 month
Average time to close pull requests: about 5 hours
Issue authors: 19
Pull request authors: 5
Average comments per issue: 2.26
Average comments per pull request: 1.58
Merged pull requests: 9
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

gf777 (4)
bpmckinnon (4)
marioroy (4)
HolyBlackCat (4)
jmbnyc (4)
ila (3)
kanonka (3)
fabioparodi (2)
trsh (2)
PietropaoloFrisoni (2)
kc9jud (2)
StavrosMar (2)
OutTheShade (2)
evan-swinney (2)
Jiboxiake (2)

Pull Request Authors

greg7mdp (15)
bpmckinnon (4)
Earthwings (4)
ecatmur (2)
ilobilo (2)
cartoonist (2)
gtarcoder (2)
stdpain (2)
scivision (2)
aviraxp (2)
AgostonSzepessy (2)
dirtysalt (1)
barracuda156 (1)
amitma1 (1)
bashimao (1)

Top Labels

Issue Labels

enhancement (1)

Pull Request Labels

Packages

Total packages: 4
Total downloads:
- homebrew 43 last-month

Total dependent packages: 1
(may contain duplicates)
Total dependent repositories: 0
(may contain duplicates)
Total versions: 20

proxy.golang.org: github.com/greg7mdp/parallel-hashmap

Documentation: https://pkg.go.dev/github.com/greg7mdp/parallel-hashmap#section-documentation
License: apache-2.0
Latest release: v2.0.0+incompatible
published over 1 year ago

Versions: 8
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Dependent packages count: 9.0%

Average: 9.6%

Dependent repos count: 10.2%

Last synced: 10 months ago

spack.io: parallel-hashmap

A family of header-only, very fast and memory-friendly hashmap and btree containers.

Homepage: https://github.com/greg7mdp/parallel-hashmap
License: []
Latest release: 1.3.12
published almost 2 years ago

Versions: 2
Dependent Packages: 1
Dependent Repositories: 0

Rankings

Dependent repos count: 0.0%

Stargazers count: 5.3%

Forks count: 7.3%

Average: 17.7%

Dependent packages count: 58.0%

Last synced: 10 months ago

conda-forge.org: parallel-hashmap

Homepage: https://github.com/greg7mdp/parallel-hashmap
License: Apache-2.0
Latest release: 1.33
published about 5 years ago

Versions: 2
Dependent Packages: 0
Dependent Repositories: 0

Rankings

Stargazers count: 8.9%

Forks count: 12.9%

Average: 26.7%

Dependent repos count: 34.0%

Dependent packages count: 51.2%

Last synced: 10 months ago

formulae.brew.sh: parallel-hashmap

Family of header-only, fast, memory-friendly C++ hashmap and btree containers

Homepage: https://greg7mdp.github.io/parallel-hashmap/
License: Apache-2.0
Latest release: 2.0.0
published over 1 year ago

Versions: 8
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 43 Last month

Rankings

Stargazers count: 11.6%

Forks count: 12.3%

Dependent packages count: 19.0%

Average: 35.6%

Dependent repos count: 50.7%

Downloads: 84.3%

Last synced: 10 months ago

parallel-hashmap

Science Score: 54.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

The Parallel Hashmap

Overview

Parallel-hashmap or GTL?

Fast and memory friendly

Installation

Example

include

include

include

Various hash maps and their pros and cons

Changes to Abseil's hashmaps

Memory usage

Iterator invalidation for hash containers

Iterator invalidation for btree containers

Example 2 - providing a hash function for a user-defined class

include // minimal header providing phmap::HashState()

include

file "Person.h"

include // minimal header providing phmap::HashState()

include

file "main.cpp"

include "Person.h" // defines Person with std::hash specialization

include

include

Thread safety

Using the Parallel Hashmap from languages other than C++

Acknowledgements

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

proxy.golang.org: github.com/greg7mdp/parallel-hashmap

Rankings

spack.io: parallel-hashmap

Rankings

conda-forge.org: parallel-hashmap

Rankings

formulae.brew.sh: parallel-hashmap

Rankings

Dependencies