Auto sharding Datalog store #303

huahaiy · 2025-01-20T16:18:29Z

To enhance scalability of Datalevin Datalog store, we can distribute data into multiple LMDB files based on hashing of entity id, so data are evenly distributed. Transaction isolation is still managed in the scope of a transaction in a single meta info LMDB, so ACID can be retained.

Because LMDB is B-tree based and has a single writer, as the data volume grows larger, everything get slower and slower. Automatically sharding the data according to data volume also allows concurrent storage as we can write to different shards at the same time. This should improve read/write performance at high data volume.

huahaiy · 2025-01-25T23:47:41Z

Use this https://en.wikipedia.org/wiki/Rendezvous_hashing

huahaiy added the enhancement New feature or request label Jan 20, 2025

huahaiy added this to the 1.0.0 milestone Jan 20, 2025

huahaiy removed this from the 1.0.0 milestone Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto sharding Datalog store #303

Auto sharding Datalog store #303

huahaiy commented Jan 20, 2025

huahaiy commented Jan 25, 2025

Auto sharding Datalog store #303

Auto sharding Datalog store #303

Comments

huahaiy commented Jan 20, 2025

huahaiy commented Jan 25, 2025