Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto sharding Datalog store #303

Open
huahaiy opened this issue Jan 20, 2025 · 1 comment
Open

Auto sharding Datalog store #303

huahaiy opened this issue Jan 20, 2025 · 1 comment
Labels
enhancement New feature or request

Comments

@huahaiy
Copy link
Contributor

huahaiy commented Jan 20, 2025

To enhance scalability of Datalevin Datalog store, we can distribute data into multiple LMDB files based on hashing of entity id, so data are evenly distributed. Transaction isolation is still managed in the scope of a transaction in a single meta info LMDB, so ACID can be retained.

Because LMDB is B-tree based and has a single writer, as the data volume grows larger, everything get slower and slower. Automatically sharding the data according to data volume also allows concurrent storage as we can write to different shards at the same time. This should improve read/write performance at high data volume.

@huahaiy huahaiy added the enhancement New feature or request label Jan 20, 2025
@huahaiy huahaiy added this to the 1.0.0 milestone Jan 20, 2025
@huahaiy
Copy link
Contributor Author

huahaiy commented Jan 25, 2025

@huahaiy huahaiy removed this from the 1.0.0 milestone Feb 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant