Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Revised connection strings, improved performance #61

Merged
merged 2 commits into from
Feb 19, 2025
Merged

Conversation

dpp
Copy link
Contributor

@dpp dpp commented Feb 17, 2025

💻 Description of Change(s) (w/ context)

Revised the connection strings to contained:up etc. rather than using glyphs

Support for saving bundles every n items processed... default to 50,000

Parallelized some caching operations

🧠 Rationale Behind Change(s)

Why were these changes made? What tradeoffs were considered?

📝 Test Plan

Passes tests and Big Tent can read and process created Clusters

dpp added 2 commits February 15, 2025 15:59
…Added build information to Cluster envelope. Cap the number of items processed in a batch and if the number of items is exceeded, break the batch into sub-batches. Used Scala parallel collections to speed up md5 hashing of index and pre-cache CBOR for items

Signed-off-by: David Pollak <[email protected]>
@dpp dpp merged commit 19ecbd0 into main Feb 19, 2025
2 checks passed
@dpp dpp deleted the dpp/better_connections branch February 19, 2025 15:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants