[SPARK] Set Auto Bucket Partitioner to be the default partitioning strategy #252

rachel-mack · 2025-05-08T13:25:17Z

Pull Request Info

PR Reviewing Guidelines

JIRA - https://jira.mongodb.org/browse/DOCSP-49645

Staging Links

batch-mode/batch-read-config

release-notes

Self-Review Checklist

Is this free of any warnings or errors in the RST?
Did you run a spell-check?
Did you run a grammar-check?
Are all the links working?
Are the facets and meta keywords accurate?
Are the page titles greater than 20 characters long and SEO relevant?

netlify · 2025-05-08T13:25:28Z

✅ Deploy Preview for docs-spark-connector ready!

Name	Link
🔨 Latest commit	`fef704a`
🔍 Latest deploy log	https://app.netlify.com/sites/docs-spark-connector/deploys/681d005d77d56b0008b53e5d
😎 Deploy Preview	https://deploy-preview-252--docs-spark-connector.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

stephmarie17

LGTM with a small fix.

source/batch-mode/batch-read-config.txt

rachel-mack · 2025-05-08T19:03:27Z

source/batch-mode/batch-read-config.txt

+The ``AutoBucketPartitioner`` is the default partitioner configuration. It
+samples the data to generate partitions and uses
+the :manual:`$bucketAuto </reference/operator/aggregation/bucketAuto/>`
+aggregation stage to paginate. By using this configuration, you can partition
+the data across single or multiple fields, including nested fields.
+
+.. note:: Compound Keys
+
+  The ``AutoBucketPartitioner`` configuration requires {+mdb-server+} version
+  7.0 or higher to support compound keys.


I moved the entire AutoBucketPartitioner section to the top because it's the default, so the entire section is showing up as new, but this highlighted section, and the SamplePartitioner section below are the only content I've changed.

rachel-mack · 2025-05-08T19:07:32Z

source/batch-mode/batch-read-config.txt

+The ``SamplePartitioner`` configuration is similar to the
+:ref:`AutoBucketPartitioner <conf-autobucketpartitioner>` configuration, but
+does not use the ``$bucketAuto`` aggregation stage. This
+configuration lets you specify a partition field, partition size, and number of
+samples per partition. 


Also modified, see note above.

rozza

LGTM!

default partitioner

70557c6

rachel-mack added 3 commits May 8, 2025 09:47

formatting

fd280b5

tweak

4e67ed5

move example

0dbc436

rachel-mack marked this pull request as ready for review May 8, 2025 14:40

stephmarie17 approved these changes May 8, 2025

View reviewed changes

source/batch-mode/batch-read-config.txt Outdated Show resolved Hide resolved

release note

9e9f4e7

rachel-mack commented May 8, 2025

View reviewed changes

typo

fef704a

rachel-mack commented May 8, 2025

View reviewed changes

rozza approved these changes May 12, 2025

View reviewed changes

rachel-mack merged commit c879f28 into mongodb:master May 12, 2025
6 checks passed

rachel-mack deleted the DOCSP-49645 branch May 12, 2025 12:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK] Set Auto Bucket Partitioner to be the default partitioning strategy #252

[SPARK] Set Auto Bucket Partitioner to be the default partitioning strategy #252

rachel-mack commented May 8, 2025 •

edited by github-actions bot

Loading

Uh oh!

netlify bot commented May 8, 2025 •

edited

Loading

Uh oh!

stephmarie17 left a comment

Uh oh!

Uh oh!

rachel-mack May 8, 2025

Uh oh!

rachel-mack May 8, 2025

Uh oh!

rozza left a comment

Uh oh!

Uh oh!

Uh oh!

[SPARK] Set Auto Bucket Partitioner to be the default partitioning strategy #252

[SPARK] Set Auto Bucket Partitioner to be the default partitioning strategy #252

Conversation

rachel-mack commented May 8, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Info

Staging Links

Self-Review Checklist

Uh oh!

netlify bot commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for docs-spark-connector ready!

Uh oh!

stephmarie17 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rachel-mack May 8, 2025

Choose a reason for hiding this comment

Uh oh!

rachel-mack May 8, 2025

Choose a reason for hiding this comment

Uh oh!

rozza left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

rachel-mack commented May 8, 2025 •

edited by github-actions bot

Loading

netlify bot commented May 8, 2025 •

edited

Loading