-
Notifications
You must be signed in to change notification settings - Fork 62
[SPARK] Set Auto Bucket Partitioner to be the default partitioning strategy #252
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
✅ Deploy Preview for docs-spark-connector ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM with a small fix.
The ``AutoBucketPartitioner`` is the default partitioner configuration. It | ||
samples the data to generate partitions and uses | ||
the :manual:`$bucketAuto </reference/operator/aggregation/bucketAuto/>` | ||
aggregation stage to paginate. By using this configuration, you can partition | ||
the data across single or multiple fields, including nested fields. | ||
|
||
.. note:: Compound Keys | ||
|
||
The ``AutoBucketPartitioner`` configuration requires {+mdb-server+} version | ||
7.0 or higher to support compound keys. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I moved the entire AutoBucketPartitioner section to the top because it's the default, so the entire section is showing up as new, but this highlighted section, and the SamplePartitioner section below are the only content I've changed.
The ``SamplePartitioner`` configuration is similar to the | ||
:ref:`AutoBucketPartitioner <conf-autobucketpartitioner>` configuration, but | ||
does not use the ``$bucketAuto`` aggregation stage. This | ||
configuration lets you specify a partition field, partition size, and number of | ||
samples per partition. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also modified, see note above.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Pull Request Info
PR Reviewing Guidelines
JIRA - https://jira.mongodb.org/browse/DOCSP-49645
Staging Links
Self-Review Checklist