Skip to content

Conversation

@masseyke
Copy link
Member

@masseyke masseyke commented Oct 29, 2025

This change checks for a sampling config associated with any index a request has passed through (as opposed to just the index the user initially sent the data to). Regardless of which sample matches (and multiple of them can), it stores the original user data. If an index request is rerouted to multiple indices, it will be evaluated against the sampling configuration of each index it is rerouted to, meaning that the original document could wind up in multiple samples.

@masseyke masseyke added >non-issue :Data Management/Ingest Node Execution or management of Ingest Pipelines including GeoIP v9.3.0 labels Oct 29, 2025
@masseyke masseyke requested a review from seanzatzdev October 30, 2025 19:01
@masseyke masseyke marked this pull request as ready for review October 30, 2025 19:01
@elasticsearchmachine elasticsearchmachine added the Team:Data Management Meta label for data/management team label Oct 30, 2025
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-data-management (Team:Data Management)

@elastic elastic locked and limited conversation to collaborators Oct 30, 2025
@elastic elastic unlocked this conversation Oct 30, 2025
Copy link
Contributor

@seanzatzdev seanzatzdev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the earlier discussion offline, lgtm!!

@masseyke masseyke changed the title Prototype of random sampling of child streams Evaluate random samples for all rerouted indices Oct 30, 2025
@masseyke masseyke merged commit fa252fa into elastic:main Oct 30, 2025
35 checks passed
@masseyke masseyke deleted the random-sampling-multiple-configs-prototype branch October 30, 2025 20:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

:Data Management/Ingest Node Execution or management of Ingest Pipelines including GeoIP >non-issue Team:Data Management Meta label for data/management team v9.3.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants