chore: Aggregation groupings for by() and without() #19928

spiridonov · 2025-11-19T20:46:26Z

What this PR does / why we need it:

This PR introduces proper groupings (a list of columns and a mode by/without). Previously it was represented only a s list of columns and without was not supported at all.

aggregator is changed to aggregate by an arbitrary variable list of labels. The resulted record will have a union of all columns seen during aggregation.
Pushdown optimizations are changed. For example, if a range aggregation has without () we have to read all columns from data objects and nothing can be pushed down.
Printers are changed to reflect gropings mode.
Proto changes for physical plans.

Which issue(s) this PR fixes:
Fixes #

Special notes for your reviewer:

Checklist

Reviewed the CONTRIBUTING.md guide (required)
Documentation added
Tests updated
Title matches the required conventional commits format, see here
- Note that Promtail is considered to be feature complete, and future development for logs collection will be in Grafana Alloy. As such, feat PRs are unlikely to be accepted unless a case can be made for the feature actually being a bug fix to existing behavior.
Changes that require user attention or interaction to upgrade are documented in docs/sources/setup/upgrade/_index.md
If the change is deprecating or removing a configuration option, update the deprecated-config.yaml and deleted-config.yaml files respectively in the tools/deprecated-config-checker directory. Example PR

…uncs

…spiridonov-more-funcs

ashwanthgoli · 2025-11-25T13:04:19Z

pkg/engine/compat.go


 		// One of the parsed columns
-		case ident.ColumnType() == types.ColumnTypeParsed:
+		case ident.ColumnType() == types.ColumnTypeParsed || (ident.ColumnType() == types.ColumnTypeGenerated &&


is this added to handle error columns that are of type Generated?

ashwanthgoli · 2025-11-25T13:05:03Z

pkg/logql/bench/generator_query.go

 			fmt.Sprintf(`count_over_time(%s[%s])`, selector, rangeInterval),
 			fmt.Sprintf(`count_over_time(%s | detected_level=~"error|warn" [%s])`, selector, rangeInterval),
 			fmt.Sprintf(`count_over_time(%s |= "level" [%s])`, selector, rangeInterval),
+			//fmt.Sprintf(`avg_over_time(%s | json | unwrap rows_affected [%s])`, selector, rangeInterval),


i think we should uncomment these if tests are passing or do that in a follow-up after fixing the correctness issues

ashwanthgoli · 2025-11-25T13:06:34Z

pkg/engine/internal/types/grouping.go

+
+const (
+	GroupingModeInvalid         GroupingMode = iota
+	GroupingModeByEmptySet                   // Grouping by empty label set: <operation> by () (<expr>)


do we need 4 of these? GroupingModeByEmptySet is the same as GroupingModeByLabelSet without any groupings right

This can also be a flag. without=(true|false)

ashwanthgoli · 2025-11-25T13:08:08Z

pkg/engine/internal/proto/physicalpb/physicalpb.proto


  // Aggregation operation to perform over the underlying range vector.
-  AggregateVectorOp operation = 2;
+  AggregateVectorOp operation = 3;


nit: no need to reorder this entry

ashwanthgoli · 2025-11-25T13:18:09Z

pkg/engine/internal/executor/aggregator.go

+		panic("len(labels) != len(labelValues)")
+	}
+
+	for _, label := range labels {


this can be expensive as we run it for each row. can we check the benchmarks for a metric test with this change?

ashwanthgoli · 2025-11-25T13:18:25Z

pkg/engine/internal/executor/aggregator_test.go


 	t.Run("basic SUM aggregation with record building", func(t *testing.T) {
-		agg := newAggregator(groupBy, 10, aggregationOperationSum)
+		agg := newAggregator(10, aggregationOperationSum)


can we add more aggregation tests that call Add() with different labels as the existing ones assume the same labels each time?

ashwanthgoli · 2025-11-25T13:18:42Z

pkg/engine/internal/executor/range_aggregation.go

 	// rangeAggregationOperations holds the mapping of range aggregation types to operations for an aggregator.
 	rangeAggregationOperations = map[types.RangeAggregationType]aggregationOperation{
 		types.RangeAggregationTypeSum:   aggregationOperationSum,
 		types.RangeAggregationTypeCount: aggregationOperationCount,


any reason why these are removed?

ashwanthgoli · 2025-11-25T13:25:30Z

pkg/engine/internal/executor/range_aggregation.go


 				for _, w := range windows {
-					r.aggregator.Add(w.end, value, labelValues)
+					r.aggregator.Add(w.end, value, labels, labelValues)


as we know the labels of a record, can we give a hint to the aggregator once per record to update its label set instead of doing it once per row?

spiridonov added 5 commits November 17, 2025 11:57

aggregation groupings

741be75

Merge branch 'main' of github.com:grafana/loki into spiridonov-more-f…

296a9b8

…uncs

hacks

12de19e

Merge branch 'main' of github.com:grafana/loki into spiridonov-more-f…

bcf4455

…uncs

lint

91586e7

pull-request-size bot added the size/XXL label Nov 19, 2025

spiridonov added 5 commits November 20, 2025 09:54

part 1

a6f32f6

Merge branch 'main' of github.com:grafana/loki into spiridonov-more-f…

49bbec7

…uncs

rollback

ef010e5

Merge branch 'main' of github.com:grafana/loki into spiridonov-more-f…

223ad95

…uncs

minor

bd180c2

spiridonov changed the title ~~chore: Aggregation grouping and min/max/avg functions~~ chore: Aggregation groupings for by() and without() Nov 20, 2025

Merge branch 'main' of github.com:grafana/loki into spiridonov-more-f…

47d3764

…uncs

spiridonov marked this pull request as ready for review November 20, 2025 15:42

spiridonov requested a review from a team as a code owner November 20, 2025 15:42

spiridonov added 6 commits November 20, 2025 10:43

lint

74db8d2

Merge branch 'main' into spiridonov-more-funcs

a6616cd

Merge branch 'main' of github.com:grafana/loki into spiridonov-more-f…

195e774

…uncs

Merge branch 'spiridonov-more-funcs' of github.com:grafana/loki into …

166ac75

…spiridonov-more-funcs

Merge branch 'main' into spiridonov-more-funcs

c35dd89

Merge branch 'main' into spiridonov-more-funcs

de52aa3

ashwanthgoli reviewed Nov 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: Aggregation groupings for by() and without() #19928

chore: Aggregation groupings for by() and without() #19928

Uh oh!

spiridonov commented Nov 19, 2025 •

edited

Loading

Uh oh!

ashwanthgoli Nov 25, 2025

Uh oh!

ashwanthgoli Nov 25, 2025

Uh oh!

ashwanthgoli Nov 25, 2025

Uh oh!

ashwanthgoli Nov 25, 2025

Uh oh!

ashwanthgoli Nov 25, 2025

Uh oh!

ashwanthgoli Nov 25, 2025

Uh oh!

ashwanthgoli Nov 25, 2025

Uh oh!

ashwanthgoli Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chore: Aggregation groupings for by() and without() #19928

Are you sure you want to change the base?

chore: Aggregation groupings for by() and without() #19928

Uh oh!

Conversation

spiridonov commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

spiridonov commented Nov 19, 2025 •

edited

Loading