-
Notifications
You must be signed in to change notification settings - Fork 1.5k
feat: support ApproxDistinct with utf8view #15200
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: support ApproxDistinct with utf8view #15200
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @zhuqi-lucas -- looks good to me
query III | ||
SELECT c2, approx_distinct(c1), approx_distinct(c5) FROM aggregate_test_100_utf8view GROUP BY c2 ORDER BY c2; | ||
---- | ||
1 5 22 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it is the same as above 💯
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @zhuqi-lucas , LGTM. I left some comments for you to consider.
where | ||
T: OffsetSizeTrait, | ||
{ | ||
/// new approx_distinct accumulator |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I understand you follow the previous pattern, but the comment seems unclear vague
/// new approx_distinct accumulator |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point @Weijun-H , addressed in latest PR.
c5 | ||
FROM aggregate_test_100; | ||
|
||
# Test approx_distinct for varchar / int |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# Test approx_distinct for varchar / int | |
# Test approx_distinct for varchar(with Utf8View) / int |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @Weijun-H , addressed in latest PR.
Thanks @zhuqi-lucas and @alamb |
Which issue does this PR close?
VARCHAR
fromUtf8
toUtf8View
#15096Rationale for this change
support ApproxDistinct with utf8view
What changes are included in this PR?
support ApproxDistinct with utf8view
Are these changes tested?
Yes
Are there any user-facing changes?
support ApproxDistinct with utf8view