[red-knot] gather type prevalence statistics #15834

carljm · 2025-01-30T18:48:46Z

Something Alex and I threw together during our 1:1 this morning. Allows us to collect statistics on the prevalence of various types in a file, most usefully TODO types or other dynamic types.

carljm · 2025-01-30T19:01:35Z

Hmm, the test failure here is very odd! Test definitely passes locally. Will have to look into this later.

AlexWaygood · 2025-01-30T21:44:43Z

that is really weird. it passes for me locally too (and it's obviously passing on all but one of our CI jobs).

That said, the statistic given by the one job that's failing is a statistic that makes more sense intuitively to me...

carljm · 2025-01-31T00:04:06Z

The issue is that in a release build we (intentionally, for perf reasons) omit tracking of distinct messages / file-and-line information for Todo types. So this results in fewer different kinds of Todo types in a release build. Then this interacts with another bug, which is that we are using FxHashMap::extend wrongly to combine statistics from different scopes (this won't sum the totals when the key exists in both maps). The latter bug only shows up if the same type occurs in more than one scope, and that only occurs in our tests when running in release mode so that all Todo types are the same.

carljm · 2025-01-31T00:32:01Z

Pushed a commit that fixes statistics merging, and fixes the tests.

It remains the case that we can't add any tests specifically testing that the statistics for "different" Todo types are differentiated, or those tests will fail on a release build. (We could switch any such tests off in release build.) We can still build features making use of this differentiation, but they will only work in debug build.

[red-knot] gather type prevalence statistics

0951a6d

carljm added the red-knot Multi-file analysis & type inference label Jan 30, 2025

carljm requested review from MichaReiser, AlexWaygood and sharkdp as code owners January 30, 2025 18:48

fix summing TypeStatistics

c6d8206

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[red-knot] gather type prevalence statistics #15834

[red-knot] gather type prevalence statistics #15834

carljm commented Jan 30, 2025

carljm commented Jan 30, 2025

AlexWaygood commented Jan 30, 2025

carljm commented Jan 31, 2025

carljm commented Jan 31, 2025

[red-knot] gather type prevalence statistics #15834

Are you sure you want to change the base?

[red-knot] gather type prevalence statistics #15834

Conversation

carljm commented Jan 30, 2025

carljm commented Jan 30, 2025

AlexWaygood commented Jan 30, 2025

carljm commented Jan 31, 2025

carljm commented Jan 31, 2025