Optimize `std::transform` for `vector<bool>` #5769

AlexGuteniev · 2025-10-07T08:02:01Z

Towards #625, specifically #625 (comment) items 1 and 2.

🦖 Optimization

When a standard functor, either transparent or integer-specialized, is passed to transform, along with all vector<bool> iterators, map that functor to a bitwise one to operate on the underlying type.

The mapping is done via template specialization, and not via if constexpr to make the dispatch working fine without <functional> included and functors defined.

Only do this for zero offset. Supporting all possible offset combination is much complexity for a little gain. Remember copy.

Extract pointers from iterators to help the compiler auto-vectorize. Yes, it does not auto-vectorize when using the whole iterators. Auto-vecotrization needs simplest ways of implementing loops.

Don't call transform again, to avoid unnecessary recursion, the operation is simple.

~~Don't process tails explicitly, yield to the existing loop for now.~~
Actually lets go for it, it is not that hard. Process tails with applying bit mask.

Don't do ranges yet. Other vector<bool> optimizations don't do them either. It is getting complicated, so instead of doing ranges separately, need to look into #1754 at last.

🏁 Benchmark

Feed the randomizer with some seed to make the inputs different 🐦

Since (auto-)vectorization is (expected to be) engaged, use alignment controlling allocator.

⏱️ Benchmark results

Benchmark	Before	After	Speedup
`transform_two_inputs_aligned<logical_and<>>/64`	108 ns	2.55 ns	42.4
`transform_two_inputs_aligned<logical_and<>>/4096`	13869 ns	9.44 ns	1470
`transform_two_inputs_aligned<logical_and<>>/65536`	416424 ns	115 ns	3620
`transform_two_inputs_aligned<logical_or<>>/64`	123 ns	2.59 ns	47.40
`transform_two_inputs_aligned<logical_or<>>/4096`	14377 ns	9.07 ns	1590
`transform_two_inputs_aligned<logical_or<>>/65536`	409012 ns	112 ns	3650
`transform_one_input_aligned<logical_not<>>/64`	83.7 ns	2.14 ns	39.10
`transform_one_input_aligned<logical_not<>>/4096`	6891 ns	7.28 ns	947
`transform_one_input_aligned<logical_not<>>/65536`	264957 ns	82.7 ns	3200

AlexGuteniev · 2025-10-07T11:36:20Z

stl/inc/functional

+
+template <class _Ty>
+struct _Map_vb_functor<equal_to<_Ty>> {
+    using _Type = conditional_t<_Is_vbool_functor_arg<_Ty>, _Bit_xnor, void>;


Alternatively, we can map to _Map_vb_functor itself and have operator() right here to save one struct.

AlexGuteniev · 2025-10-15T06:45:52Z

stl/inc/vector

 }

+template <class _VbIt, class _OutIt, class _Mapped_fn>
+_CONSTEXPR20 _OutIt _Transform_vbool_aligned(


I moved this out to <vector> from <algorithms> because other algorithms are moved out.
However, I'm not sure if it is useful.

For accessing vector<bool> representation it is not strictly necessary. Most of things are template-dependent member functions and datas. The only exception is _Vbase, which can be still deduced from iterators.

For throughput it does not look useful either. <vector> is more frequent than <algorithm> so it appears more useful to off-load <vector> instead.

For reference, 0f24d45 is the commit where this movement was made.

AlexGuteniev requested a review from a team as a code owner October 7, 2025 08:02

github-project-automation bot added this to STL Code Reviews Oct 7, 2025

github-project-automation bot moved this to Initial Review in STL Code Reviews Oct 7, 2025

AlexGuteniev force-pushed the dinosaurs! branch from 2251e80 to efb5539 Compare October 7, 2025 08:13

AlexGuteniev added 3 commits October 7, 2025 11:25

benchmark

6651787

coverage

8578a69

optimization

dc9cb95

AlexGuteniev force-pushed the dinosaurs! branch from efb5539 to dc9cb95 Compare October 7, 2025 08:25

tails

e09858c

AlexGuteniev force-pushed the dinosaurs! branch from 4c83d72 to e09858c Compare October 7, 2025 09:50

-parens

461defb

AlexGuteniev commented Oct 7, 2025

View reviewed changes

StephanTLavavej added the performance Must go faster label Oct 8, 2025

StephanTLavavej self-assigned this Oct 8, 2025

move out

0f24d45

AlexGuteniev commented Oct 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize `std::transform` for `vector<bool>` #5769

Optimize `std::transform` for `vector<bool>` #5769

Uh oh!

AlexGuteniev commented Oct 7, 2025 •

edited

Loading

Uh oh!

AlexGuteniev Oct 7, 2025

Uh oh!

AlexGuteniev Oct 15, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Optimize std::transform for vector<bool> #5769

Are you sure you want to change the base?

Optimize std::transform for vector<bool> #5769

Uh oh!

Conversation

AlexGuteniev commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦖 Optimization

🏁 Benchmark

⏱️ Benchmark results

Uh oh!

AlexGuteniev Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

AlexGuteniev Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Optimize `std::transform` for `vector<bool>` #5769

Optimize `std::transform` for `vector<bool>` #5769

AlexGuteniev commented Oct 7, 2025 •

edited

Loading

AlexGuteniev Oct 15, 2025 •

edited

Loading