Scc instead of loop entry #8852

eddyz87 · 2025-04-26T10:23:11Z

No description provided.

@0

Compute strongly connected components in the program CFG. Assign an SCC number to each instruction, recorded in env->insn_aux[*].scc. Use Tarjan's algorithm for SCC computation adapted to run non-recursively. For debug purposes print out computed SCCs as a part of full program dump in compute_live_registers() at log level 2, e.g.: func#0 @0 Live regs before insn: 0: .......... (b4) w6 = 10 2 1: ......6... (18) r1 = 0xffff88810bbb5565 2 3: .1....6... (b4) w2 = 2 2 4: .12...6... (85) call bpf_trace_printk#6 2 5: ......6... (04) w6 += -1 2 6: ......6... (56) if w6 != 0x0 goto pc-6 7: .......... (b4) w6 = 5 1 8: ......6... (18) r1 = 0xffff88810bbb5567 1 10: .1....6... (b4) w2 = 2 1 11: .12...6... (85) call bpf_trace_printk#6 1 12: ......6... (04) w6 += -1 1 13: ......6... (56) if w6 != 0x0 goto pc-6 14: .......... (b4) w0 = 0 15: 0......... (95) exit ^^^ SCC number for the instruction Signed-off-by: Eduard Zingerman <[email protected]>

A function to return IP for a given frame in a call stack of a state. Will be used by a next patch. Signed-off-by: Eduard Zingerman <[email protected]>

Replace loop_entry-based exact states comparison logic. Instead, for states within an iterator based loop, mark all registers as read and precise. Use control flow graph strongly connected components information to detect states that are members of a loop. See comments for mark_all_regs_read_and_precise() for a detailed explanation. This change addresses the cases described in [1]. These cases can be illustrated with the following diagram: .-> A --. Assume the states are visited in the order A, B, C. | | | Assume that state B reaches a state equivalent to state A. | v v At this point, state C is not processed yet, so state A '-- B C has not received any read or precision marks from C. As a result, these marks won't be propagated to B. If B has incomplete marks, it is unsafe to use it in states_equal() checks. See selftests later in a series for examples of unsafe programs that are not detected without this change. [1] https://lore.kernel.org/bpf/[email protected]/ Fixes: 2a09928 ("bpf: correct loop detection for iterators convergence") Signed-off-by: Eduard Zingerman <[email protected]>

The test case absent_mark_in_the_middle_state is equivalent of the following C program: 1: r8 = bpf_get_prandom_u32(); 2: r6 = -32; 3: bpf_iter_num_new(&fp[-8], 0, 10); 4: if (unlikely(bpf_get_prandom_u32())) 5: r6 = -31; 6: for (;;) { 7: if (!bpf_iter_num_next(&fp[-8])) 8: break; 9: if (unlikely(bpf_get_prandom_u32())) 10: *(u64 *)(fp + r6) = 7; 11: } 12: bpf_iter_num_destroy(&fp[-8]); 13: return 0; W/o a fix that instructs verifier to ignore branches count for loop entries verification proceeds as follows: - 1-4, state is {r6=-32,fp-8=active}; - 6, checkpoint A is created with {r6=-32,fp-8=active}; - 7, checkpoint B is created with {r6=-32,fp-8=active}, push state {r6=-32,fp-8=active} from 7 to 9; - 8,12,13, {r6=-32,fp-8=drained}, exit; - pop state with {r6=-32,fp-8=active} from 7 to 9; - 9, push state {r6=-32,fp-8=active} from 9 to 10; - 6, checkpoint C is created with {r6=-32,fp-8=active}; - 7, checkpoint A is hit, no precision propagated for r6 to C; - pop state {r6=-32,fp-8=active} from 9 to 10; - 10, state is {r6=-31,fp-8=active}, r6 is marked as read and precise, these marks are propagated to checkpoints A and B (but not C, as it is not the parent of current state; - 6, {r6=-31,fp-8=active} checkpoint C is hit, because r6 is not marked precise for this checkpoint; - the program is accepted, despite a possibility of unaligned u64 stack access at offset -31. The test case absent_mark_in_the_middle_state2 is similar except the following change: r8 = bpf_get_prandom_u32(); r6 = -32; bpf_iter_num_new(&fp[-8], 0, 10); if (unlikely(bpf_get_prandom_u32())) { r6 = -31; + jump_into_loop: + goto +0; + goto loop; + } + if (unlikely(bpf_get_prandom_u32())) + goto jump_into_loop; + loop: for (;;) { if (!bpf_iter_num_next(&fp[-8])) break; if (unlikely(bpf_get_prandom_u32())) *(u64 *)(fp + r6) = 7; } bpf_iter_num_destroy(&fp[-8]) return 0 The goal is to check that read/precision marks are propagated to checkpoint created at 'goto +0' that resides outside of the loop. Signed-off-by: Eduard Zingerman <[email protected]>

kernel-patches-daemon-bpf bot force-pushed the bpf-next_base branch from a041a61 to b2666a3 Compare April 28, 2025 15:56

eddyz87 added 2 commits April 28, 2025 09:56

bpf: frame_insn_idx() utility function

bfcf9be

A function to return IP for a given frame in a call stack of a state. Will be used by a next patch. Signed-off-by: Eduard Zingerman <[email protected]>

eddyz87 force-pushed the scc-instead-of-loop-entry branch from 959bbb2 to 0ce57f0 Compare April 29, 2025 03:39

eddyz87 added 2 commits April 28, 2025 23:32

eddyz87 force-pushed the scc-instead-of-loop-entry branch from 0ce57f0 to f41d03a Compare April 29, 2025 06:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scc instead of loop entry #8852

Scc instead of loop entry #8852

eddyz87 commented Apr 26, 2025

Scc instead of loop entry #8852

Are you sure you want to change the base?

Scc instead of loop entry #8852

Conversation

eddyz87 commented Apr 26, 2025