[ENH] Benchmark for s3 heap. #5564

rescrv · 2025-10-06T20:44:27Z

Description of changes

Unscientifically, minio can push 1k triggers per second with batching
that keeps the latency under one second. No need for sciencing this
one.

Test plan

Local + CI

Migration plan

N/A

Observability plan

N/A

Documentation Changes

N/A

github-actions · 2025-10-06T20:44:35Z

propel-code-bot · 2025-10-06T20:45:23Z

Add Benchmark for s3heap and Explicit Benchmark Dependency

This PR introduces a new benchmarking tool for the s3heap Rust module, designed to measure throughput and latency of S3-backed heap operations under high load. It adds the file rust/s3heap/examples/s3heap-benchmark.rs, which launches a synthetic, highly parallelized workload that stresses the system with configurable throughput and scheduling parameters. There are also supporting changes: guacamole is added as a development dependency in rust/s3heap/Cargo.toml, and the workspace's Cargo.lock is updated to reflect this. The tool utilizes large Tokio channel buffers and batching for stress testing; code and benchmark design choices were discussed in the review and are acknowledged by the author.

Key Changes

• Added new benchmark example file rust/s3heap/examples/s3heap-benchmark.rs to provide high-throughput load simulation for s3heap.
• Introduced benchmark options for runtime, target_throughput, and max_tokio_tasks to control stress test parameters.
• Added guacamole = { version = "0.11", default-features = false } to [dev-dependencies] in rust/s3heap/Cargo.toml.
• Updated Cargo.lock to include guacamole and synchronize crates with new dependency.

Affected Areas

• rust/s3heap/examples/s3heap-benchmark.rs (new benchmark file)
• rust/s3heap/Cargo.toml ([dev-dependencies] section)
• Cargo.lock (dependency graph)

This summary was automatically generated by @propel-code-bot

propel-code-bot · 2025-10-06T22:29:21Z

rust/s3heap/examples/s3heap-benchmark.rs

+#[derive(Clone, Eq, PartialEq)]
+pub struct Options {
+    pub runtime: usize,
+    pub target_throughput: usize,
+    pub max_tokio_tasks: usize,
+}
+
+impl Default for Options {
+    fn default() -> Self {
+        Options {
+            runtime: 60,
+            target_throughput: 100_000,
+            max_tokio_tasks: 10_000_000,
+        }
+    }
+}


[BestPractice]

The design of this benchmark could be refined to provide more realistic and clearer results.

Misleading Configuration and High Memory Usage: The max_tokio_tasks field in Options is used to define a very large channel capacity (over 10 million elements). This could lead to high memory consumption (potentially >600MB) if the consumer task falls behind, which might mask performance bottlenecks by simply buffering them.

Ineffective Task Limit Check: The check if tasks_alive > options.max_tokio_tasks is unlikely to ever trigger since only one long-lived task is spawned.

Consider simplifying this by removing max_tokio_tasks and sizing the channel relative to the throughput. Using tx.send(...).await instead of try_send() would also introduce back-pressure, giving a better signal of the sustainable throughput of the system under test.

For example:

// In Options struct, remove max_tokio_tasks pub struct Options { pub runtime: usize, pub target_throughput: usize, } // In main(), adjust channel size let (tx, mut rx) = tokio::sync::mpsc::channel::<Schedule>(options.target_throughput * 2); // e.g. 2s buffer // In the producer loop, use awaitable send and remove task check // ... if tx.send(Schedule { ... }).await.is_err() { // The receiver has been dropped, so we can stop. break; } // ...

I/O in Hot Loop: The eprintln! in the consumer's hot loop can introduce I/O overhead and affect measurements. It would be better to aggregate statistics and print a summary only at the end of the run.

Context for Agents

[**BestPractice**] The design of this benchmark could be refined to provide more realistic and clearer results. 1. **Misleading Configuration and High Memory Usage**: The `max_tokio_tasks` field in `Options` is used to define a very large channel capacity (over 10 million elements). This could lead to high memory consumption (potentially >600MB) if the consumer task falls behind, which might mask performance bottlenecks by simply buffering them. 2. **Ineffective Task Limit Check**: The check `if tasks_alive > options.max_tokio_tasks` is unlikely to ever trigger since only one long-lived task is spawned. Consider simplifying this by removing `max_tokio_tasks` and sizing the channel relative to the throughput. Using `tx.send(...).await` instead of `try_send()` would also introduce back-pressure, giving a better signal of the sustainable throughput of the system under test. For example: ```rust // In Options struct, remove max_tokio_tasks pub struct Options { pub runtime: usize, pub target_throughput: usize, } // In main(), adjust channel size let (tx, mut rx) = tokio::sync::mpsc::channel::<Schedule>(options.target_throughput * 2); // e.g. 2s buffer // In the producer loop, use awaitable send and remove task check // ... if tx.send(Schedule { ... }).await.is_err() { // The receiver has been dropped, so we can stop. break; } // ... ``` 3. **I/O in Hot Loop**: The `eprintln!` in the consumer's hot loop can introduce I/O overhead and affect measurements. It would be better to aggregate statistics and print a summary only at the end of the run. File: rust/s3heap/examples/s3heap-benchmark.rs Line: 32

tanujnay112 · 2025-10-07T01:08:03Z

rust/s3heap/examples/s3heap-benchmark.rs

+                    name: "demo".to_string(),
+                },
+                nonce,
+                next_scheduled: Utc::now()


We could make a configuration of this benchmark where next_scheduled is spread out a bit more to see how we do when we don't bucket that well.

Good idea. We can add that.

tanujnay112

Left one comment

propel-code-bot · 2025-10-07T16:28:58Z

rust/s3heap/examples/s3heap-benchmark.rs

+    );
+    let (tx, mut rx) =
+        tokio::sync::mpsc::channel::<Schedule>(options.target_throughput + options.max_tokio_tasks);
+    let count = Arc::new(AtomicU64::new(0));


[PerformanceOptimization]

The channel capacity is set to options.target_throughput + options.max_tokio_tasks, which is over 10 million with the default options. This will result in a channel buffer that consumes a large amount of memory (~650MB) and can lead to very large batches being processed by the consumer. This might be intentional for stress-testing, but if the goal is to simulate more frequent, smaller batches, you might consider reducing this capacity. Using just options.target_throughput would still allow for significant batching while using less memory.

Context for Agents

[**PerformanceOptimization**] The channel capacity is set to `options.target_throughput + options.max_tokio_tasks`, which is over 10 million with the default options. This will result in a channel buffer that consumes a large amount of memory (~650MB) and can lead to very large batches being processed by the consumer. This might be intentional for stress-testing, but if the goal is to simulate more frequent, smaller batches, you might consider reducing this capacity. Using just `options.target_throughput` would still allow for significant batching while using less memory. File: rust/s3heap/examples/s3heap-benchmark.rs Line: 48

propel-code-bot · 2025-10-07T16:35:03Z

rust/s3heap/examples/s3heap-benchmark.rs

+    let mut next = Duration::ZERO;
+    loop {
+        let gap = interarrival_duration(options.target_throughput as f64)(&mut guac);
+        let future = interarrival_duration(1.0 / 60.0)(&mut guac);


[BestPractice]

The variable name future is a bit confusing in an async context, as it can be mistaken for a Rust Future type. Since it represents a std::time::Duration, consider renaming it to something more descriptive like schedule_delay to improve clarity. You'll also need to update its usage on line 96.

Context for Agents

[**BestPractice**] The variable name `future` is a bit confusing in an `async` context, as it can be mistaken for a Rust `Future` type. Since it represents a `std::time::Duration`, consider renaming it to something more descriptive like `schedule_delay` to improve clarity. You'll also need to update its usage on line 96. File: rust/s3heap/examples/s3heap-benchmark.rs Line: 78

Unscientifically, minio can push 1k triggers per second with batching that keeps the latency under one second. No need for sciencing this one.

propel-code-bot · 2025-10-15T16:05:26Z

rust/s3heap/examples/s3heap-benchmark.rs

+            {
+                break;
+            }
+            eprintln!("HEAP::PUSH {}", buffer.len());


[BestPractice]

For a more accurate performance measurement, it's generally better to avoid I/O operations like eprintln! inside the hot loop of a benchmark. Printing to stderr can introduce latency and skew the results. The summary statistics printed at the end of the run are sufficient for reporting.

Context for Agents

[**BestPractice**] For a more accurate performance measurement, it's generally better to avoid I/O operations like `eprintln!` inside the hot loop of a benchmark. Printing to stderr can introduce latency and skew the results. The summary statistics printed at the end of the run are sufficient for reporting. File: rust/s3heap/examples/s3heap-benchmark.rs Line: 66

rescrv requested a review from tanujnay112 October 6, 2025 20:45

propel-code-bot bot reviewed Oct 6, 2025

View reviewed changes

blacksmith-sh bot deleted a comment from rescrv Oct 6, 2025

tanujnay112 reviewed Oct 7, 2025

View reviewed changes

tanujnay112 approved these changes Oct 7, 2025

View reviewed changes

rescrv force-pushed the rescrv/heap-scheduler branch from 5e54c6d to 52da7a6 Compare October 7, 2025 16:19

rescrv force-pushed the rescrv/s3heap-benchmark branch from 74bfed0 to d5312b4 Compare October 7, 2025 16:26

propel-code-bot bot reviewed Oct 7, 2025

View reviewed changes

blacksmith-sh bot deleted a comment from rescrv Oct 7, 2025

propel-code-bot bot reviewed Oct 7, 2025

View reviewed changes

rescrv force-pushed the rescrv/heap-scheduler branch from 52da7a6 to 43edec4 Compare October 7, 2025 17:42

rescrv force-pushed the rescrv/s3heap-benchmark branch from d4ceee4 to b18c541 Compare October 7, 2025 17:44

rescrv force-pushed the rescrv/heap-scheduler branch 2 times, most recently from 4007de1 to 08647ab Compare October 8, 2025 15:37

rescrv force-pushed the rescrv/heap-scheduler branch 2 times, most recently from 75ff5ab to 1de60cc Compare October 14, 2025 23:09

rescrv added 4 commits October 15, 2025 08:35

[ENH] Benchmark for s3 heap.

5692cbc

Unscientifically, minio can push 1k triggers per second with batching that keeps the latency under one second. No need for sciencing this one.

clippy

7b58069

skew across buckets

03dc8a6

rebase damage

0c2442a

rescrv force-pushed the rescrv/s3heap-benchmark branch from b18c541 to 0c2442a Compare October 15, 2025 15:36

rescrv changed the base branch from rescrv/heap-scheduler to main October 15, 2025 16:03

propel-code-bot bot reviewed Oct 15, 2025

View reviewed changes

Comment about something added due to reviewer.

09f9d41

blacksmith-sh bot deleted a comment from rescrv Oct 15, 2025

rescrv merged commit 2bcd28f into main Oct 15, 2025
59 checks passed

rescrv deleted the rescrv/s3heap-benchmark branch October 15, 2025 16:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENH] Benchmark for s3 heap. #5564

[ENH] Benchmark for s3 heap. #5564

Uh oh!

rescrv commented Oct 6, 2025

Uh oh!

github-actions bot commented Oct 6, 2025

Uh oh!

propel-code-bot bot commented Oct 6, 2025 •

edited

Loading

Uh oh!

propel-code-bot bot Oct 6, 2025

Uh oh!

tanujnay112 Oct 7, 2025

Uh oh!

rescrv Oct 15, 2025

Uh oh!

tanujnay112 left a comment

Uh oh!

propel-code-bot bot Oct 7, 2025

Uh oh!

propel-code-bot bot Oct 7, 2025

Uh oh!

propel-code-bot bot Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[ENH] Benchmark for s3 heap. #5564

[ENH] Benchmark for s3 heap. #5564

Uh oh!

Conversation

rescrv commented Oct 6, 2025

Description of changes

Test plan

Migration plan

Observability plan

Documentation Changes

Uh oh!

github-actions bot commented Oct 6, 2025

Reviewer Checklist

Testing, Bugs, Errors, Logs, Documentation

System Compatibility

Quality

Uh oh!

propel-code-bot bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

propel-code-bot bot Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

tanujnay112 Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

rescrv Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

tanujnay112 left a comment

Choose a reason for hiding this comment

Uh oh!

propel-code-bot bot Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

propel-code-bot bot Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

propel-code-bot bot Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

propel-code-bot bot commented Oct 6, 2025 •

edited

Loading