Uncoordintated omission - validating http load genertaors with bpf_validator #21

johnaohara · 2024-12-19T15:30:04Z

No description provided.

…lidator

content/post/uncoordinated-omission/index.adoc

franz1981 · 2024-12-19T17:31:06Z

content/post/uncoordinated-omission/index.adoc

+#[Buckets =           27, SubBuckets     =         2048]
+----------------------------------------------------------
+  92597 requests in 20.00s, 9.01MB read                     <6>
+  Socket errors: connect 5, read 0, write 0, timeout 40


socket errors? Some have been timed out...are important info?

good catch, and tbh I did not capture that info. It will mean we will likely be missing measurements for 5 requests... but I don't think that invalidates the message in the post.

franz1981 · 2024-12-19T17:34:42Z

content/post/uncoordinated-omission/index.adoc

+Av Throughput: 5142.425405 req/sec                          <6>
+
+----
+<1> The average RTT was 0.240 ms


The order and phrasing content in the bullet points is slightly different from the one to compare with i.e. https://github.com/RedHatPerf/redhatperf.github.io/pull/21/files#diff-e4c84ea749a91a7a6497412e17ff5f4973aec06e593c39f88e6c5abab5211a6aR432

and my eyes are flipping to detect which parts are relevant/different for comparison

A tabular/summary/chart view would help IMO

content/post/uncoordinated-omission/index.adoc

stalep · 2025-06-30T13:39:57Z

content/post/uncoordinated-omission/index.adoc

+
+https://github.com/johnaohara/bpf_validator/[bpf_validator] allows us to independently verify that the numbers produced by a Load Driver are not biased, and contain the full sample count.
+
+He tested against https://hyperfoil.io/[Hyperfoil] and https://github.com/giltene/wrk2[wrk2] to confirm the results presented by Hyperfoil are a accurate representation of what happened during a load test.


We instead of He?

stalep · 2025-06-30T14:17:15Z

content/post/uncoordinated-omission/index.adoc

+
+During our validation, we discovered;
+
+* Hyperfoil to add 0.072ms on average to RTT and the summary statistics reports all requests sent.


extplain what RTT stands for?

stalep · 2025-06-30T14:18:05Z

content/post/uncoordinated-omission/index.adoc

+A typical load test will be of the form shown above. 
+
+* **System Under Test (SUT)**: a dedicated system that contains an operating system, hosted application and related process to support the application.  The application typically runs as a service, listening for requests on a particular port.  When a request is revived, the network stack processes the request and pass it to the application, which handles the request and sends a response.
+* **Load Driver**: The job of the load driver is to replicate virtual users and measure their experience. Typical measurements include samples of Round Trip Time (RTT) or Throughput (req/sec)


Move the explanation for RTT to the first time it's referenced?

stalep · 2025-06-30T14:19:33Z

content/post/uncoordinated-omission/index.adoc

+* **System Under Test (SUT)**: a dedicated system that contains an operating system, hosted application and related process to support the application.  The application typically runs as a service, listening for requests on a particular port.  When a request is revived, the network stack processes the request and pass it to the application, which handles the request and sends a response.
+* **Load Driver**: The job of the load driver is to replicate virtual users and measure their experience. Typical measurements include samples of Round Trip Time (RTT) or Throughput (req/sec)
+
+The purpose of the load driver is to characterize the SUT, from the perspective of a virtual user. 


The purpose of the load driver it to characterize the user/client, not the SUT?

stalep · 2025-06-30T14:27:02Z

content/post/uncoordinated-omission/index.adoc

+
+By filtering out all packets that are sent to, and received from, a specific remote port a timestamp is recorded when a packet is sent and also when a full HTTP response has been received. 
+
+A map of timestamps is maintained within the kernel code, which calculates the RTT for each individual request. The timestamps are sent to a user space application, that records the timestamps in a http://www.hdrhistogram.org/[hdrHistogram]


Add . to the end

franz1981 · 2025-06-30T14:51:03Z

Let's see if @copilot is smart enough (will do on my article too)

Copilot

Pull Request Overview

This PR introduces a comprehensive AsciiDoc post that explains the concept of uncoordinated omission in HTTP load testing and demonstrates how to validate load generators using the bpf_validator tool. Key changes include:

Creating a detailed guide with performance results and comparison tests between Hyperfoil and wrk2.
Providing step-by-step instructions and sample command outputs for running the bpf_validator tool.
Including comprehensive statistical analysis and comparisons of request rates and latencies.

Copilot · 2025-06-30T14:51:10Z

content/post/uncoordinated-omission/index.adoc

+What can we do about it?  How can we determine if our load driver is 
+
+* a) accurate
+* b) reporting summary results from all the sample


Consider correcting 'sample' to 'samples' for grammatical accuracy.

Suggested change

* b) reporting summary results from all the sample

* b) reporting summary results from all the samples

Copilot · 2025-06-30T14:51:11Z

content/post/uncoordinated-omission/index.adoc

+99.9th Percentile: 0.841215
+99.99th Percentile: 1.735679
+
+99954 requests in 9995496s                                  <3>


The test duration value '9995496s' appears incorrect; please verify and update it to the intended duration format (e.g., '20.00s').

Suggested change

99954 requests in 9995496s <3>

99954 requests in 99954s <3>

It was by accident but 9995496s is indeed suspicious (good catch AI?!)

franz1981 · 2025-09-17T07:00:30Z

FYI @lampajr : https://tanelpoder.com/posts/optimizing-ebpf-biolatency-accounting/

Since https://github.com/johnaohara/bpf_validator/blob/26145dee0c3e37e55a3996190edc5ba7ac2a4dd2/accept.bpf.c#L34 is using the same type - the article shows the benefit to move to BPF_MAP_TYPE_PERCPU_HASH, instead.
This is not relevant in all the call paths, but on https://github.com/johnaohara/bpf_validator/blob/26145dee0c3e37e55a3996190edc5ba7ac2a4dd2/accept.bpf.c#L135 and https://github.com/johnaohara/bpf_validator/blob/26145dee0c3e37e55a3996190edc5ba7ac2a4dd2/accept.bpf.c#L200-L206 a similar pattern is used and usually entering and exit on the same read sys call doesn't really need to have a shared type of map (percpu solve that, but can be used only on specific cases).

In order to verify that improving this would matter, clearly we need to know first if capturing once "read" is called means capturing something important first, but in case it will, getting there without introducing a scalability problem, is a good idea IMO.
It can be both tested w a JMH program or a full fat load gen, adding more threads (to hyperfoil) or singe threaded too (still on hyperfoil) attaching a profiler like async profiler but with unconfined/unpriviledged access to walk the kernel stack trace: an attached ebpf program should be visible there too AFAIK

Uncoordintated omission - validating http load genertaors with bpf_va…

f66ee00

…lidator

franz1981 requested changes Dec 19, 2024

View reviewed changes

franz1981 reviewed Dec 19, 2024

View reviewed changes

barreiro reviewed Dec 20, 2024

View reviewed changes

content/post/uncoordinated-omission/index.adoc Show resolved Hide resolved

stalep reviewed Jun 30, 2025

View reviewed changes

franz1981 requested a review from Copilot June 30, 2025 14:50

Copilot AI reviewed Jun 30, 2025

View reviewed changes


		https://github.com/johnaohara/bpf_validator/[bpf_validator] allows us to independently verify that the numbers produced by a Load Driver are not biased, and contain the full sample count.

		He tested against https://hyperfoil.io/[Hyperfoil] and https://github.com/giltene/wrk2[wrk2] to confirm the results presented by Hyperfoil are a accurate representation of what happened during a load test.


		During our validation, we discovered;

		* Hyperfoil to add 0.072ms on average to RTT and the summary statistics reports all requests sent.


		By filtering out all packets that are sent to, and received from, a specific remote port a timestamp is recorded when a packet is sent and also when a full HTTP response has been received.

		A map of timestamps is maintained within the kernel code, which calculates the RTT for each individual request. The timestamps are sent to a user space application, that records the timestamps in a http://www.hdrhistogram.org/[hdrHistogram]

	* b) reporting summary results from all the sample
	* b) reporting summary results from all the samples

Uncoordintated omission - validating http load genertaors with bpf_validator #21

Are you sure you want to change the base?

Uncoordintated omission - validating http load genertaors with bpf_validator #21

Uh oh!

Conversation

johnaohara commented Dec 19, 2024

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

franz1981 commented Jun 30, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

franz1981 commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

franz1981 commented Sep 17, 2025 •

edited

Loading