Add initial version of benchmark experiment runner #1266

dannycjones · 2025-02-11T17:15:54Z

In order to investigate performance in Mountpoint, we want to be able to vary different parameters. In fact, it can be very useful to vary these parameters together to see how performance (such as sequential read throughput) changes as we vary two parameters together.

This change introduces a new benchmark running script which uses the Python framework Hydra to enumerate combinations of parameters, and then execute some function with each combination. The script manages the lifecycle of the mount-s3 file system and collecting data into an output folder.

The change currently does not reuse the FIO definitions used by our regression benchmarks. In the mid-term, these should be reconciled.

This pull request (PR) supersedes a previous PR: #986.

Does this change impact existing behavior?

No, this adds a new benchmark runner and benchmark definitions. This does not impact the Mountpoint file system.

Does this change need a changelog entry? Does it require a version change?

No, no impact to Mountpoint file system or crates.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and I agree to the terms of the Developer Certificate of Origin (DCO).

Signed-off-by: Daniel Carl Jones <[email protected]>

benchmark/benchmark.py

muddyfish · 2025-02-12T17:02:56Z

benchmark/benchmark.py

+        try:
+            # TODO: Add resource monitoring during FIO job
+            _run_fio(cfg, mount_dir)
+            success = True


try/except/else also is probably wanted here as well

I don't follow what we want the else for?

renanmagagnin · 2025-02-12T16:53:08Z

benchmark/README.md

Should we add a section on benchmarks and how to create a new one?

Added a short desc on how the benchmark works, where its defined

renanmagagnin · 2025-02-12T16:59:18Z

benchmark/benchmark.py

+        mp_version = _mount_mp(cfg, metadata, mount_dir)
+        mounted = True
+        metadata["mp_version"] = mp_version


Would it be useful to parameterise mountpoint flags? If so, we could also add them to metadata.

I'm not sure what you mean here, can you rephrase?

benchmark/benchmark.py

renanmagagnin · 2025-02-12T17:10:09Z

benchmark/README.md

+This will run the default experiment, including many different configuration combinations.
+Output is written to `multirun/` within directories for the date, time, and job run.


Maybe it could be nice to include an example output?

I described the output a bit more. I'm worried it'll be hard to maintain and become outdated quickly here.

Signed-off-by: Daniel Carl Jones <[email protected]>

benchmark/benchmark.py

sahityadg · 2025-02-19T14:29:32Z

benchmark/benchmark.py

+    ]
+    subprocess_env = {}
+
+    if cfg['s3_prefix'] is not None:


Should we just use mountpoint args as they are than have a different mapping? For example, we could just use prefix instead of s3_prefix, debug-crt instead of mountpoint_debug_crt etc.,

My thinking is that it doesn't map very well as a straight forward mapping. Right now, it wraps both Mountpoint and FIO - maybe we might extend either (more benchmarking tools or more layers of MP, though I'm not confident on how well it can be generalized).

Additionally, there's the parameters mentioned in this script as well as when they are read in Pandas when analysing results. I diverged from MP naming to try and make the results clearer (for example, using fuse_threads instead of max-threads).

I'm leaning more to keep them diverged for now.

Signed-off-by: Daniel Carl Jones <[email protected]>

benchmark/benchmark.py

…n.dump Signed-off-by: Daniel Carl Jones <[email protected]>

muddyfish · 2025-02-20T11:02:13Z

LGTM

Add initial version of benchmark experiment runner

a473b2b

Signed-off-by: Daniel Carl Jones <[email protected]>

dannycjones temporarily deployed to PR integration tests February 11, 2025 17:16 — with GitHub Actions Inactive

dannycjones marked this pull request as ready for review February 12, 2025 09:18

dannycjones requested review from mansi153 and a team February 12, 2025 09:18

muddyfish reviewed Feb 12, 2025

View reviewed changes

renanmagagnin reviewed Feb 12, 2025

View reviewed changes

dannycjones added 3 commits February 19, 2025 08:37

Update benchmark README.md based on PR feedback

9a4befb

Signed-off-by: Daniel Carl Jones <[email protected]>

Update benchmark.py based on PR feedback

8b28235

Signed-off-by: Daniel Carl Jones <[email protected]>

Update benchmark.py based on Mansi and Sahitya feedback

67d5dc2

Signed-off-by: Daniel Carl Jones <[email protected]>

dannycjones temporarily deployed to PR integration tests February 19, 2025 10:30 — with GitHub Actions Inactive

dannycjones requested review from renanmagagnin and muddyfish February 19, 2025 10:32

muddyfish reviewed Feb 19, 2025

View reviewed changes

benchmark/benchmark.py Outdated Show resolved Hide resolved

benchmark/benchmark.py Outdated Show resolved Hide resolved

sahityadg reviewed Feb 19, 2025

View reviewed changes

Update benchmark.py based on latest PR feedback

6aa364a

Signed-off-by: Daniel Carl Jones <[email protected]>

dannycjones requested a deployment to PR integration tests February 19, 2025 15:08 — with GitHub Actions Waiting

muddyfish reviewed Feb 19, 2025

View reviewed changes

benchmark/benchmark.py Outdated Show resolved Hide resolved

benchmark/benchmark.py Outdated Show resolved Hide resolved

benchmark/benchmark.py Outdated Show resolved Hide resolved

sahityadg requested review from muddyfish and sahityadg February 19, 2025 16:08

Update to remove unnecessary str() calls, replace json.dumps with jso…

64c7a37

…n.dump Signed-off-by: Daniel Carl Jones <[email protected]>

dannycjones temporarily deployed to PR integration tests February 20, 2025 08:22 — with GitHub Actions Inactive

muddyfish approved these changes Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add initial version of benchmark experiment runner #1266

Add initial version of benchmark experiment runner #1266

dannycjones commented Feb 11, 2025

muddyfish Feb 12, 2025

dannycjones Feb 19, 2025

renanmagagnin Feb 12, 2025

dannycjones Feb 19, 2025

renanmagagnin Feb 12, 2025

dannycjones Feb 13, 2025

renanmagagnin Feb 12, 2025

dannycjones Feb 19, 2025

sahityadg Feb 19, 2025 •

edited

Loading

dannycjones Feb 19, 2025

muddyfish commented Feb 20, 2025

		This will run the default experiment, including many different configuration combinations.
		Output is written to `multirun/` within directories for the date, time, and job run.

Add initial version of benchmark experiment runner #1266

Are you sure you want to change the base?

Add initial version of benchmark experiment runner #1266

Conversation

dannycjones commented Feb 11, 2025

Does this change impact existing behavior?

Does this change need a changelog entry? Does it require a version change?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sahityadg Feb 19, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

muddyfish commented Feb 20, 2025

sahityadg Feb 19, 2025 •

edited

Loading