Memory profiler #2061

jainapurva · 2025-04-15T22:13:21Z

This pull request introduces memory profiling capabilities to the microbenchmarking framework, alongside enhancements to the existing profiling infrastructure. Key changes include the addition of a memory profiler, updates to configuration files to enable memory profiling, and new test cases to validate the functionality.

Memory Profiling Enhancements:

Memory Profiling Implementation: Added generate_memory_profile and visualize_memory_profile functions in benchmarks/microbenchmarks/profiler.py to enable CUDA memory profiling and visualization. This includes memory snapshots, stats collection, and HTML visualizations of memory usage.
Validation for Pickle Files: Introduced _validate_pickle_file in profiler.py to ensure memory profile files are valid and readable.

Configuration Updates:

New Profiling Options: Updated benchmarks/microbenchmarks/README.md and benchmarks/microbenchmarks/test/benchmark_config.yml to include enable_memory_profiler as a configuration option. This allows users to enable memory profiling for specific models.

Benchmarking Framework Changes:

Integration of Memory Profiling: Updated benchmark_inference.py to run memory profiling when enable_memory_profiler is set. Results include memory stats and visualization paths.
Extended BenchmarkConfig and BenchmarkResult: Added enable_memory_profiler to BenchmarkConfig and memory profiling-related fields (memory_profile_path, memory_visualization_path, memory_stats) to BenchmarkResult in utils.py.

These changes enhance the profiling capabilities of the benchmarking framework, providing deeper insights into memory usage during inference, especially for CUDA-enabled devices.Add support for memory profiler

[ghstack-poisoned]

…shapes_config

pytorch-bot · 2025-04-15T22:13:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2061

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…_profiler

Copilot

Pull Request Overview

This PR introduces memory profiling capabilities to the microbenchmarking framework, enabling CUDA memory profiling and visualization alongside the standard profiling functionality. Key changes include the implementation of memory profiling functions in profiler.py, updates to configuration files and utility classes to support new profiling options, and the addition of comprehensive tests to validate the functionality.

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
benchmarks/microbenchmarks/utils.py	Added new fields and updated to_dict to carry memory profiling info
benchmarks/microbenchmarks/test/test_benchmark_profiler.py	Added tests for memory profile generation and visualization
benchmarks/microbenchmarks/test/benchmark_config.yml	Updated YAML config to include enable_memory_profiler flags
benchmarks/microbenchmarks/profiler.py	Added generate_memory_profile and visualize_memory_profile functions, including pickle file validation
benchmarks/microbenchmarks/benchmark_inference.py	Integrated memory profiling functions to run alongside model profiler
benchmarks/microbenchmarks/README.md	Documented new memory profiling options and usage

benchmarks/microbenchmarks/test/test_benchmark_profiler.py

Co-authored-by: Copilot <[email protected]>

HDCharles

lgtm

jainapurva and others added 13 commits April 8, 2025 14:35

Update

8b22a68

[ghstack-poisoned]

Add profiler

04f39ef

Add support for different models and different shapes

4b7ea5d

Add ruff fixes

33fa3ca

Updates

5ee6b58

Updates

345a00c

Merge remote-tracking branch 'origin/bench-gpu-profiling' into model_…

6e88306

…shapes_config

Updates

5895b7e

Updates

bbcba36

Memory profiler

62a1e70

updates

d5bdb4a

Merge remote-tracking branch 'origin/bench-gpu-profiling' into model_…

7677902

…shapes_config

Updates to memory_profiler

7c15006

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 15, 2025

jainapurva added 3 commits April 17, 2025 22:17

Merge remote-tracking branch 'origin/main' into model_shapes_config

06f5ee7

Added a future todo

784ec94

Merge remote-tracking branch 'origin/model_shapes_config' into memory…

ceded86

…_profiler

jainapurva changed the base branch from main to model_shapes_config April 18, 2025 19:01

jainapurva added topic: performance Use this tag if this PR improves the performance of a feature topic: for developers Use this tag if this PR is mainly developer facing labels Apr 18, 2025

jainapurva requested review from Copilot and HDCharles April 18, 2025 21:32

Copilot AI reviewed Apr 18, 2025

View reviewed changes

benchmarks/microbenchmarks/test/test_benchmark_profiler.py Outdated Show resolved Hide resolved

Update benchmarks/microbenchmarks/test/test_benchmark_profiler.py

19dcb3d

Co-authored-by: Copilot <[email protected]>

jainapurva marked this pull request as ready for review April 21, 2025 18:05

HDCharles approved these changes Apr 22, 2025

View reviewed changes

jainapurva added 3 commits April 22, 2025 19:38

Merge remote-tracking branch 'origin/main' into memory_profiler

aadcd91

Test fix

1ae84a8

Merge remote-tracking branch 'origin/main' into memory_profiler

391f6d8

jainapurva changed the base branch from model_shapes_config to main April 25, 2025 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory profiler #2061

Memory profiler #2061

jainapurva commented Apr 15, 2025 •

edited

Loading

pytorch-bot bot commented Apr 15, 2025 •

edited

Loading

Copilot AI left a comment

HDCharles left a comment

Memory profiler #2061

Are you sure you want to change the base?

Memory profiler #2061

Conversation

jainapurva commented Apr 15, 2025 • edited Loading

Memory Profiling Enhancements:

Configuration Updates:

Benchmarking Framework Changes:

pytorch-bot bot commented Apr 15, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2061

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

HDCharles left a comment

Choose a reason for hiding this comment

jainapurva commented Apr 15, 2025 •

edited

Loading

pytorch-bot bot commented Apr 15, 2025 •

edited

Loading