upload to spanner and add min input and output len #43

kaushikmitr · 2025-06-20T00:59:52Z

This pull request introduces significant enhancements to the benchmark_serving.py script and related files, focusing on data validation, Spanner integration, and improved configurability for benchmarking datasets. Key changes include adding support for uploading benchmark results to Google Cloud Spanner, introducing minimum input/output length filters, and enabling additional arguments for dataset filtering and Spanner configuration.

Enhancements to Benchmarking and Data Validation:

Added safe_json_value function to handle NaN and Infinity values for JSON serialization, ensuring compatibility with Spanner and other systems. (benchmark_serving.py, benchmark_serving.pyR45-R239)
Introduced min_input_len and min_output_len parameters in get_filtered_dataset to filter datasets based on minimum sequence lengths. (benchmark_serving.py, [1] [2]

Integration with Google Cloud Spanner:

Implemented upload_to_spanner_batch_with_retry function to upload benchmark results to Spanner with retry logic for batch uploads. (benchmark_serving.py, benchmark_serving.pyR45-R239)
Added Spanner-related arguments (--spanner-instance-id, --spanner-database-id) to the CLI parser for configuring Spanner uploads. (benchmark_serving.py, benchmark_serving.pyR1345-R1356)
Modified save_json_results to optionally upload results to Spanner, controlled by the spanner_upload flag. (benchmark_serving.py, benchmark_serving.pyR837-R847)

Updates to Benchmark Workflow:

Enhanced async def benchmark to pass minimum input/output lengths and enable Spanner uploads. (benchmark_serving.py, [1] [2]
Updated print_and_save_result to support Spanner uploads and optional server metrics scraping. (benchmark_serving.py, [1] [2]

Shell Script Modifications:

Added support for --min-input-length, --min-output-length, --spanner-instance-id, and --spanner-database-id in latency_throughput_curve.sh. (latency_throughput_curve.sh, [1] [2]

upload to spanner and add min input and output len

2d2e809

kaushikmitr requested review from Bslabe123 and achandrasekar June 20, 2025 01:00

kaushikmitr added 2 commits June 20, 2025 01:54

update requirements.txt

0598cde

include inferenceobjective and target model headers

536313a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

upload to spanner and add min input and output len #43

upload to spanner and add min input and output len #43

Uh oh!

kaushikmitr commented Jun 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

upload to spanner and add min input and output len #43

Are you sure you want to change the base?

upload to spanner and add min input and output len #43

Uh oh!

Conversation

kaushikmitr commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Enhancements to Benchmarking and Data Validation:

Integration with Google Cloud Spanner:

Updates to Benchmark Workflow:

Shell Script Modifications:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kaushikmitr commented Jun 20, 2025 •

edited

Loading