fix(taskworker) Add metric to see how long we wait #94864

markstory · 2025-07-03T14:44:37Z

I'm interested in understanding how much time we're losing to waiting on empty multiprocessing queues. This metric will help understand this, so we can tune workers more.

codecov · 2025-07-03T15:03:49Z

Codecov Report

Attention: Patch coverage is 50.00000% with 1 line in your changes missing coverage. Please review.

✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
src/sentry/taskworker/workerchild.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           master   #94864    +/-   ##
========================================
  Coverage   87.89%   87.89%            
========================================
  Files       10440    10442     +2     
  Lines      603678   604023   +345     
  Branches    23505    23505            
========================================
+ Hits       530577   530902   +325     
- Misses      72734    72754    +20     
  Partials      367      367

evanh · 2025-07-03T18:53:05Z

src/sentry/taskworker/workerchild.py

            try:
+                # If the queue is empty, this could block for a second.
+                # We could be losing a bunch of throughput here.


I don't think this is accurate. If this blocks, it means there's no work to be done anyways. E.g. the throughput can't be "lost" here. Pausing here isn't causing the worker to not do work.

Fair point. I'll trim that out. I'm interested to see if making this timeout shorter, and adjusting worker buffer sizes could help us get more throughput from workers.

fix(taskworker) Add metric to see how long we wait

9fbcbc9

I'm interested in understanding how much time we're losing to waiting on empty multiprocessing queues. This metric will help understand this, so we can tune workers more.

markstory requested a review from a team as a code owner July 3, 2025 14:44

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Jul 3, 2025

vercel bot deployed to Preview July 3, 2025 14:45 View deployment

evanh approved these changes Jul 3, 2025

View reviewed changes

Reduce comment.

deea453

vercel bot deployed to Preview July 4, 2025 03:16 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(taskworker) Add metric to see how long we wait #94864

fix(taskworker) Add metric to see how long we wait #94864

markstory commented Jul 3, 2025

Uh oh!

codecov bot commented Jul 3, 2025 •

edited

Loading

Uh oh!

evanh Jul 3, 2025

Uh oh!

markstory Jul 4, 2025

Uh oh!

Uh oh!

Uh oh!

fix(taskworker) Add metric to see how long we wait #94864

Are you sure you want to change the base?

fix(taskworker) Add metric to see how long we wait #94864

Conversation

markstory commented Jul 3, 2025

Uh oh!

codecov bot commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

evanh Jul 3, 2025

Choose a reason for hiding this comment

Uh oh!

markstory Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Jul 3, 2025 •

edited

Loading