cuda-perf

cuda-perf #135

Triggered via schedule June 18, 2026 11:43

Christoffer-JL

⁠ 081e1c8

main

Status Cancelled

Total duration 2d 0h 15m 49s

Artifacts –

cuda-perf.yml

on: schedule

set-parameters

10s

Matrix: export-models

Matrix: benchmark-cuda

upload-benchmark-results

1m 58s

Annotations

56 errors and 2 warnings

export-models (openai/whisper-small, non-quantized, openai_whisper-small, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (openai/whisper-small, quantized-int4-weight-only, openai_whisper-small, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (mistralai/Voxtral-Mini-3B-2507, quantized-int4-tile-packed, mistralai_Voxtral-Mini... / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (mistralai/Voxtral-Mini-3B-2507, non-quantized, mistralai_Voxtral-Mini-3B-2507, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (openai/whisper-small, quantized-int4-tile-packed, openai_whisper-small, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (openai/whisper-medium, non-quantized, openai_whisper-medium, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (openai/whisper-medium, quantized-int4-tile-packed, openai_whisper-medium, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (openai/whisper-large-v3-turbo, quantized-int4-weight-only, openai_whisper-large-v3... / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (openai/whisper-medium, quantized-int4-weight-only, openai_whisper-medium, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (mistralai/Voxtral-Mini-3B-2507, quantized-int4-weight-only, mistralai_Voxtral-Mini... / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (openai/whisper-large-v3-turbo, quantized-int4-tile-packed, openai_whisper-large-v3... / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (openai/whisper-large-v3-turbo, non-quantized, openai_whisper-large-v3-turbo, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (google/gemma-3-4b-it, quantized-int4-tile-packed, google_gemma-3-4b-it, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (nvidia/parakeet-tdt, quantized-int4-weight-only, nvidia_parakeet-tdt, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (google/gemma-3-4b-it, quantized-int4-weight-only, google_gemma-3-4b-it, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (google/gemma-3-4b-it, non-quantized, google_gemma-3-4b-it, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (nvidia/parakeet-tdt, quantized-int4-tile-packed, nvidia_parakeet-tdt, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

export-models (nvidia/parakeet-tdt, non-quantized, nvidia_parakeet-tdt, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

benchmark-cuda (openai/whisper-medium, non-quantized, openai_whisper-medium, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (mistralai/Voxtral-Mini-3B-2507, quantized-int4-tile-packed, mistralai_Voxtral-Min... / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (google/gemma-3-4b-it, quantized-int4-tile-packed, google_gemma-3-4b-it, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (nvidia/parakeet-tdt, quantized-int4-tile-packed, nvidia_parakeet-tdt, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (google/gemma-3-4b-it, non-quantized, google_gemma-3-4b-it, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-medium, quantized-int4-tile-packed, openai_whisper-medium, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-large-v3-turbo, quantized-int4-weight-only, openai_whisper-large-v... / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (nvidia/parakeet-tdt, quantized-int4-weight-only, nvidia_parakeet-tdt, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (google/gemma-3-4b-it, quantized-int4-weight-only, google_gemma-3-4b-it, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-large-v3-turbo, quantized-int4-tile-packed, openai_whisper-large-v... / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-large-v3-turbo, non-quantized, openai_whisper-large-v3-turbo, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-medium, quantized-int4-weight-only, openai_whisper-medium, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-small, non-quantized, openai_whisper-small, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (nvidia/parakeet-tdt, non-quantized, nvidia_parakeet-tdt, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-small, quantized-int4-weight-only, openai_whisper-small, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-small, quantized-int4-tile-packed, openai_whisper-small, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (mistralai/Voxtral-Mini-3B-2507, non-quantized, mistralai_Voxtral-Mini-3B-2507, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (mistralai/Voxtral-Mini-3B-2507, quantized-int4-weight-only, mistralai_Voxtral-Min... / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

upload-benchmark-results

Could not assume role with OIDC: Not authorized to perform sts:AssumeRoleWithWebIdentity

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists

set-parameters

Node.js 20 is deprecated. The following actions target Node.js 20 but are being forced to run on Node.js 24: actions/checkout@v3, actions/setup-python@v4. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/

upload-benchmark-results

Node.js 20 is deprecated. The following actions target Node.js 20 but are being forced to run on Node.js 24: actions/checkout@v3, actions/download-artifact@v4, actions/setup-python@v4, aws-actions/configure-aws-credentials@v4. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda-perf #135

Summary

cuda-perf #135

Uh oh!

cuda-perf.yml

Annotations