cuda-perf #135
Triggered via schedule
June 18, 2026 11:43
Status
Cancelled
Total duration
2d 0h 15m 49s
Artifacts
–
cuda-perf.yml
on: schedule
set-parameters
10s
Matrix: export-models
Matrix: benchmark-cuda
upload-benchmark-results
1m 58s
Annotations
56 errors and 2 warnings
|
export-models (openai/whisper-small, non-quantized, openai_whisper-small, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (openai/whisper-small, quantized-int4-weight-only, openai_whisper-small, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (mistralai/Voxtral-Mini-3B-2507, quantized-int4-tile-packed, mistralai_Voxtral-Mini... / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (mistralai/Voxtral-Mini-3B-2507, non-quantized, mistralai_Voxtral-Mini-3B-2507, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (openai/whisper-small, quantized-int4-tile-packed, openai_whisper-small, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (openai/whisper-medium, non-quantized, openai_whisper-medium, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (openai/whisper-medium, quantized-int4-tile-packed, openai_whisper-medium, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (openai/whisper-large-v3-turbo, quantized-int4-weight-only, openai_whisper-large-v3... / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (openai/whisper-medium, quantized-int4-weight-only, openai_whisper-medium, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (mistralai/Voxtral-Mini-3B-2507, quantized-int4-weight-only, mistralai_Voxtral-Mini... / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (openai/whisper-large-v3-turbo, quantized-int4-tile-packed, openai_whisper-large-v3... / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (openai/whisper-large-v3-turbo, non-quantized, openai_whisper-large-v3-turbo, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (google/gemma-3-4b-it, quantized-int4-tile-packed, google_gemma-3-4b-it, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (nvidia/parakeet-tdt, quantized-int4-weight-only, nvidia_parakeet-tdt, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (google/gemma-3-4b-it, quantized-int4-weight-only, google_gemma-3-4b-it, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (google/gemma-3-4b-it, non-quantized, google_gemma-3-4b-it, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (nvidia/parakeet-tdt, quantized-int4-tile-packed, nvidia_parakeet-tdt, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
export-models (nvidia/parakeet-tdt, non-quantized, nvidia_parakeet-tdt, 50) / linux-job
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
benchmark-cuda (openai/whisper-medium, non-quantized, openai_whisper-medium, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (mistralai/Voxtral-Mini-3B-2507, quantized-int4-tile-packed, mistralai_Voxtral-Min... / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (google/gemma-3-4b-it, quantized-int4-tile-packed, google_gemma-3-4b-it, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (nvidia/parakeet-tdt, quantized-int4-tile-packed, nvidia_parakeet-tdt, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (google/gemma-3-4b-it, non-quantized, google_gemma-3-4b-it, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (openai/whisper-medium, quantized-int4-tile-packed, openai_whisper-medium, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (openai/whisper-large-v3-turbo, quantized-int4-weight-only, openai_whisper-large-v... / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (nvidia/parakeet-tdt, quantized-int4-weight-only, nvidia_parakeet-tdt, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (google/gemma-3-4b-it, quantized-int4-weight-only, google_gemma-3-4b-it, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (openai/whisper-large-v3-turbo, quantized-int4-tile-packed, openai_whisper-large-v... / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (openai/whisper-large-v3-turbo, non-quantized, openai_whisper-large-v3-turbo, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (openai/whisper-medium, quantized-int4-weight-only, openai_whisper-medium, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (openai/whisper-small, non-quantized, openai_whisper-small, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (nvidia/parakeet-tdt, non-quantized, nvidia_parakeet-tdt, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (openai/whisper-small, quantized-int4-weight-only, openai_whisper-small, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (openai/whisper-small, quantized-int4-tile-packed, openai_whisper-small, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (mistralai/Voxtral-Mini-3B-2507, non-quantized, mistralai_Voxtral-Mini-3B-2507, 50) / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
benchmark-cuda (mistralai/Voxtral-Mini-3B-2507, quantized-int4-weight-only, mistralai_Voxtral-Min... / linux-job
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
|
|
upload-benchmark-results
Could not assume role with OIDC: Not authorized to perform sts:AssumeRoleWithWebIdentity
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
cuda-perf
Canceling since a higher priority waiting request for cuda-perf-main-081e1c81e878d103a465924e1a67d9c2b476f214-false-true exists
|
|
set-parameters
Node.js 20 is deprecated. The following actions target Node.js 20 but are being forced to run on Node.js 24: actions/checkout@v3, actions/setup-python@v4. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
|
|
upload-benchmark-results
Node.js 20 is deprecated. The following actions target Node.js 20 but are being forced to run on Node.js 24: actions/checkout@v3, actions/download-artifact@v4, actions/setup-python@v4, aws-actions/configure-aws-credentials@v4. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
|