Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adds retries around install artifacts if checksums fail #402

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

jaxesn
Copy link
Member

@jaxesn jaxesn commented Mar 17, 2025

Issue #, if available:

Description of changes:

When adding gpg validation to ssm, I added retries around both the downloading and the checksum validation to catch cases where the checksum fails due to a bad download. I actually saw something like this in 1 test run in my devstack.

This PR changes the remaining artifact downloading to match this pattern where checksum validation can trigger a retry.

Note:

The new unit tests are basically the same between the different artifacts.

Testing (if applicable):

Documentation added/planned (if applicable):

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@jaxesn jaxesn force-pushed the jgw/add-retries-to-checksums branch 5 times, most recently from 54c365d to c3e409c Compare March 18, 2025 16:59
@jaxesn jaxesn force-pushed the jgw/add-retries-to-checksums branch from c3e409c to 4552411 Compare April 2, 2025 13:58
@g-gaston g-gaston requested a review from Copilot April 2, 2025 19:54
Copy link

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds retry logic to artifact installations when checksum validation fails, thereby increasing the robustness of artifact downloads. The changes update install functions across multiple components (kubelet, kubectl, image credential provider, IAM Roles Anywhere, IAM authenticator, and CNI plugins) while unifying constant naming and error messaging.

Reviewed Changes

Copilot reviewed 19 out of 19 changed files in this pull request and generated no comments.

Show a summary per file
File Description
internal/test/install.go Adds retry test cases to simulate flaky HTTP/checksum scenarios.
internal/node/hybrid/aws.go Updates constant naming for signing helper path.
internal/kubelet/kubeconfig.go Uses new default constants for authenticator paths.
internal/kubelet/install.go Implements retry logic and renames install options constants.
internal/kubectl/install.go Implements retry logic and renames install options constants.
internal/imagecredentialprovider/install.go Implements retry logic and renames install options constants.
internal/iamrolesanywhere/install.go Implements retry logic and renames install options constants.
internal/iamauthenticator/install.go Implements retry logic and renames install options constants.
internal/flows/install.go Updates installation call patterns to use the new options structures.
internal/cni/install.go Implements retry logic and renames install options constants for CNI.
internal/aws/source.go Updates error wrapping for clarity and consistency.
internal/artifact/source.go Updates error wrapping in checksum parsing for clarity.
Comments suppressed due to low confidence (3)

internal/test/install.go:40

  • [nitpick] Several test cases share similar patterns across artifacts. Consider refactoring common test logic into a helper to minimize duplication and simplify maintenance.
tests := []struct {

internal/aws/source.go:173

  • [nitpick] The error wrapping style here differs from other parts of the code. Consider using a consistent approach (e.g., errors.Wrap) throughout the codebase.
return nil, fmt.Errorf("getting artifact file reader: %w", err)

internal/artifact/source.go:35

  • [nitpick] Ensure that error messages maintain a consistent format and wrapping style across modules for better clarity and debugging.
return nil, fmt.Errorf("parsing expected checksum: %w", err)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant