Skip to content

Attempt to fix CI timeouts#1811

Open
Rolv-Apneseth wants to merge 9 commits into
containers:mainfrom
Rolv-Apneseth:fix-ci
Open

Attempt to fix CI timeouts#1811
Rolv-Apneseth wants to merge 9 commits into
containers:mainfrom
Rolv-Apneseth:fix-ci

Conversation

@Rolv-Apneseth

Copy link
Copy Markdown
Contributor

Attempt to fix CI timeouts with suggestions from #1805.

  1. Add --command-timeout to skopeo copy calls so they can't hang forever
  2. When setting up the local Docker registry, add a step to wait and ensure it's ready before proceeding

@debarshiray I chose to go with 10m timeout for the one over the network and 60s for the ones happening locally (in case there's some really slow IO). Did also take the openssl s_client approach like you suggested too. WDYT?

For some reason, calls to skopeo copy sometimes hang indefinitely. Avoid
this issue by specifying --command-timeout with reasonable values
(longer for call which must fetch over the network).

Signed-off-by: Rolv Apneseth <rolv.apneseth@gmail.com>

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces timeouts to various skopeo copy commands and adds a retry loop to wait for the Docker registry to become ready before logging in, aiming to prevent CI hangs. The reviewer suggested wrapping the openssl s_client readiness check with a timeout command to prevent the check itself from hanging indefinitely if the registry is unresponsive during the TLS handshake.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Comment thread test/system/libs/helpers.bash Outdated
…oceeding

To try and avoid potential races, ensure that the local docker registry
is ready before finishing the setup.

Signed-off-by: Rolv Apneseth <rolv.apneseth@gmail.com>
@Rolv-Apneseth

Copy link
Copy Markdown
Contributor Author

It does seem like an issue on the registry or maybe the node side of things. skopeo shouldn't hang, but it seems that the registry only actually finishes the response when skopeo hits its timeout.

Updating the registry image to ghcr.io/distribution/distribution:3 gets us past the setup failure but as you can see from the latest runs, one of the tests hangs on toolbox create with the same fedora-toolbox:34 image.

Test passes fine locally:

$ bats --filter "custom image that needs an authentication file" test/system/101-create.bats
   test suite: Set up
101-create.bats
 ✓ create: With a custom image that needs an authentication file
   test suite: Tear down

1 test, 0 failures

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant