-
-
Notifications
You must be signed in to change notification settings - Fork 11
docker-ssh-agent builds consistently timeout on ci.jenkins.io #4557
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
The Windows build time has increased from 7 minutes to 17 minutes. While we're working to reduce the Windows build time, let's unblock this build by allowing it to run longer. jenkins-infra/helpdesk#4557 is the Jenkins infra help desk that records the issue with ci.jenkins.io Windows build performance. Jobs that passed in less than 20 minutes (but more tthan 15 minutes): * https://ci.jenkins.io/job/Core/job/remoting/job/master/796/ * https://ci.jenkins.io/job/Core/job/remoting/view/change-requests/job/PR-781/ * https://ci.jenkins.io/job/Core/job/remoting/view/change-requests/job/PR-782/
Increase of the remoting timeout from 15 minutes to 25 minutes has been merged from: |
The Windows build time has increased from 7 minutes to 17 minutes. While we're working to reduce the Windows build time, let's unblock this build by allowing it to run longer. jenkins-infra/helpdesk#4557 is the Jenkins infra help desk that records the issue with ci.jenkins.io Windows build performance. Jobs that passed in less than 20 minutes (but more tthan 15 minutes): * https://ci.jenkins.io/job/Core/job/remoting/job/master/796/ * https://ci.jenkins.io/job/Core/job/remoting/view/change-requests/job/PR-781/ * https://ci.jenkins.io/job/Core/job/remoting/view/change-requests/job/PR-782/
it's not due to a lack of memory or cpu, metrics are ok, seems more a problem with docker on windows. |
I did test on a windows VM from the agent template and it went through correctly:
so we need to test on an agent currently running, this would need some network adaptation to allow RDS connexion |
…a issue is resolved
PR disabling 3 failing tests until this infra issue is resolved to allow new releases in the meantime, ready for review and merging: |
…g-test-for-now chore: disable failing test(s) until jenkins-infra/helpdesk#4557 infra issue is resolved
Update: the "longpath" for git is now enabled as per #4574 (comment) and we were able to release a new version. Let's resume work to understand what is the problem when running the last tests in an EC2 agent: jenkinsci/docker-ssh-agent#496 |
Note: the jenkinsci/docker-agent repository also seems to have the same kind of issue as reported and described by @lemeurherveCB Example in jenkinsci/docker-agent#949 for instance. Symptoms are close (stuck tests in Pester harness, not on Linux), but there might be different root causes. @lemeurherveCB is expected to try without the |
…-failing-test-for-now Revert "chore: disable failing test(s) until jenkins-infra/helpdesk#4557 infra issue is resolved"
Closing as per jenkinsci/docker-ssh-agent#496 (tests are back) |
Uh oh!
There was an error while loading. Please reload this page.
Service(s)
ci.jenkins.io
Summary
The builds for remoting and docker-ssh-agent fail on ci.jenkins.io due to a timeout. The timeout failures first became visible after the ci.jenkins.io transition from Azure to AWS.
The failures due to timeout may have very different causes, since the remoting timeout seems to always be on Windows agents while the docker-ssh-agent timeout seems to be on Linux.
The remoting timeout is resolved by extending the timeout from 15 seconds to 25 seconds. The docker-ssh-agent timeout is not resolved. Can be confirmed at
Reproduction steps
The text was updated successfully, but these errors were encountered: