Skip to content

ci-test-infra-triage perma failing #34863

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
BenTheElder opened this issue May 27, 2025 · 8 comments
Open

ci-test-infra-triage perma failing #34863

BenTheElder opened this issue May 27, 2025 · 8 comments
Assignees
Labels
kind/bug Categorizes issue or PR as related to a bug. sig/testing Categorizes an issue or PR as relevant to SIG Testing.

Comments

@BenTheElder
Copy link
Member

https://prow.k8s.io/?job=ci-test-infra-triage

cc @dims @aojea

/sig testing

timing out at 3h.

@BenTheElder BenTheElder added the kind/bug Categorizes issue or PR as related to a bug. label May 27, 2025
@k8s-ci-robot k8s-ci-robot added the sig/testing Categorizes an issue or PR as relevant to SIG Testing. label May 27, 2025
@BenTheElder
Copy link
Member Author

We seem to be stuck on data from the 26th now

@BenTheElder
Copy link
Member Author

This was passing reliably until recently:
Image

first failure:

Test started last Saturday at 3:23 PM failed after 3h15m2s
https://prow.k8s.io/view/gs/kubernetes-ci-logs/logs/ci-test-infra-triage/1926403800193568768

green before that:

Test started last Saturday at 11:23 AM passed after 2h39m42s.

We don't have a diff link, but it was during 2025-04-25 that it started failing

@BenTheElder
Copy link
Member Author

Something seriously regressed the performance, it was completing in ~50 minutes:

Image

Now taking 3+ hours and timing out:

Image

@BenTheElder
Copy link
Member Author

So maybe something that changed around ~2025-05-16, possibly more data in the CI results or some regression to the tool

@BenTheElder
Copy link
Member Author

#34879 (comment)
So the image with #34876 should've been available by 7:06 AM pacific

A few hours later we are currently running at https://prow.k8s.io/view/gs/kubernetes-ci-logs/logs/ci-test-infra-triage/1928154351960854528

@dims
Copy link
Member

dims commented May 29, 2025

@BenTheElder i was watching the previous run, and my change did not solve the issue. i've filed another one #34882 after looking through the logs, let's try that one next. If i don't make progress, will file reverts for both.

@BenTheElder
Copy link
Member Author

@dims your changes sound reasonable to keep IMHO, though I haven't had a chance to read in detail.

Something recent caused this to grow pretty steadily, seems like pathological data. Haven't been able to confirm and I'm out tomorrow (back monday)

@dims
Copy link
Member

dims commented May 31, 2025

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug. sig/testing Categorizes an issue or PR as relevant to SIG Testing.
Projects
None yet
Development

No branches or pull requests

3 participants