-
Notifications
You must be signed in to change notification settings - Fork 3.9k
roachtest: sql-stats/mixed-version failed #146699
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Removing release blocker as we've seen this flake before. |
Seems like node 1 was draining/shutting down while this happened. Here's when we're making our request:
The Here's node 1 starting shutdown prior to our request:
Perhaps we should have longer timeouts to account for this possibility during mixed-version testing. Or we should reduce our own timeouts in the code to fail faster. This would be the code here: https://github.com/cockroachdb/cockroach/blob/master/pkg/server/statements.go#L77-L94 Pretty suspicious that we set |
Previously, we had some HTTP requests to SQL Stats which did not retry. Retries have been added there. Additionally, we will retry on HTTP errors as well to deal with timeout errors and cluster network issues. This test is bringing nodes up and down during upgrades and a retry is sometimes necessary to make sure we don't get stuck waiting on a draining node. Resolves: cockroachdb#146699 Release note: None
Ended up adding retries to the test, not the server. I attempted adding them but realized that it is best to push this down to the client and have the server not mask the issue. Otherwise, we end up with more conditional behavior on the server and need to cross reference with timeout settings. Simpler for client to expect to retry. |
Uh oh!
There was an error while loading. Please reload this page.
roachtest.sql-stats/mixed-version failed with artifacts on release-24.3 @ c0f83306a8557f08d060eb776186b47b92c18615:
Parameters:
arch=arm64
cloud=aws
coverageBuild=false
cpu=4
encrypted=false
fs=ext4
localSSD=true
mvtDeploymentMode=shared-process
mvtVersions=v23.2.10 → v24.1.14 → release-24.3
runtimeAssertionsBuild=false
ssd=0
Help
See: roachtest README
See: How To Investigate (internal)
Grafana is not yet available for aws clusters
This test on roachdash | Improve this report!
Jira issue: CRDB-50661
The text was updated successfully, but these errors were encountered: