Skip to content

[AUTOCUT] Gradle Check Flaky Test Report for IndexShardTests #14321

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
opensearch-ci-bot opened this issue Jun 13, 2024 · 4 comments
Open

[AUTOCUT] Gradle Check Flaky Test Report for IndexShardTests #14321

opensearch-ci-bot opened this issue Jun 13, 2024 · 4 comments
Assignees
Labels
autocut flaky-test Random test failure that succeeds on second run Storage Issues and PRs relating to data and metadata storage >test-failure Test failure from CI, local build, etc.

Comments

@opensearch-ci-bot
Copy link
Collaborator

opensearch-ci-bot commented Jun 13, 2024

Flaky Test Report for IndexShardTests

Noticed the IndexShardTests has some flaky, failing tests that failed during post-merge actions.

Details

Git Reference Merged Pull Request Build Details Test Name
17e8935 13844 39312 org.opensearch.index.shard.IndexShardTests.testRefreshLevelRestoreShardFromRemoteStore
1bdf43d 13866 39441 org.opensearch.index.shard.IndexShardTests.testRefreshLevelRestoreShardFromRemoteStore
679ccac 14248 40881 org.opensearch.index.shard.IndexShardTests.testRefreshLevelRestoreShardFromRemoteStore
a8b1734 18087 57282 org.opensearch.index.shard.IndexShardTests.testRestoreSearchOnlyShardFromStore

The other pull requests, besides those involved in post-merge actions, that contain failing tests with the IndexShardTests class are:

For more details on the failed tests refer to OpenSearch Gradle Check Metrics dashboard.

@sachinpkale
Copy link
Member

Fixed with #14418

@ankitkala ankitkala added Storage Issues and PRs relating to data and metadata storage and removed untriaged labels Jun 24, 2024
@vikasvb90 vikasvb90 removed the Indexing Indexing, Bulk Indexing and anything related to indexing label Jul 8, 2024
@linuxpi
Copy link
Contributor

linuxpi commented Jul 25, 2024

[Storage Triage - attendees 1 2 3 4 5 6 7 8]

@sachinpkale Looks like the issue was not fixed! Please feel free to raise a PR to address!

@linuxpi linuxpi moved this from 🆕 New to Ready To Be Picked in Storage Project Board Jul 25, 2024
@ashking94
Copy link
Member

ashking94 commented Jan 27, 2025

Tried reproducing this test by running the test for 2k+ iterations, but the failure did not occur. I also could not find any stack trace since all the build links have expired.

@github-project-automation github-project-automation bot moved this from Ready To Be Picked to ✅ Done in Storage Project Board Jan 27, 2025
@github-project-automation github-project-automation bot moved this from ✅ Done to 🏗 In progress in Storage Project Board Apr 28, 2025
@andrross
Copy link
Member

REPRODUCE WITH: ./gradlew ':server:test' --tests "org.opensearch.index.shard.IndexShardTests.testRestoreSearchOnlyShardFromStore" -Dtests.seed=90940F9D9F354EDC -Dtests.security.manager=true -Dtests.jvm.argline="-XX:TieredStopAtLevel=1 -XX:ReservedCodeCacheSize=64m" -Dtests.locale=ee-TG -Dtests.timezone=Asia/Ashgabat -Druntime.java=21

IndexShardTests > testRestoreSearchOnlyShardFromStore FAILED
    org.apache.lucene.index.CorruptIndexException: misplaced codec footer (file truncated?): length=0 but footerLength==16 (resource=metadata__9223372036854775746__9223372036854775804__9223372036854775802__9223372036854775805__-1039764442__9223370290998210694__2)
        at __randomizedtesting.SeedInfo.seed([90940F9D9F354EDC:C6CDC876E014858E]:0)
        at app//org.apache.lucene.codecs.CodecUtil.checksumEntireFile(CodecUtil.java:616)
        at app//org.opensearch.common.io.VersionedCodecStreamWrapper.readStream(VersionedCodecStreamWrapper.java:64)
        at app//org.opensearch.index.store.RemoteSegmentStoreDirectory.readMetadataFile(RemoteSegmentStoreDirectory.java:258)
        at app//org.opensearch.index.store.RemoteSegmentStoreDirectory.readLatestMetadataFile(RemoteSegmentStoreDirectory.java:247)
        at app//org.opensearch.index.store.RemoteSegmentStoreDirectory.init(RemoteSegmentStoreDirectory.java:154)
        at app//org.opensearch.index.store.RemoteSegmentStoreDirectory.<init>(RemoteSegmentStoreDirectory.java:141)
        at app//org.opensearch.index.shard.IndexShardTestCase.createRemoteSegmentStoreDirectory(IndexShardTestCase.java:814)
        at app//org.opensearch.index.shard.IndexShardTestCase.createRemoteStore(IndexShardTestCase.java:802)
        at app//org.opensearch.index.shard.IndexShardTestCase.newShard(IndexShardTestCase.java:672)
        at app//org.opensearch.index.shard.IndexShardTestCase.newShard(IndexShardTestCase.java:528)
        at app//org.opensearch.index.shard.IndexShardTestCase.reinitShard(IndexShardTestCase.java:892)
        at app//org.opensearch.index.shard.IndexShardTestCase.reinitShard(IndexShardTestCase.java:863)
        at app//org.opensearch.index.shard.IndexShardTestCase.reinitShard(IndexShardTestCase.java:858)
        at app//org.opensearch.index.shard.IndexShardTests.testRestoreSearchOnlyShardFromStore(IndexShardTests.java:3056)

@mch2 @vinaykpud Is this perhaps related to the reader/writer separation work?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
autocut flaky-test Random test failure that succeeds on second run Storage Issues and PRs relating to data and metadata storage >test-failure Test failure from CI, local build, etc.
Projects
Status: 🏗 In progress
Development

No branches or pull requests

9 participants