Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]org.opensearch.search.aggregations.bucket.TermsFixedDocCountErrorIT.testSliceLevelDocCountErrorMultiShard {p0={"search.concurrent_segment_search.enabled":"true"}} flaky #11950

Closed
dblock opened this issue Jan 19, 2024 · 2 comments · Fixed by #11976
Assignees
Labels
bug Something isn't working flaky-test Random test failure that succeeds on second run Storage:Durability Issues and PRs related to the durability framework untriaged

Comments

@dblock
Copy link
Member

dblock commented Jan 19, 2024

Describe the bug

java.lang.AssertionError: expected:<2> but was:<3>

Related component

Storage:Durability

To Reproduce

REPRODUCE WITH: ./gradlew ':server:internalClusterTest' --tests "org.opensearch.search.aggregations.bucket.TermsFixedDocCountErrorIT" -Dtests.method="testSliceLevelDocCountErrorMultiShard {p0={"search.concurrent_segment_search.enabled":"true"}}" -Dtests.seed=41B9C6A508DBF95D -Dtests.security.manager=true -Dtests.jvm.argline="-XX:TieredStopAtLevel=1 -XX:ReservedCodeCacheSize=64m" -Dtests.locale=es-MX -Dtests.timezone=Australia/North -Druntime.java=21
NOTE: leaving temporary files on disk at: /var/jenkins/workspace/gradle-check/search/server/build/testrun/internalClusterTest/temp/org.opensearch.search.aggregations.bucket.TermsFixedDocCountErrorIT_41B9C6A508DBF95D-001
NOTE: test params are: codec=Asserting(Lucene99): {s_value=Lucene99, index_uuid=Lucene99, _id=Lucene99, type=PostingsFormat(name=Asserting)}, docValues:{s_value=DocValuesFormat(name=Lucene90), _seq_no=DocValuesFormat(name=Asserting), _primary_term=DocValuesFormat(name=Lucene90), _version=DocValuesFormat(name=Lucene90)}, maxPointsInLeafNode=1593, maxMBSortInHeap=6.303042065496907, sim=Asserting(RandomSimilarity(queryNorm=false): {}), locale=es-MX, timezone=Australia/North
NOTE: Linux 5.15.0-1048-aws amd64/Eclipse Adoptium 21.0.1 (64-bit)/cpus=32,threads=1,free=460063632,total=933232640
NOTE: All tests run in this JVM: [RejectionActionIT, CloneIndexIT, GetIndexIT, ActiveShardsObserverIT, MinimumClusterManagerNodesIT, VotingConfigurationIT, ClusterSettingsIT, AliasedIndexDocumentActionsIT, HiddenIndexIT, MultiFieldsIntegrationIT, IndexingMemoryControllerIT, IndexPrimaryRelocationIT, UpdateNumberOfReplicasIT, SimpleNodesInfoIT, RemoteIndexRecoveryIT, RemoteStoreMultipartFileCorruptionIT, AggregationsIntegrationIT, GlobalIT, TermsFixedDocCountErrorIT]

Expected behavior

Test to pass.

Additional Details

https://build.ci.opensearch.org/job/gradle-check/32347/testReport/junit/org.opensearch.search.aggregations.bucket/TermsFixedDocCountErrorIT/testSliceLevelDocCountErrorMultiShard__p0___search_concurrent_segment_search_enabled___true___/

@dblock dblock added bug Something isn't working untriaged flaky-test Random test failure that succeeds on second run labels Jan 19, 2024
@github-actions github-actions bot added the Storage:Durability Issues and PRs related to the durability framework label Jan 19, 2024
@jed326 jed326 self-assigned this Jan 22, 2024
@jed326
Copy link
Collaborator

jed326 commented Jan 22, 2024

This is a new test I added in #11680, will take a look.

@jed326
Copy link
Collaborator

jed326 commented Jan 22, 2024

assertEquals(2, segmentResponse.getIndices().get("idx_mshard_2").getShards().get(0).getShards()[0].getSegments().size());

Failure is coming from checking the segment count of one of the indices and in the shared output we are getting 3 segments instead of 2. We need the segment count to be stable and docs indexed in a certain way in order for the doc count errors to be accurate here so will need to think more on how we can accomplish that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working flaky-test Random test failure that succeeds on second run Storage:Durability Issues and PRs related to the durability framework untriaged
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

2 participants