Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] org.opensearch.remotestore.RemoteStoreClusterStateRestoreIT.testDataStreamPostRemoteStateRestore is flaky #11483

Closed
reta opened this issue Dec 5, 2023 · 1 comment
Assignees
Labels
bug Something isn't working Cluster Manager flaky-test Random test failure that succeeds on second run Storage:Remote

Comments

@reta
Copy link
Collaborator

reta commented Dec 5, 2023

Describe the bug
The test case org.opensearch.remotestore.RemoteStoreClusterStateRestoreIT.testDataStreamPostRemoteStateRestore is flaky:

org.opensearch.remotestore.RemoteStoreClusterStateRestoreIT.testDataStreamPostRemoteStateRestore

com.carrotsearch.randomizedtesting.UncaughtExceptionError: Captured an uncaught exception in thread: Thread[id=2588, name=opensearch[node_t2][remote_refresh_retry][T#1], state=RUNNABLE, group=TGRP-RemoteStoreClusterStateRestoreIT]
	at __randomizedtesting.SeedInfo.seed([6F53DEF57F1BE2B6:704F47A55513DDB]:0)
Caused by: org.opensearch.core.concurrency.OpenSearchRejectedExecutionException: rejected execution of java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask@ea40f13[Not completed, task = java.util.concurrent.Executors$RunnableAdapter@7665f2c5[Wrapped task = [threaded] org.opensearch.index.shard.ReleasableRetryableRefreshListener$$Lambda/0x00007ff53ca2e3a8@509663e1]] on org.opensearch.threadpool.Scheduler$SafeScheduledThreadPoolExecutor@421eefc[Terminated, pool size = 0, active threads = 0, queued tasks = 0, completed tasks = 34]
	at __randomizedtesting.SeedInfo.seed([6F53DEF57F1BE2B6]:0)
	at app//org.opensearch.common.util.concurrent.OpenSearchAbortPolicy.rejectedExecution(OpenSearchAbortPolicy.java:67)
	at [email protected]/java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:841)
	at [email protected]/java.util.concurrent.ScheduledThreadPoolExecutor.delayedExecute(ScheduledThreadPoolExecutor.java:340)
	at [email protected]/java.util.concurrent.ScheduledThreadPoolExecutor.schedule(ScheduledThreadPoolExecutor.java:562)
	at app//org.opensearch.threadpool.ThreadPool.schedule(ThreadPool.java:487)
	at app//org.opensearch.index.shard.ReleasableRetryableRefreshListener.scheduleRetry(ReleasableRetryableRefreshListener.java:125)
	at app//org.opensearch.index.shard.ReleasableRetryableRefreshListener.scheduleRetry(ReleasableRetryableRefreshListener.java:178)
	at app//org.opensearch.index.shard.ReleasableRetryableRefreshListener.runAfterRefreshWithPermit(ReleasableRetryableRefreshListener.java:167)
	at app//org.opensearch.index.shard.ReleasableRetryableRefreshListener.lambda$scheduleRetry$2(ReleasableRetryableRefreshListener.java:126)
	at app//org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:852)
	at [email protected]/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1144)
	at [email protected]/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:642)
	at [email protected]/java.lang.Thread.run(Thread.java:1583)

To Reproduce

./gradlew ':server:internalClusterTest' --tests "org.opensearch.remotestore.RemoteStoreClusterStateRestoreIT.testDataStreamPostRemoteStateRestore" -Dtests.seed=6F53DEF57F1BE2B6

Expected behavior
The test should always pass

Plugins
Standard

Screenshots
If applicable, add screenshots to help explain your problem.

Host/Environment (please complete the following information):

  • CI

Additional context

@reta
Copy link
Collaborator Author

reta commented Jun 19, 2024

Closing in favour of #14326

@reta reta closed this as completed Jun 19, 2024
@github-project-automation github-project-automation bot moved this from 🆕 New to ✅ Done in Cluster Manager Project Board Jun 19, 2024
@github-project-automation github-project-automation bot moved this from 🆕 New to ✅ Done in Storage Project Board Jun 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Cluster Manager flaky-test Random test failure that succeeds on second run Storage:Remote
Projects
Status: ✅ Done
Status: ✅ Done
Development

No branches or pull requests

5 participants