Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[qob] Query on Batch worker stages should fail fast #14597

Open
daniel-goldstein opened this issue Jul 2, 2024 · 0 comments · May be fixed by #14745
Open

[qob] Query on Batch worker stages should fail fast #14597

daniel-goldstein opened this issue Jul 2, 2024 · 0 comments · May be fixed by #14745

Comments

@daniel-goldstein
Copy link
Contributor

What happened?

Currently, Query on Batch waits for the full stage of workers to complete before collecting the results even in the event of a failure. Now that Query on Batch uses Job Groups, we can use the job group cancel_after_n_failures functionality to cancel remaining jobs in the stage after a certain number have failed.

Query on Batch should set cancel_after_n_failures = 1 so that the user can see the error of the failed partition without waiting for all the partitions to run (and paying for them).

Version

0.2.131

Relevant log output

No response

@daniel-goldstein daniel-goldstein removed the needs-triage A brand new issue that needs triaging. label Jul 9, 2024
@patrick-schultz patrick-schultz linked a pull request Oct 24, 2024 that will close this issue
1 task
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants