Run job_config handlers in new tasks #2637

majamassarini · 2024-11-12T14:47:14Z

There is a correlation between this new exception in sentry
and some hard time limit errors, look at the graphs.

In splunk, the above celery task id, had more than 25,528 events associated and ran for a really long time.

If we run sequentially all the handlers for all the job_configs, for every entry in the db that needs babysitting we can reach the hard time limit for the task.
If we fail to babysit the "entries in the db", the tasks will sum up leading to more "hard time limit" exceptions.

softwarefactory-project-zuul · 2024-11-12T14:51:54Z

Build failed.
https://softwarefactory-project.io/zuul/t/packit-service/buildset/5f93c58e773f47e78b958a367dbd491c

✔️ pre-commit SUCCESS in 2m 16s
❌ packit-service-tests FAILURE in 4m 00s

packit_service/worker/helpers/build/babysit.py

softwarefactory-project-zuul · 2024-11-13T11:24:30Z

Build failed.
https://softwarefactory-project.io/zuul/t/packit-service/buildset/6faabc29b1124ae7a40201b4ea24ef63

❌ pre-commit FAILURE in 2m 05s
✔️ packit-service-tests SUCCESS in 2m 29s

softwarefactory-project-zuul · 2024-11-13T11:27:48Z

Build succeeded.
https://softwarefactory-project.io/zuul/t/packit-service/buildset/56fbfd55363b451692b9a52946015569

✔️ pre-commit SUCCESS in 2m 02s
✔️ packit-service-tests SUCCESS in 2m 31s

packit_service/worker/helpers/build/babysit.py

softwarefactory-project-zuul · 2024-11-13T11:34:13Z

Build failed.
https://softwarefactory-project.io/zuul/t/packit-service/buildset/a3077410f6a14a158739f3ab60c406ae

❌ pre-commit FAILURE in 2m 08s
❌ packit-service-tests FAILURE in 2m 12s

softwarefactory-project-zuul · 2024-11-13T11:39:54Z

Build succeeded.
https://softwarefactory-project.io/zuul/t/packit-service/buildset/7569a3ffd02c42d58d02bc0fc65e797b

✔️ pre-commit SUCCESS in 2m 01s
✔️ packit-service-tests SUCCESS in 2m 47s

packit_service/worker/helpers/build/babysit.py

softwarefactory-project-zuul · 2024-11-13T12:44:23Z

Build succeeded.
https://softwarefactory-project.io/zuul/t/packit-service/buildset/0e98c4bf37694503bc608afe41febfc4

✔️ pre-commit SUCCESS in 2m 06s
✔️ packit-service-tests SUCCESS in 2m 38s

lbarcziova

this is a great improvement, thanks Maja!

lbarcziova · 2024-11-13T12:56:40Z

packit_service/worker/helpers/build/babysit.py

@@ -55,6 +57,13 @@
 logger = logging.getLogger(__name__)


+def celery_run_async(signatures: list[Signature]) -> None:
+    logger.debug("Signatures are going to be sent to Celery (from update_copr_build_state).")


this can be called also from update_testing_farm_run, so the log should be adjusted

If we run sequentially all the handlers for all the job_configs, for every entry in the db that needs babysitting we can reach the hard time limit for the task. I did not make the vm image build handlers running in parallel, because now the code needs the tasks output. Co-authored-by: Nikola Forró <[email protected]>

for more information, see https://pre-commit.ci

softwarefactory-project-zuul · 2024-11-13T13:20:40Z

Build succeeded.
https://softwarefactory-project.io/zuul/t/packit-service/buildset/6b10a2eb8de54739b1dcb56b4dcdf811

✔️ pre-commit SUCCESS in 2m 06s
✔️ packit-service-tests SUCCESS in 3m 01s

softwarefactory-project-zuul · 2024-11-13T13:29:05Z

Build succeeded (gate pipeline).
https://softwarefactory-project.io/zuul/t/packit-service/buildset/cf0aa521a72441fdbf91899c4c56c326

✔️ pre-commit SUCCESS in 2m 07s

majamassarini requested a review from a team as a code owner November 12, 2024 14:47

nforro reviewed Nov 12, 2024

View reviewed changes