Job monitoring utilities #2

wlandau · 2023-10-20T14:47:47Z

It would be great to have standalone R function utilities to manage batch jobs. These would run in the user's interactive session outside the targets pipeline / crew controller. I am thinking of covering the same functionality as qsub, qstat, and qdel in SGE (sbatch, squeue, and scancel in SLURM), plus log files. Proposal:

crew_aws_batch_submit(): submit a job that runs some code (R or shell). This could help e.g. submit a targets pipeline as a Batch job which submits other Batch jobs.
crew_aws_batch_status(): get the status of jobs in a given job queue / job definition.
crew_aws_batch_terminate(): terminate one or more jobs with specific job names/IDs/ARNs.
crew_aws_batch_logs(): log files for one or more jobs, or for an entire job definition. This would really help detect tricky worker-level errors such as running out of memory or hitting a price spike that terminates spot instances.

The text was updated successfully, but these errors were encountered:

DyfanJones · 2023-11-06T14:09:55Z

Hi @wlandau for crew_aws_batch_logs you could hack smdocker logging method. In short when it is building a docker using AWS CodeBuild it returns the AWS CloudWatch logs back the to the console for R users to monitor and check.

https://github.com/DyfanJones/sm-docker/blob/main/R/logs.R

I am happy to contribute on this if you think it is possible solution for your problem :)

wlandau · 2023-12-08T21:14:14Z

Thanks for the input, @DyfanJones! I think I implemented what I need in https://github.com/wlandau/crew.aws.batch#job-management, but it would be amazing to have help with paws-r/paws#721 so I can request paginated downloads for log files.

wlandau self-assigned this Oct 20, 2023

This was referenced Dec 8, 2023

Allow daemons to exit immediately when the connection terminates? shikokuchuo/mirai#87

Closed

Infinite loop while paginating paws.management::cloudwatchlogs()$get_log_events() paws-r/paws#721

Closed

wlandau-lilly closed this as completed in 93f4db8 Dec 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Job monitoring utilities #2

Job monitoring utilities #2

wlandau commented Oct 20, 2023

DyfanJones commented Nov 6, 2023

wlandau commented Dec 8, 2023

Job monitoring utilities #2

Job monitoring utilities #2

Comments

wlandau commented Oct 20, 2023

DyfanJones commented Nov 6, 2023

wlandau commented Dec 8, 2023