You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When a job operation is run with @with_jobs and @cmd in an environment which uses jsrun to run jobs on the compute node (ie. Summit), the job will fail.
To reproduce
The following examples will fail:
@Project.operation@flow.with_job@flow.cmd# ... pre and post conditions ...deffoo(job):
return ('trap "some commands --args" EXIT')
@Project.operation@flow.cmd# ... pre and post conditions ...deffoo(job):
return ('trap "cd {}; some commands --args" EXIT'.format(job.ws))
The following will run:
@Project.operation@flow.cmd# ... pre and post conditions ...defgen_pqr(job):
return ("cd {}; some commands --args".format(job.ws))
Error output
bash-4.2$ jsrun -n1 python flowprojects/project.py run -o gen_pqr
[h50n13:02512] PMIX ERROR: INVALID-NAMESPACE in file dstore_base.c at line 1739
Error (No such file or directory) executing process: trap
Using environment configuration: SummitEnvironment
ERROR: Encountered error during program execution: 'Command 'jsrun -n 1 -a 1 -c 1 -g 0 -d packed -b rs trap"cd /path/to/job/ws/; some commands --args" EXIT' returned non-zero exit status 210.'
System configuration
Please complete the following information:
Operating System [e.g. macOS]: Red Hat Enterprise Linux (RHEL) version 7.6
Version of Python [e.g. 3.7]: 3.7.0
Version of signac [e.g. 1.0]: 1.6.0
Version of signac-flow: 0.12.0
The text was updated successfully, but these errors were encountered:
klywang
changed the title
@flow.with_jobs does not work with @cmd in environments that use jsrun
@with_jobs does not work with @cmd in environments that use jsrun
Apr 12, 2021
return'trap "cd $(pwd)" EXIT && cd {} && {}'.format(job.ws, func(job))
to be separate from what we submit. For example, rather than submitting jsrun -n 1 -a 1 -c 1 -g 0 -d packed -b rs trap "cd /path/to/job/ws/; some commands --args" EXIT', signac flow would submit trap "cd {job.ws}; jsrun -n 1 -a 1 -c 1 -g 0 -d packed -b rs some commands --args
Description
When a job operation is run with
@with_jobs
and@cmd
in an environment which usesjsrun
to run jobs on the compute node (ie. Summit), the job will fail.To reproduce
The following examples will fail:
The following will run:
Error output
System configuration
Please complete the following information:
The text was updated successfully, but these errors were encountered: