Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

avoid using fqdn? #58

Open
dcherian opened this issue Sep 7, 2021 · 4 comments
Open

avoid using fqdn? #58

dcherian opened this issue Sep 7, 2021 · 4 comments

Comments

@dcherian
Copy link
Contributor

dcherian commented Sep 7, 2021

I landed on a casper node that was named crthc02.hpc.ucar.edu instead of crhtc02.hpc.ucar.edu which broke ncar_jobqueue's regex.

I emailed cislhelp and they fixed it but also suggested not using the FQDN...

I'd also suggest that you avoid if you can, using the FQDN as an identifier for whatever purpose you're using it for.

Perhaps we should talk to them and figure out a better solution.

@kmpaul
Copy link
Contributor

kmpaul commented Sep 13, 2021

Did they suggest an alternative solution? I don't know of any other mechanism to determine if your node is in the Casper cluster or the Cheyenne cluster.

@dcherian
Copy link
Contributor Author

I didn't ask them. I thought it would be better for xdev to open up a new conversation rather than extending the scope of that ticket.

@andersy005
Copy link
Contributor

Did they suggest an alternative solution? I don't know of any other mechanism to determine if your node is in the Casper cluster or the Cheyenne cluster.

Ccing @jbaksta

@jbaksta
Copy link

jbaksta commented Sep 15, 2021

Why not just explicitly state which resource you're targeting as part of a job submission process? Is there a reason to tie you to a piece of hardware so to speak rather than just set an environment variable that says you submitted to Casper or Cheyenne? Basically, why inspect when you can be explicit on a submission? Hostnames are likely to be much more fluid; especially as we look at higher levels of enablement w/ Linux namespaces.

An alternative could be to inspect the $PBS_JOBID. Usually the CSG modules loaded set a specific environment variable too because they use something like that for $PATH building since we have shared application storage. At least with default modules on Cheyenne and Casper you'll have the two following set:

NCAR_HOST=cheyenne

NCAR_HOST=dav

Note that cross submission between clusters (new-ish PBS capability we're enabling), the environment may get reset during job submission, but loading the ncarenv module gives you the above.

@andersy005 andersy005 added this to Xdev Oct 22, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

4 participants