-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add GPFS health check #71
base: dev
Are you sure you want to change the base?
Conversation
One thing that could probably be improved is allowing path to mmhealth to be changed to avoid hardcoding the value. |
Made path to mmhealth configurable and updated README. |
We noticed something with GPFS can cause |
We currently run
So very similar. Worth nothing is that this was broken for a little while pretty recently (some version of 4.2.3.x it must have been), and in the interim we had to do this instead:
IBM helped us figure that one out (which I guess is only fair as they broke For
|
This looks awesome, Trey! This will go into nhc/dev as soon as 1.4.3 is out the door. Thanks much! |
I have only deployed this onto one system and one where I knew there were GPFS network issues with nodes not using RDMA that was configured:
Configured check:
One thing I am not sure on for behavior is what to do if the configured component isn't found in output, right now if you do
check_gpfs_health FOO
, there is no warning of failure.