[gpfsug-discuss] "mmhealth cluster show" returns error
Anna Christina Wagner
Anna.Wagner at de.ibm.com
Thu May 11 12:28:22 BST 2017
Hello Bob,
4.2.2 is the release were we introduced "mmhealth cluster show". And you
are totally right, it can be a little fragile at times.
So a short explanation:
We had this situation on test machines as well. Because of issues with the
system not only the mm-commands but also usual Linux commands
took more than 10 seconds to return. We have internally a default time out
of 10 seconds for cli commands. So if you had a failover situation, in
which the cluster manager
was changed (we have our cluster state manager (CSM) on the cluster
manager) and the mmlsmgr command did not return in 10 seconds the node
does not
know, that it is the CSM and will not start the corresponding service for
that.
If you want me to look further into it or if you have feedback regarding
mmhealth please feel free to send me an email (Anna.Wagner at de.ibm.com)
Mit freundlichen Grüßen / Kind regards
Wagner, Anna Christina
Software Engineer, Spectrum Scale Development
IBM Systems
IBM Deutschland Research & Development GmbH / Vorsitzende des
Aufsichtsrats: Martina Koederitz
Geschäftsführung: Dirk Wittkopp
Sitz der Gesellschaft: Böblingen / Registergericht: Amtsgericht Stuttgart,
HRB 243294
From: "Oesterlin, Robert" <Robert.Oesterlin at nuance.com>
To: gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>
Date: 10.05.2017 18:21
Subject: Re: [gpfsug-discuss] "mmhealth cluster show" returns error
Sent by: gpfsug-discuss-bounces at spectrumscale.org
Yea, it?s fine.
I did manage to get it to respond after I did a ?mmsysmoncontrol restart?
but it?s still not showing proper status across the cluster.
Seems a bit fragile :-)
Bob Oesterlin
Sr Principal Storage Engineer, Nuance
On 5/10/17, 10:46 AM, "gpfsug-discuss-bounces at spectrumscale.org on behalf
of valdis.kletnieks at vt.edu" <gpfsug-discuss-bounces at spectrumscale.org on
behalf of valdis.kletnieks at vt.edu> wrote:
On Wed, 10 May 2017 14:13:56 -0000, "Oesterlin, Robert" said:
> [root]# mmhealth cluster show
> nrg1-gpfs16.nrg1.us.grid.nuance.com: Could not find the cluster
state manager. It may be in an failover process. Please try again in a few
seconds.
Does 'mmlsmgr' return something sane?
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20170511/7c811cf3/attachment-0002.htm>
More information about the gpfsug-discuss
mailing list