[gpfsug-discuss] "mmhealth cluster show" returns error

Anna Christina Wagner Anna.Wagner at de.ibm.com
Thu May 11 12:28:22 BST 2017


Hello Bob,

4.2.2 is the release were we introduced "mmhealth cluster show". And you 
are totally right, it can be a little fragile at times.

So a short explanation: 
We had this situation on test machines as well. Because of issues with the 
system not only the mm-commands but also usual Linux commands 
took more than 10 seconds to return. We have internally a default time out 
of 10 seconds for cli commands. So if you had a failover situation, in 
which the cluster manager 
was changed (we have our cluster state manager (CSM) on the cluster 
manager) and the mmlsmgr command did not return in 10 seconds the node 
does not
know, that it is the CSM and will not start the corresponding service for 
that. 


If you want me to look further into it or if you have feedback regarding 
mmhealth please feel free to send me an email (Anna.Wagner at de.ibm.com)

Mit freundlichen Grüßen / Kind regards

Wagner, Anna Christina

Software Engineer, Spectrum Scale Development
IBM Systems

IBM Deutschland Research & Development GmbH / Vorsitzende des 
Aufsichtsrats: Martina Koederitz
Geschäftsführung: Dirk Wittkopp
Sitz der Gesellschaft: Böblingen / Registergericht: Amtsgericht Stuttgart, 
HRB 243294 



From:   "Oesterlin, Robert" <Robert.Oesterlin at nuance.com>
To:     gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>
Date:   10.05.2017 18:21
Subject:        Re: [gpfsug-discuss] "mmhealth cluster show" returns error
Sent by:        gpfsug-discuss-bounces at spectrumscale.org



Yea, it?s fine. 

I did manage to get it to respond after I did a ?mmsysmoncontrol restart? 
but it?s still not showing proper status across the cluster.

Seems a bit fragile :-) 

Bob Oesterlin
Sr Principal Storage Engineer, Nuance
 
 

On 5/10/17, 10:46 AM, "gpfsug-discuss-bounces at spectrumscale.org on behalf 
of valdis.kletnieks at vt.edu" <gpfsug-discuss-bounces at spectrumscale.org on 
behalf of valdis.kletnieks at vt.edu> wrote:

    On Wed, 10 May 2017 14:13:56 -0000, "Oesterlin, Robert" said:
 
    > [root]# mmhealth cluster show
    > nrg1-gpfs16.nrg1.us.grid.nuance.com: Could not find the cluster 
state manager. It may be in an failover process. Please try again in a few 
seconds.
 
    Does 'mmlsmgr' return something sane?
 

_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss




-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20170511/7c811cf3/attachment-0002.htm>


More information about the gpfsug-discuss mailing list