<html dir="ltr">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<style type="text/css" id="owaParaStyle">P {margin-top:0;margin-bottom:0;}</style>
</head>
<body fpstyle="1" ocsi="0">
<div style="direction: ltr;font-family: Tahoma;color: #000000;font-size: 10pt;">
<div>Hello Daniel,</div>
<div>I've solved my problem disabling the check (I've gpfs v4.2.3-5) by putting</div>
<div><br>
</div>
<div>ib_rdma_enable_monitoring=False</div>
<div><br>
</div>
<div>in the [network] section of the file /var/mmfs/mmsysmon/mmsysmonitor.conf, and restarting the mmsysmonitor.</div>
<div><br>
</div>
<div>There was a thread in this group about this problem.</div>
<div><br>
</div>
<div>   A<br>
</div>
<div><br>
</div>
<div style="font-family: Times New Roman; color: #000000; font-size: 16px">
<hr tabindex="-1">
<div id="divRpF969749" style="direction: ltr;"><font size="2" face="Tahoma" color="#000000"><b>From:</b> gpfsug-discuss-bounces@spectrumscale.org [gpfsug-discuss-bounces@spectrumscale.org] on behalf of Yaron Daniel [YARD@il.ibm.com]<br>
<b>Sent:</b> Sunday, July 01, 2018 7:17 PM<br>
<b>To:</b> gpfsug main discussion list<br>
<b>Subject:</b> Re: [gpfsug-discuss] How to get rid of very old mmhealth events<br>
</font><br>
</div>
<div></div>
<div><span style="font-size:10pt; font-family:sans-serif">Hi</span><br>
<br>
<span style="font-size:10pt; font-family:sans-serif">There is was issue with Scale 5.x GUI error -
</span><span style="font-size:10pt; font-family:Tahoma">ib_rdma_nic_unrecognized(mlx5_0/2)</span><br>
<br>
<span style="font-size:10pt; font-family:sans-serif">Check if you have the patch:</span><br>
<br>
<span style="font-size:12pt">[root@gssio1 ~]#  diff /usr/lpp/mmfs/lib/mmsysmon/NetworkService.py /tmp/NetworkService.py<br>
229c229,230<br>
<         recognizedNICs = set(re.findall(r"verbsConnectPorts\[\d+\] +: (\w+/\d+)/\d+\n", mmfsadm))<br>
---<br>
>         #recognizedNICs = set(re.findall(r"verbsConnectPorts\[\d+\] +: (\w+/\d+)/\d+\n", mmfsadm))<br>
>          recognizedNICs = set(re.findall(r"verbsConnectPorts\[\d+\] +: (\w+/\d+)/\d+/\d+\n", mmfsadm))</span><span style="font-size:10pt; font-family:sans-serif"><br>
</span><br>
<br>
<span style="font-size:9pt; font-family:Arial">And restart the - </span><span style="font-size:12pt"><b>mmsysmoncontrol restart
</b></span><span style="font-size:9pt; font-family:Arial"> </span><br>
<br>
<span style="font-size:10pt; font-family:Arial">Regards</span><br>
<span style="font-size:9pt; font-family:Arial"> </span><br>
<table style="border-collapse:collapse" width="780">
<tbody>
<tr height="8">
<td colspan="4" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="780">
<div align="center">
<hr noshade="">
</div>
<br>
<span style="font-size:1pt; font-family:Arial"> </span></td>
</tr>
<tr height="8">
<td colspan="4" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="780">
<span style="font-size:1pt; font-family:Arial"> </span></td>
</tr>
<tr height="8">
<td colspan="2" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="516">
<span style="font-size:10pt; color:blue; font-family:Arial"><b>Yaron Daniel</b></span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="168">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial"> 94 Em Ha'Moshavot Rd</span></td>
<td rowspan="3" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="96">
<div align="right"><img src="cid:_1_0B5B5F080B5B5954005EFD8BC22582BD" style="border:0px solid" align="bottom"></div>
</td>
</tr>
<tr height="8">
<td colspan="2" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="516">
<span style="font-size:10pt; color:blue; font-family:Arial"><b>Storage Architect – IL Lab Services (Storage)</b></span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="168">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial"> Petach Tiqva, 49527</span></td>
</tr>
<tr height="8">
<td colspan="2" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="516">
<span style="font-size:10pt; color:blue; font-family:Arial"><b>IBM Global Markets, Systems HW Sales</b></span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="168">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial"> Israel</span></td>
</tr>
<tr height="8">
<td colspan="2" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="516">
<span style="font-size:10pt; color:blue; font-family:Arial"><b> </b></span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="168">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial"> </span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="96">
<span style="font-size:9pt; font-family:Arial"> </span></td>
</tr>
<tr height="8">
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="90">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">Phone:</span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="426">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">+972-3-916-5672</span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="168">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial"> </span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="96">
<span style="font-size:10pt"> </span></td>
</tr>
<tr height="8">
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="90">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">Fax:</span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="426">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">+972-3-916-5672</span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="168">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">  </span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="96">
<span style="font-size:10pt"> </span></td>
</tr>
<tr height="8">
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="90">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">Mobile:</span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="426">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">+972-52-8395593</span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="168">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">  </span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="96">
<span style="font-size:10pt"> </span></td>
</tr>
<tr height="8">
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="90">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">e-mail:</span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="426">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">yard@il.ibm.com</span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="168">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">  </span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="96">
<span style="font-size:10pt"> </span></td>
</tr>
<tr height="8">
<td colspan="2" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="516">
<a href="http://www.ibm.com/il/he/" target="_blank" rel="noopener noreferrer"><span style="font-size:10pt; color:blue; font-family:Arial"><u>IBM Israel</u></span></a></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="168">
<span style="font-size:10pt; color:#5f5f5f; font-family:Arial">  </span></td>
<td style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="96">
<span style="font-size:10pt"> </span></td>
</tr>
<tr height="8">
<td colspan="4" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="780">
<span style="font-size:9pt; color:#5f5f5f; font-family:Arial"> </span></td>
</tr>
<tr height="8">
<td colspan="4" style="border-style:none none none none; border-color:#000000; border-width:0px 0px 0px 0px; padding:0px 0px" width="780">
<span style="font-size:9pt; color:#5f5f5f; font-family:Arial"> </span></td>
</tr>
</tbody>
</table>
<p style="margin-top:0px; margin-bottom:0px"></p>
<br>
<img src="cid:_1_06EDAB5406EDA744005EFD8BC22582BD" alt="IBM Storage Strategy and Solutions v1" style="border:0px solid"><img src="cid:_1_06EDAD5C06EDA744005EFD8BC22582BD" alt="IBM Storage Management and Data Protection v1" style="border:0px solid"><img src="cid:_1_06EDAF6406EDA744005EFD8BC22582BD" style="border:0px solid"><img src="cid:_1_06EDB16C06EDA744005EFD8BC22582BD" style="border:0px solid"><span style="font-size:12pt"> </span><img src="cid:_1_06EDB38C06EDA744005EFD8BC22582BD" alt="https://acclaim-production-app.s3.amazonaws.com/images/6c2c3858-6df8-45be-ac2b-f93b8da74e20/Data%2BDriven%2BMulti%2BCloud%2BStrategy%2BV1%2Bver%2B4.png" style="border:0px solid"><span style="font-size:12pt">     
</span><img src="cid:_2_06EDB5D806EDA744005EFD8BC22582BD" alt="Related image" style="border:0px solid"><br>
<br>
<br>
<br>
<span style="font-size:9pt; color:#5f5f5f; font-family:sans-serif">From:        </span><span style="font-size:9pt; font-family:sans-serif">"Andrew Beattie" <abeattie@au1.ibm.com></span><br>
<span style="font-size:9pt; color:#5f5f5f; font-family:sans-serif">To:        </span><span style="font-size:9pt; font-family:sans-serif">gpfsug-discuss@spectrumscale.org</span><br>
<span style="font-size:9pt; color:#5f5f5f; font-family:sans-serif">Date:        </span><span style="font-size:9pt; font-family:sans-serif">06/28/2018 11:16 AM</span><br>
<span style="font-size:9pt; color:#5f5f5f; font-family:sans-serif">Subject:        </span><span style="font-size:9pt; font-family:sans-serif">Re: [gpfsug-discuss] How to get rid of very old mmhealth events</span><br>
<span style="font-size:9pt; color:#5f5f5f; font-family:sans-serif">Sent by:        </span><span style="font-size:9pt; font-family:sans-serif">gpfsug-discuss-bounces@spectrumscale.org</span><br>
<hr noshade="">
<br>
<br>
<br>
<span style="font-size:11pt; font-family:Arial">Do you know if there is actually a cable plugged into port 2?</span><br>
<span style="font-size:11pt; font-family:Arial"> </span><br>
<span style="font-size:11pt; font-family:Arial">The system will work fine as long as there is network connectivity, but you may have an issue with redundancy or loss of bandwidth if you do not have every port cabled and configured correctly.</span><br>
<span style="font-size:11pt; font-family:Arial"> </span><br>
<span style="font-size:11pt; font-family:Arial">Regards</span><br>
<span style="font-size:12pt; color:#80803f; font-family:sans-serif"><b>Andrew Beattie</b></span><br>
<span style="font-size:10pt; font-family:sans-serif"><b>Software Defined Storage  - IT Specialist</b></span><br>
<span style="font-size:8pt; color:#3f8080; font-family:sans-serif"><b>Phone: </b>
</span><span style="font-size:8pt; font-family:sans-serif">614-2133-7927</span><br>
<span style="font-size:8pt; color:#3f8080; font-family:sans-serif"><b>E-mail: </b>
</span><a href="mailto:abeattie@au1.ibm.com" target="_blank" rel="noopener noreferrer"><span style="font-size:8pt; color:#4f4f4f; font-family:sans-serif"><u>abeattie@au1.ibm.com</u></span></a><br>
<span style="font-size:11pt; font-family:Arial"> </span><br>
<span style="font-size:11pt; font-family:Arial"> </span><br>
<span style="font-size:11pt; font-family:Arial">----- Original message -----<br>
From: "Dorigo Alvise (PSI)" <alvise.dorigo@psi.ch><br>
Sent by: gpfsug-discuss-bounces@spectrumscale.org<br>
To: "gpfsug-discuss@spectrumscale.org" <gpfsug-discuss@spectrumscale.org><br>
Cc:<br>
Subject: [gpfsug-discuss] How to get rid of very old mmhealth events<br>
Date: Thu, Jun 28, 2018 6:08 PM<br>
</span><br>
<span style="font-size:10pt; font-family:Tahoma">Dear experts,</span><br>
<span style="font-size:10pt; font-family:Tahoma">I've e GL2 IBM system running SpectrumScale v4.2.3-6 (RHEL 7.3).</span><br>
<span style="font-size:10pt; font-family:Tahoma">The system is working properly but I get a DEGRADED status report for the NETWORK running the command mmhealth:</span><br>
<span style="font-size:10pt; font-family:Tahoma"> </span><br>
<span style="font-size:10pt; font-family:Tahoma">[root@sf-gssio1 ~]# mmhealth node show<br>
<br>
Node name:      sf-gssio1.psi.ch<br>
Node status:    DEGRADED<br>
Status Change:  23 min. ago<br>
<br>
Component       Status        Status Change     Reasons<br>
-------------------------------------------------------------------------------------------------------------------------------------------<br>
GPFS            HEALTHY       22 min. ago       -<br>
NETWORK         DEGRADED      145 days ago      ib_rdma_link_down(mlx5_0/2), ib_rdma_nic_down(mlx5_0/2), ib_rdma_nic_unrecognized(mlx5_0/2)</span><br>
<span style="font-size:10pt; font-family:Tahoma">[...]</span><br>
<span style="font-size:10pt; font-family:Tahoma"> </span><br>
<span style="font-size:10pt; font-family:Tahoma">This event is clearly an outlier because the network, verbs and IB are correctly working:</span><br>
<span style="font-size:10pt; font-family:Tahoma"> </span><br>
<span style="font-size:10pt; font-family:Tahoma">[root@sf-gssio1 ~]# mmfsadm test verbs status<br>
VERBS RDMA status: started</span><br>
<span style="font-size:10pt; font-family:Tahoma"> </span><br>
<span style="font-size:10pt; font-family:Tahoma">[root@sf-gssio1 ~]# mmlsconfig verbsPorts|grep gssio1<br>
verbsPorts mlx5_0/1 [sf-ems1,sf-gssio1,sf-gssio2]</span><br>
<span style="font-size:10pt; font-family:Tahoma"> </span><br>
<span style="font-size:10pt; font-family:Tahoma">[root@sf-gssio1 ~]# mmdiag --config|grep verbsPorts<br>
! verbsPorts mlx5_0/1</span><br>
<span style="font-size:10pt; font-family:Tahoma"> </span><br>
<span style="font-size:10pt; font-family:Tahoma">[root@sf-gssio1 ~]# ibstat  mlx5_0<br>
CA 'mlx5_0'<br>
   CA type: MT4113<br>
   Number of ports: 2<br>
   Firmware version: 10.16.1020<br>
   Hardware version: 0<br>
   Node GUID: 0xec0d9a03002b5db0<br>
   System image GUID: 0xec0d9a03002b5db0<br>
   Port 1:<br>
       State: Active<br>
       Physical state: LinkUp<br>
       Rate: 56<br>
       Base lid: 42<br>
       LMC: 0<br>
       SM lid: 1<br>
       Capability mask: 0x26516848<br>
       Port GUID: 0xec0d9a03002b5db0<br>
       Link layer: InfiniBand<br>
   Port 2:<br>
       State: Down<br>
       Physical state: Disabled<br>
       Rate: 10<br>
       Base lid: 65535<br>
       LMC: 0<br>
       SM lid: 0<br>
       Capability mask: 0x26516848<br>
       Port GUID: 0xec0d9a03002b5db8<br>
       Link layer: InfiniBand</span><br>
<span style="font-size:10pt; font-family:Tahoma"> </span><br>
<span style="font-size:10pt; font-family:Tahoma">That event is there since 145 days and I didn't go away after a daemon restart (mmshutdown/mmstartup).</span><br>
<span style="font-size:10pt; font-family:Tahoma">My question is: how I can get rid of this event and restore the mmhealth's output to HEALTHY ? This is important because I've nagios sensors that periodically parse the "mmhealth -Y ..." output and at the moment
 I've to disable their email notification (which is not good if some real bad event happens).</span><br>
<span style="font-size:10pt; font-family:Tahoma"> </span><br>
<span style="font-size:10pt; font-family:Tahoma">Thanks,</span><br>
<span style="font-size:10pt; font-family:Tahoma"> </span><br>
<span style="font-size:10pt; font-family:Tahoma">  Alvise</span><br>
<tt><span style="font-size:10pt">_______________________________________________<br>
gpfsug-discuss mailing list<br>
gpfsug-discuss at spectrumscale.org</span></tt><tt><span style="font-size:10pt; color:blue"><u><br>
</u></span></tt><a href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss" target="_blank" rel="noopener noreferrer"><tt><span style="font-size:10pt; color:blue"><u>http://gpfsug.org/mailman/listinfo/gpfsug-discuss</u></span></tt></a><br>
<span style="font-size:11pt; font-family:Arial"> </span><br>
<tt><span style="font-size:10pt">_______________________________________________<br>
gpfsug-discuss mailing list<br>
gpfsug-discuss at spectrumscale.org<br>
</span></tt><a href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss" target="_blank" rel="noopener noreferrer"><tt><span style="font-size:10pt">http://gpfsug.org/mailman/listinfo/gpfsug-discuss</span></tt></a><tt><span style="font-size:10pt"><br>
</span></tt><br>
<br>
<br>
</div>
</div>
</div>
</body>
</html>