[gpfsug-discuss] Odd behavior - GPSF failed to start after initial node add

IBM Spectrum Scale scale at us.ibm.com
Mon Jun 5 22:49:23 BST 2017


Looks like a bug in the code.  The command hung in grep command.  It has
missing argument.

Please open a PMR to have this fix.

Instead of "service gpfs start", can you use mmstartup?  You can also try
to run mm list command before service gpfs start as a workaround.

Regards, The Spectrum Scale (GPFS) team

------------------------------------------------------------------------------------------------------------------

If you feel that your question can benefit other users of  Spectrum Scale
(GPFS), then please post it to the public IBM developerWroks Forum at
https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479.


If your query concerns a potential software error in Spectrum Scale (GPFS)
and you have an IBM software maintenance contract please contact
1-800-237-5511 in the United States or your local IBM Service Center in
other countries.

The forum is informally monitored as time permits and should not be used
for priority messages to the Spectrum Scale (GPFS) team.



From:	"Oesterlin, Robert" <Robert.Oesterlin at nuance.com>
To:	gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>
Date:	06/05/2017 11:54 AM
Subject:	[gpfsug-discuss] Odd behavior - GPSF failed to start after
            initial	node add
Sent by:	gpfsug-discuss-bounces at spectrumscale.org



Our node build process re-adds a node to the cluster and then does a
“service gpfs start”, but GPFS doesn’t start.  From the build log:

+ ssh -o StrictHostKeyChecking=no nrg1-gpfs01.nrg1.us.grid.nuance.com
'/usr/local/sbin/addnode.sh cnq-r02r09u27.nrg1.us.grid.nuance.com'
+ rc=0
+ chkconfig gpfs on
+ service gpfs start

The “service gpfs start” command hangs and never seems to return.

If I look at the process tree:

[root at cnq-r02r09u27 ~]# ps ax | egrep "mm|gpfs"
11715 ?        S      0:00 /bin/bash ./nrgX_gpfs_post
12191 ?        Ssl    0:00 /usr/lpp/mmfs/bin/mmsdrserv 1191 10
10 /var/adm/ras/mmsdrserv.log 128 yes no
12208 ?        S
0:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmccrmonitor 15
12271 ?        S      0:00 /bin/sh /sbin/service gpfs start
12276 ?        S      0:00 /bin/sh /etc/init.d/gpfs start
12278 ?        S
0:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmautoload reboot
12292 ?        S
0:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmautoload reboot
12293 ?        S      0:00 /bin/grep -lw /var/mmfs/gen/nodeFiles/*.num
12294 ?        S      0:00 /bin/sed -e s%/var/mmfs/gen/nodeFiles/....%% -e
s/\.num$//
21639 ?        S
0:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmccrmonitor 15

This is GPFS 4.2.2-1

This seems to occur only on the initial startup after build - if I try to
start GPFS again, it works just fine - any ideas on what it’s sitting here
waiting? Nothing in mmfslog (does not exist)

Bob Oesterlin
Sr Principal Storage Engineer, Nuance
507-269-0413

 _______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20170605/0274715a/attachment-0002.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20170605/0274715a/attachment-0002.gif>


More information about the gpfsug-discuss mailing list