[gpfsug-discuss] Remote cluster mount failing

Yuri L Volobuev volobuev at us.ibm.com
Wed Sep 7 17:58:07 BST 2016


It's unclear what's wrong.  I'd have two main suspects: (1) TLS protocol
version confusion, due to a difference in GSKit version and/or
configuration (e.g. NIST SP800 compliance) on two sides (2) firewall.  TLS
issues are usually messy and tedious to work though.  I'd recommend opening
a PMR to facilitate debug data collection and analysis.  A lot of gory
detail may be needed to figure out what's going on.

yuri



From:	"Simon Thompson (Research Computing - IT Services)"
            <S.J.Thompson at bham.ac.uk>
To:	"gpfsug-discuss at spectrumscale.org"
            <gpfsug-discuss at spectrumscale.org>,
Date:	09/07/2016 05:37 AM
Subject:	[gpfsug-discuss] Remote cluster mount failing
Sent by:	gpfsug-discuss-bounces at spectrumscale.org



Hi All,

I'm trying to get some multi cluster thing working between two of our GPFS
clusters.

In the "client" cluster, when trying to mount the "remote" cluster, I get:

# mmmount gpfs
Wed  7 Sep 13:33:06 BST 2016: mmmount: Mounting file systems ...
mount: mount /dev/gpfs on /gpfs failed: Connection timed out
mmmount: Command failed. Examine previous error messages to determine
cause.


And in the log file:
Wed Sep  7 13:33:07.481 2016: [N] The client side TLS handshake with node
10.0.0.182 was cancelled: connection reset by peer (return code 420).
Wed Sep  7 13:33:07.486 2016: [N] The client side TLS handshake with node
10.0.0.181 was cancelled: connection reset by peer (return code 420).
Wed Sep  7 13:33:07.487 2016: [E] Failed to join remote cluster
GPFS_STORAGE.CLUSTER
Wed Sep  7 13:33:07.488 2016: [W] Command: err 78: mount
GPFS_STORAGE.CLUSTER:gpfs
Wed Sep  7 13:33:07.489 2016: Connection timed out

In the remote cluster, I see:

Wed Sep  7 13:33:07.487 2016: [W] The TLS handshake with node 10.0.0.222
failed with error 447 (server side).
Wed Sep  7 13:33:07.488 2016: [X] Connection from 10.10.0.35 <c0p174>
refused, authentication failed
Wed Sep  7 13:33:07.489 2016: [E] Killing connection from 10.10.0.35, err
703
Wed Sep  7 13:33:07.490 2016: Operation not permitted



Weirdly though on other nodes in the client cluster this succeeds fine and
can mount, so I think I got all the bits in the mmauth and mmremotecluster
configured correctly.

Any suggestions?

Thanks

Simon

_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20160907/726907fa/attachment-0002.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20160907/726907fa/attachment-0002.gif>


More information about the gpfsug-discuss mailing list