[gpfsug-discuss] Remote cluster mount failing
Felipe Knop
knop at us.ibm.com
Mon Sep 12 06:17:05 BST 2016
There is a chance the problem might be related to an upgrade from 3.5 to
4.1, or perhaps a remote mount between versions 3.5 and 4.1. It would be
useful to know details related to any such migration and different
releases when the PMR is opened.
Thanks,
Felipe
----
Felipe Knop knop at us.ibm.com
GPFS Development and Security
IBM Systems
IBM Building 008
2455 South Rd, Poughkeepsie, NY 12601
(845) 433-9314 T/L 293-9314
From: Yuri L Volobuev/Austin/IBM at IBMUS
To: gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>
Date: 09/09/2016 12:30 PM
Subject: Re: [gpfsug-discuss] Remote cluster mount failing
Sent by: gpfsug-discuss-bounces at spectrumscale.org
It could be "easy" in the end, e.g. regenerating the key ("mmauth genkey
new") may fix the issue. Figuring out exactly what is going wrong is messy
though, and requires looking at a number of debug data points, something
that's awkward to do on a public mailing list. I don't think you want to
post certificates et al on a mailing list. The PMR channel is more
appropriate for this kind of thing.
yuri
"Simon Thompson (Research Computing - IT Services)" ---09/09/2016 07:37:52
AM---That’s sorta what I was expecting. Though I was hoping someone might
have said 'oh just run mmchconf
From: "Simon Thompson (Research Computing - IT Services)"
<S.J.Thompson at bham.ac.uk>
To: gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>,
Date: 09/09/2016 07:37 AM
Subject: Re: [gpfsug-discuss] Remote cluster mount failing
Sent by: gpfsug-discuss-bounces at spectrumscale.org
That’s sorta what I was expecting. Though I was hoping someone might have
said 'oh just run mmchconfig ....' or something easy.
PMR on its way in.
Thanks!
Simon
From: <gpfsug-discuss-bounces at spectrumscale.org> on behalf of Yuri L
Volobuev <volobuev at us.ibm.com>
Reply-To: "gpfsug-discuss at spectrumscale.org" <
gpfsug-discuss at spectrumscale.org>
Date: Wednesday, 7 September 2016 at 17:58
To: "gpfsug-discuss at spectrumscale.org" <gpfsug-discuss at spectrumscale.org>
Subject: Re: [gpfsug-discuss] Remote cluster mount failing
It's unclear what's wrong. I'd have two main suspects: (1) TLS protocol
version confusion, due to a difference in GSKit version and/or
configuration (e.g. NIST SP800 compliance) on two sides (2) firewall. TLS
issues are usually messy and tedious to work though. I'd recommend opening
a PMR to facilitate debug data collection and analysis. A lot of gory
detail may be needed to figure out what's going on.
yuri
"Simon Thompson (Research Computing - IT Services)" ---09/07/2016 05:37:11
AM---Hi All, I'm trying to get some multi cluster thing working between
two of our GPFS
From: "Simon Thompson (Research Computing - IT Services)" <
S.J.Thompson at bham.ac.uk>
To: "gpfsug-discuss at spectrumscale.org" <gpfsug-discuss at spectrumscale.org>,
Date: 09/07/2016 05:37 AM
Subject: [gpfsug-discuss] Remote cluster mount failing
Sent by: gpfsug-discuss-bounces at spectrumscale.org
Hi All,
I'm trying to get some multi cluster thing working between two of our GPFS
clusters.
In the "client" cluster, when trying to mount the "remote" cluster, I get:
# mmmount gpfs
Wed 7 Sep 13:33:06 BST 2016: mmmount: Mounting file systems ...
mount: mount /dev/gpfs on /gpfs failed: Connection timed out
mmmount: Command failed. Examine previous error messages to determine
cause.
And in the log file:
Wed Sep 7 13:33:07.481 2016: [N] The client side TLS handshake with node
10.0.0.182 was cancelled: connection reset by peer (return code 420).
Wed Sep 7 13:33:07.486 2016: [N] The client side TLS handshake with node
10.0.0.181 was cancelled: connection reset by peer (return code 420).
Wed Sep 7 13:33:07.487 2016: [E] Failed to join remote cluster
GPFS_STORAGE.CLUSTER
Wed Sep 7 13:33:07.488 2016: [W] Command: err 78: mount
GPFS_STORAGE.CLUSTER:gpfs
Wed Sep 7 13:33:07.489 2016: Connection timed out
In the remote cluster, I see:
Wed Sep 7 13:33:07.487 2016: [W] The TLS handshake with node 10.0.0.222
failed with error 447 (server side).
Wed Sep 7 13:33:07.488 2016: [X] Connection from 10.10.0.35 <c0p174>
refused, authentication failed
Wed Sep 7 13:33:07.489 2016: [E] Killing connection from 10.10.0.35, err
703
Wed Sep 7 13:33:07.490 2016: Operation not permitted
Weirdly though on other nodes in the client cluster this succeeds fine and
can mount, so I think I got all the bits in the mmauth and mmremotecluster
configured correctly.
Any suggestions?
Thanks
Simon
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss
[attachment "graycol.gif" deleted by Yuri L Volobuev/Austin/IBM]
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20160912/039e0b74/attachment-0002.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20160912/039e0b74/attachment-0004.gif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20160912/039e0b74/attachment-0005.gif>
More information about the gpfsug-discuss
mailing list