[gpfsug-discuss] NTP leap-second can take out GPFS server - it just has
bevans at canditmedia.co.uk
Mon Jul 2 22:25:02 BST 2012
This has so far hit all almost all of the places I work with (not so much GPFS crashing, but certainly storage manager going bezerk) - the majority of them do not use public NTP servers. In most cases no one actually noticed until it was pointed out, well worth a quick 'top' of your storage servers if you're using Engenio/LSI/NetApp based units (ie, DS3/4/5000).
The fix is here: http://blog.wpkg.org/2012/07/01/java-leap-second-bug-30-june-1-july-2012-fix/
But of course I wouldn't mess with the time in production unless you've got GPFS shutdown first.
On 2 Jul 2012, at 14:59, Jonathan Buzzard wrote:
> On 02/07/12 14:03, Jez Tucker wrote:
>> Just had a lovely one.
>> As I’m, sure all of you are aware by now, there’s been much fun with
>> some of the NTP Stratum 1 servers not correctly accounting for the
>> leap-seocnd last night.
>> You may wish to turn off ntp on your servers and correct your NTP to
>> trusted servers.
>> A clock skew from ntp.pool.org just took out one of our servers and the
>> node was expelled from the cluster.
> Hum, not sure I would run my production servers directly off something
> from ntp.pool.org, I would at least put a local server in between.
> Not notice any problems here, but then we are running latest RHEL 5.8
> and latest IBM Storage Manager (10.83) :-)
> Jonathan A. Buzzard Tel: +441382-386998
> Storage Administrator, College of Life Sciences
> University of Dundee, DD1 5EH
> The University of Dundee is a registered Scottish Charity, No: SC015096
> gpfsug-discuss mailing list
> gpfsug-discuss at gpfsug.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the gpfsug-discuss