[gpfsug-discuss] Large in doubt on fileset
Jonathan Buzzard
jonathan.buzzard at strath.ac.uk
Tue Oct 8 11:45:38 BST 2019
On Mon, 2019-10-07 at 19:22 +0300, Tomer Perry wrote:
[SNIP]
>
> So, do you experience large number of node expels/crashes etc. that
> might be related to that ( otherwise, it might be some other bug that
> needs to be fixed...).
>
Not as far as I can determine. The logs show only 58 expels in the last
six months and around 2/3rds of those where on essentially dormant
nodes that where being used for development work on fixing issues with
the xcat node deployment for the compute nodes (triggering an rinstall
on a node that was up with GPFS mounted but actually doing nothing).
I have done an mmcheckquota which didn't take long to complete and now
I the "in doubt" is a more reasonable sub 10GB. I shall monitor what
happens more closely in future.
JAB.
--
Jonathan A. Buzzard Tel: +44141-5483420
HPC System Administrator, ARCHIE-WeSt.
University of Strathclyde, John Anderson Building, Glasgow. G4 0NG
More information about the gpfsug-discuss
mailing list