[gpfsug-discuss] Large in doubt on fileset

Jonathan Buzzard jonathan.buzzard at strath.ac.uk
Tue Oct 8 11:45:38 BST 2019


On Mon, 2019-10-07 at 19:22 +0300, Tomer Perry wrote:

[SNIP]

> 
> So, do you experience large number of node expels/crashes etc. that
> might be related to that ( otherwise, it might be some other bug that
> needs to be fixed...). 
> 

Not as far as I can determine. The logs show only 58 expels in the last
six months and around 2/3rds of those where on essentially dormant
nodes that where being used for development work on fixing issues with
the xcat node deployment for the compute nodes (triggering an rinstall
on a node that was up with GPFS mounted but actually doing nothing).

I have done an mmcheckquota which didn't take long to complete and now
I the "in doubt" is a more reasonable sub 10GB. I shall monitor what
happens more closely in future.


JAB.

-- 
Jonathan A. Buzzard                         Tel: +44141-5483420
HPC System Administrator, ARCHIE-WeSt.
University of Strathclyde, John Anderson Building, Glasgow. G4 0NG





More information about the gpfsug-discuss mailing list