[gpfsug-discuss] mmchdisk /dev/fs start -a progress

Jonathan Buzzard jonathan.buzzard at strath.ac.uk
Sat Nov 21 10:13:42 GMT 2020


On 21/11/2020 00:37, Peter van Hooft wrote:
> 
> Hello,
> 
> Is it possible to find out the progress of the 'mmchdisk /dev/fs start -a'
> command when the controlling terminal had been lost?
> 

I don't think so. You are lucky it is still running

> We can see the task running on the fs manager node with 'mmdiag --commands' with
> attributes 'hold PIT/disk waitTime 0'
> We are starting to worry the mmchdisk is taking too long, and see continuously waiters like
> Waiting 3.1946 sec since 01:28:23, ignored, thread 22092 TSCHDISKCmdThread: on ThCond 0x180267573D0 (SGManagementMgrDataCondvar), reason 'waiting for stripe group to recover'
> 
> Thanks for any hints.
> 

Not that this is going to help this time, but it is why you should 
*ALWAYS* without exception run these sorts of commands within a 
screen/tmux session so when you loose the connection to the server you 
can just reconnect and pick it up again.

This is introductory system administration 101. No critical or long 
running command should ever be dependant on a remote controlling 
terminal. If you can't run them locally then run them in a screen or 
tmux session.

There are plenty of good howto's for both screen and tmux on the 
internet. Depending on which distribution you use I would note that 
RedHat have very annoyingly and for completely specious reasons removed 
screen from RHEL8 and left tmux. So if you are starting from scratch 
tmux is the one to learn :-(


JAB.

-- 
Jonathan A. Buzzard                         Tel: +44141-5483420
HPC System Administrator, ARCHIE-WeSt.
University of Strathclyde, John Anderson Building, Glasgow. G4 0NG



More information about the gpfsug-discuss mailing list