[gpfsug-discuss] GPFS GUI

David D. Johnson david_johnson at brown.edu
Wed May 17 12:58:15 BST 2017


I have issues as well with the gui. The issue that I had most similar to yours
came about because I had installed the collector RPM and enabled collectors on
two server nodes, but the GUI was only getting data from one of them. Each client
randomly selected a collector to deliver data to. 
So how are multiple collectors supposed to work?  Active/Passive? Failover pairs?
Shared storage?  Better not be on GPFS…   Maybe there is a place in the gui config
to tell it to keep track of multiple collectors, but I gave up looking and turned of the
second collector service and removed it from the candidates.

Other issue I mentioned before is that it is totally confused about how many nodes are
in the cluster (thinks 21, with 3 unhealthy) when there are only 12 nodes in all, all healthy.
The nodes dashboard never finishes loading, and no means of digging deeper (text based
info) to find out why it is wedged.

 — ddj

> On May 17, 2017, at 7:44 AM, Wilson, Neil <neil.wilson at metoffice.gov.uk> wrote:
> 
> Hello all,
>  
> Does anyone have any experience with troubleshooting the new GPFS GUI?
> I’ve got it up and running but have a few weird problems with it...
> Maybe someone can help or point me in the right direction?
>  
> 1.       It keeps generating an alert saying that the cluster is down, when it isn’t??
>  
> Event name: 
> gui_cluster_down
> Component:
> GUI
> Entity type: 
> Node
> Entity name: 
> Event time: 
> 17/05/2017 12:19:29
> Message: 
> The GUI detected that the cluster is down.
> Description:
> The GUI checks the cluster state.
> Cause: 
> The GUI calculated that an insufficient amount of quorum nodes is up and running.
> User action: 
> Check why the cluster lost quorum.
> Reporting node: 
> Event type: 
> Active health state of an entity which is monitored by the system.
>  
> 2.       It is collecting sensor data from the NSD nodes without any issue, but it won’t collect sensor data from any of the client nodes?
> I have the pmsensors package installed on all the nodes in question , the service is enabled and running – the logs showing that it has connected to the collector.
> However in the GUI it just says “Performance collector did not return any data”
>  
> 3.       The NSD nodes are returning performance data, but are all displaying a state of unknown.
>   
>  
> Would be great if anyone has any experience or ideas on how to troubleshoot this!
>  
> Thanks
> Neil
> _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at spectrumscale.org <http://spectrumscale.org/>
> http://gpfsug.org/mailman/listinfo/gpfsug-discuss <http://gpfsug.org/mailman/listinfo/gpfsug-discuss>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20170517/853815cb/attachment-0002.htm>


More information about the gpfsug-discuss mailing list