[gpfsug-discuss] gui_refresh_task_failed for HW_INVENTORY with two active GUI nodes

Ulrich Sibiller u.sibiller at science-computing.de
Thu Jan 30 14:43:54 GMT 2020


On 1/29/20 2:05 PM, Billich Heinrich Rainer (ID SD) wrote:
> Hello,
> 
> Can I change the times at which the GUI runs HW_INVENTORY and related tasks?
> 
> we frequently get  messages like
> 
>     gui_refresh_task_failed     GUI           WARNING     12 hours ago      The following GUI 
> refresh task(s) failed: HW_INVENTORY
> 
> The tasks fail due to timeouts. Running the task manually most times succeeds. We do run two gui 
> nodes per cluster and I noted that both servers seem run the HW_INVENTORY at the exact same time 
> which may lead to locking or congestion issues, actually the logs show messages like
> 
> EFSSA0194I Waiting for concurrent operation to complete.
> 
> The gui calls ‘rinv’ on the xCat servers. Rinv for a single   little-endian  server takes a long 
> time – about 2-3 minutes , while it finishes in  about 15s for big-endian server.
> 
> Hence the long runtime of rinv on little-endian systems may be an issue, too
> 
> We run 5.0.4-1 efix9 on the gui and ESS  5.3.4.1 on the GNR systems  (5.0.3.2 efix4). We run a mix 
> of ppc64 and ppc64le systems, which a separate xCat/ems server for each type. The GUI nodes are ppc64le.
> 
> We did see this issue with several gpfs version on the gui and with at least two ESS/xCat versions.
> 
> Just to be sure I did purge the Posgresql tables.
> 
> I did try
> 
> /usr/lpp/mmfs/gui/cli/lstasklog HW_INVENTORY
> 
> /usr/lpp/mmfs/gui/cli/runtask HW_INVENTORY –debug
> 
> And also tried to read the logs in /var/log/cnlog/mgtsrv/ - but they are difficult.


I have seen the same on ppc64le. From time to time it recovers but then it starts again. The
timeouts are okay, it is the hardware. I haven opened a call at IBM and they suggested upgrading to
ESS 5.3.5 because of the new firmwares which I am currently doing. I can dig out more details if you
want.

Uli
-- 
Science + Computing AG
Vorstandsvorsitzender/Chairman of the board of management:
Dr. Martin Matzke
Vorstand/Board of Management:
Matthias Schempp, Sabine Hohenstein
Vorsitzender des Aufsichtsrats/
Chairman of the Supervisory Board:
Philippe Miltin
Aufsichtsrat/Supervisory Board:
Martin Wibbe, Ursula Morgenstern
Sitz/Registered Office: Tuebingen
Registergericht/Registration Court: Stuttgart
Registernummer/Commercial Register No.: HRB 382196


More information about the gpfsug-discuss mailing list