<html dir="ltr">

<head>

<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">

<style type="text/css" id="owaParaStyle">P {margin-top:0;margin-bottom:0;}</style>

</head>

<body fpstyle="1" ocsi="0">

<div style="direction: ltr;font-family: Tahoma;color: #000000;font-size: 10pt;">

<div>More precisely the problem is the following:</div>

<div><br>

</div>

<div>If I set period=1 for a "rate" sensor (network speed, NSD read/write speed, PDisk read/write speed) everything is correct because every second the sensors get the valuess of the cumulative counters (and do not divide it by 1, which is not affecting anything

 for 1 second).</div>

<div>If I set the period=2, the "rate" sensors collect the values from the cumulative counters every two seconds but they do not divide by 2 those values (because pmsensors do not actually divide; they seem to silly report what they read which is understand-able

 from a performance point of view); then grafana receives as double as the real speed.</div>

<div><br>

</div>

<div>I've to correct myself: here the point is not how sampling/downsampling is done by grafana/grafana-bridge/whatever as I wrongly wrote in my first email.</div>

<div>The point is: if I collect data every N seconds (because I do not want to overloads the pmcollector node), how can I divide (in grafana) the reported collected data by N to get real avg speed in that N-seconds time interval ??</div>

<div><br>

</div>

<div>At the moment it seems that the only option is using N=1, which is bad because, as I stated, it overloads the collector when many nodes run many pmsensors...<br>

</div>

<div><br>

</div>

<div>  A</div>

<div><br>

</div>

<div style="font-family: Times New Roman; color: #000000; font-size: 16px">

<hr tabindex="-1">

<div id="divRpF147333" style="direction: ltr;"><font size="2" face="Tahoma" color="#000000"><b>From:</b> gpfsug-discuss-bounces@spectrumscale.org [gpfsug-discuss-bounces@spectrumscale.org] on behalf of IBM Spectrum Scale [scale@us.ibm.com]<br>

<b>Sent:</b> Friday, July 27, 2018 8:27 PM<br>

<b>To:</b> gpfsug main discussion list<br>

<b>Subject:</b> Re: [gpfsug-discuss] How Zimon/Grafana-bridge process data<br>

</font><br>

</div>

<div></div>

<div>

<p><font size="2">Hi,</font><br>

<font size="2">as there are more often similar questions rising, we just put an article about the topic on the Spectrum Scale Wiki</font><br>

<a href="https://www.ibm.com/developerworks/community/wikis/home?lang=en#!/wiki/General%20Parallel%20File%20System%20(GPFS)/page/Downsampling%2C%20Upsampling%20and%20Aggregation%20of%20the%20performance%20data" target="_blank" rel="noopener noreferrer"><font size="2">https://www.ibm.com/developerworks/community/wikis/home?lang=en#!/wiki/General%20Parallel%20File%20System%20(GPFS)/page/Downsampling%2C%20Upsampling%20and%20Aggregation%20of%20the%20performance%20data</font></a><br>

<br>

<font size="2">While there will be some minor updates on the article in the next time, it might already explain your questions.</font><br>

<br>

<font size="2">Regards, The Spectrum Scale (GPFS) team<br>

</font><font size="2"><br>

------------------------------------------------------------------------------------------------------------------<br>

If you feel that your question can benefit other users of Spectrum Scale (GPFS), then please post it to the public IBM developerWroks Forum at

<a href="https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479" target="_blank" rel="noopener noreferrer">

https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479</a>.

<br>

<br>

If your query concerns a potential software error in Spectrum Scale (GPFS) and you have an IBM software maintenance contract please contact 1-800-237-5511 in the United States or your local IBM Service Center in other countries.

<br>

<br>

The forum is informally monitored as time permits and should not be used for priority messages to the Spectrum Scale (GPFS) team.</font><br>

<br>

<img src="cid:1__=0ABB0844DFF76EC08f9e8a93df938690918c0AB@" alt="Inactive hide details for "Dorigo Alvise (PSI)" ---13.07.2018 12:08:59---Hi, I've a GL2 cluster based on gpfs 4.2.3-6, with 1 s" width="16" height="16" border="0"><font size="2" color="#424282">"Dorigo

 Alvise (PSI)" ---13.07.2018 12:08:59---Hi, I've a GL2 cluster based on gpfs 4.2.3-6, with 1 support node and 2 IO/NSD nodes.</font><br>

<br>

<font size="2" color="#5F5F5F">From: </font><font size="2">"Dorigo Alvise (PSI)" <alvise.dorigo@psi.ch></font><br>

<font size="2" color="#5F5F5F">To: </font><font size="2">"gpfsug-discuss@spectrumscale.org" <gpfsug-discuss@spectrumscale.org></font><br>

<font size="2" color="#5F5F5F">Date: </font><font size="2">13.07.2018 12:08</font><br>

<font size="2" color="#5F5F5F">Subject: </font><font size="2">[gpfsug-discuss] How Zimon/Grafana-bridge process data</font><br>

<font size="2" color="#5F5F5F">Sent by: </font><font size="2">gpfsug-discuss-bounces@spectrumscale.org</font><br>

</p>

<hr style="color:#8091A5" width="100%" size="2" noshade="" align="left">

<br>

<br>

<br>

<font size="2" face="Tahoma">Hi,</font><br>

<font size="2" face="Tahoma">I've a GL2 cluster based on gpfs 4.2.3-6, with 1 support node and 2 IO/NSD nodes.</font><br>

<br>

<font size="2" face="Tahoma">I've the following perfmon configuration for the metric-group GPFSNSDDisk:</font><br>

<br>

<font size="2" face="Tahoma">{<br>

name = "GPFSNSDDisk"<br>

period = 2<br>

restrict = "nsdNodes"<br>

}, </font><br>

<br>

<font size="2" face="Tahoma">that, as far as I know sends data to the collector every 2 seconds (correct ?). But how ? does it send what it reads from the counter every two seconds ? or does it aggregated in some way ? or what else ?</font><br>

<br>

<font size="2" face="Tahoma">In the collector node pmcollector, grafana-bridge and grafana-server run.</font><br>

<br>

<font size="2" face="Tahoma">Now I need to understand how to play with the grafana parameters:</font><br>

<font size="2" face="Tahoma">- Down sample (or Disable downsampling)</font><br>

<font size="2" face="Tahoma">- Aggregator (following on the same row the metrics).</font><br>

<br>

<font size="2" face="Tahoma">See attached picture 4s.png as reference.</font><br>

<br>

<font size="2" face="Tahoma">In the past I had the period set to 1. And grafana used to display correct data (bytes/s for the metric gpfs_nsdds_bytes_written) with aggregator set to "sum", which AFAIK means "sum all that metrics that match the filter below"

 (again see the attached picture to see how the filter is set to only collect data from the IO nodes).</font><br>

<br>

<font size="2" face="Tahoma">Today I've changed to "period=2"... and grafana started to display funny data rate (the double, or quad of the real rate).</font><br>

<br>

<font size="2" face="Tahoma">I had to play (almost randomly) with "Aggregator" (from sum to avg, which as fas as I undestand doesn't mean anything in my case... average between the two IO nodes ? or what ?) and "Down sample" (from empty to 2s, and then to 4s)

 to get back real data rate which is compliant with what I do get with dstat.</font><br>

<br>

<font size="2" face="Tahoma">Can someone kindly explain how to play with these parameters when zimon sensor's period is changed ?

</font><br>

<br>

<font size="2" face="Tahoma">Many thanks in advance</font><br>

<font size="2" face="Tahoma">Regards,</font><br>

<br>

<font size="2" face="Tahoma">Alvise Dorigo[attachment "4s.png" deleted by Manfred Haubrich/Germany/IBM]

</font><tt><font size="2">_______________________________________________<br>

gpfsug-discuss mailing list<br>

gpfsug-discuss at spectrumscale.org<br>

</font></tt><tt><font size="2"><a href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss" target="_blank" rel="noopener noreferrer">http://gpfsug.org/mailman/listinfo/gpfsug-discuss</a></font></tt><tt><font size="2"><br>

</font></tt><br>

<br>

<br>

</div>

</div>

</div>

</body>

</html>