<div class="socmaildefaultfont" dir="ltr" style="font-family:Arial;font-size:10.5pt" ><div dir="ltr" >Hi</div>
<div dir="ltr" > </div>
<div dir="ltr" >My 2 cents.</div>
<div dir="ltr" > </div>
<div dir="ltr" >Leave at least 4K inodes, then you get massive improvement on small files (less 3.5K minus whatever you use on xattr)</div>
<div dir="ltr" > </div>
<div dir="ltr" >About blocksize for data, unless you have actual data that suggest that you will actually benefit from smaller than 1MB block, leave there. GPFS uses sublocks where 1/16th of the BS can be allocated to different files, so the "waste" is much less than you think on 1MB and you get the throughput and less structures of much more data blocks.</div>
<div dir="ltr" > </div>
<div dir="ltr" >No<strong> warranty at all</strong> but I try to do this when the BS talk comes in: (might need some clean up it could not be last note but you get the idea)</div>
<div dir="ltr" > </div>
<div dir="ltr" ><div>POSIX<br>find . -type f -name '*' -exec ls -l {} \; > find_ls_files.out</div>
<div>GPFS<br>cd /usr/lpp/mmfs/samples/ilm<br>gcc mmfindUtil_processOutputFile.c -o mmfindUtil_processOutputFile<br>./mmfind /gpfs/shared -ls -type f > find_ls_files.out</div>
<div> CONVERT to CSV</div>
<div><br>POSIX<br>cat find_ls_files.out | awk '{print $5","}' > find_ls_files.out.csv<br>GPFS<br>cat find_ls_files.out | awk '{print $7","}' > find_ls_files.out.csv</div>
<div> LOAD in octave</div>
<div><br>FILESIZE = int32 (dlmread ("find_ls_files.out.csv", ","));</div>
<div> Clean the second column (OPTIONAL as the next clean up will do the same)</div>
<div><br>FILESIZE(:,[2]) = [];</div>
<div> If we are on 4K aligment we need to clean the files that go to inodes (WELL not exactly ... extended attributes! so maybe use a lower number!)</div>
<div><br>FILESIZE(FILESIZE<=3584) =[];</div>
<div> If we are not we need to clean the 0 size files</div>
<div><br>FILESIZE(FILESIZE==0) =[];</div>
<div> Median</div>
<div><br>FILESIZEMEDIAN = int32 (median (FILESIZE))</div>
<div> Mean</div>
<div><br>FILESIZEMEAN = int32 (mean (FILESIZE))</div>
<div> Variance</div>
<div><br>int32 (var (FILESIZE))</div>
<div> iqr interquartile range, i.e., the difference between the upper and lower quartile, of the input data.</div>
<div><br>int32 (iqr (FILESIZE))</div>
<div> Standard deviation</div>
<div> </div>
<div> </div>
<div>For some FS with lots of files you might need a rather powerful machine to run the calculations on octave, I never hit anything could not manage on a 64GB RAM Power box. Most of the times it is enough with my laptop.</div></div>
<div dir="ltr" > </div>
<div dir="ltr" > </div>
<div dir="ltr" ><br>--<br>Ystävällisin terveisin / Kind regards / Saludos cordiales / Salutations<br><br>Luis Bolinches<br>Lab Services<br>http://www-03.ibm.com/systems/services/labservices/<br><br>IBM Laajalahdentie 23 (main Entrance) Helsinki, 00330 Finland<br>Phone: +358 503112585<br><br>"If you continually give you will continually have." Anonymous</div>
<div dir="ltr" > </div>
<div dir="ltr" > </div>
<blockquote data-history-content-modified="1" dir="ltr" style="border-left:solid #aaaaaa 2px; margin-left:5px; padding-left:5px; direction:ltr; margin-right:0px" >----- Original message -----<br>From: Stef Coene <stef.coene@docum.org><br>Sent by: gpfsug-discuss-bounces@spectrumscale.org<br>To: gpfsug main discussion list <gpfsug-discuss@spectrumscale.org><br>Cc:<br>Subject: Re: [gpfsug-discuss] Blocksize<br>Date: Thu, Sep 22, 2016 10:30 PM<br>
<div><font size="2" face="Default Monospace,Courier New,Courier,monospace" >On 09/22/2016 09:07 PM, J. Eric Wonderley wrote:<br>> It defaults to 4k:<br>> mmlsfs testbs8M -i<br>> flag value description<br>> ------------------- ------------------------<br>> -----------------------------------<br>> -i 4096 Inode size in bytes<br>><br>> I think you can make as small as 512b. Gpfs will store very small<br>> files in the inode.<br>><br>> Typically you want your average file size to be your blocksize and your<br>> filesystem has one blocksize and one inodesize.<br><br>The files are not small, but around 20 MB on average.<br>So I calculated with IBM that a 1 MB or 2 MB block size is best.<br><br>But I'm not sure if it's better to use a smaller block size for the<br>metadata.<br><br>The file system is not that large (400 TB) and will hold backup data<br>from CommVault.<br><br><br>Stef<br>_______________________________________________<br>gpfsug-discuss mailing list<br>gpfsug-discuss at spectrumscale.org<br><a href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss" target="_blank" >http://gpfsug.org/mailman/listinfo/gpfsug-discuss</a></font><br> </div></blockquote>
<div dir="ltr" > </div></div><BR>
Ellei edellä ole toisin mainittu: / Unless stated otherwise above:<BR>
Oy IBM Finland Ab<BR>
PL 265, 00101 Helsinki, Finland<BR>
Business ID, Y-tunnus: 0195876-3 <BR>
Registered in Finland<BR>
<BR>