From taylorm at us.ibm.com Mon Aug 1 17:42:19 2016 From: taylorm at us.ibm.com (Michael L Taylor) Date: Mon, 1 Aug 2016 09:42:19 -0700 Subject: [gpfsug-discuss] Spectrum Scale 4.2.1 Released In-Reply-To: References: Message-ID: Thanks for sharing Bob. Since some folks asked previously, if you go to the 4.2.1 FAQ PDF version there will be change bars on the left for what changed in FAQ from previous version as well as a FAQ July updates table near the top to quickly highlight the changes from last FAQ. http://www.ibm.com/support/knowledgecenter/en/STXKQY/gpfsclustersfaq.pdf?view=kc Also, two short blogs on the 4.2.1 release on the Storage Community might be of interest: http://storagecommunity.org/easyblog -------------- next part -------------- An HTML attachment was scrubbed... URL: From raot at bnl.gov Mon Aug 1 19:36:15 2016 From: raot at bnl.gov (Tejas Rao) Date: Mon, 1 Aug 2016 14:36:15 -0400 Subject: [gpfsug-discuss] HAWC (Highly available write cache) Message-ID: <7953aa8c-904a-cee5-34be-7d40e55b46db@bnl.gov> I have enabled write cache (HAWC) by running the below commands. The recovery logs are supposedly placed in the replicated system metadata pool (SSDs). I do not have a "system.log" pool as it is only needed if recovery logs are stored on the client nodes. mmchfs gpfs01 --write-cache-threshold 64K mmchfs gpfs01 -L 1024M mmchconfig logPingPongSector=no I have recycled the daemon on all nodes in the cluster (including the NSD nodes). I still see small synchronous writes (4K) from the clients going to the data drives (data pool). I am checking this by looking at "mmdiag --iohist" output. Should they not be going to the system pool? Do I need to do something else? How can I confirm that HAWC is working as advertised? Thanks. From oehmes at gmail.com Mon Aug 1 19:49:37 2016 From: oehmes at gmail.com (Sven Oehme) Date: Mon, 1 Aug 2016 11:49:37 -0700 Subject: [gpfsug-discuss] HAWC (Highly available write cache) In-Reply-To: <7953aa8c-904a-cee5-34be-7d40e55b46db@bnl.gov> References: <7953aa8c-904a-cee5-34be-7d40e55b46db@bnl.gov> Message-ID: when you say 'synchronous write' what do you mean by that ? if you are talking about using direct i/o (O_DIRECT flag), they don't leverage HAWC data path, its by design. sven On Mon, Aug 1, 2016 at 11:36 AM, Tejas Rao wrote: > I have enabled write cache (HAWC) by running the below commands. The > recovery logs are supposedly placed in the replicated system metadata pool > (SSDs). I do not have a "system.log" pool as it is only needed if recovery > logs are stored on the client nodes. > > mmchfs gpfs01 --write-cache-threshold 64K > mmchfs gpfs01 -L 1024M > mmchconfig logPingPongSector=no > > I have recycled the daemon on all nodes in the cluster (including the NSD > nodes). > > I still see small synchronous writes (4K) from the clients going to the > data drives (data pool). I am checking this by looking at "mmdiag --iohist" > output. Should they not be going to the system pool? > > Do I need to do something else? How can I confirm that HAWC is working as > advertised? > > Thanks. > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From raot at bnl.gov Mon Aug 1 20:05:52 2016 From: raot at bnl.gov (Tejas Rao) Date: Mon, 1 Aug 2016 15:05:52 -0400 Subject: [gpfsug-discuss] HAWC (Highly available write cache) In-Reply-To: References: <7953aa8c-904a-cee5-34be-7d40e55b46db@bnl.gov> Message-ID: <5629f550-05c9-25dd-bbe1-bdea618e8ae0@bnl.gov> In my case GPFS storage is used to store VM images (KVM) and hence the small IO. I always see lots of small 4K writes and the GPFS filesystem block size is 8MB. I thought the reason for the small writes is that the linux kernel requests GPFS to initiate a periodic sync which by default is every 5 seconds and can be controlled by "vm.dirty_writeback_centisecs". I thought HAWC would help in such cases and would harden (coalesce) the small writes in the "system" pool and would flush to the "data" pool in larger block size. Note - I am not doing direct i/o explicitly. On 8/1/2016 14:49, Sven Oehme wrote: > when you say 'synchronous write' what do you mean by that ? > if you are talking about using direct i/o (O_DIRECT flag), they don't > leverage HAWC data path, its by design. > > sven > > On Mon, Aug 1, 2016 at 11:36 AM, Tejas Rao > wrote: > > I have enabled write cache (HAWC) by running the below commands. > The recovery logs are supposedly placed in the replicated system > metadata pool (SSDs). I do not have a "system.log" pool as it is > only needed if recovery logs are stored on the client nodes. > > mmchfs gpfs01 --write-cache-threshold 64K > mmchfs gpfs01 -L 1024M > mmchconfig logPingPongSector=no > > I have recycled the daemon on all nodes in the cluster (including > the NSD nodes). > > I still see small synchronous writes (4K) from the clients going > to the data drives (data pool). I am checking this by looking at > "mmdiag --iohist" output. Should they not be going to the system pool? > > Do I need to do something else? How can I confirm that HAWC is > working as advertised? > > Thanks. > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From dhildeb at us.ibm.com Mon Aug 1 20:50:09 2016 From: dhildeb at us.ibm.com (Dean Hildebrand) Date: Mon, 1 Aug 2016 12:50:09 -0700 Subject: [gpfsug-discuss] HAWC (Highly available write cache) In-Reply-To: <5629f550-05c9-25dd-bbe1-bdea618e8ae0@bnl.gov> References: <7953aa8c-904a-cee5-34be-7d40e55b46db@bnl.gov> <5629f550-05c9-25dd-bbe1-bdea618e8ae0@bnl.gov> Message-ID: Hi Tejas, Do you know the workload in the VM? The workload which enters into HAWC may or may not be the same as the workload that eventually goes into the data pool....it all depends on whether the 4KB writes entering HAWC can be coalesced or not. For example, sequential 4KB writes can all be coalesced into a single large chunk. So 4KB writes into HAWC will convert into 8MB writes to data pool (in your system). But random 4KB writes into HAWC may end up being 4KB writes into the data pool if there are no adjoining 4KB writes (i.e., if 4KB blocks are all dispersed, they can't be coalesced). The goal of HAWC though, whether the 4KB blocks are coalesced or not, is to reduce app latency by ensuring that writing the blocks back to the data pool is done in the background. So while 4KB blocks may still be hitting the data pool, hopefully the application is seeing the latency of your presumably lower latency system pool. Dean From: Tejas Rao To: gpfsug main discussion list Date: 08/01/2016 12:06 PM Subject: Re: [gpfsug-discuss] HAWC (Highly available write cache) Sent by: gpfsug-discuss-bounces at spectrumscale.org In my case GPFS storage is used to store VM images (KVM) and hence the small IO. I always see lots of small 4K writes and the GPFS filesystem block size is 8MB. I thought the reason for the small writes is that the linux kernel requests GPFS to initiate a periodic sync which by default is every 5 seconds and can be controlled by "vm.dirty_writeback_centisecs". I thought HAWC would help in such cases and would harden (coalesce) the small writes in the "system" pool and would flush to the "data" pool in larger block size. Note - I am not doing direct i/o explicitly. On 8/1/2016 14:49, Sven Oehme wrote: when you say 'synchronous write' what do you mean by that ?? if you are talking about using direct i/o (O_DIRECT flag), they don't leverage HAWC data path, its by design. sven On Mon, Aug 1, 2016 at 11:36 AM, Tejas Rao wrote: I have enabled write cache (HAWC) by running the below commands. The recovery logs are supposedly placed in the replicated system metadata pool (SSDs). I do not have a "system.log" pool as it is only needed if recovery logs are stored on the client nodes. mmchfs gpfs01 --write-cache-threshold 64K mmchfs gpfs01 -L 1024M mmchconfig logPingPongSector=no I have recycled the daemon on all nodes in the cluster (including the NSD nodes). I still see small synchronous writes (4K) from the clients going to the data drives (data pool). I am checking this by looking at "mmdiag --iohist" output. Should they not be going to the system pool? Do I need to do something else? How can I confirm that HAWC is working as advertised? Thanks. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From raot at bnl.gov Mon Aug 1 21:42:06 2016 From: raot at bnl.gov (Tejas Rao) Date: Mon, 1 Aug 2016 16:42:06 -0400 Subject: [gpfsug-discuss] HAWC (Highly available write cache) In-Reply-To: References: <7953aa8c-904a-cee5-34be-7d40e55b46db@bnl.gov> <5629f550-05c9-25dd-bbe1-bdea618e8ae0@bnl.gov> Message-ID: <04707e32-83fc-f42d-10cf-99139c136371@bnl.gov> I am not 100% sure what the workload of the VMs is. We have 100's of VMs all used differently, so the workload is rather mixed. I do see 4K writes going to "system" pool, they are tagged as "logData" in 'mmdiag --iohist'. But I also see 4K writes going to the data drives, so it looks like everything is not getting coalesced and these are random writes. Could these 4k writes labelled as "logData" be the writes going to HAWC log files? On 8/1/2016 15:50, Dean Hildebrand wrote: > > Hi Tejas, > > Do you know the workload in the VM? > > The workload which enters into HAWC may or may not be the same as the > workload that eventually goes into the data pool....it all depends on > whether the 4KB writes entering HAWC can be coalesced or not. For > example, sequential 4KB writes can all be coalesced into a single > large chunk. So 4KB writes into HAWC will convert into 8MB writes to > data pool (in your system). But random 4KB writes into HAWC may end up > being 4KB writes into the data pool if there are no adjoining 4KB > writes (i.e., if 4KB blocks are all dispersed, they can't be > coalesced). The goal of HAWC though, whether the 4KB blocks are > coalesced or not, is to reduce app latency by ensuring that writing > the blocks back to the data pool is done in the background. So while > 4KB blocks may still be hitting the data pool, hopefully the > application is seeing the latency of your presumably lower latency > system pool. > > Dean > > > Inactive hide details for Tejas Rao ---08/01/2016 12:06:15 PM---In my > case GPFS storage is used to store VM images (KVM) and heTejas Rao > ---08/01/2016 12:06:15 PM---In my case GPFS storage is used to store > VM images (KVM) and hence the small IO. > > From: Tejas Rao > To: gpfsug main discussion list > Date: 08/01/2016 12:06 PM > Subject: Re: [gpfsug-discuss] HAWC (Highly available write cache) > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > ------------------------------------------------------------------------ > > > > In my case GPFS storage is used to store VM images (KVM) and hence the > small IO. > > I always see lots of small 4K writes and the GPFS filesystem block > size is 8MB. I thought the reason for the small writes is that the > linux kernel requests GPFS to initiate a periodic sync which by > default is every 5 seconds and can be controlled by > "vm.dirty_writeback_centisecs". > > I thought HAWC would help in such cases and would harden (coalesce) > the small writes in the "system" pool and would flush to the "data" > pool in larger block size. > > Note - I am not doing direct i/o explicitly. > > > > On 8/1/2016 14:49, Sven Oehme wrote: > > when you say 'synchronous write' what do you mean by that ? > if you are talking about using direct i/o (O_DIRECT flag), > they don't leverage HAWC data path, its by design. > > sven > > On Mon, Aug 1, 2016 at 11:36 AM, Tejas Rao <_raot at bnl.gov_ > > wrote: > I have enabled write cache (HAWC) by running the below > commands. The recovery logs are supposedly placed in the > replicated system metadata pool (SSDs). I do not have a > "system.log" pool as it is only needed if recovery logs > are stored on the client nodes. > > mmchfs gpfs01 --write-cache-threshold 64K > mmchfs gpfs01 -L 1024M > mmchconfig logPingPongSector=no > > I have recycled the daemon on all nodes in the cluster > (including the NSD nodes). > > I still see small synchronous writes (4K) from the clients > going to the data drives (data pool). I am checking this > by looking at "mmdiag --iohist" output. Should they not be > going to the system pool? > > Do I need to do something else? How can I confirm that > HAWC is working as advertised? > > Thanks. > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at _spectrumscale.org_ > _ > __http://gpfsug.org/mailman/listinfo/gpfsug-discuss_ > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > _http://gpfsug.org/mailman/listinfo/gpfsug-discuss_ > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 105 bytes Desc: not available URL: From dhildeb at us.ibm.com Mon Aug 1 21:55:28 2016 From: dhildeb at us.ibm.com (Dean Hildebrand) Date: Mon, 1 Aug 2016 13:55:28 -0700 Subject: [gpfsug-discuss] HAWC (Highly available write cache) In-Reply-To: <04707e32-83fc-f42d-10cf-99139c136371@bnl.gov> References: <7953aa8c-904a-cee5-34be-7d40e55b46db@bnl.gov><5629f550-05c9-25dd-bbe1-bdea618e8ae0@bnl.gov> <04707e32-83fc-f42d-10cf-99139c136371@bnl.gov> Message-ID: Hi Tejas, Yes, most likely those 4k writes are the HAWC writes...hopefully those 4KB writes have a lower latency than the 4k writes to your data pool so you are realizing the benefits. Dean From: Tejas Rao To: gpfsug main discussion list Date: 08/01/2016 01:42 PM Subject: Re: [gpfsug-discuss] HAWC (Highly available write cache) Sent by: gpfsug-discuss-bounces at spectrumscale.org I am not 100% sure what the workload of the VMs is. We have 100's of VMs all used differently, so the workload is rather mixed. I do see 4K writes going to "system" pool, they are tagged as "logData" in 'mmdiag --iohist'. But I also see 4K writes going to the data drives, so it looks like everything is not getting coalesced and these are random writes. Could these 4k writes labelled as "logData" be the writes going to HAWC log files? On 8/1/2016 15:50, Dean Hildebrand wrote: Hi Tejas, Do you know the workload in the VM? The workload which enters into HAWC may or may not be the same as the workload that eventually goes into the data pool....it all depends on whether the 4KB writes entering HAWC can be coalesced or not. For example, sequential 4KB writes can all be coalesced into a single large chunk. So 4KB writes into HAWC will convert into 8MB writes to data pool (in your system). But random 4KB writes into HAWC may end up being 4KB writes into the data pool if there are no adjoining 4KB writes (i.e., if 4KB blocks are all dispersed, they can't be coalesced). The goal of HAWC though, whether the 4KB blocks are coalesced or not, is to reduce app latency by ensuring that writing the blocks back to the data pool is done in the background. So while 4KB blocks may still be hitting the data pool, hopefully the application is seeing the latency of your presumably lower latency system pool. Dean Inactive hide details for Tejas Rao ---08/01/2016 12:06:15 PM---In my case GPFS storage is used to store VM images (KVM) and heTejas Rao ---08/01/2016 12:06:15 PM---In my case GPFS storage is used to store VM images (KVM) and hence the small IO. From: Tejas Rao To: gpfsug main discussion list Date: 08/01/2016 12:06 PM Subject: Re: [gpfsug-discuss] HAWC (Highly available write cache) Sent by: gpfsug-discuss-bounces at spectrumscale.org In my case GPFS storage is used to store VM images (KVM) and hence the small IO. I always see lots of small 4K writes and the GPFS filesystem block size is 8MB. I thought the reason for the small writes is that the linux kernel requests GPFS to initiate a periodic sync which by default is every 5 seconds and can be controlled by "vm.dirty_writeback_centisecs". I thought HAWC would help in such cases and would harden (coalesce) the small writes in the "system" pool and would flush to the "data" pool in larger block size. Note - I am not doing direct i/o explicitly. On 8/1/2016 14:49, Sven Oehme wrote: when you say 'synchronous write' what do you mean by that ? if you are talking about using direct i/o (O_DIRECT flag), they don't leverage HAWC data path, its by design. sven On Mon, Aug 1, 2016 at 11:36 AM, Tejas Rao wrote: I have enabled write cache (HAWC) by running the below commands. The recovery logs are supposedly placed in the replicated system metadata pool (SSDs). I do not have a "system.log" pool as it is only needed if recovery logs are stored on the client nodes. mmchfs gpfs01 --write-cache-threshold 64K mmchfs gpfs01 -L 1024M mmchconfig logPingPongSector=no I have recycled the daemon on all nodes in the cluster (including the NSD nodes). I still see small synchronous writes (4K) from the clients going to the data drives (data pool). I am checking this by looking at "mmdiag --iohist" output. Should they not be going to the system pool? Do I need to do something else? How can I confirm that HAWC is working as advertised? Thanks. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From Greg.Lehmann at csiro.au Wed Aug 3 06:06:32 2016 From: Greg.Lehmann at csiro.au (Greg.Lehmann at csiro.au) Date: Wed, 3 Aug 2016 05:06:32 +0000 Subject: [gpfsug-discuss] SS 4.2.1.0 upgrade pain Message-ID: <04fbf3c0ae40468d912293821905197d@exch1-cdc.nexus.csiro.au> On Debian I am seeing this when trying to upgrade: mmshutdown dpkg -I gpfs.base_4.2.1-0_amd64.deb gpfs.docs_4.2.1-0_all.deb gpfs.ext_4.2.1-0_amd64.deb gpfs.gpl_4.2.1-0_all.deb gpfs.gskit_8.0.50-57_amd64.deb gpfs.msg.en-us_4.2.1-0_all.deb (Reading database ... 65194 files and directories currently installed.) Preparing to replace gpfs.base 4.1.0-6 (using gpfs.base_4.2.1-0_amd64.deb) ... Unpacking replacement gpfs.base ... Preparing to replace gpfs.docs 4.1.0-6 (using gpfs.docs_4.2.1-0_all.deb) ... Unpacking replacement gpfs.docs ... Preparing to replace gpfs.ext 4.1.0-6 (using gpfs.ext_4.2.1-0_amd64.deb) ... Unpacking replacement gpfs.ext ... Etc. Unpacking replacement gpfs.gpl ... Preparing to replace gpfs.gskit 8.0.50-32 (using gpfs.gskit_8.0.50-57_amd64.deb) ... Unpacking replacement gpfs.gskit ... Preparing to replace gpfs.msg.en-us 4.1.0-6 (using gpfs.msg.en-us_4.2.1-0_all.deb) ... Unpacking replacement gpfs.msg.en-us ... Setting up gpfs.base (4.2.1-0) ... At which point it hangs. A ps shows this: ps -ef | grep mm root 21269 1 0 14:18 pts/0 00:00:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmccrmonitor 15 root 21276 21150 1 14:18 pts/0 00:00:03 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmsysmoncontrol start root 21363 1 0 14:18 ? 00:00:00 /usr/lpp/mmfs/bin/mmsdrserv 1191 10 10 /var/adm/ras/mmsdrserv.log 128 yes root 22485 21276 0 14:18 pts/0 00:00:00 python /usr/lpp/mmfs/bin/mmsysmon.py root 22486 22485 0 14:18 pts/0 00:00:00 /bin/sh -c /usr/lpp/mmfs/bin/mmlsmgr -c root 22488 22486 1 14:18 pts/0 00:00:03 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmlsmgr -c root 24420 22488 0 14:18 pts/0 00:00:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmcommon linkCommand hadoop1-12-cdc-ib2.it.csiro.au /var/mmfs/tmp/nodefile.mmlsmgr.22488 mmlsmgr -c root 24439 24420 0 14:18 pts/0 00:00:00 /usr/bin/perl /usr/lpp/mmfs/bin/mmdsh -svL gpfs-07-cdc-ib2.san.csiro.au /usr/lpp/mmfs/bin/mmremote mmrpc:1:1:1510:mmrc_mmlsmgr_hadoop1-12-cdc-ib2.it.csiro.au_24420_1470197923_: runCmd _NO_FILE_COPY_ _NO_MOUNT_CHECK_ NULL _LINK_ mmlsmgr -c root 24446 24439 0 14:18 pts/0 00:00:00 /usr/bin/ssh gpfs-07-cdc-ib2.san.csiro.au -n -l root /bin/ksh -c ' LANG=en_US.UTF-8 LC_ALL= LC_COLLATE= LC_TYPE= LC_MONETARY= LC_NUMERIC= LC_TIME= LC_MESSAGES= MMMODE=lc environmentType=lc2 GPFS_rshPath=/usr/bin/ssh GPFS_rcpPath=/usr/bin/scp mmScriptTrace= GPFSCMDPORTRANGE=0 GPFS_CIM_MSG_FORMAT= /usr/lpp/mmfs/bin/mmremote mmrpc:1:1:1510:mmrc_mmlsmgr_hadoop1-12-cdc-ib2.it.csiro.au_24420_1470197923_: runCmd _NO_FILE_COPY_ _NO_MOUNT_CHECK_ NULL _LINK_ mmlsmgr -c ' root 24546 21269 0 14:23 pts/0 00:00:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmccrmonitor 15 root 24548 24455 0 14:23 pts/1 00:00:00 grep mm It is trying to connect with ssh to one of my nsd servers, that it does not have permission to? I am guessing that is where the hang is. Anybody else seen this? I have a workaround - remove from cluster before the update, but this is a bit of extra work I can do without. I have not had to this for previous versions starting with 4.1.0.0. Greg -------------- next part -------------- An HTML attachment was scrubbed... URL: From Greg.Lehmann at csiro.au Wed Aug 3 08:32:43 2016 From: Greg.Lehmann at csiro.au (Greg.Lehmann at csiro.au) Date: Wed, 3 Aug 2016 07:32:43 +0000 Subject: [gpfsug-discuss] SS 4.2.1.0 upgrade pain In-Reply-To: <04fbf3c0ae40468d912293821905197d@exch1-cdc.nexus.csiro.au> References: <04fbf3c0ae40468d912293821905197d@exch1-cdc.nexus.csiro.au> Message-ID: <663114b24b0b403aa076a83791f32c58@exch1-cdc.nexus.csiro.au> And I am seeing the same behaviour on a SLES 12 SP1 update from 4.2.04 to 4.2.1.0. From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Greg.Lehmann at csiro.au Sent: Wednesday, 3 August 2016 3:07 PM To: gpfsug-discuss at spectrumscale.org Subject: [ExternalEmail] [gpfsug-discuss] SS 4.2.1.0 upgrade pain On Debian I am seeing this when trying to upgrade: mmshutdown dpkg -I gpfs.base_4.2.1-0_amd64.deb gpfs.docs_4.2.1-0_all.deb gpfs.ext_4.2.1-0_amd64.deb gpfs.gpl_4.2.1-0_all.deb gpfs.gskit_8.0.50-57_amd64.deb gpfs.msg.en-us_4.2.1-0_all.deb (Reading database ... 65194 files and directories currently installed.) Preparing to replace gpfs.base 4.1.0-6 (using gpfs.base_4.2.1-0_amd64.deb) ... Unpacking replacement gpfs.base ... Preparing to replace gpfs.docs 4.1.0-6 (using gpfs.docs_4.2.1-0_all.deb) ... Unpacking replacement gpfs.docs ... Preparing to replace gpfs.ext 4.1.0-6 (using gpfs.ext_4.2.1-0_amd64.deb) ... Unpacking replacement gpfs.ext ... Etc. Unpacking replacement gpfs.gpl ... Preparing to replace gpfs.gskit 8.0.50-32 (using gpfs.gskit_8.0.50-57_amd64.deb) ... Unpacking replacement gpfs.gskit ... Preparing to replace gpfs.msg.en-us 4.1.0-6 (using gpfs.msg.en-us_4.2.1-0_all.deb) ... Unpacking replacement gpfs.msg.en-us ... Setting up gpfs.base (4.2.1-0) ... At which point it hangs. A ps shows this: ps -ef | grep mm root 21269 1 0 14:18 pts/0 00:00:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmccrmonitor 15 root 21276 21150 1 14:18 pts/0 00:00:03 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmsysmoncontrol start root 21363 1 0 14:18 ? 00:00:00 /usr/lpp/mmfs/bin/mmsdrserv 1191 10 10 /var/adm/ras/mmsdrserv.log 128 yes root 22485 21276 0 14:18 pts/0 00:00:00 python /usr/lpp/mmfs/bin/mmsysmon.py root 22486 22485 0 14:18 pts/0 00:00:00 /bin/sh -c /usr/lpp/mmfs/bin/mmlsmgr -c root 22488 22486 1 14:18 pts/0 00:00:03 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmlsmgr -c root 24420 22488 0 14:18 pts/0 00:00:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmcommon linkCommand hadoop1-12-cdc-ib2.it.csiro.au /var/mmfs/tmp/nodefile.mmlsmgr.22488 mmlsmgr -c root 24439 24420 0 14:18 pts/0 00:00:00 /usr/bin/perl /usr/lpp/mmfs/bin/mmdsh -svL gpfs-07-cdc-ib2.san.csiro.au /usr/lpp/mmfs/bin/mmremote mmrpc:1:1:1510:mmrc_mmlsmgr_hadoop1-12-cdc-ib2.it.csiro.au_24420_1470197923_: runCmd _NO_FILE_COPY_ _NO_MOUNT_CHECK_ NULL _LINK_ mmlsmgr -c root 24446 24439 0 14:18 pts/0 00:00:00 /usr/bin/ssh gpfs-07-cdc-ib2.san.csiro.au -n -l root /bin/ksh -c ' LANG=en_US.UTF-8 LC_ALL= LC_COLLATE= LC_TYPE= LC_MONETARY= LC_NUMERIC= LC_TIME= LC_MESSAGES= MMMODE=lc environmentType=lc2 GPFS_rshPath=/usr/bin/ssh GPFS_rcpPath=/usr/bin/scp mmScriptTrace= GPFSCMDPORTRANGE=0 GPFS_CIM_MSG_FORMAT= /usr/lpp/mmfs/bin/mmremote mmrpc:1:1:1510:mmrc_mmlsmgr_hadoop1-12-cdc-ib2.it.csiro.au_24420_1470197923_: runCmd _NO_FILE_COPY_ _NO_MOUNT_CHECK_ NULL _LINK_ mmlsmgr -c ' root 24546 21269 0 14:23 pts/0 00:00:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmccrmonitor 15 root 24548 24455 0 14:23 pts/1 00:00:00 grep mm It is trying to connect with ssh to one of my nsd servers, that it does not have permission to? I am guessing that is where the hang is. Anybody else seen this? I have a workaround - remove from cluster before the update, but this is a bit of extra work I can do without. I have not had to this for previous versions starting with 4.1.0.0. Greg -------------- next part -------------- An HTML attachment was scrubbed... URL: From kenneth.waegeman at ugent.be Wed Aug 3 09:54:30 2016 From: kenneth.waegeman at ugent.be (Kenneth Waegeman) Date: Wed, 3 Aug 2016 10:54:30 +0200 Subject: [gpfsug-discuss] Upgrade from 4.1.1 to 4.2.1 Message-ID: <57A1B146.9070505@ugent.be> Hi, In the upgrade procedure (prerequisites) of 4.2.1, I read: "If you are coming from 4.1.1-X, you must first upgrade to 4.2.0-0. You may use this 4.2.1-0 package to perform a First Time Install or to upgrade from an existing 4.2.0-X level." What does this mean exactly. Should we just install the 4.2.0 rpms first, and then the 4.2.1 rpms, or should we install the 4.2.0 rpms, start up gpfs, bring gpfs down again and then do the 4.2.1 rpms? But if we re-install a 4.1.1 node, we can immediately install 4.2.1 ? Thanks! Kenneth From bbanister at jumptrading.com Wed Aug 3 15:53:52 2016 From: bbanister at jumptrading.com (Bryan Banister) Date: Wed, 3 Aug 2016 14:53:52 +0000 Subject: [gpfsug-discuss] Upgrade from 4.1.1 to 4.2.1 In-Reply-To: <57A1B146.9070505@ugent.be> References: <57A1B146.9070505@ugent.be> Message-ID: <21BC488F0AEA2245B2C3E83FC0B33DBB062B3718@CHI-EXCHANGEW1.w2k.jumptrading.com> Your first process is correct. Install the 4.2.0-0 rpms first, then install the 4.2.1 rpms after. -Bryan -----Original Message----- From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Kenneth Waegeman Sent: Wednesday, August 03, 2016 3:55 AM To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] Upgrade from 4.1.1 to 4.2.1 Hi, In the upgrade procedure (prerequisites) of 4.2.1, I read: "If you are coming from 4.1.1-X, you must first upgrade to 4.2.0-0. You may use this 4.2.1-0 package to perform a First Time Install or to upgrade from an existing 4.2.0-X level." What does this mean exactly. Should we just install the 4.2.0 rpms first, and then the 4.2.1 rpms, or should we install the 4.2.0 rpms, start up gpfs, bring gpfs down again and then do the 4.2.1 rpms? But if we re-install a 4.1.1 node, we can immediately install 4.2.1 ? Thanks! Kenneth _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ________________________________ Note: This email is for the confidential use of the named addressee(s) only and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you are hereby notified that any review, dissemination or copying of this email is strictly prohibited, and to please notify the sender immediately and destroy this email and any attachments. Email transmission cannot be guaranteed to be secure or error-free. The Company, therefore, does not make any guarantees as to the completeness or accuracy of this email or any attachments. This email is for informational purposes only and does not constitute a recommendation, offer, request or solicitation of any kind to buy, sell, subscribe, redeem or perform any type of transaction of a financial product. From pinto at scinet.utoronto.ca Wed Aug 3 17:22:27 2016 From: pinto at scinet.utoronto.ca (Jaime Pinto) Date: Wed, 03 Aug 2016 12:22:27 -0400 Subject: [gpfsug-discuss] quota on secondary groups for a user? Message-ID: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> Suppose I want to set both USR and GRP quotas for a user, however GRP is not the primary group. Will gpfs enforce the secondary group quota for that user? What I mean is, if the user keeps writing files with secondary group as the attribute, and that overall group quota is reached, will that user be stopped by gpfs? Thanks Jaime ************************************ TELL US ABOUT YOUR SUCCESS STORIES http://www.scinethpc.ca/testimonials ************************************ --- Jaime Pinto SciNet HPC Consortium - Compute/Calcul Canada www.scinet.utoronto.ca - www.computecanada.org University of Toronto 256 McCaul Street, Room 235 Toronto, ON, M5T1W5 P: 416-978-2755 C: 416-505-1477 ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. From oehmes at gmail.com Wed Aug 3 17:35:39 2016 From: oehmes at gmail.com (Sven Oehme) Date: Wed, 3 Aug 2016 09:35:39 -0700 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> Message-ID: Hi, quotas are only counted against primary group sven On Wed, Aug 3, 2016 at 9:22 AM, Jaime Pinto wrote: > Suppose I want to set both USR and GRP quotas for a user, however GRP is > not the primary group. Will gpfs enforce the secondary group quota for that > user? > > What I mean is, if the user keeps writing files with secondary group as > the attribute, and that overall group quota is reached, will that user be > stopped by gpfs? > > Thanks > Jaime > > > > > ************************************ > TELL US ABOUT YOUR SUCCESS STORIES > http://www.scinethpc.ca/testimonials > ************************************ > --- > Jaime Pinto > SciNet HPC Consortium - Compute/Calcul Canada > www.scinet.utoronto.ca - www.computecanada.org > University of Toronto > 256 McCaul Street, Room 235 > Toronto, ON, M5T1W5 > P: 416-978-2755 > C: 416-505-1477 > > ---------------------------------------------------------------- > This message was sent using IMP at SciNet Consortium, University of > Toronto. > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From pinto at scinet.utoronto.ca Wed Aug 3 17:41:24 2016 From: pinto at scinet.utoronto.ca (Jaime Pinto) Date: Wed, 03 Aug 2016 12:41:24 -0400 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> Message-ID: <20160803124124.21815zz1w4exmuus@support.scinet.utoronto.ca> Quoting "Sven Oehme" : > Hi, > > quotas are only counted against primary group > > sven Thanks Sven I kind of suspected, but needed an independent confirmation. Jaime > > > On Wed, Aug 3, 2016 at 9:22 AM, Jaime Pinto > wrote: > >> Suppose I want to set both USR and GRP quotas for a user, however GRP is >> not the primary group. Will gpfs enforce the secondary group quota for that >> user? >> >> What I mean is, if the user keeps writing files with secondary group as >> the attribute, and that overall group quota is reached, will that user be >> stopped by gpfs? >> >> Thanks >> Jaime >> >> >> >> >> ************************************ >> TELL US ABOUT YOUR SUCCESS STORIES >> http://www.scinethpc.ca/testimonials >> ************************************ >> --- >> Jaime Pinto >> SciNet HPC Consortium - Compute/Calcul Canada >> www.scinet.utoronto.ca - www.computecanada.org >> University of Toronto >> 256 McCaul Street, Room 235 >> Toronto, ON, M5T1W5 >> P: 416-978-2755 >> C: 416-505-1477 >> >> ---------------------------------------------------------------- >> This message was sent using IMP at SciNet Consortium, University of >> Toronto. >> >> >> _______________________________________________ >> gpfsug-discuss mailing list >> gpfsug-discuss at spectrumscale.org >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >> > ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. From jonathan at buzzard.me.uk Wed Aug 3 17:44:01 2016 From: jonathan at buzzard.me.uk (Jonathan Buzzard) Date: Wed, 3 Aug 2016 17:44:01 +0100 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> Message-ID: <891fb362-ac69-2803-3664-1a55087868dc@buzzard.me.uk> On 03/08/16 17:22, Jaime Pinto wrote: > Suppose I want to set both USR and GRP quotas for a user, however GRP is > not the primary group. Will gpfs enforce the secondary group quota for > that user? Nope that's not how POSIX schematics work for group quotas. As far as I can tell only your primary group is used for group quotas. It basically makes group quotas in Unix a waste of time in my opinion. At least I have never come across a real world scenario where they work in a useful manner. > What I mean is, if the user keeps writing files with secondary group as > the attribute, and that overall group quota is reached, will that user > be stopped by gpfs? > File sets are the answer to your problems, but retrospectively applying them to a file system is a pain. You create a file set for a directory and can then apply a quota to the file set. Even better you can apply per file set user and group quotas. So if file set A has a 1TB quota you could limit user X to 100GB in the file set, but outside the file set they could have a different quota or even no quota. Only issue is a limit of ~10,000 file sets per file system JAB. -- Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk Fife, United Kingdom. From pinto at scinet.utoronto.ca Wed Aug 3 17:55:43 2016 From: pinto at scinet.utoronto.ca (Jaime Pinto) Date: Wed, 03 Aug 2016 12:55:43 -0400 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <891fb362-ac69-2803-3664-1a55087868dc@buzzard.me.uk> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> <891fb362-ac69-2803-3664-1a55087868dc@buzzard.me.uk> Message-ID: <20160803125543.11831ypcdi8i189b@support.scinet.utoronto.ca> I guess I have a bit of a puzzle to solve, combining quotas on filesets, paths and USR/GRP attributes So much for the "standard" built-in linux account creation script, in which by default every new user is created with primary GID=UID, doesn't really help any of us. Jaime Quoting "Jonathan Buzzard" : > On 03/08/16 17:22, Jaime Pinto wrote: >> Suppose I want to set both USR and GRP quotas for a user, however GRP is >> not the primary group. Will gpfs enforce the secondary group quota for >> that user? > > Nope that's not how POSIX schematics work for group quotas. As far as I > can tell only your primary group is used for group quotas. It basically > makes group quotas in Unix a waste of time in my opinion. At least I > have never come across a real world scenario where they work in a > useful manner. > >> What I mean is, if the user keeps writing files with secondary group as >> the attribute, and that overall group quota is reached, will that user >> be stopped by gpfs? >> > > File sets are the answer to your problems, but retrospectively applying > them to a file system is a pain. You create a file set for a directory > and can then apply a quota to the file set. Even better you can apply > per file set user and group quotas. So if file set A has a 1TB quota > you could limit user X to 100GB in the file set, but outside the file > set they could have a different quota or even no quota. > > Only issue is a limit of ~10,000 file sets per file system > > > JAB. > > -- > Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk > Fife, United Kingdom. > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. From Kevin.Buterbaugh at Vanderbilt.Edu Wed Aug 3 19:06:34 2016 From: Kevin.Buterbaugh at Vanderbilt.Edu (Buterbaugh, Kevin L) Date: Wed, 3 Aug 2016 18:06:34 +0000 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> Message-ID: Hi Sven, Wait - am I misunderstanding something here? Let?s say that I have ?user1? who has primary group ?group1? and secondary group ?group2?. And let?s say that they write to a directory where the bit on the directory forces all files created in that directory to have group2 associated with them. Are you saying that those files still count against group1?s group quota??? Thanks for clarifying? Kevin On Aug 3, 2016, at 11:35 AM, Sven Oehme > wrote: Hi, quotas are only counted against primary group sven On Wed, Aug 3, 2016 at 9:22 AM, Jaime Pinto > wrote: Suppose I want to set both USR and GRP quotas for a user, however GRP is not the primary group. Will gpfs enforce the secondary group quota for that user? What I mean is, if the user keeps writing files with secondary group as the attribute, and that overall group quota is reached, will that user be stopped by gpfs? Thanks Jaime ************************************ TELL US ABOUT YOUR SUCCESS STORIES http://www.scinethpc.ca/testimonials ************************************ --- Jaime Pinto SciNet HPC Consortium - Compute/Calcul Canada www.scinet.utoronto.ca - www.computecanada.org University of Toronto 256 McCaul Street, Room 235 Toronto, ON, M5T1W5 P: 416-978-2755 C: 416-505-1477 ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From pinto at scinet.utoronto.ca Wed Aug 3 19:30:08 2016 From: pinto at scinet.utoronto.ca (Jaime Pinto) Date: Wed, 03 Aug 2016 14:30:08 -0400 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

Message-ID: <20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> Quoting "Buterbaugh, Kevin L" : > Hi Sven, > > Wait - am I misunderstanding something here? Let?s say that I have > ?user1? who has primary group ?group1? and secondary group ?group2?. > And let?s say that they write to a directory where the bit on the > directory forces all files created in that directory to have group2 > associated with them. Are you saying that those files still count > against group1?s group quota??? > > Thanks for clarifying? > > Kevin Not really, My interpretation is that all files written with group2 will count towards the quota on that group. However any users with group2 as the primary group will be prevented from writing any further when the group2 quota is reached. However the culprit user1 with primary group as group1 won't be detected by gpfs, and can just keep going on writing group2 files. As far as the individual user quota, it doesn't matter: group1 or group2 it will be counted towards the usage of that user. It would be interesting if the behavior was more as expected. I just checked with my Lustre counter-parts and they tell me whichever secondary group is hit first, however many there may be, the user will be stopped. The problem then becomes identifying which of the secondary groups hit the limit for that user. Jaime > > On Aug 3, 2016, at 11:35 AM, Sven Oehme > > wrote: > > Hi, > > quotas are only counted against primary group > > sven > > > On Wed, Aug 3, 2016 at 9:22 AM, Jaime Pinto > > wrote: > Suppose I want to set both USR and GRP quotas for a user, however > GRP is not the primary group. Will gpfs enforce the secondary group > quota for that user? > > What I mean is, if the user keeps writing files with secondary group > as the attribute, and that overall group quota is reached, will > that user be stopped by gpfs? > > Thanks > Jaime > > > > > ************************************ > TELL US ABOUT YOUR SUCCESS STORIES > http://www.scinethpc.ca/testimonials > ************************************ > --- > Jaime Pinto > SciNet HPC Consortium - Compute/Calcul Canada > www.scinet.utoronto.ca - > www.computecanada.org > University of Toronto > 256 McCaul Street, Room 235 > Toronto, ON, M5T1W5 > P: 416-978-2755 > C: 416-505-1477 > ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. From Kevin.Buterbaugh at Vanderbilt.Edu Wed Aug 3 19:34:21 2016 From: Kevin.Buterbaugh at Vanderbilt.Edu (Buterbaugh, Kevin L) Date: Wed, 3 Aug 2016 18:34:21 +0000 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

<20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> Message-ID: <78DAAA7C-C0C2-42C2-B6B9-B5EC6CC3A3F4@vanderbilt.edu> Hi Jaime / Sven, If Jaime?s interpretation is correct about user1 continuing to be able to write to ?group2? files even though that group is at their hard limit, then that?s a bug that needs fixing. I haven?t tested that myself, and we?re in a downtime right now so I?m a tad bit busy, but if I need to I?ll test it on our test cluster later this week. Kevin On Aug 3, 2016, at 1:30 PM, Jaime Pinto > wrote: Quoting "Buterbaugh, Kevin L" >: Hi Sven, Wait - am I misunderstanding something here? Let?s say that I have ?user1? who has primary group ?group1? and secondary group ?group2?. And let?s say that they write to a directory where the bit on the directory forces all files created in that directory to have group2 associated with them. Are you saying that those files still count against group1?s group quota??? Thanks for clarifying? Kevin Not really, My interpretation is that all files written with group2 will count towards the quota on that group. However any users with group2 as the primary group will be prevented from writing any further when the group2 quota is reached. However the culprit user1 with primary group as group1 won't be detected by gpfs, and can just keep going on writing group2 files. As far as the individual user quota, it doesn't matter: group1 or group2 it will be counted towards the usage of that user. It would be interesting if the behavior was more as expected. I just checked with my Lustre counter-parts and they tell me whichever secondary group is hit first, however many there may be, the user will be stopped. The problem then becomes identifying which of the secondary groups hit the limit for that user. Jaime On Aug 3, 2016, at 11:35 AM, Sven Oehme > wrote: Hi, quotas are only counted against primary group sven On Wed, Aug 3, 2016 at 9:22 AM, Jaime Pinto > wrote: Suppose I want to set both USR and GRP quotas for a user, however GRP is not the primary group. Will gpfs enforce the secondary group quota for that user? What I mean is, if the user keeps writing files with secondary group as the attribute, and that overall group quota is reached, will that user be stopped by gpfs? Thanks Jaime ************************************ TELL US ABOUT YOUR SUCCESS STORIES http://www.scinethpc.ca/testimonials ************************************ --- Jaime Pinto SciNet HPC Consortium - Compute/Calcul Canada www.scinet.utoronto.ca - www.computecanada.org University of Toronto 256 McCaul Street, Room 235 Toronto, ON, M5T1W5 P: 416-978-2755 C: 416-505-1477 ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From jonathan at buzzard.me.uk Wed Aug 3 19:46:54 2016 From: jonathan at buzzard.me.uk (Jonathan Buzzard) Date: Wed, 3 Aug 2016 19:46:54 +0100 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

Message-ID: On 03/08/16 19:06, Buterbaugh, Kevin L wrote: > Hi Sven, > > Wait - am I misunderstanding something here? Let?s say that I have > ?user1? who has primary group ?group1? and secondary group ?group2?. > And let?s say that they write to a directory where the bit on the > directory forces all files created in that directory to have group2 > associated with them. Are you saying that those files still count > against group1?s group quota??? > Yeah, but bastard user from hell over here then does chgrp group1 myevilfile.txt and your set group id bit becomes irrelevant because it is only ever indicative. In fact there is nothing that guarantees the set group id bit is honored because there is nothing stopping the user or a program coming in immediately after the file is created and changing that. Not pointing fingers at the OSX SMB client when Unix extensions are active on a Samba server in any way there. As such Unix group quotas are in the real world a total waste of space. This is if you ask me why XFS and Lustre have project quotas and GPFS has file sets. JAB. -- Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk Fife, United Kingdom. From Kevin.Buterbaugh at Vanderbilt.Edu Wed Aug 3 19:55:01 2016 From: Kevin.Buterbaugh at Vanderbilt.Edu (Buterbaugh, Kevin L) Date: Wed, 3 Aug 2016 18:55:01 +0000 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

Message-ID: JAB, The set group id bit is tangential to my point. I expect GPFS to count any files a user owns against their user quota. If they are a member of multiple groups then I also expect it to count it against the group quota of whatever group is associated with that file. I.e., if they do a chgrp then GPFS should subtract from one group and add to another. Kevin On Aug 3, 2016, at 1:46 PM, Jonathan Buzzard > wrote: On 03/08/16 19:06, Buterbaugh, Kevin L wrote: Hi Sven, Wait - am I misunderstanding something here? Let?s say that I have ?user1? who has primary group ?group1? and secondary group ?group2?. And let?s say that they write to a directory where the bit on the directory forces all files created in that directory to have group2 associated with them. Are you saying that those files still count against group1?s group quota??? Yeah, but bastard user from hell over here then does chgrp group1 myevilfile.txt and your set group id bit becomes irrelevant because it is only ever indicative. In fact there is nothing that guarantees the set group id bit is honored because there is nothing stopping the user or a program coming in immediately after the file is created and changing that. Not pointing fingers at the OSX SMB client when Unix extensions are active on a Samba server in any way there. As such Unix group quotas are in the real world a total waste of space. This is if you ask me why XFS and Lustre have project quotas and GPFS has file sets. JAB. -- Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk Fife, United Kingdom. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From jonathan at buzzard.me.uk Wed Aug 3 20:13:09 2016 From: jonathan at buzzard.me.uk (Jonathan Buzzard) Date: Wed, 3 Aug 2016 20:13:09 +0100 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <78DAAA7C-C0C2-42C2-B6B9-B5EC6CC3A3F4@vanderbilt.edu> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

<20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> <78DAAA7C-C0C2-42C2-B6B9-B5EC6CC3A3F4@vanderbilt.edu> Message-ID: <2b823e10-34e8-ce9d-956c-267df4e6042b@buzzard.me.uk> On 03/08/16 19:34, Buterbaugh, Kevin L wrote: > Hi Jaime / Sven, > > If Jaime?s interpretation is correct about user1 continuing to be able > to write to ?group2? files even though that group is at their hard > limit, then that?s a bug that needs fixing. I haven?t tested that > myself, and we?re in a downtime right now so I?m a tad bit busy, but if > I need to I?ll test it on our test cluster later this week. > Even if Jamie's interpretation is wrong it shows the other massive failure of group quotas under Unix and why they are not fit for purpose in the real world. So bufh here can deliberately or accidentally do a denial of service on other users and tracking down the offending user is a right pain in the backside. The point of being able to change group ownership on a file is to indicate the massive weakness of the whole group quota system, and why in my experience nobody actually uses it, and "project" quota options have been implemented in many "enterprise" Unix file systems. JAB. -- Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk Fife, United Kingdom. From Kevin.Buterbaugh at Vanderbilt.Edu Wed Aug 3 20:18:11 2016 From: Kevin.Buterbaugh at Vanderbilt.Edu (Buterbaugh, Kevin L) Date: Wed, 3 Aug 2016 19:18:11 +0000 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <2b823e10-34e8-ce9d-956c-267df4e6042b@buzzard.me.uk> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

<20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> <78DAAA7C-C0C2-42C2-B6B9-B5EC6CC3A3F4@vanderbilt.edu> <2b823e10-34e8-ce9d-956c-267df4e6042b@buzzard.me.uk> Message-ID: <6B06DA37-321E-4730-A3D1-61E41E4C6187@vanderbilt.edu> JAB, Our scratch filesystem uses user and group quotas. It started out as a traditional scratch filesystem but then we decided (for better or worse) to allow groups to purchase quota on it (and we don?t purge it, as many sites do). We have many users in multiple groups, so if this is not working right it?s a potential issue for us. But you?re right, I?m a nobody? Kevin On Aug 3, 2016, at 2:13 PM, Jonathan Buzzard > wrote: On 03/08/16 19:34, Buterbaugh, Kevin L wrote: Hi Jaime / Sven, If Jaime?s interpretation is correct about user1 continuing to be able to write to ?group2? files even though that group is at their hard limit, then that?s a bug that needs fixing. I haven?t tested that myself, and we?re in a downtime right now so I?m a tad bit busy, but if I need to I?ll test it on our test cluster later this week. Even if Jamie's interpretation is wrong it shows the other massive failure of group quotas under Unix and why they are not fit for purpose in the real world. So bufh here can deliberately or accidentally do a denial of service on other users and tracking down the offending user is a right pain in the backside. The point of being able to change group ownership on a file is to indicate the massive weakness of the whole group quota system, and why in my experience nobody actually uses it, and "project" quota options have been implemented in many "enterprise" Unix file systems. JAB. -- Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk Fife, United Kingdom. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From oehmes at gmail.com Wed Aug 3 21:32:32 2016 From: oehmes at gmail.com (Sven Oehme) Date: Wed, 3 Aug 2016 13:32:32 -0700 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <6B06DA37-321E-4730-A3D1-61E41E4C6187@vanderbilt.edu> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

<20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> <78DAAA7C-C0C2-42C2-B6B9-B5EC6CC3A3F4@vanderbilt.edu> <2b823e10-34e8-ce9d-956c-267df4e6042b@buzzard.me.uk> <6B06DA37-321E-4730-A3D1-61E41E4C6187@vanderbilt.edu> Message-ID: i can't contribute much to the usefulness of tracking primary or secondary group. depending on who you ask you get a 50/50 answer why its great or broken either way. Jonathan explanation was correct, we only track/enforce primary groups , we don't do anything with secondary groups in regards to quotas. if there is 'doubt' of correct quotation of files on the disk in the filesystem one could always run mmcheckquota, its i/o intensive but will match quota usage of the in memory 'assumption' and update it from the actual data thats stored on disk. sven On Wed, Aug 3, 2016 at 12:18 PM, Buterbaugh, Kevin L < Kevin.Buterbaugh at vanderbilt.edu> wrote: > JAB, > > Our scratch filesystem uses user and group quotas. It started out as a > traditional scratch filesystem but then we decided (for better or worse) to > allow groups to purchase quota on it (and we don?t purge it, as many sites > do). > > We have many users in multiple groups, so if this is not working right > it?s a potential issue for us. But you?re right, I?m a nobody? > > Kevin > > On Aug 3, 2016, at 2:13 PM, Jonathan Buzzard > wrote: > > On 03/08/16 19:34, Buterbaugh, Kevin L wrote: > > Hi Jaime / Sven, > > If Jaime?s interpretation is correct about user1 continuing to be able > to write to ?group2? files even though that group is at their hard > limit, then that?s a bug that needs fixing. I haven?t tested that > myself, and we?re in a downtime right now so I?m a tad bit busy, but if > I need to I?ll test it on our test cluster later this week. > > > Even if Jamie's interpretation is wrong it shows the other massive failure > of group quotas under Unix and why they are not fit for purpose in the real > world. > > So bufh here can deliberately or accidentally do a denial of service on > other users and tracking down the offending user is a right pain in the > backside. > > The point of being able to change group ownership on a file is to indicate > the massive weakness of the whole group quota system, and why in my > experience nobody actually uses it, and "project" quota options have been > implemented in many "enterprise" Unix file systems. > > JAB. > > -- > Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk > Fife, United Kingdom. > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > ? > Kevin Buterbaugh - Senior System Administrator > Vanderbilt University - Advanced Computing Center for Research and > Education > Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Greg.Lehmann at csiro.au Thu Aug 4 00:03:47 2016 From: Greg.Lehmann at csiro.au (Greg.Lehmann at csiro.au) Date: Wed, 3 Aug 2016 23:03:47 +0000 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <20160803125543.11831ypcdi8i189b@support.scinet.utoronto.ca> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> <891fb362-ac69-2803-3664-1a55087868dc@buzzard.me.uk> <20160803125543.11831ypcdi8i189b@support.scinet.utoronto.ca> Message-ID: <762ff4f5796c4992b3bceb23b26fdbf3@exch1-cdc.nexus.csiro.au> The GID selection rules for account creation are Linux distribution specific. It sounds like you are familiar with Red Hat, where I think this idea of GID=UID started. sles12sp1-brc:/dev/disk/by-uuid # useradd testout sles12sp1-brc:/dev/disk/by-uuid # grep testout /etc/passwd testout:x:1001:100::/home/testout:/bin/bash sles12sp1-brc:/dev/disk/by-uuid # grep 100 /etc/group users:x:100: sles12sp1-brc:/dev/disk/by-uuid # Cheers, Greg -----Original Message----- From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Jaime Pinto Sent: Thursday, 4 August 2016 2:56 AM To: gpfsug main discussion list ; Jonathan Buzzard Cc: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] quota on secondary groups for a user? I guess I have a bit of a puzzle to solve, combining quotas on filesets, paths and USR/GRP attributes So much for the "standard" built-in linux account creation script, in which by default every new user is created with primary GID=UID, doesn't really help any of us. Jaime Quoting "Jonathan Buzzard" : > On 03/08/16 17:22, Jaime Pinto wrote: >> Suppose I want to set both USR and GRP quotas for a user, however GRP >> is not the primary group. Will gpfs enforce the secondary group quota >> for that user? > > Nope that's not how POSIX schematics work for group quotas. As far as > I can tell only your primary group is used for group quotas. It > basically makes group quotas in Unix a waste of time in my opinion. At > least I have never come across a real world scenario where they work > in a useful manner. > >> What I mean is, if the user keeps writing files with secondary group >> as the attribute, and that overall group quota is reached, will that >> user be stopped by gpfs? >> > > File sets are the answer to your problems, but retrospectively > applying them to a file system is a pain. You create a file set for a > directory and can then apply a quota to the file set. Even better you > can apply per file set user and group quotas. So if file set A has a > 1TB quota you could limit user X to 100GB in the file set, but outside > the file set they could have a different quota or even no quota. > > Only issue is a limit of ~10,000 file sets per file system > > > JAB. > > -- > Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk > Fife, United Kingdom. > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss From Greg.Lehmann at csiro.au Thu Aug 4 03:41:55 2016 From: Greg.Lehmann at csiro.au (Greg.Lehmann at csiro.au) Date: Thu, 4 Aug 2016 02:41:55 +0000 Subject: [gpfsug-discuss] 4.2.1 documentation Message-ID: <8033d4a67d9745f4a52f148538423066@exch1-cdc.nexus.csiro.au> I see only 4 pdfs now with slightly different titles to the previous 5 pdfs available with 4.2.0. Just checking there are only supposed to be 4 now? Greg -------------- next part -------------- An HTML attachment was scrubbed... URL: From kenneth.waegeman at ugent.be Thu Aug 4 09:13:29 2016 From: kenneth.waegeman at ugent.be (Kenneth Waegeman) Date: Thu, 4 Aug 2016 10:13:29 +0200 Subject: [gpfsug-discuss] 4.2.1 documentation In-Reply-To: <8033d4a67d9745f4a52f148538423066@exch1-cdc.nexus.csiro.au> References: <8033d4a67d9745f4a52f148538423066@exch1-cdc.nexus.csiro.au> Message-ID: <57A2F929.8000003@ugent.be> This is new, it is explained how they are merged at http://www.ibm.com/support/knowledgecenter/STXKQY_4.2.1/com.ibm.spectrum.scale.v4r21.doc/bl1xx_soc.htm Cheers! K On 04/08/16 04:41, Greg.Lehmann at csiro.au wrote: > > I see only 4 pdfs now with slightly different titles to the previous 5 > pdfs available with 4.2.0. Just checking there are only supposed to be > 4 now? > > Greg > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From r.sobey at imperial.ac.uk Thu Aug 4 09:13:51 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Thu, 4 Aug 2016 08:13:51 +0000 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <891fb362-ac69-2803-3664-1a55087868dc@buzzard.me.uk> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> <891fb362-ac69-2803-3664-1a55087868dc@buzzard.me.uk> Message-ID: 1000 isn't it?! We've always worked on that assumption. -----Original Message----- From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Jonathan Buzzard Sent: 03 August 2016 17:44 To: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] quota on secondary groups for a user? in the file set, but outside the file set they could have a different quota or even no quota. Only issue is a limit of ~10,000 file sets per file system JAB. -- Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk Fife, United Kingdom. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss From r.sobey at imperial.ac.uk Thu Aug 4 09:17:01 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Thu, 4 Aug 2016 08:17:01 +0000 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> <891fb362-ac69-2803-3664-1a55087868dc@buzzard.me.uk> Message-ID: Ah. Dependent vs independent. (10,000 and 1000 respectively). -----Original Message----- From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Sobey, Richard A Sent: 04 August 2016 09:14 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] quota on secondary groups for a user? 1000 isn't it?! We've always worked on that assumption. -----Original Message----- From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Jonathan Buzzard Sent: 03 August 2016 17:44 To: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] quota on secondary groups for a user? in the file set, but outside the file set they could have a different quota or even no quota. Only issue is a limit of ~10,000 file sets per file system JAB. -- Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk Fife, United Kingdom. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss From st.graf at fz-juelich.de Thu Aug 4 09:20:42 2016 From: st.graf at fz-juelich.de (Stephan Graf) Date: Thu, 4 Aug 2016 10:20:42 +0200 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca> <891fb362-ac69-2803-3664-1a55087868dc@buzzard.me.uk> Message-ID: <57A2FADA.1060508@fz-juelich.de> Hi! I have tested it with dependent filesets in GPFS 4.1.1.X and there the limit is 10.000. Stephan On 08/04/16 10:13, Sobey, Richard A wrote: > 1000 isn't it?! We've always worked on that assumption. > > -----Original Message----- > From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Jonathan Buzzard > Sent: 03 August 2016 17:44 > To: gpfsug-discuss at spectrumscale.org > Subject: Re: [gpfsug-discuss] quota on secondary groups for a user? > in the file set, but outside the file set they could have a different quota or even no quota. > > Only issue is a limit of ~10,000 file sets per file system > > > JAB. > -- Stephan Graf Juelich Supercomputing Centre Institute for Advanced Simulation Forschungszentrum Juelich GmbH 52425 Juelich, Germany Phone: +49-2461-61-6578 Fax: +49-2461-61-6656 E-mail: st.graf at fz-juelich.de WWW: http://www.fz-juelich.de/jsc/ ------------------------------------------------------------------------------------------------ ------------------------------------------------------------------------------------------------ Forschungszentrum Juelich GmbH 52425 Juelich Sitz der Gesellschaft: Juelich Eingetragen im Handelsregister des Amtsgerichts Dueren Nr. HR B 3498 Vorsitzender des Aufsichtsrats: MinDir Dr. Karl Eugen Huthmacher Geschaeftsfuehrung: Prof. Dr.-Ing. Wolfgang Marquardt (Vorsitzender), Karsten Beneke (stellv. Vorsitzender), Prof. Dr.-Ing. Harald Bolt, Prof. Dr. Sebastian M. Schmidt ------------------------------------------------------------------------------------------------ ------------------------------------------------------------------------------------------------ From daniel.kidger at uk.ibm.com Thu Aug 4 09:22:36 2016 From: daniel.kidger at uk.ibm.com (Daniel Kidger) Date: Thu, 4 Aug 2016 08:22:36 +0000 Subject: [gpfsug-discuss] 4.2.1 documentation In-Reply-To: <8033d4a67d9745f4a52f148538423066@exch1-cdc.nexus.csiro.au> Message-ID: Yes they have been re arranged. My observation is that the Admin and Advanced Admin have merged into one PDFs, and the DMAPI manual is now a chapter of the new Programming guide (along with the complete set of man pages which have moved out of the Admin guide). Table 3 on page 26 of the Concepts, Planning and Install guide describes these change. IMHO The new format is much better as all Admin is in one place not two. ps. I couldn't find in the programming guide a chapter yet on Light Weight Events. Anyone in product development care to comment? :-) Daniel IBM Spectrum Storage Software +44 (0)7818 522266 Sent from my iPad using IBM Verse On 4 Aug 2016, 03:42:21, Greg.Lehmann at csiro.au wrote: From: Greg.Lehmann at csiro.au To: gpfsug-discuss at spectrumscale.org Cc: Date: 4 Aug 2016 03:42:21 Subject: [gpfsug-discuss] 4.2.1 documentation I see only 4 pdfs now with slightly different titles to the previous 5 pdfs available with 4.2.0. Just checking there are only supposed to be 4 now? GregUnless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU -------------- next part -------------- An HTML attachment was scrubbed... URL: From pinto at scinet.utoronto.ca Thu Aug 4 16:59:31 2016 From: pinto at scinet.utoronto.ca (Jaime Pinto) Date: Thu, 04 Aug 2016 11:59:31 -0400 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

<20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> Message-ID: <20160804115931.26601tycacksqhcz@support.scinet.utoronto.ca> Since there were inconsistencies in the responses, I decided to rig a couple of accounts/groups on our LDAP to test "My interpretation", and determined that I was wrong. When Kevin mentioned it would mean a bug I had to double-check: If a user hits the hard quota or exceeds the grace period on the soft quota on any of the secondary groups that user will be stopped from further writing to those groups as well, just as in the primary group. I hope this clears the waters a bit. I still have to solve my puzzle. Thanks everyone for the feedback. Jaime Quoting "Jaime Pinto" : > Quoting "Buterbaugh, Kevin L" : > >> Hi Sven, >> >> Wait - am I misunderstanding something here? Let?s say that I have >> ?user1? who has primary group ?group1? and secondary group >> ?group2?. And let?s say that they write to a directory where the >> bit on the directory forces all files created in that directory to >> have group2 associated with them. Are you saying that those >> files still count against group1?s group quota??? >> >> Thanks for clarifying? >> >> Kevin > > Not really, > > My interpretation is that all files written with group2 will count > towards the quota on that group. However any users with group2 as the > primary group will be prevented from writing any further when the > group2 quota is reached. However the culprit user1 with primary group > as group1 won't be detected by gpfs, and can just keep going on writing > group2 files. > > As far as the individual user quota, it doesn't matter: group1 or > group2 it will be counted towards the usage of that user. > > It would be interesting if the behavior was more as expected. I just > checked with my Lustre counter-parts and they tell me whichever > secondary group is hit first, however many there may be, the user will > be stopped. The problem then becomes identifying which of the secondary > groups hit the limit for that user. > > Jaime > > >> >> On Aug 3, 2016, at 11:35 AM, Sven Oehme >> > wrote: >> >> Hi, >> >> quotas are only counted against primary group >> >> sven >> >> >> On Wed, Aug 3, 2016 at 9:22 AM, Jaime Pinto >> > wrote: >> Suppose I want to set both USR and GRP quotas for a user, however >> GRP is not the primary group. Will gpfs enforce the secondary group >> quota for that user? >> >> What I mean is, if the user keeps writing files with secondary >> group as the attribute, and that overall group quota is reached, >> will that user be stopped by gpfs? >> >> Thanks >> Jaime >> >> >> >> >> ************************************ >> TELL US ABOUT YOUR SUCCESS STORIES >> http://www.scinethpc.ca/testimonials >> ************************************ >> --- >> Jaime Pinto >> SciNet HPC Consortium - Compute/Calcul Canada >> www.scinet.utoronto.ca - >> www.computecanada.org >> University of Toronto >> 256 McCaul Street, Room 235 >> Toronto, ON, M5T1W5 >> P: 416-978-2755 >> C: 416-505-1477 >> > > > ---------------------------------------------------------------- > This message was sent using IMP at SciNet Consortium, University of Toronto. > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > ************************************ TELL US ABOUT YOUR SUCCESS STORIES http://www.scinethpc.ca/testimonials ************************************ --- Jaime Pinto SciNet HPC Consortium - Compute/Calcul Canada www.scinet.utoronto.ca - www.computecanada.org University of Toronto 256 McCaul Street, Room 235 Toronto, ON, M5T1W5 P: 416-978-2755 C: 416-505-1477 ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. From Kevin.Buterbaugh at Vanderbilt.Edu Thu Aug 4 17:08:30 2016 From: Kevin.Buterbaugh at Vanderbilt.Edu (Buterbaugh, Kevin L) Date: Thu, 4 Aug 2016 16:08:30 +0000 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <20160804115931.26601tycacksqhcz@support.scinet.utoronto.ca> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

<20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> <20160804115931.26601tycacksqhcz@support.scinet.utoronto.ca> Message-ID: <7C0606E3-37D9-4301-8676-5060A0984FF2@vanderbilt.edu> Hi Jaime, Thank you sooooo much for doing this and reporting back the results! They?re in line with what I would expect to happen. I was going to test this as well, but we have had to extend our downtime until noontime tomorrow, so I haven?t had a chance to do so yet. Now I don?t have to? ;-) Kevin On Aug 4, 2016, at 10:59 AM, Jaime Pinto > wrote: Since there were inconsistencies in the responses, I decided to rig a couple of accounts/groups on our LDAP to test "My interpretation", and determined that I was wrong. When Kevin mentioned it would mean a bug I had to double-check: If a user hits the hard quota or exceeds the grace period on the soft quota on any of the secondary groups that user will be stopped from further writing to those groups as well, just as in the primary group. I hope this clears the waters a bit. I still have to solve my puzzle. Thanks everyone for the feedback. Jaime Quoting "Jaime Pinto" >: Quoting "Buterbaugh, Kevin L" >: Hi Sven, Wait - am I misunderstanding something here? Let?s say that I have ?user1? who has primary group ?group1? and secondary group ?group2?. And let?s say that they write to a directory where the bit on the directory forces all files created in that directory to have group2 associated with them. Are you saying that those files still count against group1?s group quota??? Thanks for clarifying? Kevin Not really, My interpretation is that all files written with group2 will count towards the quota on that group. However any users with group2 as the primary group will be prevented from writing any further when the group2 quota is reached. However the culprit user1 with primary group as group1 won't be detected by gpfs, and can just keep going on writing group2 files. As far as the individual user quota, it doesn't matter: group1 or group2 it will be counted towards the usage of that user. It would be interesting if the behavior was more as expected. I just checked with my Lustre counter-parts and they tell me whichever secondary group is hit first, however many there may be, the user will be stopped. The problem then becomes identifying which of the secondary groups hit the limit for that user. Jaime On Aug 3, 2016, at 11:35 AM, Sven Oehme > wrote: Hi, quotas are only counted against primary group sven On Wed, Aug 3, 2016 at 9:22 AM, Jaime Pinto > wrote: Suppose I want to set both USR and GRP quotas for a user, however GRP is not the primary group. Will gpfs enforce the secondary group quota for that user? What I mean is, if the user keeps writing files with secondary group as the attribute, and that overall group quota is reached, will that user be stopped by gpfs? Thanks Jaime ************************************ TELL US ABOUT YOUR SUCCESS STORIES http://www.scinethpc.ca/testimonials ************************************ --- Jaime Pinto SciNet HPC Consortium - Compute/Calcul Canada www.scinet.utoronto.ca - www.computecanada.org University of Toronto 256 McCaul Street, Room 235 Toronto, ON, M5T1W5 P: 416-978-2755 C: 416-505-1477 ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ************************************ TELL US ABOUT YOUR SUCCESS STORIES http://www.scinethpc.ca/testimonials ************************************ --- Jaime Pinto SciNet HPC Consortium - Compute/Calcul Canada www.scinet.utoronto.ca - www.computecanada.org University of Toronto 256 McCaul Street, Room 235 Toronto, ON, M5T1W5 P: 416-978-2755 C: 416-505-1477 ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From pinto at scinet.utoronto.ca Thu Aug 4 17:34:09 2016 From: pinto at scinet.utoronto.ca (Jaime Pinto) Date: Thu, 04 Aug 2016 12:34:09 -0400 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <7C0606E3-37D9-4301-8676-5060A0984FF2@vanderbilt.edu> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

<20160803143008.11673qj876dtqjw0@support.scinet.utoronto.ca> <20160804115931.26601tycacksqhcz@support.scinet.utoronto.ca> <7C0606E3-37D9-4301-8676-5060A0984FF2@vanderbilt.edu> Message-ID: <20160804123409.18403cy3iz123gxt@support.scinet.utoronto.ca> OK More info: Users can apply the 'sg group1' or 'sq group2' command from a shell or script to switch the group mask from that point on, and dodge the quota that may have been exceeded on a group. However, as the group owner or other member of the group on the limit, I could not find a tool they can use on their own to find out who is(are) the largest user(s); 'du' takes too long, and some users don't give read permissions on their directories. As part of the puzzle solution I have to come up with a root wrapper that can make the contents of the mmrepquota report available to them. Jaime Quoting "Buterbaugh, Kevin L" : > Hi Jaime, > > Thank you sooooo much for doing this and reporting back the results! > They?re in line with what I would expect to happen. I was going > to test this as well, but we have had to extend our downtime until > noontime tomorrow, so I haven?t had a chance to do so yet. Now I > don?t have to? ;-) > > Kevin > > On Aug 4, 2016, at 10:59 AM, Jaime Pinto > > wrote: > > Since there were inconsistencies in the responses, I decided to rig > a couple of accounts/groups on our LDAP to test "My interpretation", > and determined that I was wrong. When Kevin mentioned it would mean > a bug I had to double-check: > > If a user hits the hard quota or exceeds the grace period on the > soft quota on any of the secondary groups that user will be stopped > from further writing to those groups as well, just as in the primary > group. > > I hope this clears the waters a bit. I still have to solve my puzzle. > > Thanks everyone for the feedback. > Jaime > > > > Quoting "Jaime Pinto" > >: > > Quoting "Buterbaugh, Kevin L" > >: > > Hi Sven, > > Wait - am I misunderstanding something here? Let?s say that I have > ?user1? who has primary group ?group1? and secondary group > ?group2?. And let?s say that they write to a directory where the > bit on the directory forces all files created in that directory to > have group2 associated with them. Are you saying that those files > still count against group1?s group quota??? > > Thanks for clarifying? > > Kevin > > Not really, > > My interpretation is that all files written with group2 will count > towards the quota on that group. However any users with group2 as the > primary group will be prevented from writing any further when the > group2 quota is reached. However the culprit user1 with primary group > as group1 won't be detected by gpfs, and can just keep going on writing > group2 files. > > As far as the individual user quota, it doesn't matter: group1 or > group2 it will be counted towards the usage of that user. > > It would be interesting if the behavior was more as expected. I just > checked with my Lustre counter-parts and they tell me whichever > secondary group is hit first, however many there may be, the user will > be stopped. The problem then becomes identifying which of the secondary > groups hit the limit for that user. > > Jaime > > > > On Aug 3, 2016, at 11:35 AM, Sven Oehme > > > wrote: > > Hi, > > quotas are only counted against primary group > > sven > > > On Wed, Aug 3, 2016 at 9:22 AM, Jaime Pinto > > > wrote: > Suppose I want to set both USR and GRP quotas for a user, however > GRP is not the primary group. Will gpfs enforce the secondary group > quota for that user? > > What I mean is, if the user keeps writing files with secondary > group as the attribute, and that overall group quota is reached, > will that user be stopped by gpfs? > > Thanks > Jaime > > > > > ************************************ > TELL US ABOUT YOUR SUCCESS STORIES > http://www.scinethpc.ca/testimonials > ************************************ > --- > Jaime Pinto > SciNet HPC Consortium - Compute/Calcul Canada > www.scinet.utoronto.ca - > www.computecanada.org > University of Toronto > 256 McCaul Street, Room 235 > Toronto, ON, M5T1W5 > P: 416-978-2755 > C: 416-505-1477 > > > > ---------------------------------------------------------------- > This message was sent using IMP at SciNet Consortium, University of Toronto. > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > > > > ************************************ > TELL US ABOUT YOUR SUCCESS STORIES > http://www.scinethpc.ca/testimonials > ************************************ > --- > Jaime Pinto > SciNet HPC Consortium - Compute/Calcul Canada > www.scinet.utoronto.ca - > www.computecanada.org > University of Toronto > 256 McCaul Street, Room 235 > Toronto, ON, M5T1W5 > P: 416-978-2755 > C: 416-505-1477 > > ---------------------------------------------------------------- > This message was sent using IMP at SciNet Consortium, University of Toronto. > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > ? > Kevin Buterbaugh - Senior System Administrator > Vanderbilt University - Advanced Computing Center for Research and Education > Kevin.Buterbaugh at vanderbilt.edu - > (615)875-9633 > > > > ************************************ TELL US ABOUT YOUR SUCCESS STORIES http://www.scinethpc.ca/testimonials ************************************ --- Jaime Pinto SciNet HPC Consortium - Compute/Calcul Canada www.scinet.utoronto.ca - www.computecanada.org University of Toronto 256 McCaul Street, Room 235 Toronto, ON, M5T1W5 P: 416-978-2755 C: 416-505-1477 ---------------------------------------------------------------- This message was sent using IMP at SciNet Consortium, University of Toronto. From Kevin.Buterbaugh at Vanderbilt.Edu Wed Aug 10 22:00:26 2016 From: Kevin.Buterbaugh at Vanderbilt.Edu (Buterbaugh, Kevin L) Date: Wed, 10 Aug 2016 21:00:26 +0000 Subject: [gpfsug-discuss] User group meeting at SC16? Message-ID: Hi All, Just got an e-mail from DDN announcing that they are holding their user group meeting at SC16 on Monday afternoon like they always do, which is prompting me to inquire if IBM is going to be holding a meeting at SC16? Last year in Austin the IBM meeting was on Sunday afternoon, which worked out great as far as I was concerned. Thanks? ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From Robert.Oesterlin at nuance.com Wed Aug 10 22:04:11 2016 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Wed, 10 Aug 2016 21:04:11 +0000 Subject: [gpfsug-discuss] User group meeting at SC16? In-Reply-To: References: Message-ID: <95126B16-B4DB-4406-862B-AA81E37F04E6@nuance.com> We're still trying to schedule that - The thinking right now is staying where last year. (Sunday afternoon) There is never a perfect time at these sorts of event - bound to step on something! If anyone has feedback (positive or negative) - let us know. Look for a formal announcement in early September. Bob Oesterlin GPFS-UG Co-Principal Sr Storage Engineer, Nuance HPC Grid From: on behalf of "Buterbaugh, Kevin L" Reply-To: gpfsug main discussion list Date: Wednesday, August 10, 2016 at 4:00 PM To: gpfsug main discussion list Subject: [EXTERNAL] [gpfsug-discuss] User group meeting at SC16? Hi All, Just got an e-mail from DDN announcing that they are holding their user group meeting at SC16 on Monday afternoon like they always do, which is prompting me to inquire if IBM is going to be holding a meeting at SC16? Last year in Austin the IBM meeting was on Sunday afternoon, which worked out great as far as I was concerned. Thanks? ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From malone12 at illinois.edu Wed Aug 10 22:43:15 2016 From: malone12 at illinois.edu (Maloney, John Daniel) Date: Wed, 10 Aug 2016 21:43:15 +0000 Subject: [gpfsug-discuss] User group meeting at SC16? Message-ID: <4AD486D7-D452-465A-85EC-1BDDE2C5DCFD@illinois.edu> Hi Bob, Thanks for the update! The couple storage folks from NCSA going to SC16 won?t be available Sunday (I?m not able to get in until Monday morning). Agree completely there is never a perfect time, just giving our feedback. Thanks again, J.D. Maloney Storage Engineer | Storage Enabling Technologies Group National Center for Supercomputing Applications (NCSA) From: > on behalf of "Oesterlin, Robert" > Reply-To: gpfsug main discussion list > Date: Wednesday, August 10, 2016 at 4:04 PM To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] User group meeting at SC16? We're still trying to schedule that - The thinking right now is staying where last year. (Sunday afternoon) There is never a perfect time at these sorts of event - bound to step on something! If anyone has feedback (positive or negative) - let us know. Look for a formal announcement in early September. Bob Oesterlin GPFS-UG Co-Principal Sr Storage Engineer, Nuance HPC Grid From: > on behalf of "Buterbaugh, Kevin L" > Reply-To: gpfsug main discussion list > Date: Wednesday, August 10, 2016 at 4:00 PM To: gpfsug main discussion list > Subject: [EXTERNAL] [gpfsug-discuss] User group meeting at SC16? Hi All, Just got an e-mail from DDN announcing that they are holding their user group meeting at SC16 on Monday afternoon like they always do, which is prompting me to inquire if IBM is going to be holding a meeting at SC16? Last year in Austin the IBM meeting was on Sunday afternoon, which worked out great as far as I was concerned. Thanks? ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From aaron.s.knister at nasa.gov Thu Aug 11 05:47:17 2016 From: aaron.s.knister at nasa.gov (Aaron Knister) Date: Thu, 11 Aug 2016 00:47:17 -0400 Subject: [gpfsug-discuss] GPFS and SELinux Message-ID: Hi Everyone, I'm passing this along on behalf of one of our security guys. Just wondering what feedback/thoughts others have on the topic. Current IBM guidance on GPFS and SELinux indicates that the default context for services (initrc_t) is insufficient for GPFS operations. See: https://www.ibm.com/developerworks/community/wikis/home?lang=en#!/wiki/General+Parallel+File+System+(GPFS)/page/Using+GPFS+with+SElinux That part is true (by design), but IBM goes further to say use runcon out of rc.local and configure the gpfs service to not start via init. I believe these latter two (rc.local/runcon and no-init) can be addressed, relatively trivially, through the application of a small selinux policy. Ideally, I would hope for IBM to develop, test, and send out the policy, but I'm happy to offer the following suggestions. I believe "a)" could be developed in a relatively short period of time. "b)" would take more time, effort and experience. a) consider SELinux context transition. As an example, consider: https://github.com/TresysTechnology/refpolicy/tree/master/policy/modules/services (specifically, the ssh components) On a normal centOS/RHEL system sshd has the file context of sshd_exec_t, and runs under sshd_t Referencing ssh.te, you see several references to sshd_exec_t in: domtrans_pattern init_daemon_domain daemontools_service_domain (and so on) These configurations allow init to fire sshd off, setting its runtime context to sshd_t, based on the file context of sshd_exec_t. This should be duplicable for the gpfs daemon, altho I note it seems to be fired through a layer of abstraction in mmstartup. A simple policy that allows INIT to transition GPFS to unconfined_t would go a long way towards easing integration. b) file contexts of gpfs_daemon_t and gpfs_util_t, perhaps, that when executed, would pick up a context of gpfs_t? Which then could be mapped through standard SELinux policy to allow access to configuration files (gpfs_etc_t?), block devices, etc? I admit, in b, I am speculating heavily. -- Aaron Knister NASA Center for Climate Simulation (Code 606.2) Goddard Space Flight Center (301) 286-2776 From janfrode at tanso.net Thu Aug 11 10:54:27 2016 From: janfrode at tanso.net (Jan-Frode Myklebust) Date: Thu, 11 Aug 2016 11:54:27 +0200 Subject: [gpfsug-discuss] GPFS and SELinux In-Reply-To: References: Message-ID: I believe the runcon part is no longer necessary, at least on my RHEL7 based systems mmfsd is running unconfined by default: [root at flexscale01 ~]# ps -efZ|grep mmfsd unconfined_u:unconfined_r:unconfined_t:s0-s0:c0.c1023 root 18018 17709 0 aug.05 ? 00:24:53 /usr/lpp/mmfs/bin/mmfsd and I've never seen any problems with that for base GPFS. I suspect doing a proper selinux domain for GPFS will be quite close to unconfined, so maybe not worth the effort... -jf On Thu, Aug 11, 2016 at 6:47 AM, Aaron Knister wrote: > Hi Everyone, > > I'm passing this along on behalf of one of our security guys. Just > wondering what feedback/thoughts others have on the topic. > > > Current IBM guidance on GPFS and SELinux indicates that the default > context for services (initrc_t) is insufficient for GPFS operations. > > See: > https://www.ibm.com/developerworks/community/wikis/home? > lang=en#!/wiki/General+Parallel+File+System+(GPFS)/ > page/Using+GPFS+with+SElinux > > > That part is true (by design), but IBM goes further to say use runcon > out of rc.local and configure the gpfs service to not start via init. > > I believe these latter two (rc.local/runcon and no-init) can be > addressed, relatively trivially, through the application of a small > selinux policy. > > Ideally, I would hope for IBM to develop, test, and send out the policy, > but I'm happy to offer the following suggestions. I believe "a)" could > be developed in a relatively short period of time. "b)" would take more > time, effort and experience. > > a) consider SELinux context transition. > > As an example, consider: > https://github.com/TresysTechnology/refpolicy/tree/master/ > policy/modules/services > > > (specifically, the ssh components) > > On a normal centOS/RHEL system sshd has the file context of sshd_exec_t, > and runs under sshd_t > > Referencing ssh.te, you see several references to sshd_exec_t in: > domtrans_pattern > init_daemon_domain > daemontools_service_domain > (and so on) > > These configurations allow init to fire sshd off, setting its runtime > context to sshd_t, based on the file context of sshd_exec_t. > > This should be duplicable for the gpfs daemon, altho I note it seems to > be fired through a layer of abstraction in mmstartup. > > A simple policy that allows INIT to transition GPFS to unconfined_t > would go a long way towards easing integration. > > b) file contexts of gpfs_daemon_t and gpfs_util_t, perhaps, that when > executed, would pick up a context of gpfs_t? Which then could be mapped > through standard SELinux policy to allow access to configuration files > (gpfs_etc_t?), block devices, etc? > > I admit, in b, I am speculating heavily. > > > > -- > Aaron Knister > NASA Center for Climate Simulation (Code 606.2) > Goddard Space Flight Center > (301) 286-2776 > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From douglasof at us.ibm.com Fri Aug 12 20:40:27 2016 From: douglasof at us.ibm.com (Douglas O'flaherty) Date: Fri, 12 Aug 2016 19:40:27 +0000 Subject: [gpfsug-discuss] HPCwire Readers Choice Message-ID: Reminder... Get your stories in today! To view this email in your browser, click here. Last Call for Readers' Choice Award Nominations! Deadline: Friday, August 12th at 11:50pm! Only 3 days left until nominations for the 2016 HPCwire Readers' Choice Awards come to a close! Be sure to submit your picks for the best in HPC and make your voice heard before it's too late! These annual awards are a way for our community to recognize the best and brightest innovators within the global HPC community. Time is running out for you to nominate what you think are the greatest achievements in HPC for 2016, so cast your ballot today! The 2016 Categories Include the Following: * Best Use of HPC Application in Life Sciences * Best Use of HPC Application in Manufacturing * Best Use of HPC Application in Energy (previously 'Oil and Gas') * Best Use of HPC in Automotive * Best Use of HPC in Financial Services * Best Use of HPC in Entertainment * Best Use of HPC in the Cloud * Best Use of High Performance Data Analytics * Best Implementation of Energy-Efficient HPC * Best HPC Server Product or Technology * Best HPC Storage Product or Technology * Best HPC Software Product or Technology * Best HPC Visualization Product or Technology * Best HPC Interconnect Product or Technology * Best HPC Cluster Solution or Technology * Best Data-Intensive System (End-User Focused) * Best HPC Collaboration Between Government & Industry * Best HPC Collaboration Between Academia & Industry * Top Supercomputing Achievement * Top 5 New Products or Technologies to Watch * Top 5 Vendors to Watch * Workforce Diversity Leadership Award * Outstanding Leadership in HPC Nominations are accepted from readers, users, vendors - virtually anyone who is connected to the HPC community and is a reader of HPCwire. Nominations will close on August 12, 2016 at 11:59pm. Make your voice heard! Help tell the story of HPC in 2016 by submitting your nominations for the HPCwire Readers' Choice Awards now! Nominations close on August 12, 2016. All nominations are subject to review by the editors of HPCwire with only the most relevant being accepted. Voting begins August 22, 2015. The final presentation of these prestigious and highly anticipated awards to each organization's leading executives will take place live during SC '16 in Salt Lake City, UT. The finalist(s) in each category who receive the most votes will win this year's awards. Open to HPCwire readers only. HPCwire Subscriber Services This email was sent to lwestoby at us.ibm.com. You are receiving this email message as an HPCwire subscriber. To forward this email to a friend, click here. Unsubscribe from this list. Copyright ? 2016 Tabor Communications Inc. All rights reserved. 8445 Camino Santa Fe San Diego, California 92121 P: 858.625.0070 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 40078 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 5880 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 43 bytes Desc: not available URL: From r.sobey at imperial.ac.uk Mon Aug 15 10:59:34 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Mon, 15 Aug 2016 09:59:34 +0000 Subject: [gpfsug-discuss] Minor GPFS versions coexistence problems? Message-ID: Hi all, If I wanted to upgrade my NSD nodes one at a time from 3.5.0.22 to 3.5.0.27 (or whatever the latest in that branch is) am I ok to stagger it over a few days, perhaps up to 2 weeks or will I run into problems if they're on different versions? Cheers Richard -------------- next part -------------- An HTML attachment was scrubbed... URL: From Robert.Oesterlin at nuance.com Mon Aug 15 12:22:31 2016 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Mon, 15 Aug 2016 11:22:31 +0000 Subject: [gpfsug-discuss] Minor GPFS versions coexistence problems? In-Reply-To: References: Message-ID: In general, yes, it's common practice to do the 'rolling upgrades'. If I had to do my whole cluster at once, with an outage, I'd probably never upgrade. :) Bob Oesterlin Sr Storage Engineer, Nuance HPC Grid From: on behalf of "Sobey, Richard A" Reply-To: gpfsug main discussion list Date: Monday, August 15, 2016 at 4:59 AM To: "'gpfsug-discuss at spectrumscale.org'" Subject: [EXTERNAL] [gpfsug-discuss] Minor GPFS versions coexistence problems? Hi all, If I wanted to upgrade my NSD nodes one at a time from 3.5.0.22 to 3.5.0.27 (or whatever the latest in that branch is) am I ok to stagger it over a few days, perhaps up to 2 weeks or will I run into problems if they?re on different versions? Cheers Richard -------------- next part -------------- An HTML attachment was scrubbed... URL: From Kevin.Buterbaugh at Vanderbilt.Edu Mon Aug 15 13:45:25 2016 From: Kevin.Buterbaugh at Vanderbilt.Edu (Buterbaugh, Kevin L) Date: Mon, 15 Aug 2016 12:45:25 +0000 Subject: [gpfsug-discuss] Minor GPFS versions coexistence problems? In-Reply-To: References:

Message-ID: <9691E717-690C-48C7-8017-BA6F001B5461@vanderbilt.edu> Richard, I will second what Bob said with one caveat ? on one occasion we had an issue with our multi-cluster setup because the PTF?s were incompatible. However, that was clearly documented in the release notes, which we obviously hadn?t read carefully enough. While we generally do rolling upgrades over a two to three week period, we have run for months with clients at differing PTF levels. HTHAL? Kevin On Aug 15, 2016, at 6:22 AM, Oesterlin, Robert > wrote: In general, yes, it's common practice to do the 'rolling upgrades'. If I had to do my whole cluster at once, with an outage, I'd probably never upgrade. :) Bob Oesterlin Sr Storage Engineer, Nuance HPC Grid From: > on behalf of "Sobey, Richard A" > Reply-To: gpfsug main discussion list > Date: Monday, August 15, 2016 at 4:59 AM To: "'gpfsug-discuss at spectrumscale.org'" > Subject: [EXTERNAL] [gpfsug-discuss] Minor GPFS versions coexistence problems? Hi all, If I wanted to upgrade my NSD nodes one at a time from 3.5.0.22 to 3.5.0.27 (or whatever the latest in that branch is) am I ok to stagger it over a few days, perhaps up to 2 weeks or will I run into problems if they?re on different versions? Cheers Richard _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From r.sobey at imperial.ac.uk Mon Aug 15 13:58:47 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Mon, 15 Aug 2016 12:58:47 +0000 Subject: [gpfsug-discuss] Minor GPFS versions coexistence problems? In-Reply-To: <9691E717-690C-48C7-8017-BA6F001B5461@vanderbilt.edu> References:

<9691E717-690C-48C7-8017-BA6F001B5461@vanderbilt.edu> Message-ID: Thanks Kevin and Bob. PTF = minor version? I can?t think what it might stand for. Something Time Fix? Point in time fix? From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Buterbaugh, Kevin L Sent: 15 August 2016 13:45 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Minor GPFS versions coexistence problems? Richard, I will second what Bob said with one caveat ? on one occasion we had an issue with our multi-cluster setup because the PTF?s were incompatible. However, that was clearly documented in the release notes, which we obviously hadn?t read carefully enough. While we generally do rolling upgrades over a two to three week period, we have run for months with clients at differing PTF levels. HTHAL? Kevin On Aug 15, 2016, at 6:22 AM, Oesterlin, Robert > wrote: In general, yes, it's common practice to do the 'rolling upgrades'. If I had to do my whole cluster at once, with an outage, I'd probably never upgrade. :) Bob Oesterlin Sr Storage Engineer, Nuance HPC Grid From: > on behalf of "Sobey, Richard A" > Reply-To: gpfsug main discussion list > Date: Monday, August 15, 2016 at 4:59 AM To: "'gpfsug-discuss at spectrumscale.org'" > Subject: [EXTERNAL] [gpfsug-discuss] Minor GPFS versions coexistence problems? Hi all, If I wanted to upgrade my NSD nodes one at a time from 3.5.0.22 to 3.5.0.27 (or whatever the latest in that branch is) am I ok to stagger it over a few days, perhaps up to 2 weeks or will I run into problems if they?re on different versions? Cheers Richard _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From jamiedavis at us.ibm.com Mon Aug 15 14:02:13 2016 From: jamiedavis at us.ibm.com (James Davis) Date: Mon, 15 Aug 2016 13:02:13 +0000 Subject: [gpfsug-discuss] Minor GPFS versions coexistence problems? In-Reply-To: References: ,

<9691E717-690C-48C7-8017-BA6F001B5461@vanderbilt.edu> Message-ID: An HTML attachment was scrubbed... URL: From Robert.Oesterlin at nuance.com Mon Aug 15 14:05:01 2016 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Mon, 15 Aug 2016 13:05:01 +0000 Subject: [gpfsug-discuss] Minor GPFS versions coexistence problems? Message-ID: <28479088-C492-4441-A761-F49E1556E13E@nuance.com> PTF = Program Temporary Fix. IBM-Speak for a fix for a particular problem. Bob Oesterlin Sr Storage Engineer, Nuance HPC Grid From: on behalf of "Sobey, Richard A" Reply-To: gpfsug main discussion list Date: Monday, August 15, 2016 at 7:58 AM To: gpfsug main discussion list Subject: [EXTERNAL] Re: [gpfsug-discuss] Minor GPFS versions coexistence problems? Thanks Kevin and Bob. PTF = minor version? I can?t think what it might stand for. Something Time Fix? Point in time fix? From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Buterbaugh, Kevin L Sent: 15 August 2016 13:45 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Minor GPFS versions coexistence problems? Richard, I will second what Bob said with one caveat ? on one occasion we had an issue with our multi-cluster setup because the PTF?s were incompatible. However, that was clearly documented in the release notes, which we obviously hadn?t read carefully enough. While we generally do rolling upgrades over a two to three week period, we have run for months with clients at differing PTF levels. HTHAL? Kevin On Aug 15, 2016, at 6:22 AM, Oesterlin, Robert > wrote: In general, yes, it's common practice to do the 'rolling upgrades'. If I had to do my whole cluster at once, with an outage, I'd probably never upgrade. :) Bob Oesterlin Sr Storage Engineer, Nuance HPC Grid From: > on behalf of "Sobey, Richard A" > Reply-To: gpfsug main discussion list > Date: Monday, August 15, 2016 at 4:59 AM To: "'gpfsug-discuss at spectrumscale.org'" > Subject: [EXTERNAL] [gpfsug-discuss] Minor GPFS versions coexistence problems? Hi all, If I wanted to upgrade my NSD nodes one at a time from 3.5.0.22 to 3.5.0.27 (or whatever the latest in that branch is) am I ok to stagger it over a few days, perhaps up to 2 weeks or will I run into problems if they?re on different versions? Cheers Richard _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ? Kevin Buterbaugh - Senior System Administrator Vanderbilt University - Advanced Computing Center for Research and Education Kevin.Buterbaugh at vanderbilt.edu - (615)875-9633 -------------- next part -------------- An HTML attachment was scrubbed... URL: From kdball at us.ibm.com Mon Aug 15 15:12:07 2016 From: kdball at us.ibm.com (Keith D Ball) Date: Mon, 15 Aug 2016 14:12:07 +0000 Subject: [gpfsug-discuss] gpfsug-discuss Digest, Vol 55, Issue 16 In-Reply-To: References: Message-ID: An HTML attachment was scrubbed... URL: From jake.carroll at uq.edu.au Mon Aug 15 22:08:58 2016 From: jake.carroll at uq.edu.au (Jake Carroll) Date: Mon, 15 Aug 2016 21:08:58 +0000 Subject: [gpfsug-discuss] More on AFM cache chaining Message-ID: <94AB3BCD-B551-4F3E-9128-65B582A4ABC6@uq.edu.au> Hi there. In the spirit of a conversation a friend showed me a couple of weeks ago from Radhika Parameswaran and Luke Raimbach, we?re doing something similar to Luke (kind of), or at least attempting it, in regards to cache chaining. We?ve got a large research storage platform in Brisbane, Queensland, Australia and we?re trying to leverage a few different modes of operation. Currently: Cache A (IW) connects to what would be a Home (B) which then is effectively an NFS mount to (C) a DMF based NFS export. To a point, this works. It kind of allows us to use ?home? as the ultimate sink, and data migration in and out of DMF seems to be working nicely when GPFS pulls things from (B) which don?t appear to currently be in (A) due to policy, or a HWM was hit (thus emptying cache). We?ve tested it as far out as the data ONLY being offline in tape media inside (C) and it still works, cleanly coming back to (A) within a very reasonable time-frame. ? We hit ?problem 1? which is in and around NFS v4 ACL?s which aren?t surfacing or mapping correctly (as we?d expect). I guess this might be the caveat of trying to backend the cache to a home and have it sitting inside DMF (over an NFS Export) for surfacing of the data for clients. Where we?d like to head: We haven?t seen it yet, but as Luke and Radhika were discussing last month, we really liked the idea of an IW Cache (A, where instruments dump huge data) which then via AFM ends up at (B) (might also be technically ?home? but IW) which is then also a function of (C) which might also be another cache that sits next to a HPC platform for reading and writing data into quickly and out of in parallel. We like the idea of chained caches because it gives us extremely flexibility in the premise of our ?Data anywhere? fabric. We appreciate that this has some challenges, in that we know if you?ve got multiple IW scenarios the last write will always win ? this we can control with workload guidelines. But we?d like to add our voices to this idea of having caches chained all the way back to some point such that data is being pulled all the way from C --> B --> A and along the way, inflection points of IO might be written and read at point C and point B AND point A such that everyone would see the distribution and consistent data in the end. We?re also working on surfacing data via object and file simultaneously for different needs. This is coming along relatively well, but we?re still learning about where and where this does not make sense so far. A moving target, from how it all appears on the surface. Some might say that is effectively asking for a globally eventually (always) consistent filesystem within Scale?. Anyway ? just some thoughts. Regards, -jc -------------- next part -------------- An HTML attachment was scrubbed... URL: From aaron.s.knister at nasa.gov Tue Aug 16 03:22:17 2016 From: aaron.s.knister at nasa.gov (Aaron Knister) Date: Mon, 15 Aug 2016 22:22:17 -0400 Subject: [gpfsug-discuss] mmfsadm test pit Message-ID: I just discovered this interesting gem poking at mmfsadm: test pit fsname list|suspend|status|resume|stop [jobId] There have been times where I've kicked off a restripe and either intentionally or accidentally ctrl-c'd it only to realize that many times it's disappeared into the ether and is still running. The only way I've known so far to stop it is with a chgmgr. A far more painful instance happened when I ran a rebalance on an fs w/more than 31 nsds using more than 31 pit workers and hit *that* fun APAR which locked up access for a single filesystem to all 3.5k nodes. We spent 48 hours round the clock rebooting nodes as jobs drained to clear it up. I would have killed in that instance for a way to cancel the PIT job (the chmgr trick didn't work). It looks like you might actually be able to do this with mmfsadm, although how wise this is, I do not know (kinda curious about that). Here's an example. I kicked off a restripe and then ctrl-c'd it on a client node. Then ran these commands from the fs manager: root at loremds19:~ # /usr/lpp/mmfs/bin/mmfsadm test pit tlocal list JobId 785979015170 PitJobStatus PIT_JOB_RUNNING progress 0.00 debug: statusListP D40E2C70 root at loremds19:~ # /usr/lpp/mmfs/bin/mmfsadm test pit tlocal stop 785979015170 debug: statusListP 0 root at loremds19:~ # /usr/lpp/mmfs/bin/mmfsadm test pit tlocal list JobId 785979015170 PitJobStatus PIT_JOB_STOPPING progress 4.01 debug: statusListP D4013E70 ... some time passes ... root at loremds19:~ # /usr/lpp/mmfs/bin/mmfsadm test pit tlocal list debug: statusListP 0 Interesting. -Aaron -- Aaron Knister NASA Center for Climate Simulation (Code 606.2) Goddard Space Flight Center (301) 286-2776 From volobuev at us.ibm.com Tue Aug 16 16:21:13 2016 From: volobuev at us.ibm.com (Yuri L Volobuev) Date: Tue, 16 Aug 2016 08:21:13 -0700 Subject: [gpfsug-discuss] 4.2.1 documentation In-Reply-To: References: <8033d4a67d9745f4a52f148538423066@exch1-cdc.nexus.csiro.au> Message-ID: Light Weight Event support is not fully baked yet, and thus not documented. It's getting there. yuri From: "Daniel Kidger" To: "gpfsug main discussion list" , Cc: "gpfsug-discuss" Date: 08/04/2016 01:23 AM Subject: Re: [gpfsug-discuss] 4.2.1 documentation Sent by: gpfsug-discuss-bounces at spectrumscale.org Yes they have been re arranged. My observation is that the Admin and Advanced Admin have merged into one PDFs, and the DMAPI manual is now a chapter of the new Programming guide (along with the complete set of man pages which have moved out of the Admin guide). Table 3 on page 26 of the Concepts, Planning and Install guide describes these change. IMHO The new format is much better as all Admin is in one place not two. ps. I couldn't find in the programming guide a chapter yet on Light Weight Events. Anyone in product development care to comment? :-) Daniel IBM Spectrum Storage Software +44 (0)7818 522266 Sent from my iPad using IBM Verse On 4 Aug 2016, 03:42:21, Greg.Lehmann at csiro.au wrote: From: Greg.Lehmann at csiro.au To: gpfsug-discuss at spectrumscale.org Cc: Date: 4 Aug 2016 03:42:21 Subject: [gpfsug-discuss] 4.2.1 documentation I see only 4 pdfs now with slightly different titles to the previous 5 pdfs available with 4.2.0. Just checking there are only supposed to be 4 now? Greg Unless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From volobuev at us.ibm.com Tue Aug 16 16:42:33 2016 From: volobuev at us.ibm.com (Yuri L Volobuev) Date: Tue, 16 Aug 2016 08:42:33 -0700 Subject: [gpfsug-discuss] quota on secondary groups for a user? In-Reply-To: <20160804123409.18403cy3iz123gxt@support.scinet.utoronto.ca> References: <20160803122227.13743yk89dn1dbur@support.scinet.utoronto.ca>

Message-ID: Thanks Marc! That's incredibly helpful info. I'll uh, not use the test pit command :) -Aaron On 8/16/16 5:09 PM, Marc A Kaplan wrote: > I was surprised to read that Ctrl-C did not really kill restripe. It's > supposed to! If it doesn't that's a bug. > > I ran this by my expert within IBM and he wrote to me: > > First of all a "PIT job" such as restripe, deldisk, delsnapshot, and > such should be easy to stop by ^C the management program that started > them. The SG manager daemon holds open a socket to the client program > for the purposes of sending command output, progress updates, error > messages and the like. The PIT code checks this socket periodically and > aborts the PIT process cleanly if the socket is closed. If this cleanup > doesn't occur, it is a bug and should be worth reporting. However, > there's no exact guarantee on how quickly each thread on the SG mgr will > notice and then how quickly the helper nodes can be stopped and so > forth. The interval between socket checks depends among other things on > how long it takes to process each file, if there are a few very large > files, the delay can be significant. In the limiting case, where most > of the FS storage is contained in a few files, this mechanism doesn't > work [elided] well. So it can be quite involved and slow sometimes to > wrap up a PIT operation. > > The simplest way to determine if the command has really stopped is with > the mmdiag --commands issued on the SG manager node. This shows running > commands with the command line, start time, socket, flags, etc. After > ^Cing the client program, the entry here should linger for a while, then > go away. When it exits you'll see an entry in the GPFS log file where > it fails with err 50. If this doesn't stop the command after a while, > it is worth looking into. > > If the command wasn't issued on the SG mgr node and you can't find the > where the client command is running, the socket is still a useful hint. > While tedious, it should be possible to trace this socket back to node > where that command was originally run using netstat or equivalent. > Poking around inside a GPFS internaldump will also provide clues; there > should be an outstanding sgmMsgSGClientCmd command listed in the dump > tscomm section. Once you find it, just 'kill `pidof mmrestripefs` or > similar. > > I'd like to warn the OP away from mmfsadm test pit. These commands are > of course unsupported and unrecommended for any purpose (even internal > test and development purposes, as far as I know). You are definitely > working without a net there. When I was improving the integration > between PIT and snapshot quiesce a few years ago, I looked into this and > couldn't figure out how to (easily) make these stop and resume commands > safe to use, so as far as I know they remain unsafe. The list command, > however, is probably fairly okay; but it would probably be better to use > mmfsadm saferdump pit. > > > > > > From: Aaron Knister > To: > Date: 08/15/2016 10:49 PM > Subject: [gpfsug-discuss] mmfsadm test pit > Sent by: gpfsug-discuss-bounces at spectrumscale.org > ------------------------------------------------------------------------ > > > > I just discovered this interesting gem poking at mmfsadm: > > test pit fsname list|suspend|status|resume|stop [jobId] > > There have been times where I've kicked off a restripe and either > intentionally or accidentally ctrl-c'd it only to realize that many > times it's disappeared into the ether and is still running. The only way > I've known so far to stop it is with a chgmgr. > > A far more painful instance happened when I ran a rebalance on an fs > w/more than 31 nsds using more than 31 pit workers and hit *that* fun > APAR which locked up access for a single filesystem to all 3.5k nodes. > We spent 48 hours round the clock rebooting nodes as jobs drained to > clear it up. I would have killed in that instance for a way to cancel > the PIT job (the chmgr trick didn't work). It looks like you might > actually be able to do this with mmfsadm, although how wise this is, I > do not know (kinda curious about that). > > Here's an example. I kicked off a restripe and then ctrl-c'd it on a > client node. Then ran these commands from the fs manager: > > root at loremds19:~ # /usr/lpp/mmfs/bin/mmfsadm test pit tlocal list > JobId 785979015170 PitJobStatus PIT_JOB_RUNNING progress 0.00 > debug: statusListP D40E2C70 > > root at loremds19:~ # /usr/lpp/mmfs/bin/mmfsadm test pit tlocal stop > 785979015170 > debug: statusListP 0 > > root at loremds19:~ # /usr/lpp/mmfs/bin/mmfsadm test pit tlocal list > JobId 785979015170 PitJobStatus PIT_JOB_STOPPING progress 4.01 > debug: statusListP D4013E70 > > ... some time passes ... > > root at loremds19:~ # /usr/lpp/mmfs/bin/mmfsadm test pit tlocal list > debug: statusListP 0 > > Interesting. > > -Aaron > > -- > Aaron Knister > NASA Center for Climate Simulation (Code 606.2) > Goddard Space Flight Center > (301) 286-2776 > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -- Aaron Knister NASA Center for Climate Simulation (Code 606.2) Goddard Space Flight Center (301) 286-2776 From aaron.s.knister at nasa.gov Wed Aug 17 02:46:39 2016 From: aaron.s.knister at nasa.gov (Knister, Aaron S. (GSFC-606.2)[COMPUTER SCIENCE CORP]) Date: Wed, 17 Aug 2016 01:46:39 +0000 Subject: [gpfsug-discuss] Monitor NSD server queue? Message-ID: <5F910253243E6A47B81A9A2EB424BBA101CC6514@NDJSMBX404.ndc.nasa.gov> Hi Everyone, We ran into a rather interesting situation over the past week. We had a job that was pounding the ever loving crap out of one of our filesystems (called dnb02) doing about 15GB/s of reads. We had other jobs experience a slowdown on a different filesystem (called dnb41) that uses entirely separate backend storage. What I can't figure out is why this other filesystem was affected. I've checked IB bandwidth and congestion, Fibre channel bandwidth and errors, Ethernet bandwidth congestion, looked at the mmpmon nsd_ds counters (including disk request wait time), and checked out the disk iowait values from collectl. I simply can't account for the slowdown on the other filesystem. The only thing I can think of is the high latency on dnb02's NSDs caused the mmfsd NSD queues to back up. Here's my question-- how can I monitor the state of th NSD queues? I can't find anything in mmdiag. An mmfsadm saferdump NSD shows me the queues and their status. I'm just not sure calling saferdump NSD every 10 seconds to monitor this data is going to end well. I've seen saferdump NSD cause mmfsd to die and that's from a task we only run every 6 hours that calls saferdump NSD. Any thoughts/ideas here would be great. Thanks! -Aaron -------------- next part -------------- An HTML attachment was scrubbed... URL: From Robert.Oesterlin at nuance.com Wed Aug 17 12:45:04 2016 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Wed, 17 Aug 2016 11:45:04 +0000 Subject: [gpfsug-discuss] Monitor NSD server queue? In-Reply-To: <5F910253243E6A47B81A9A2EB424BBA101CC6514@NDJSMBX404.ndc.nasa.gov> References: <5F910253243E6A47B81A9A2EB424BBA101CC6514@NDJSMBX404.ndc.nasa.gov> Message-ID: <7BFE2D50-9AA9-4A78-A05A-08D5DEB0A2E1@nuance.com> Hi Aaron You did a perfect job of explaining a situation I've run into time after time - high latency on the disk subsystem causing a backup in the NSD queues. I was doing what you suggested not to do - "mmfsadm saferdump nsd' and looking at the queues. In my case 'mmfsadm saferdump" would usually work or hang, rather than kill mmfsd. But - the hang usually resulted it a tied up thread in mmfsd, so that's no good either. I wish I had better news - this is the only way I've found to get visibility to these queues. IBM hasn't seen fit to gives us a way to safely look at these. I personally think it's a bug that we can't safely dump these structures, as they give insight as to what's actually going on inside the NSD server. Yuri, Sven - thoughts? Bob Oesterlin Sr Storage Engineer, Nuance HPC Grid From: on behalf of "Knister, Aaron S. (GSFC-606.2)[COMPUTER SCIENCE CORP]" Reply-To: gpfsug main discussion list Date: Tuesday, August 16, 2016 at 8:46 PM To: gpfsug main discussion list Subject: [EXTERNAL] [gpfsug-discuss] Monitor NSD server queue? Hi Everyone, We ran into a rather interesting situation over the past week. We had a job that was pounding the ever loving crap out of one of our filesystems (called dnb02) doing about 15GB/s of reads. We had other jobs experience a slowdown on a different filesystem (called dnb41) that uses entirely separate backend storage. What I can't figure out is why this other filesystem was affected. I've checked IB bandwidth and congestion, Fibre channel bandwidth and errors, Ethernet bandwidth congestion, looked at the mmpmon nsd_ds counters (including disk request wait time), and checked out the disk iowait values from collectl. I simply can't account for the slowdown on the other filesystem. The only thing I can think of is the high latency on dnb02's NSDs caused the mmfsd NSD queues to back up. Here's my question-- how can I monitor the state of th NSD queues? I can't find anything in mmdiag. An mmfsadm saferdump NSD shows me the queues and their status. I'm just not sure calling saferdump NSD every 10 seconds to monitor this data is going to end well. I've seen saferdump NSD cause mmfsd to die and that's from a task we only run every 6 hours that calls saferdump NSD. Any thoughts/ideas here would be great. Thanks! -Aaron -------------- next part -------------- An HTML attachment was scrubbed... URL: From volobuev at us.ibm.com Wed Aug 17 21:34:57 2016 From: volobuev at us.ibm.com (Yuri L Volobuev) Date: Wed, 17 Aug 2016 13:34:57 -0700 Subject: [gpfsug-discuss] Monitor NSD server queue? In-Reply-To: <7BFE2D50-9AA9-4A78-A05A-08D5DEB0A2E1@nuance.com> References: <5F910253243E6A47B81A9A2EB424BBA101CC6514@NDJSMBX404.ndc.nasa.gov> <7BFE2D50-9AA9-4A78-A05A-08D5DEB0A2E1@nuance.com> Message-ID: Unfortunately, at the moment there's no safe mechanism to show the usage statistics for different NSD queues. "mmfsadm saferdump nsd" as implemented doesn't acquire locks when parsing internal data structures. Now, NSD data structures are fairly static, as much things go, so the risk of following a stale pointer and hitting a segfault isn't particularly significant. I don't think I remember ever seeing mmfsd crash with NSD dump code on the stack. That said, this isn't code that's tested and known to be safe for production use. I haven't seen a case myself where an mmfsd thread gets stuck running this dump command, either, but Bob has. If that condition ever reoccurs, I'd be interested in seeing debug data. I agree that there's value in giving a sysadmin insight into the inner workings of the NSD server machinery, in particular the queue dynamics. mmdiag should be enhanced to allow this. That'd be a very reasonable (and doable) RFE. yuri From: "Oesterlin, Robert" To: gpfsug main discussion list , Date: 08/17/2016 04:45 AM Subject: Re: [gpfsug-discuss] Monitor NSD server queue? Sent by: gpfsug-discuss-bounces at spectrumscale.org Hi Aaron You did a perfect job of explaining a situation I've run into time after time - high latency on the disk subsystem causing a backup in the NSD queues. I was doing what you suggested not to do - "mmfsadm saferdump nsd' and looking at the queues. In my case 'mmfsadm saferdump" would usually work or hang, rather than kill mmfsd. But - the hang usually resulted it a tied up thread in mmfsd, so that's no good either. I wish I had better news - this is the only way I've found to get visibility to these queues. IBM hasn't seen fit to gives us a way to safely look at these. I personally think it's a bug that we can't safely dump these structures, as they give insight as to what's actually going on inside the NSD server. Yuri, Sven - thoughts? Bob Oesterlin Sr Storage Engineer, Nuance HPC Grid From: on behalf of "Knister, Aaron S. (GSFC-606.2)[COMPUTER SCIENCE CORP]" Reply-To: gpfsug main discussion list Date: Tuesday, August 16, 2016 at 8:46 PM To: gpfsug main discussion list Subject: [EXTERNAL] [gpfsug-discuss] Monitor NSD server queue? Hi Everyone, We ran into a rather interesting situation over the past week. We had a job that was pounding the ever loving crap out of one of our filesystems (called dnb02) doing about 15GB/s of reads. We had other jobs experience a slowdown on a different filesystem (called dnb41) that uses entirely separate backend storage. What I can't figure out is why this other filesystem was affected. I've checked IB bandwidth and congestion, Fibre channel bandwidth and errors, Ethernet bandwidth congestion, looked at the mmpmon nsd_ds counters (including disk request wait time), and checked out the disk iowait values from collectl. I simply can't account for the slowdown on the other filesystem. The only thing I can think of is the high latency on dnb02's NSDs caused the mmfsd NSD queues to back up. Here's my question-- how can I monitor the state of th NSD queues? I can't find anything in mmdiag. An mmfsadm saferdump NSD shows me the queues and their status. I'm just not sure calling saferdump NSD every 10 seconds to monitor this data is going to end well. I've seen saferdump NSD cause mmfsd to die and that's from a task we only run every 6 hours that calls saferdump NSD. Any thoughts/ideas here would be great. Thanks! -Aaron_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From SAnderson at convergeone.com Wed Aug 17 22:11:25 2016 From: SAnderson at convergeone.com (Shaun Anderson) Date: Wed, 17 Aug 2016 21:11:25 +0000 Subject: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE to Spectrum Archive Message-ID: <1471468285737.63407@convergeone.com> ?I am in process of migrating from 3.5 to 4.2 and LTFSEE to Spectrum Archive. 1 node cluster (currently) connected to V3700 storage and TS4500 backend. We have upgraded their 2nd node to 4.2 and have successfully tested joining the domain, created smb shares, and validated their ability to access and control permissions on those shares. They are using .tdb backend for id mapping on their current server. I'm looking to discuss with someone the best method of migrating those tdb databases to the second server, or understand how Spectrum Scale does id mapping and where it stores that information. Any hints would be greatly appreciated. Regards, SHAUN ANDERSON STORAGE ARCHITECT O 208.577.2112 M 214.263.7014 [sig] [RH_CertifiedSysAdmin_CMYK] [Linux on IBM Power Systems - Sales 2016] [IBM Spectrum Storage - Sales 2016] NOTICE: This email message and any attachments here to may contain confidential information. Any unauthorized review, use, disclosure, or distribution of such information is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy the original message and all copies of it. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 14134 bytes Desc: image001.png URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image003.jpg Type: image/jpeg Size: 2593 bytes Desc: image003.jpg URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image005.png Type: image/png Size: 11635 bytes Desc: image005.png URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image007.png Type: image/png Size: 11505 bytes Desc: image007.png URL: From YARD at il.ibm.com Thu Aug 18 00:11:52 2016 From: YARD at il.ibm.com (Yaron Daniel) Date: Thu, 18 Aug 2016 02:11:52 +0300 Subject: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE to Spectrum Archive In-Reply-To: <1471468285737.63407@convergeone.com> References: <1471468285737.63407@convergeone.com> Message-ID: Hi Do u use CES protocols nodes ? Or Samba on each of the Server ? Regards Yaron Daniel 94 Em Ha'Moshavot Rd Server, Storage and Data Services - Team Leader Petach Tiqva, 49527 Global Technology Services Israel Phone: +972-3-916-5672 Fax: +972-3-916-5672 Mobile: +972-52-8395593 e-mail: yard at il.ibm.com IBM Israel From: Shaun Anderson To: "gpfsug-discuss at spectrumscale.org" Date: 08/18/2016 12:11 AM Subject: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE to Spectrum Archive Sent by: gpfsug-discuss-bounces at spectrumscale.org ?I am in process of migrating from 3.5 to 4.2 and LTFSEE to Spectrum Archive. 1 node cluster (currently) connected to V3700 storage and TS4500 backend. We have upgraded their 2nd node to 4.2 and have successfully tested joining the domain, created smb shares, and validated their ability to access and control permissions on those shares. They are using .tdb backend for id mapping on their current server. I'm looking to discuss with someone the best method of migrating those tdb databases to the second server, or understand how Spectrum Scale does id mapping and where it stores that information. Any hints would be greatly appreciated. Regards, SHAUN ANDERSON STORAGE ARCHITECT O 208.577.2112 M 214.263.7014 NOTICE: This email message and any attachments here to may contain confidential information. Any unauthorized review, use, disclosure, or distribution of such information is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy the original message and all copies of it._______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 1851 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 14134 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 2593 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 11635 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 11505 bytes Desc: not available URL: From SAnderson at convergeone.com Thu Aug 18 02:51:38 2016 From: SAnderson at convergeone.com (Shaun Anderson) Date: Thu, 18 Aug 2016 01:51:38 +0000 Subject: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE to Spectrum Archive In-Reply-To: References: <1471468285737.63407@convergeone.com>, Message-ID: <1471485097896.49269@convergeone.com> ?We are currently running samba on the 3.5 node, but wanting to migrate everything into using CES once we get everything up to 4.2. SHAUN ANDERSON STORAGE ARCHITECT O 208.577.2112 M 214.263.7014 ________________________________ From: gpfsug-discuss-bounces at spectrumscale.org on behalf of Yaron Daniel Sent: Wednesday, August 17, 2016 5:11 PM To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE to Spectrum Archive Hi Do u use CES protocols nodes ? Or Samba on each of the Server ? Regards ________________________________ Yaron Daniel 94 Em Ha'Moshavot Rd [cid:_1_0DDE2A700DDE24DC007F6D32C2258012] Server, Storage and Data Services- Team Leader Petach Tiqva, 49527 Global Technology Services Israel Phone: +972-3-916-5672 Fax: +972-3-916-5672 Mobile: +972-52-8395593 e-mail: yard at il.ibm.com IBM Israel From: Shaun Anderson To: "gpfsug-discuss at spectrumscale.org" Date: 08/18/2016 12:11 AM Subject: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE to Spectrum Archive Sent by: gpfsug-discuss-bounces at spectrumscale.org ________________________________ ?I am in process of migrating from 3.5 to 4.2 and LTFSEE to Spectrum Archive. 1 node cluster (currently) connected to V3700 storage and TS4500 backend. We have upgraded their 2nd node to 4.2 and have successfully tested joining the domain, created smb shares, and validated their ability to access and control permissions on those shares. They are using .tdb backend for id mapping on their current server. I'm looking to discuss with someone the best method of migrating those tdb databases to the second server, or understand how Spectrum Scale does id mapping and where it stores that information. Any hints would be greatly appreciated. Regards, SHAUN ANDERSON STORAGE ARCHITECT O208.577.2112 M214.263.7014 [sig] [RH_CertifiedSysAdmin_CMYK] [Linux on IBM Power Systems - Sales 2016] [IBM Spectrum Storage - Sales 2016] NOTICE: This email message and any attachments here to may contain confidential information. Any unauthorized review, use, disclosure, or distribution of such information is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy the original message and all copies of it._______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss NOTICE: This email message and any attachments here to may contain confidential information. Any unauthorized review, use, disclosure, or distribution of such information is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy the original message and all copies of it. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ATT00001.gif Type: image/gif Size: 1851 bytes Desc: ATT00001.gif URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ATT00002.png Type: image/png Size: 14134 bytes Desc: ATT00002.png URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ATT00003.jpg Type: image/jpeg Size: 2593 bytes Desc: ATT00003.jpg URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ATT00004.png Type: image/png Size: 11635 bytes Desc: ATT00004.png URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ATT00005.png Type: image/png Size: 11505 bytes Desc: ATT00005.png URL: From YARD at il.ibm.com Thu Aug 18 04:56:50 2016 From: YARD at il.ibm.com (Yaron Daniel) Date: Thu, 18 Aug 2016 06:56:50 +0300 Subject: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE toSpectrumArchive In-Reply-To: <1471485097896.49269@convergeone.com> References: <1471468285737.63407@convergeone.com>, <1471485097896.49269@convergeone.com> Message-ID: So - the procedure you are asking related to Samba. Please check at redhat Site the process of upgrade Samba - u will need to backup the tdb files and restore them. But pay attention that the Samba ids will remain the same after moving to CES - please review the Authentication Section. Regards Yaron Daniel 94 Em Ha'Moshavot Rd Server, Storage and Data Services - Team Leader Petach Tiqva, 49527 Global Technology Services Israel Phone: +972-3-916-5672 Fax: +972-3-916-5672 Mobile: +972-52-8395593 e-mail: yard at il.ibm.com IBM Israel From: Shaun Anderson To: gpfsug main discussion list Date: 08/18/2016 04:52 AM Subject: Re: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE to Spectrum Archive Sent by: gpfsug-discuss-bounces at spectrumscale.org ?We are currently running samba on the 3.5 node, but wanting to migrate everything into using CES once we get everything up to 4.2. SHAUN ANDERSON STORAGE ARCHITECT O 208.577.2112 M 214.263.7014 From: gpfsug-discuss-bounces at spectrumscale.org on behalf of Yaron Daniel Sent: Wednesday, August 17, 2016 5:11 PM To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE to Spectrum Archive Hi Do u use CES protocols nodes ? Or Samba on each of the Server ? Regards Yaron Daniel 94 Em Ha'Moshavot Rd Server, Storage and Data Services- Team Leader Petach Tiqva, 49527 Global Technology Services Israel Phone: +972-3-916-5672 Fax: +972-3-916-5672 Mobile: +972-52-8395593 e-mail: yard at il.ibm.com IBM Israel From: Shaun Anderson To: "gpfsug-discuss at spectrumscale.org" Date: 08/18/2016 12:11 AM Subject: [gpfsug-discuss] Migrate 3.5 to 4.2 and LTFSEE to Spectrum Archive Sent by: gpfsug-discuss-bounces at spectrumscale.org ?I am in process of migrating from 3.5 to 4.2 and LTFSEE to Spectrum Archive. 1 node cluster (currently) connected to V3700 storage and TS4500 backend. We have upgraded their 2nd node to 4.2 and have successfully tested joining the domain, created smb shares, and validated their ability to access and control permissions on those shares. They are using .tdb backend for id mapping on their current server. I'm looking to discuss with someone the best method of migrating those tdb databases to the second server, or understand how Spectrum Scale does id mapping and where it stores that information. Any hints would be greatly appreciated. Regards, SHAUN ANDERSON STORAGE ARCHITECT O208.577.2112 M214.263.7014 NOTICE: This email message and any attachments here to may contain confidential information. Any unauthorized review, use, disclosure, or distribution of such information is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy the original message and all copies of it._______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss NOTICE: This email message and any attachments here to may contain confidential information. Any unauthorized review, use, disclosure, or distribution of such information is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy the original message and all copies of it._______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 1851 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 1851 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 14134 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/jpeg Size: 2593 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 11635 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 11505 bytes Desc: not available URL: From Robert.Oesterlin at nuance.com Thu Aug 18 15:47:25 2016 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Thu, 18 Aug 2016 14:47:25 +0000 Subject: [gpfsug-discuss] Monitor NSD server queue? Message-ID: <2702740E-EC6A-4998-BA1A-35A1EF5B5EDC@nuance.com> Done. Notification generated at: 18 Aug 2016, 10:46 AM Eastern Time (ET) ID: 93260 Headline: Give sysadmin insight into the inner workings of the NSD server machinery, in particular the queue dynamics Submitted on: 18 Aug 2016, 10:46 AM Eastern Time (ET) Brand: Servers and Systems Software Product: Spectrum Scale (formerly known as GPFS) - Public RFEs Link: http://www.ibm.com/developerworks/rfe/execute?use_case=viewRfe&CR_ID=93260 Bob Oesterlin Sr Storage Engineer, Nuance HPC Grid 507-269-0413 From: on behalf of Yuri L Volobuev Reply-To: gpfsug main discussion list Date: Wednesday, August 17, 2016 at 3:34 PM To: gpfsug main discussion list Subject: [EXTERNAL] Re: [gpfsug-discuss] Monitor NSD server queue? Unfortunately, at the moment there's no safe mechanism to show the usage statistics for different NSD queues. "mmfsadm saferdump nsd" as implemented doesn't acquire locks when parsing internal data structures. Now, NSD data structures are fairly static, as much things go, so the risk of following a stale pointer and hitting a segfault isn't particularly significant. I don't think I remember ever seeing mmfsd crash with NSD dump code on the stack. That said, this isn't code that's tested and known to be safe for production use. I haven't seen a case myself where an mmfsd thread gets stuck running this dump command, either, but Bob has. If that condition ever reoccurs, I'd be interested in seeing debug data. I agree that there's value in giving a sysadmin insight into the inner workings of the NSD server machinery, in particular the queue dynamics. mmdiag should be enhanced to allow this. That'd be a very reasonable (and doable) RFE. yuri [nactive hide details for "Oesterlin, Robert" ---08/17/2016 04:45:30 AM---]"Oesterlin, Robert" ---08/17/2016 04:45:30 AM---Hi Aaron You did a perfect job of explaining a situation I've run into time after time - high latenc From: "Oesterlin, Robert" To: gpfsug main discussion list