From makaplan at us.ibm.com Thu Sep 1 00:40:13 2016 From: makaplan at us.ibm.com (Marc A Kaplan) Date: Wed, 31 Aug 2016 19:40:13 -0400 Subject: [gpfsug-discuss] Data Replication In-Reply-To: References:

Message-ID: You can leave out the WHERE ... AND POOL_NAME LIKE 'deep' - that is redundant with the FROM POOL 'deep' clause. In fact at a slight additional overhead in mmapplypolicy processing due to begin checked a little later in the game, you can leave out MISC_ATTRIBUTES NOT LIKE '%2%' since the code is smart enough to not operate on files already marked as replicate(2). I believe mmapplypolicy .... -I yes means do any necessary data movement and/or replication "now" Alternatively you can say -I defer, which will leave the files "ill-replicated" and then fix them up with mmrestripefs later. The -I yes vs -I defer choice is the same as for mmchattr. Think of mmapplypolicy as a fast, parallel way to do find ... | xargs mmchattr ... Advert: see also samples/ilm/mmfind -- the latest version should have an -xargs option From: Jan-Frode Myklebust To: gpfsug main discussion list Date: 08/31/2016 04:44 PM Subject: Re: [gpfsug-discuss] Data Replication Sent by: gpfsug-discuss-bounces at spectrumscale.org Assuming your DeepFlash pool is named "deep", something like the following should work: RULE 'deepreplicate' migrate from pool 'deep' to pool 'deep' replicate(2) where MISC_ATTRIBUTES NOT LIKE '%2%' and POOL_NAME LIKE 'deep' "mmapplypolicy gpfs0 -P replicate-policy.pol -I yes" and possibly "mmrestripefs gpfs0 -r" afterwards. -jf On Wed, Aug 31, 2016 at 8:01 PM, Brian Marshall wrote: Daniel, So here's my use case: I have a Sandisk IF150 (branded as DeepFlash recently) with 128TB of flash acting as a "fast tier" storage pool in our HPC scratch file system. Can I set the filesystem replication level to 1 then write a policy engine rule to send small and/or recent files to the IF150 with a replication of 2? Any other comments on the proposed usage strategy are helpful. Thank you, Brian Marshall On Wed, Aug 31, 2016 at 10:32 AM, Daniel Kidger wrote: The other 'Exception' is when a rule is used to convert a 1 way replicated file to 2 way, or when only one failure group is up due to HW problems. It that case the (re-replication) is done by whatever nodes are used for the rule or command-line, which may include an NSD server. Daniel IBM Spectrum Storage Software +44 (0)7818 522266 Sent from my iPad using IBM Verse On 30 Aug 2016, 19:53:31, mimarsh2 at vt.edu wrote: From: mimarsh2 at vt.edu To: gpfsug-discuss at spectrumscale.org Cc: Date: 30 Aug 2016 19:53:31 Subject: Re: [gpfsug-discuss] Data Replication Thanks. This confirms the numbers that I am seeing. Brian On Tue, Aug 30, 2016 at 2:50 PM, Laurence Horrocks-Barlow < laurence at qsplace.co.uk> wrote: Its the client that does all the synchronous replication, this way the cluster is able to scale as the clients do the leg work (so to speak). The somewhat "exception" is if a GPFS NSD server (or client with direct NSD) access uses a server bases protocol such as SMB, in this case the SMB server will do the replication as the SMB client doesn't know about GPFS or its replication; essentially the SMB server is the GPFS client. -- Lauz On 30 August 2016 17:03:38 CEST, Bryan Banister wrote: The NSD Client handles the replication and will, as you stated, write one copy to one NSD (using the primary server for this NSD) and one to a different NSD in a different GPFS failure group (using quite likely, but not necessarily, a different NSD server that is the primary server for this alternate NSD). Cheers, -Bryan From: gpfsug-discuss-bounces at spectrumscale.org [mailto: gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Brian Marshall Sent: Tuesday, August 30, 2016 9:59 AM To: gpfsug main discussion list Subject: [gpfsug-discuss] Data Replication All, If I setup a filesystem to have data replication of 2 (2 copies of data), does the data get replicated at the NSD Server or at the client? i.e. Does the client send 2 copies over the network or does the NSD Server get a single copy and then replicate on storage NSDs? I couldn't find a place in the docs that talked about this specific point. Thank you, Brian Marshall Note: This email is for the confidential use of the named addressee(s) only and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you are hereby notified that any review, dissemination or copying of this email is strictly prohibited, and to please notify the sender immediately and destroy this email and any attachments. Email transmission cannot be guaranteed to be secure or error-free. The Company, therefore, does not make any guarantees as to the completeness or accuracy of this email or any attachments. This email is for informational purposes only and does not constitute a recommendation, offer, request or solicitation of any kind to buy, sell, subscribe, redeem or perform any type of transaction of a financial product. gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -- Sent from my Android device with K-9 Mail. Please excuse my brevity. _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss Unless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From daniel.kidger at uk.ibm.com Thu Sep 1 11:29:48 2016 From: daniel.kidger at uk.ibm.com (Daniel Kidger) Date: Thu, 1 Sep 2016 10:29:48 +0000 Subject: [gpfsug-discuss] gpfs native raid In-Reply-To: <96282850-6bfa-73ae-8502-9e8df3a56390@nasa.gov> Message-ID: Aaron, GNR is a key differentiator for IBM's (and Lenovo's) Storage hardware appliance. ESS and GSS are otherwise commodity storage arrays connected to commodity NSD servers, albeit with a high degree of tuning and rigorous testing and validation. This competes with equivalent DDN and Seagate appliances as well other non s/w Raid offerings from other IBM partners. GNR only works for a small number of disk arrays and then only in certain configurations. GNR then might be thought of as 'firmware' for the hardware rather than part of a software defined products at is Spectrum Scale. If you beleive the viewpoint that hardware Raid 'is dead' then GNR will not be the only s/w Raid that will be used to underly Spectrum Scale. As well as vendor specific offerings from DDN, Seagate, etc. ZFS is likely to be a popular choice but is today not well understood or tested. This will change as more 3rd parties publish their experiences and tuning optimisations, and also as storage solution vendors bidding Spectrum Scale find they can't compete without a software Raid component in their offering. Disclaimer: the above are my own views and not necessarily an IBM official viewpoint. Daniel IBM Spectrum Storage Software +44 (0)7818 522266 Sent from my iPad using IBM Verse On 30 Aug 2016, 18:17:01, aaron.s.knister at nasa.gov wrote: From: aaron.s.knister at nasa.gov To: gpfsug-discuss at spectrumscale.org Cc: Date: 30 Aug 2016 18:17:01 Subject: Re: [gpfsug-discuss] gpfs native raid Thanks Christopher. I've tried GPFS on zvols a couple times and the write throughput I get is terrible because of the required sync=always parameter. Perhaps a couple of SSD's could help get the number up, though. -Aaron On 8/30/16 12:47 PM, Christopher Maestas wrote: > Interestingly enough, Spectrum Scale can run on zvols. Check out: > > http://files.gpfsug.org/presentations/2016/anl-june/LANL_GPFS_ZFS.pdf > > -cdm > > ------------------------------------------------------------------------ > On Aug 30, 2016, 9:17:05 AM, aaron.s.knister at nasa.gov wrote: > > From: aaron.s.knister at nasa.gov > To: gpfsug-discuss at spectrumscale.org > Cc: > Date: Aug 30, 2016 9:17:05 AM > Subject: [gpfsug-discuss] gpfs native raid > > Does anyone know if/when we might see gpfs native raid opened up for the > masses on non-IBM hardware? It's hard to answer the question of "why > can't GPFS do this? Lustre can" in regards to Lustre's integration with > ZFS and support for RAID on commodity hardware. > -Aaron > -- > Aaron Knister > NASA Center for Climate Simulation (Code 606.2) > Goddard Space Flight Center > (301) 286-2776 > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -- Aaron Knister NASA Center for Climate Simulation (Code 606.2) Goddard Space Flight Center (301) 286-2776 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discussUnless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU -------------- next part -------------- An HTML attachment was scrubbed... URL: From daniel.kidger at uk.ibm.com Thu Sep 1 12:22:47 2016 From: daniel.kidger at uk.ibm.com (Daniel Kidger) Date: Thu, 1 Sep 2016 11:22:47 +0000 Subject: [gpfsug-discuss] Maximum value for data replication? In-Reply-To: References: , Message-ID: An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.1__=0ABB0AB3DFD67DBA8f9e8a93df938 at us.ibm.com.gif Type: image/gif Size: 105 bytes Desc: not available URL: From bauer at cesnet.cz Thu Sep 1 14:30:23 2016 From: bauer at cesnet.cz (Miroslav Bauer) Date: Thu, 1 Sep 2016 15:30:23 +0200 Subject: [gpfsug-discuss] Migration to separate metadata and data disks Message-ID: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz> Hello, I have a GPFS 3.5 filesystem (fs1) and I'm trying to migrate the filesystem metadata from state: -m = 2 (default metadata replicas) - SATA disks (dataAndMetadata, failGroup=1) - SSDs (metadataOnly, failGroup=3) to the desired state: -m = 1 - SATA disks (dataOnly, failGroup=1) - SSDs (metadataOnly, failGroup=3) I have done the following steps in the following order: 1) change SATA disks to dataOnly (stanza file modifies the 'usage' attribute only): # mmchdisk fs1 change -F dataOnly_disks.stanza Attention: Disk parameters were changed. Use the mmrestripefs command with the -r option to relocate data and metadata. Verifying file system configuration information ... mmchdisk: Propagating the cluster configuration data to all affected nodes. This is an asynchronous process. 2) change default metadata replicas number 2->1 # mmchfs fs1 -m 1 3) run mmrestripefs as suggested by output of 1) # mmrestripefs fs1 -r Scanning file system metadata, phase 1 ... Error processing inodes. No space left on device mmrestripefs: Command failed. Examine previous error messages to determine cause. It is, however, still possible to create new files on the filesystem. When I return one of the SATA disks as a dataAndMetadata disk, the mmrestripefs command stops complaining about No space left on device. Both df and mmdf say that there is enough space both for data (SATA) and metadata (SSDs). Does anyone have an idea why is it complaining? Thanks, -- Miroslav Bauer -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 3716 bytes Desc: S/MIME Cryptographic Signature URL: From aaron.s.knister at nasa.gov Thu Sep 1 14:36:32 2016 From: aaron.s.knister at nasa.gov (Aaron Knister) Date: Thu, 1 Sep 2016 09:36:32 -0400 Subject: [gpfsug-discuss] Migration to separate metadata and data disks In-Reply-To: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz> References: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz> Message-ID: I must admit, I'm curious as to the reason you're dropping the replication factor from 2 down to 1. There are some serious advantages we've seen to having multiple metadata replicas, as far as error recovery is concerned. Could you paste an output of mmlsdisk for the filesystem? -Aaron On 9/1/16 9:30 AM, Miroslav Bauer wrote: > Hello, > > I have a GPFS 3.5 filesystem (fs1) and I'm trying to migrate the > filesystem metadata from state: > -m = 2 (default metadata replicas) > - SATA disks (dataAndMetadata, failGroup=1) > - SSDs (metadataOnly, failGroup=3) > to the desired state: > -m = 1 > - SATA disks (dataOnly, failGroup=1) > - SSDs (metadataOnly, failGroup=3) > > I have done the following steps in the following order: > 1) change SATA disks to dataOnly (stanza file modifies the 'usage' > attribute only): > # mmchdisk fs1 change -F dataOnly_disks.stanza > Attention: Disk parameters were changed. > Use the mmrestripefs command with the -r option to relocate data and > metadata. > Verifying file system configuration information ... > mmchdisk: Propagating the cluster configuration data to all > affected nodes. This is an asynchronous process. > > 2) change default metadata replicas number 2->1 > # mmchfs fs1 -m 1 > > 3) run mmrestripefs as suggested by output of 1) > # mmrestripefs fs1 -r > Scanning file system metadata, phase 1 ... > Error processing inodes. > No space left on device > mmrestripefs: Command failed. Examine previous error messages to > determine cause. > > It is, however, still possible to create new files on the filesystem. > When I return one of the SATA disks as a dataAndMetadata disk, the > mmrestripefs > command stops complaining about No space left on device. Both df and mmdf > say that there is enough space both for data (SATA) and metadata (SSDs). > Does anyone have an idea why is it complaining? > > Thanks, > > -- > Miroslav Bauer > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -- Aaron Knister NASA Center for Climate Simulation (Code 606.2) Goddard Space Flight Center (301) 286-2776 From aaron.s.knister at nasa.gov Thu Sep 1 14:39:17 2016 From: aaron.s.knister at nasa.gov (Aaron Knister) Date: Thu, 1 Sep 2016 09:39:17 -0400 Subject: [gpfsug-discuss] Migration to separate metadata and data disks In-Reply-To: References: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz> Message-ID: By the way, I suspect the no space on device errors are because GPFS believes for some reason that it is unable to maintain the metadata replication factor of 2 that's likely set on all previously created inodes. On 9/1/16 9:36 AM, Aaron Knister wrote: > I must admit, I'm curious as to the reason you're dropping the > replication factor from 2 down to 1. There are some serious advantages > we've seen to having multiple metadata replicas, as far as error > recovery is concerned. > > Could you paste an output of mmlsdisk for the filesystem? > > -Aaron > > On 9/1/16 9:30 AM, Miroslav Bauer wrote: >> Hello, >> >> I have a GPFS 3.5 filesystem (fs1) and I'm trying to migrate the >> filesystem metadata from state: >> -m = 2 (default metadata replicas) >> - SATA disks (dataAndMetadata, failGroup=1) >> - SSDs (metadataOnly, failGroup=3) >> to the desired state: >> -m = 1 >> - SATA disks (dataOnly, failGroup=1) >> - SSDs (metadataOnly, failGroup=3) >> >> I have done the following steps in the following order: >> 1) change SATA disks to dataOnly (stanza file modifies the 'usage' >> attribute only): >> # mmchdisk fs1 change -F dataOnly_disks.stanza >> Attention: Disk parameters were changed. >> Use the mmrestripefs command with the -r option to relocate data and >> metadata. >> Verifying file system configuration information ... >> mmchdisk: Propagating the cluster configuration data to all >> affected nodes. This is an asynchronous process. >> >> 2) change default metadata replicas number 2->1 >> # mmchfs fs1 -m 1 >> >> 3) run mmrestripefs as suggested by output of 1) >> # mmrestripefs fs1 -r >> Scanning file system metadata, phase 1 ... >> Error processing inodes. >> No space left on device >> mmrestripefs: Command failed. Examine previous error messages to >> determine cause. >> >> It is, however, still possible to create new files on the filesystem. >> When I return one of the SATA disks as a dataAndMetadata disk, the >> mmrestripefs >> command stops complaining about No space left on device. Both df and mmdf >> say that there is enough space both for data (SATA) and metadata (SSDs). >> Does anyone have an idea why is it complaining? >> >> Thanks, >> >> -- >> Miroslav Bauer >> >> >> >> >> _______________________________________________ >> gpfsug-discuss mailing list >> gpfsug-discuss at spectrumscale.org >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >> > -- Aaron Knister NASA Center for Climate Simulation (Code 606.2) Goddard Space Flight Center (301) 286-2776 From jonathan at buzzard.me.uk Thu Sep 1 14:49:11 2016 From: jonathan at buzzard.me.uk (Jonathan Buzzard) Date: Thu, 01 Sep 2016 14:49:11 +0100 Subject: [gpfsug-discuss] Migration to separate metadata and data disks In-Reply-To: References: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz>

Message-ID: <1472737751.25479.22.camel@buzzard.phy.strath.ac.uk> On Thu, 2016-09-01 at 09:39 -0400, Aaron Knister wrote: > By the way, I suspect the no space on device errors are because GPFS > believes for some reason that it is unable to maintain the metadata > replication factor of 2 that's likely set on all previously created inodes. > Hazarding a guess, but there is only one SSD NSD, so if all the metadata is going to go on SSD there is no point in replicating. It would also explain why it would believe it can't maintain the metadata replication factor. Though it could just be a simple metadata size is larger than the available SSD size. JAB. -- Jonathan A. Buzzard Email: jonathan (at) buzzard.me.uk Fife, United Kingdom. From makaplan at us.ibm.com Thu Sep 1 14:59:28 2016 From: makaplan at us.ibm.com (Marc A Kaplan) Date: Thu, 1 Sep 2016 09:59:28 -0400 Subject: [gpfsug-discuss] gpfs native raid In-Reply-To: References: <96282850-6bfa-73ae-8502-9e8df3a56390@nasa.gov> Message-ID: I've been told that it is a big leap to go from supporting GSS and ESS to allowing and supporting native raid for customers who may throw together "any" combination of hardware they might choose. In particular the GNR "disk hospital" functions... https://www.ibm.com/support/knowledgecenter/SSFKCN_3.5.0/com.ibm.cluster.gpfs.v3r5.gpfs200.doc/bl1adv_introdiskhospital.htm will be tricky to support on umpteen different vendor boxes -- and keep in mind, those will be from IBM competitors! That said, ESS and GSS show that IBM has some good tech in this area and IBM has shown with the Spectrum Scale product (sans GNR) it can support just about any semi-reasonable hardware configuration and a good slew of OS versions and architectures... Heck I have a demo/test version of GPFS running on a 5 year old Thinkpad laptop.... And we have some GSSs in the lab... Not to mention Power hardware and mainframe System Z (think 360, 370, 290, Z) -------------- next part -------------- An HTML attachment was scrubbed... URL: From aaron.s.knister at nasa.gov Thu Sep 1 15:02:50 2016 From: aaron.s.knister at nasa.gov (Aaron Knister) Date: Thu, 1 Sep 2016 10:02:50 -0400 Subject: [gpfsug-discuss] Migration to separate metadata and data disks In-Reply-To: References: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz>

Message-ID: <2ce7334b-28c1-7a14-814a-fbcf99d8049e@nasa.gov> Oh! I think you've already provided the info I was looking for :) I thought that failGroup=3 meant there were 3 failure groups within the SSDs. I suspect that's not at all what you meant and that actually is the failure group of all of those disks. That I think explains what's going on-- there's only one failure group's worth of metadata-capable disks available and as such GPFS can't place the 2nd replica for existing files. Here's what I would suggest: - Create at least 2 failure groups within the SSDs - Put the default metadata replication factor back to 2 - Run a restripefs -R to shuffle files around and restore the metadata replication factor of 2 to any files created while it was set to 1 If you're not interested in replication for metadata then perhaps all you need to do is the mmrestripefs -R. I think that should un-replicate the file from the SATA disks leaving the copy on the SSDs. Hope that helps. -Aaron On 9/1/16 9:39 AM, Aaron Knister wrote: > By the way, I suspect the no space on device errors are because GPFS > believes for some reason that it is unable to maintain the metadata > replication factor of 2 that's likely set on all previously created inodes. > > On 9/1/16 9:36 AM, Aaron Knister wrote: >> I must admit, I'm curious as to the reason you're dropping the >> replication factor from 2 down to 1. There are some serious advantages >> we've seen to having multiple metadata replicas, as far as error >> recovery is concerned. >> >> Could you paste an output of mmlsdisk for the filesystem? >> >> -Aaron >> >> On 9/1/16 9:30 AM, Miroslav Bauer wrote: >>> Hello, >>> >>> I have a GPFS 3.5 filesystem (fs1) and I'm trying to migrate the >>> filesystem metadata from state: >>> -m = 2 (default metadata replicas) >>> - SATA disks (dataAndMetadata, failGroup=1) >>> - SSDs (metadataOnly, failGroup=3) >>> to the desired state: >>> -m = 1 >>> - SATA disks (dataOnly, failGroup=1) >>> - SSDs (metadataOnly, failGroup=3) >>> >>> I have done the following steps in the following order: >>> 1) change SATA disks to dataOnly (stanza file modifies the 'usage' >>> attribute only): >>> # mmchdisk fs1 change -F dataOnly_disks.stanza >>> Attention: Disk parameters were changed. >>> Use the mmrestripefs command with the -r option to relocate data and >>> metadata. >>> Verifying file system configuration information ... >>> mmchdisk: Propagating the cluster configuration data to all >>> affected nodes. This is an asynchronous process. >>> >>> 2) change default metadata replicas number 2->1 >>> # mmchfs fs1 -m 1 >>> >>> 3) run mmrestripefs as suggested by output of 1) >>> # mmrestripefs fs1 -r >>> Scanning file system metadata, phase 1 ... >>> Error processing inodes. >>> No space left on device >>> mmrestripefs: Command failed. Examine previous error messages to >>> determine cause. >>> >>> It is, however, still possible to create new files on the filesystem. >>> When I return one of the SATA disks as a dataAndMetadata disk, the >>> mmrestripefs >>> command stops complaining about No space left on device. Both df and >>> mmdf >>> say that there is enough space both for data (SATA) and metadata (SSDs). >>> Does anyone have an idea why is it complaining? >>> >>> Thanks, >>> >>> -- >>> Miroslav Bauer >>> >>> >>> >>> >>> _______________________________________________ >>> gpfsug-discuss mailing list >>> gpfsug-discuss at spectrumscale.org >>> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >>> >> > -- Aaron Knister NASA Center for Climate Simulation (Code 606.2) Goddard Space Flight Center (301) 286-2776 From makaplan at us.ibm.com Thu Sep 1 15:14:18 2016 From: makaplan at us.ibm.com (Marc A Kaplan) Date: Thu, 1 Sep 2016 10:14:18 -0400 Subject: [gpfsug-discuss] Migration to separate metadata and data disks In-Reply-To: <2ce7334b-28c1-7a14-814a-fbcf99d8049e@nasa.gov> References: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz>

<2ce7334b-28c1-7a14-814a-fbcf99d8049e@nasa.gov> Message-ID: I believe the OP left out a step. I am not saying this is a good idea, but ... One must change the replication factors marked in each inode for each file... This could be done using an mmapplypolicy rule: RULE 'one' MIGRATE FROM POOL 'yourdatapool' TO POOL 'yourdatapool' REPLICATE(1,1) (repeat rule for each POOL you have) Put that (those) rules in a file and do a "one time" run like mmapplypolicy yourfilesystem -P /path/to/rule -N nodelist-to-do-this-work -g /filesystem/bigtemp -I defer Then try your restripe again. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 21994 bytes Desc: not available URL: From bauer at cesnet.cz Thu Sep 1 15:28:36 2016 From: bauer at cesnet.cz (Miroslav Bauer) Date: Thu, 1 Sep 2016 16:28:36 +0200 Subject: [gpfsug-discuss] Migration to separate metadata and data disks In-Reply-To: <2ce7334b-28c1-7a14-814a-fbcf99d8049e@nasa.gov> References: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz>

<2ce7334b-28c1-7a14-814a-fbcf99d8049e@nasa.gov> Message-ID: <505ff859-d49a-04cc-bd9d-50f7b2a8df0b@cesnet.cz> Yes, failure group id is exactly what I meant :). Unfortunately, mmrestripefs with -R behaves the same as with -r. I also believed that mmrestripefs -R is the correct tool for fixing the replication settings on inodes (according to manpages), but I will try possible solutions you and Marc suggested and let you know how it went. Thank you, -- Miroslav Bauer On 09/01/2016 04:02 PM, Aaron Knister wrote: > Oh! I think you've already provided the info I was looking for :) I > thought that failGroup=3 meant there were 3 failure groups within the > SSDs. I suspect that's not at all what you meant and that actually is > the failure group of all of those disks. That I think explains what's > going on-- there's only one failure group's worth of metadata-capable > disks available and as such GPFS can't place the 2nd replica for > existing files. > > Here's what I would suggest: > > - Create at least 2 failure groups within the SSDs > - Put the default metadata replication factor back to 2 > - Run a restripefs -R to shuffle files around and restore the metadata > replication factor of 2 to any files created while it was set to 1 > > If you're not interested in replication for metadata then perhaps all > you need to do is the mmrestripefs -R. I think that should > un-replicate the file from the SATA disks leaving the copy on the SSDs. > > Hope that helps. > > -Aaron > > On 9/1/16 9:39 AM, Aaron Knister wrote: >> By the way, I suspect the no space on device errors are because GPFS >> believes for some reason that it is unable to maintain the metadata >> replication factor of 2 that's likely set on all previously created >> inodes. >> >> On 9/1/16 9:36 AM, Aaron Knister wrote: >>> I must admit, I'm curious as to the reason you're dropping the >>> replication factor from 2 down to 1. There are some serious advantages >>> we've seen to having multiple metadata replicas, as far as error >>> recovery is concerned. >>> >>> Could you paste an output of mmlsdisk for the filesystem? >>> >>> -Aaron >>> >>> On 9/1/16 9:30 AM, Miroslav Bauer wrote: >>>> Hello, >>>> >>>> I have a GPFS 3.5 filesystem (fs1) and I'm trying to migrate the >>>> filesystem metadata from state: >>>> -m = 2 (default metadata replicas) >>>> - SATA disks (dataAndMetadata, failGroup=1) >>>> - SSDs (metadataOnly, failGroup=3) >>>> to the desired state: >>>> -m = 1 >>>> - SATA disks (dataOnly, failGroup=1) >>>> - SSDs (metadataOnly, failGroup=3) >>>> >>>> I have done the following steps in the following order: >>>> 1) change SATA disks to dataOnly (stanza file modifies the 'usage' >>>> attribute only): >>>> # mmchdisk fs1 change -F dataOnly_disks.stanza >>>> Attention: Disk parameters were changed. >>>> Use the mmrestripefs command with the -r option to relocate data and >>>> metadata. >>>> Verifying file system configuration information ... >>>> mmchdisk: Propagating the cluster configuration data to all >>>> affected nodes. This is an asynchronous process. >>>> >>>> 2) change default metadata replicas number 2->1 >>>> # mmchfs fs1 -m 1 >>>> >>>> 3) run mmrestripefs as suggested by output of 1) >>>> # mmrestripefs fs1 -r >>>> Scanning file system metadata, phase 1 ... >>>> Error processing inodes. >>>> No space left on device >>>> mmrestripefs: Command failed. Examine previous error messages to >>>> determine cause. >>>> >>>> It is, however, still possible to create new files on the filesystem. >>>> When I return one of the SATA disks as a dataAndMetadata disk, the >>>> mmrestripefs >>>> command stops complaining about No space left on device. Both df and >>>> mmdf >>>> say that there is enough space both for data (SATA) and metadata >>>> (SSDs). >>>> Does anyone have an idea why is it complaining? >>>> >>>> Thanks, >>>> >>>> -- >>>> Miroslav Bauer >>>> >>>> >>>> >>>> >>>> _______________________________________________ >>>> gpfsug-discuss mailing list >>>> gpfsug-discuss at spectrumscale.org >>>> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >>>> >>> >> > -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 3716 bytes Desc: S/MIME Cryptographic Signature URL: From S.J.Thompson at bham.ac.uk Thu Sep 1 22:06:44 2016 From: S.J.Thompson at bham.ac.uk (Simon Thompson (Research Computing - IT Services)) Date: Thu, 1 Sep 2016 21:06:44 +0000 Subject: [gpfsug-discuss] Maximum value for data replication? In-Reply-To: References: , , Message-ID: I have two protocol node in each of two data centres. So four protocol nodes in the cluster. Plus I also have a quorum vm which is lockstep/ha so guaranteed to survive in one of the data centres should we lose power. The protocol servers being protocol servers don't have access to the fibre channel storage. And we've seen ces do bad things when the storage cluster it is remotely mounting (and the ces root is on) fails/is under load etc. So the four full copies is about guaranteeing there are two full copies in both data centres. And remember this is only for the cesroot, so lock data for the ces ips, the smb registry I think as well. I was hoping that by making the cesroot in the protocol node cluster rather than a fileset on a remote mounted filesysyem, that it would fix the ces weirdness we see as it would become a local gpfs file system. I guess three copies would maybe work. But also in another cluster, we have been thinking about adding NVMe into NSD servers for metadata and system.log and so I can se there are cases there where having higher numbers of copies would be useful. Yes I take the point that more copies means more load for the client, but in these cases, we aren't thinking about gpfs as the fastest possible hpc file system, but for other infrastructure purposes, which is one of the ways the product seems to be moving. Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Daniel Kidger [daniel.kidger at uk.ibm.com] Sent: 01 September 2016 12:22 To: gpfsug-discuss at spectrumscale.org Cc: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] Maximum value for data replication? Simon, Hi. Can you explain why you would like a full copy of all blocks on all 4 NSD servers ? Is there a particular use case, and hence an interest from product development? Otherwise remember that with 4 NSD servers, with one failure group per (storage rich) NSD server, then all 4 disk arrays will be loaded equally, as new files will get written to any 3 (or 2 or 1) of the 4 failure groups. Also remember that as you add more replication then there is more network load on the gpfs client as it has to perform all the writes itself. Perhaps someone technical can comment on the logic that determines which '3' out of 4 failure groups, a particular block is written to. Daniel [/spectrum_storage-banne] [Spectrum Scale Logo] Dr Daniel Kidger IBM Technical Sales Specialist Software Defined Solution Sales +44-07818 522 266 daniel.kidger at uk.ibm.com ----- Original message ----- From: Steve Duersch Sent by: gpfsug-discuss-bounces at spectrumscale.org To: gpfsug-discuss at spectrumscale.org Cc: Subject: Re: [gpfsug-discuss] Maximum value for data replication? Date: Wed, Aug 31, 2016 1:45 PM >>Is there a maximum value for data replication in Spectrum Scale? The maximum value for replication is 3. Steve Duersch Spectrum Scale RAID 845-433-7902 IBM Poughkeepsie, New York [Inactive hide details for gpfsug-discuss-request---08/30/2016 07:25:24 PM---Send gpfsug-discuss mailing list submissions to gp]gpfsug-discuss-request---08/30/2016 07:25:24 PM---Send gpfsug-discuss mailing list submissions to gpfsug-discuss at spectrumscale.org From: gpfsug-discuss-request at spectrumscale.org To: gpfsug-discuss at spectrumscale.org Date: 08/30/2016 07:25 PM Subject: gpfsug-discuss Digest, Vol 55, Issue 55 Sent by: gpfsug-discuss-bounces at spectrumscale.org ________________________________ Send gpfsug-discuss mailing list submissions to gpfsug-discuss at spectrumscale.org To subscribe or unsubscribe via the World Wide Web, visit http://gpfsug.org/mailman/listinfo/gpfsug-discuss or, via email, send a message with subject or body 'help' to gpfsug-discuss-request at spectrumscale.org You can reach the person managing the list at gpfsug-discuss-owner at spectrumscale.org When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." Today's Topics: 1. Maximum value for data replication? (Simon Thompson (Research Computing - IT Services)) 2. greetings (Kevin D Johnson) 3. GPFS 3.5.0 on RHEL 6.8 (Lukas Hejtmanek) 4. Re: GPFS 3.5.0 on RHEL 6.8 (Kevin D Johnson) 5. Re: GPFS 3.5.0 on RHEL 6.8 (mark.bergman at uphs.upenn.edu) 6. Re: *New* IBM Spectrum Protect Whitepaper "Petascale Data Protection" (Lukas Hejtmanek) 7. Re: *New* IBM Spectrum Protect Whitepaper "Petascale Data Protection" (Sven Oehme) ---------------------------------------------------------------------- Message: 1 Date: Tue, 30 Aug 2016 19:09:05 +0000 From: "Simon Thompson (Research Computing - IT Services)" To: "gpfsug-discuss at spectrumscale.org" Subject: [gpfsug-discuss] Maximum value for data replication? Message-ID: Content-Type: text/plain; charset="us-ascii" Is there a maximum value for data replication in Spectrum Scale? I have a number of nsd servers which have local storage and Id like each node to have a full copy of all the data in the file-system, say this value is 4, can I set replication to 4 for data and metadata and have each server have a full copy? These are protocol nodes and multi cluster mount another file system (yes I know not supported) and the cesroot is in the remote file system. On several occasions where GPFS has wibbled a bit, this has caused issues with ces locks, so I was thinking of moving the cesroot to a local filesysyem which is replicated on the local ssds in the protocol nodes. I.e. Its a generally quiet file system as its only ces cluster config. I assume if I stop protocols, rsync the data and then change to the new ces root, I should be able to get this working? Thanks Simon ------------------------------ Message: 2 Date: Tue, 30 Aug 2016 19:43:39 +0000 From: "Kevin D Johnson" To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] greetings Message-ID: Content-Type: text/plain; charset="us-ascii" An HTML attachment was scrubbed... URL: ------------------------------ Message: 3 Date: Tue, 30 Aug 2016 22:39:18 +0200 From: Lukas Hejtmanek To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] GPFS 3.5.0 on RHEL 6.8 Message-ID: <20160830203917.qptfgqvlmdbzu6wr at ics.muni.cz> Content-Type: text/plain; charset=iso-8859-2 Hello, does it work for anyone? As of kernel 2.6.32-642, GPFS 3.5.0 (including the latest patch 32) does start but does not mount and file system. The internal mount cmd gets stucked. -- Luk?? Hejtm?nek ------------------------------ Message: 4 Date: Tue, 30 Aug 2016 20:51:39 +0000 From: "Kevin D Johnson" To: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] GPFS 3.5.0 on RHEL 6.8 Message-ID: Content-Type: text/plain; charset="us-ascii" An HTML attachment was scrubbed... URL: ------------------------------ Message: 5 Date: Tue, 30 Aug 2016 17:07:21 -0400 From: mark.bergman at uphs.upenn.edu To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] GPFS 3.5.0 on RHEL 6.8 Message-ID: <24437-1472591241.445832 at bR6O.TofS.917u> Content-Type: text/plain; charset="UTF-8" In the message dated: Tue, 30 Aug 2016 22:39:18 +0200, The pithy ruminations from Lukas Hejtmanek on <[gpfsug-discuss] GPFS 3.5.0 on RHEL 6.8> were: => Hello, GPFS 3.5.0.[23..3-0] work for me under [CentOS|ScientificLinux] 6.8, but at kernel 2.6.32-573 and lower. I've found kernel bugs in blk_cloned_rq_check_limits() in later kernel revs that caused multipath errors, resulting in GPFS being unable to find all NSDs and mount the filesystem. I am not updating to a newer kernel until I'm certain this is resolved. I opened a bug with CentOS: https://bugs.centos.org/view.php?id=10997 and began an extended discussion with the (RH & SUSE) developers of that chunk of kernel code. I don't know if an upstream bug has been opened by RH, but see: https://patchwork.kernel.org/patch/9140337/ => => does it work for anyone? As of kernel 2.6.32-642, GPFS 3.5.0 (including the => latest patch 32) does start but does not mount and file system. The internal => mount cmd gets stucked. => => -- => Luk?? Hejtm?nek -- Mark Bergman voice: 215-746-4061 mark.bergman at uphs.upenn.edu fax: 215-614-0266 http://www.cbica.upenn.edu/ IT Technical Director, Center for Biomedical Image Computing and Analytics Department of Radiology University of Pennsylvania PGP Key: http://www.cbica.upenn.edu/sbia/bergman ------------------------------ Message: 6 Date: Wed, 31 Aug 2016 00:02:50 +0200 From: Lukas Hejtmanek To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] *New* IBM Spectrum Protect Whitepaper "Petascale Data Protection" Message-ID: <20160830220250.yt6r7gvfq7rlvtcs at ics.muni.cz> Content-Type: text/plain; charset=iso-8859-2 Hello, On Mon, Aug 29, 2016 at 09:20:46AM +0200, Frank Kraemer wrote: > Find the paper here: > > https://www.ibm.com/developerworks/community/wikis/home?lang=en#!/wiki/Tivoli%20Storage%20Manager/page/Petascale%20Data%20Protection thank you for the paper, I appreciate it. However, I wonder whether it could be extended a little. As it has the title Petascale Data Protection, I think that in Peta scale, you have to deal with millions (well rather hundreds of millions) of files you store in and this is something where TSM does not scale well. Could you give some hints: On the backup site: mmbackup takes ages for: a) scan (try to scan 500M files even in parallel) b) backup - what if 10 % of files get changed - backup process can be blocked several days as mmbackup cannot run in several instances on the same file system, so you have to wait until one run of mmbackup finishes. How long could it take at petascale? On the restore site: how can I restore e.g. 40 millions of file efficiently? dsmc restore '/path/*' runs into serious troubles after say 20M files (maybe wrong internal structures used), however, scanning 1000 more files takes several minutes resulting the dsmc restore never reaches that 40M files. using filelists the situation is even worse. I run dsmc restore -filelist with a filelist consisting of 2.4M files. Running for *two* days without restoring even a single file. dsmc is consuming 100 % CPU. So any hints addressing these issues with really large number of files would be even more appreciated. -- Luk?? Hejtm?nek ------------------------------ Message: 7 Date: Tue, 30 Aug 2016 16:24:59 -0700 From: Sven Oehme To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] *New* IBM Spectrum Protect Whitepaper "Petascale Data Protection" Message-ID: Content-Type: text/plain; charset="utf-8" so lets start with some simple questions. when you say mmbackup takes ages, what version of gpfs code are you running ? how do you execute the mmbackup command ? exact parameters would be useful . what HW are you using for the metadata disks ? how much capacity (df -h) and how many inodes (df -i) do you have in the filesystem you try to backup ? sven On Tue, Aug 30, 2016 at 3:02 PM, Lukas Hejtmanek wrote: > Hello, > > On Mon, Aug 29, 2016 at 09:20:46AM +0200, Frank Kraemer wrote: > > Find the paper here: > > > > https://www.ibm.com/developerworks/community/wikis/home?lang=en#!/wiki/ > Tivoli%20Storage%20Manager/page/Petascale%20Data%20Protection > > thank you for the paper, I appreciate it. > > However, I wonder whether it could be extended a little. As it has the > title > Petascale Data Protection, I think that in Peta scale, you have to deal > with > millions (well rather hundreds of millions) of files you store in and this > is > something where TSM does not scale well. > > Could you give some hints: > > On the backup site: > mmbackup takes ages for: > a) scan (try to scan 500M files even in parallel) > b) backup - what if 10 % of files get changed - backup process can be > blocked > several days as mmbackup cannot run in several instances on the same file > system, so you have to wait until one run of mmbackup finishes. How long > could > it take at petascale? > > On the restore site: > how can I restore e.g. 40 millions of file efficiently? dsmc restore > '/path/*' > runs into serious troubles after say 20M files (maybe wrong internal > structures used), however, scanning 1000 more files takes several minutes > resulting the dsmc restore never reaches that 40M files. > > using filelists the situation is even worse. I run dsmc restore -filelist > with a filelist consisting of 2.4M files. Running for *two* days without > restoring even a single file. dsmc is consuming 100 % CPU. > > So any hints addressing these issues with really large number of files > would > be even more appreciated. > > -- > Luk?? Hejtm?nek > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: ------------------------------ _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss End of gpfsug-discuss Digest, Vol 55, Issue 55 ********************************************** _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss Unless stated otherwise above: IBM United Kingdom Limited - Registered in England and Wales with number 741598. Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU -------------- next part -------------- A non-text attachment was scrubbed... Name: Image.1__=0ABB0AB3DFD67DBA8f9e8a93df938 at us.ibm.com.gif Type: image/gif Size: 105 bytes Desc: Image.1__=0ABB0AB3DFD67DBA8f9e8a93df938 at us.ibm.com.gif URL: From r.sobey at imperial.ac.uk Fri Sep 2 14:37:26 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Fri, 2 Sep 2016 13:37:26 +0000 Subject: [gpfsug-discuss] CES node responding on system IP address Message-ID: Hi all, *Should* a CES node, 4.2.0 OR 4.2.1, be responding on its system IP address? The nodes in my cluster, seemingly randomly, either give me a list of shares, or prompt me to enter a username and password. For example, Start > Run \\cesnode.fqdn I get a prompt for a username and password. If I add the system IP into my hosts file and call it clustername.fqdn it responds normally i.e. no prompt for username or password. Should I be worried about the inconsistencies here? Richard Sobey Storage Area Network (SAN) Analyst Technical Operations, ICT Imperial College London South Kensington 403, City & Guilds Building London SW7 2AZ Tel: +44 (0)20 7594 6915 Email: r.sobey at imperial.ac.uk http://www.imperial.ac.uk/admin-services/ict/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From r.sobey at imperial.ac.uk Fri Sep 2 16:10:59 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Fri, 2 Sep 2016 15:10:59 +0000 Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? In-Reply-To: References:

Message-ID: I?ve verified the upgrade has fixed this issue so thanks again. However I?ve noticed that stopping SMB doesn?t trigger an IP address failover. I think it should. mmces node suspend (or rebooting, or mmces address move, or etc..) seems to trigger the failover. Richard From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Danny Alexander Calderon Rodriguez Sent: 27 August 2016 13:53 To: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Hi Richard This is fixed in release 4.2.1, if you cant upgrade now, you can fix this manuallly Just do this. edit file /usr/lpp/mmfs/lib/mmcesmon/SMBService.py Change if authType == 'ad' and not nodeState.nfsStopped: to nfsEnabled = utils.isProtocolEnabled("NFS", self.logger) if authType == 'ad' and not nodeState.nfsStopped and nfsEnabled: You need to stop the gpfs service in each node where you apply the change " after change the lines please use tap key" Enviado desde mi iPhone El 27/08/2016, a las 6:00 a.m., gpfsug-discuss-request at spectrumscale.org escribi?: Send gpfsug-discuss mailing list submissions to gpfsug-discuss at spectrumscale.org To subscribe or unsubscribe via the World Wide Web, visit http://gpfsug.org/mailman/listinfo/gpfsug-discuss or, via email, send a message with subject or body 'help' to gpfsug-discuss-request at spectrumscale.org You can reach the person managing the list at gpfsug-discuss-owner at spectrumscale.org When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." Today's Topics: 1. Re: Cannot stop SMB... stop NFS first?(Christof Schmitt) 2. Re: CES and mmuserauth command (Christof Schmitt) ---------------------------------------------------------------------- Message: 1 Date: Fri, 26 Aug 2016 12:29:31 -0400 From: "Christof Schmitt" > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Message-ID: > Content-Type: text/plain; charset="UTF-8" That would be the case when Active Directory is configured for authentication. In that case the SMB service includes two aspects: One is the actual SMB file server, and the second one is the service for the Active Directory integration. Since NFS depends on authentication and id mapping services, it requires SMB to be running. Regards, Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) From: "Sobey, Richard A" > To: "'gpfsug-discuss at spectrumscale.org'" > Date: 08/26/2016 04:48 AM Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Sent by: gpfsug-discuss-bounces at spectrumscale.org Sorry all, prepare for a deluge of emails like this, hopefully it?ll help other people implementing CES in the future. I?m trying to stop SMB on a node, but getting the following output: [root at cesnode ~]# mmces service stop smb smb: Request denied. Please stop NFS first [root at cesnode ~]# mmces service list Enabled services: SMB SMB is running As you can see there is no way to stop NFS when it?s not running but it seems to be blocking me. It?s happening on all the nodes in the cluster. SS version is 4.2.0 running on a fully up to date RHEL 7.1 server. Richard_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ------------------------------ Message: 2 Date: Fri, 26 Aug 2016 12:29:31 -0400 From: "Christof Schmitt" > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] CES and mmuserauth command Message-ID: > Content-Type: text/plain; charset="ISO-2022-JP" The --user-name option applies to both, AD and LDAP authentication. In the LDAP case, this information is correct. I will try to get some clarification added for the AD case. The same applies to the information shown in "service list". There is a common field that holds the information and the parameter from the initial "service create" is stored there. The meaning is different for AD and LDAP: For LDAP it is the username being used to access the LDAP server, while in the AD case it was only the user initially used until the machine account was created. Regards, Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) From: Jan-Frode Myklebust > To: gpfsug main discussion list > Date: 08/26/2016 05:59 AM Subject: Re: [gpfsug-discuss] CES and mmuserauth command Sent by: gpfsug-discuss-bounces at spectrumscale.org On Fri, Aug 26, 2016 at 1:49 AM, Christof Schmitt < christof.schmitt at us.ibm.com> wrote: When joinging the AD domain, --user-name, --password and --server are only used to initially identify and logon to the AD and to create the machine account for the cluster. Once that is done, that information is no longer used, and e.g. the account from --user-name could be deleted, the password changed or the specified DC could be removed from the domain (as long as other DCs are remaining). That was my initial understanding of the --user-name, but when reading the man-page I get the impression that it's also used to do connect to AD to do user and group lookups: ------------------------------------------------------------------------------------------------------ ??user?name userName Specifies the user name to be used to perform operations against the authentication server. The specified user name must have sufficient permissions to read user and group attributes from the authentication server. ------------------------------------------------------------------------------------------------------- Also it's strange that "mmuserauth service list" would list the USER_NAME if it was only somthing that was used at configuration time..? -jf_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ------------------------------ _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss End of gpfsug-discuss Digest, Vol 55, Issue 44 ********************************************** -------------- next part -------------- An HTML attachment was scrubbed... URL: From S.J.Thompson at bham.ac.uk Fri Sep 2 16:15:30 2016 From: S.J.Thompson at bham.ac.uk (Simon Thompson (Research Computing - IT Services)) Date: Fri, 2 Sep 2016 15:15:30 +0000 Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? In-Reply-To: References:

, Message-ID: Should it? If you were running nfs and smb, would you necessarily want to fail the ip over? Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Sobey, Richard A [r.sobey at imperial.ac.uk] Sent: 02 September 2016 16:10 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? I?ve verified the upgrade has fixed this issue so thanks again. However I?ve noticed that stopping SMB doesn?t trigger an IP address failover. I think it should. mmces node suspend (or rebooting, or mmces address move, or etc..) seems to trigger the failover. Richard From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Danny Alexander Calderon Rodriguez Sent: 27 August 2016 13:53 To: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Hi Richard This is fixed in release 4.2.1, if you cant upgrade now, you can fix this manuallly Just do this. edit file /usr/lpp/mmfs/lib/mmcesmon/SMBService.py Change if authType == 'ad' and not nodeState.nfsStopped: to nfsEnabled = utils.isProtocolEnabled("NFS", self.logger) if authType == 'ad' and not nodeState.nfsStopped and nfsEnabled: You need to stop the gpfs service in each node where you apply the change " after change the lines please use tap key" Enviado desde mi iPhone El 27/08/2016, a las 6:00 a.m., gpfsug-discuss-request at spectrumscale.org escribi?: Send gpfsug-discuss mailing list submissions to gpfsug-discuss at spectrumscale.org To subscribe or unsubscribe via the World Wide Web, visit http://gpfsug.org/mailman/listinfo/gpfsug-discuss or, via email, send a message with subject or body 'help' to gpfsug-discuss-request at spectrumscale.org You can reach the person managing the list at gpfsug-discuss-owner at spectrumscale.org When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." Today's Topics: 1. Re: Cannot stop SMB... stop NFS first?(Christof Schmitt) 2. Re: CES and mmuserauth command (Christof Schmitt) ---------------------------------------------------------------------- Message: 1 Date: Fri, 26 Aug 2016 12:29:31 -0400 From: "Christof Schmitt" > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Message-ID: > Content-Type: text/plain; charset="UTF-8" That would be the case when Active Directory is configured for authentication. In that case the SMB service includes two aspects: One is the actual SMB file server, and the second one is the service for the Active Directory integration. Since NFS depends on authentication and id mapping services, it requires SMB to be running. Regards, Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) From: "Sobey, Richard A" > To: "'gpfsug-discuss at spectrumscale.org'" > Date: 08/26/2016 04:48 AM Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Sent by: gpfsug-discuss-bounces at spectrumscale.org Sorry all, prepare for a deluge of emails like this, hopefully it?ll help other people implementing CES in the future. I?m trying to stop SMB on a node, but getting the following output: [root at cesnode ~]# mmces service stop smb smb: Request denied. Please stop NFS first [root at cesnode ~]# mmces service list Enabled services: SMB SMB is running As you can see there is no way to stop NFS when it?s not running but it seems to be blocking me. It?s happening on all the nodes in the cluster. SS version is 4.2.0 running on a fully up to date RHEL 7.1 server. Richard_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ------------------------------ Message: 2 Date: Fri, 26 Aug 2016 12:29:31 -0400 From: "Christof Schmitt" > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] CES and mmuserauth command Message-ID: > Content-Type: text/plain; charset="ISO-2022-JP" The --user-name option applies to both, AD and LDAP authentication. In the LDAP case, this information is correct. I will try to get some clarification added for the AD case. The same applies to the information shown in "service list". There is a common field that holds the information and the parameter from the initial "service create" is stored there. The meaning is different for AD and LDAP: For LDAP it is the username being used to access the LDAP server, while in the AD case it was only the user initially used until the machine account was created. Regards, Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) From: Jan-Frode Myklebust > To: gpfsug main discussion list > Date: 08/26/2016 05:59 AM Subject: Re: [gpfsug-discuss] CES and mmuserauth command Sent by: gpfsug-discuss-bounces at spectrumscale.org On Fri, Aug 26, 2016 at 1:49 AM, Christof Schmitt < christof.schmitt at us.ibm.com> wrote: When joinging the AD domain, --user-name, --password and --server are only used to initially identify and logon to the AD and to create the machine account for the cluster. Once that is done, that information is no longer used, and e.g. the account from --user-name could be deleted, the password changed or the specified DC could be removed from the domain (as long as other DCs are remaining). That was my initial understanding of the --user-name, but when reading the man-page I get the impression that it's also used to do connect to AD to do user and group lookups: ------------------------------------------------------------------------------------------------------ ??user?name userName Specifies the user name to be used to perform operations against the authentication server. The specified user name must have sufficient permissions to read user and group attributes from the authentication server. ------------------------------------------------------------------------------------------------------- Also it's strange that "mmuserauth service list" would list the USER_NAME if it was only somthing that was used at configuration time..? -jf_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ------------------------------ _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss End of gpfsug-discuss Digest, Vol 55, Issue 44 ********************************************** From r.sobey at imperial.ac.uk Fri Sep 2 16:23:28 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Fri, 2 Sep 2016 15:23:28 +0000 Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? In-Reply-To: References:

, Message-ID: A fair point, but since we're not running NFS, a failure of the only other service [SMB], whether it stops through user input or some other means, should cause the node to go unhealthy (in CTDB parlance) and trigger a failover. That would be my preference. Otoh if you were running NFS and SMB and one of those services crashed, do you still want a node in the cluster that could potentially respond and fails to do so? I guess it's a question for each organisation to answer themselves. -----Original Message----- From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Simon Thompson (Research Computing - IT Services) Sent: 02 September 2016 16:16 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Should it? If you were running nfs and smb, would you necessarily want to fail the ip over? Simon ________________________________________ From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Sobey, Richard A [r.sobey at imperial.ac.uk] Sent: 02 September 2016 16:10 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? I've verified the upgrade has fixed this issue so thanks again. However I've noticed that stopping SMB doesn't trigger an IP address failover. I think it should. mmces node suspend (or rebooting, or mmces address move, or etc..) seems to trigger the failover. Richard From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Danny Alexander Calderon Rodriguez Sent: 27 August 2016 13:53 To: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Hi Richard This is fixed in release 4.2.1, if you cant upgrade now, you can fix this manuallly Just do this. edit file /usr/lpp/mmfs/lib/mmcesmon/SMBService.py Change if authType == 'ad' and not nodeState.nfsStopped: to nfsEnabled = utils.isProtocolEnabled("NFS", self.logger) if authType == 'ad' and not nodeState.nfsStopped and nfsEnabled: You need to stop the gpfs service in each node where you apply the change " after change the lines please use tap key" Enviado desde mi iPhone El 27/08/2016, a las 6:00 a.m., gpfsug-discuss-request at spectrumscale.org escribi?: Send gpfsug-discuss mailing list submissions to gpfsug-discuss at spectrumscale.org To subscribe or unsubscribe via the World Wide Web, visit http://gpfsug.org/mailman/listinfo/gpfsug-discuss or, via email, send a message with subject or body 'help' to gpfsug-discuss-request at spectrumscale.org You can reach the person managing the list at gpfsug-discuss-owner at spectrumscale.org When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." Today's Topics: 1. Re: Cannot stop SMB... stop NFS first?(Christof Schmitt) 2. Re: CES and mmuserauth command (Christof Schmitt) ---------------------------------------------------------------------- Message: 1 Date: Fri, 26 Aug 2016 12:29:31 -0400 From: "Christof Schmitt" > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Message-ID: > Content-Type: text/plain; charset="UTF-8" That would be the case when Active Directory is configured for authentication. In that case the SMB service includes two aspects: One is the actual SMB file server, and the second one is the service for the Active Directory integration. Since NFS depends on authentication and id mapping services, it requires SMB to be running. Regards, Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) From: "Sobey, Richard A" > To: "'gpfsug-discuss at spectrumscale.org'" > Date: 08/26/2016 04:48 AM Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? Sent by: gpfsug-discuss-bounces at spectrumscale.org Sorry all, prepare for a deluge of emails like this, hopefully it?ll help other people implementing CES in the future. I?m trying to stop SMB on a node, but getting the following output: [root at cesnode ~]# mmces service stop smb smb: Request denied. Please stop NFS first [root at cesnode ~]# mmces service list Enabled services: SMB SMB is running As you can see there is no way to stop NFS when it?s not running but it seems to be blocking me. It?s happening on all the nodes in the cluster. SS version is 4.2.0 running on a fully up to date RHEL 7.1 server. Richard_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ------------------------------ Message: 2 Date: Fri, 26 Aug 2016 12:29:31 -0400 From: "Christof Schmitt" > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] CES and mmuserauth command Message-ID: > Content-Type: text/plain; charset="ISO-2022-JP" The --user-name option applies to both, AD and LDAP authentication. In the LDAP case, this information is correct. I will try to get some clarification added for the AD case. The same applies to the information shown in "service list". There is a common field that holds the information and the parameter from the initial "service create" is stored there. The meaning is different for AD and LDAP: For LDAP it is the username being used to access the LDAP server, while in the AD case it was only the user initially used until the machine account was created. Regards, Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) From: Jan-Frode Myklebust > To: gpfsug main discussion list > Date: 08/26/2016 05:59 AM Subject: Re: [gpfsug-discuss] CES and mmuserauth command Sent by: gpfsug-discuss-bounces at spectrumscale.org On Fri, Aug 26, 2016 at 1:49 AM, Christof Schmitt < christof.schmitt at us.ibm.com> wrote: When joinging the AD domain, --user-name, --password and --server are only used to initially identify and logon to the AD and to create the machine account for the cluster. Once that is done, that information is no longer used, and e.g. the account from --user-name could be deleted, the password changed or the specified DC could be removed from the domain (as long as other DCs are remaining). That was my initial understanding of the --user-name, but when reading the man-page I get the impression that it's also used to do connect to AD to do user and group lookups: ------------------------------------------------------------------------------------------------------ ??user?name userName Specifies the user name to be used to perform operations against the authentication server. The specified user name must have sufficient permissions to read user and group attributes from the authentication server. ------------------------------------------------------------------------------------------------------- Also it's strange that "mmuserauth service list" would list the USER_NAME if it was only somthing that was used at configuration time..? -jf_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ------------------------------ _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss End of gpfsug-discuss Digest, Vol 55, Issue 44 ********************************************** _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss From ulmer at ulmer.org Fri Sep 2 17:02:44 2016 From: ulmer at ulmer.org (Stephen Ulmer) Date: Fri, 2 Sep 2016 12:02:44 -0400 Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? In-Reply-To: References:

Message-ID: I think that stopping SMB is an explicitly different assertion than suspending the node, et cetera. When you ask the service to stop, it should stop -- not start a game of whack-a-mole. If you wanted to move the service there are other other ways. If it fails, clearly it the IP address should move. Liberty, -- Stephen > On Sep 2, 2016, at 11:23 AM, Sobey, Richard A wrote: > > A fair point, but since we're not running NFS, a failure of the only other service [SMB], whether it stops through user input or some other means, should cause the node to go unhealthy (in CTDB parlance) and trigger a failover. That would be my preference. > > Otoh if you were running NFS and SMB and one of those services crashed, do you still want a node in the cluster that could potentially respond and fails to do so? I guess it's a question for each organisation to answer themselves. > > -----Original Message----- > From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Simon Thompson (Research Computing - IT Services) > Sent: 02 September 2016 16:16 > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > > > Should it? > > If you were running nfs and smb, would you necessarily want to fail the ip over? > > Simon > ________________________________________ > From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Sobey, Richard A [r.sobey at imperial.ac.uk] > Sent: 02 September 2016 16:10 > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > > I've verified the upgrade has fixed this issue so thanks again. > > However I've noticed that stopping SMB doesn't trigger an IP address failover. I think it should. mmces node suspend (or rebooting, or mmces address move, or etc..) seems to trigger the failover. > > Richard > > From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Danny Alexander Calderon Rodriguez > Sent: 27 August 2016 13:53 > To: gpfsug-discuss at spectrumscale.org > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > > Hi Richard > > This is fixed in release 4.2.1, if you cant upgrade now, you can fix this manuallly > > > Just do this. > > edit file /usr/lpp/mmfs/lib/mmcesmon/SMBService.py > > > > Change > > if authType == 'ad' and not nodeState.nfsStopped: > > to > > > > nfsEnabled = utils.isProtocolEnabled("NFS", self.logger) > if authType == 'ad' and not nodeState.nfsStopped and nfsEnabled: > > > You need to stop the gpfs service in each node where you apply the change > > > " after change the lines please use tap key" > > > > Enviado desde mi iPhone > > El 27/08/2016, a las 6:00 a.m., gpfsug-discuss-request at spectrumscale.org escribi?: > Send gpfsug-discuss mailing list submissions to > gpfsug-discuss at spectrumscale.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > or, via email, send a message with subject or body 'help' to > gpfsug-discuss-request at spectrumscale.org > > You can reach the person managing the list at > gpfsug-discuss-owner at spectrumscale.org > > When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." > > > Today's Topics: > > 1. Re: Cannot stop SMB... stop NFS first?(Christof Schmitt) > 2. Re: CES and mmuserauth command (Christof Schmitt) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Fri, 26 Aug 2016 12:29:31 -0400 > From: "Christof Schmitt" > > To: gpfsug main discussion list > > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > Message-ID: > > > > Content-Type: text/plain; charset="UTF-8" > > That would be the case when Active Directory is configured for authentication. In that case the SMB service includes two aspects: One is the actual SMB file server, and the second one is the service for the Active Directory integration. Since NFS depends on authentication and id mapping services, it requires SMB to be running. > > Regards, > > Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ > christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) > > > > From: "Sobey, Richard A" > > To: "'gpfsug-discuss at spectrumscale.org'" > > > Date: 08/26/2016 04:48 AM > Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > Sorry all, prepare for a deluge of emails like this, hopefully it?ll help other people implementing CES in the future. > > I?m trying to stop SMB on a node, but getting the following output: > > [root at cesnode ~]# mmces service stop smb > smb: Request denied. Please stop NFS first > > [root at cesnode ~]# mmces service list > Enabled services: SMB > SMB is running > > As you can see there is no way to stop NFS when it?s not running but it seems to be blocking me. It?s happening on all the nodes in the cluster. > > SS version is 4.2.0 running on a fully up to date RHEL 7.1 server. > > Richard_______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > > ------------------------------ > > Message: 2 > Date: Fri, 26 Aug 2016 12:29:31 -0400 > From: "Christof Schmitt" > > To: gpfsug main discussion list > > Subject: Re: [gpfsug-discuss] CES and mmuserauth command > Message-ID: > > > > Content-Type: text/plain; charset="ISO-2022-JP" > > The --user-name option applies to both, AD and LDAP authentication. In the LDAP case, this information is correct. I will try to get some clarification added for the AD case. > > The same applies to the information shown in "service list". There is a common field that holds the information and the parameter from the initial "service create" is stored there. The meaning is different for AD and > LDAP: For LDAP it is the username being used to access the LDAP server, while in the AD case it was only the user initially used until the machine account was created. > > Regards, > > Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ > christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) > > > > From: Jan-Frode Myklebust > > To: gpfsug main discussion list > > Date: 08/26/2016 05:59 AM > Subject: Re: [gpfsug-discuss] CES and mmuserauth command > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > > On Fri, Aug 26, 2016 at 1:49 AM, Christof Schmitt < christof.schmitt at us.ibm.com> wrote: > > When joinging the AD domain, --user-name, --password and --server are only used to initially identify and logon to the AD and to create the machine account for the cluster. Once that is done, that information is no longer used, and e.g. the account from --user-name could be deleted, the password changed or the specified DC could be removed from the domain (as long as other DCs are remaining). > > > That was my initial understanding of the --user-name, but when reading the man-page I get the impression that it's also used to do connect to AD to do user and group lookups: > > ------------------------------------------------------------------------------------------------------ > ??user?name userName > Specifies the user name to be used to perform operations > against the authentication server. The specified user > name must have sufficient permissions to read user and > group attributes from the authentication server. > ------------------------------------------------------------------------------------------------------- > > Also it's strange that "mmuserauth service list" would list the USER_NAME if it was only somthing that was used at configuration time..? > > > > -jf_______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > > > ------------------------------ > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > End of gpfsug-discuss Digest, Vol 55, Issue 44 > ********************************************** > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss From laurence at qsplace.co.uk Fri Sep 2 18:54:02 2016 From: laurence at qsplace.co.uk (Laurence Horrors-Barlow) Date: Fri, 2 Sep 2016 19:54:02 +0200 Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? In-Reply-To: References:

Message-ID: <721250E5-767B-4C44-A9E1-5DD255FD4F7D@qsplace.co.uk> I believe the services auto restart on a crash (or kill), a change I noticed between 4.1.1 and 4.2 hence no IP fail over. Suspending a node to force a fail over is possible the most sensible approach. -- Lauz Sent from my iPad > On 2 Sep 2016, at 18:02, Stephen Ulmer wrote: > > I think that stopping SMB is an explicitly different assertion than suspending the node, et cetera. When you ask the service to stop, it should stop -- not start a game of whack-a-mole. > > If you wanted to move the service there are other other ways. If it fails, clearly it the IP address should move. > > Liberty, > > -- > Stephen > > > >> On Sep 2, 2016, at 11:23 AM, Sobey, Richard A wrote: >> >> A fair point, but since we're not running NFS, a failure of the only other service [SMB], whether it stops through user input or some other means, should cause the node to go unhealthy (in CTDB parlance) and trigger a failover. That would be my preference. >> >> Otoh if you were running NFS and SMB and one of those services crashed, do you still want a node in the cluster that could potentially respond and fails to do so? I guess it's a question for each organisation to answer themselves. >> >> -----Original Message----- >> From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Simon Thompson (Research Computing - IT Services) >> Sent: 02 September 2016 16:16 >> To: gpfsug main discussion list >> Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? >> >> >> Should it? >> >> If you were running nfs and smb, would you necessarily want to fail the ip over? >> >> Simon >> ________________________________________ >> From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Sobey, Richard A [r.sobey at imperial.ac.uk] >> Sent: 02 September 2016 16:10 >> To: gpfsug main discussion list >> Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? >> >> I've verified the upgrade has fixed this issue so thanks again. >> >> However I've noticed that stopping SMB doesn't trigger an IP address failover. I think it should. mmces node suspend (or rebooting, or mmces address move, or etc..) seems to trigger the failover. >> >> Richard >> >> From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Danny Alexander Calderon Rodriguez >> Sent: 27 August 2016 13:53 >> To: gpfsug-discuss at spectrumscale.org >> Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? >> >> Hi Richard >> >> This is fixed in release 4.2.1, if you cant upgrade now, you can fix this manuallly >> >> >> Just do this. >> >> edit file /usr/lpp/mmfs/lib/mmcesmon/SMBService.py >> >> >> >> Change >> >> if authType == 'ad' and not nodeState.nfsStopped: >> >> to >> >> >> >> nfsEnabled = utils.isProtocolEnabled("NFS", self.logger) >> if authType == 'ad' and not nodeState.nfsStopped and nfsEnabled: >> >> >> You need to stop the gpfs service in each node where you apply the change >> >> >> " after change the lines please use tap key" >> >> >> >> Enviado desde mi iPhone >> >> El 27/08/2016, a las 6:00 a.m., gpfsug-discuss-request at spectrumscale.org escribi?: >> Send gpfsug-discuss mailing list submissions to >> gpfsug-discuss at spectrumscale.org >> >> To subscribe or unsubscribe via the World Wide Web, visit >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >> or, via email, send a message with subject or body 'help' to >> gpfsug-discuss-request at spectrumscale.org >> >> You can reach the person managing the list at >> gpfsug-discuss-owner at spectrumscale.org >> >> When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." >> >> >> Today's Topics: >> >> 1. Re: Cannot stop SMB... stop NFS first?(Christof Schmitt) >> 2. Re: CES and mmuserauth command (Christof Schmitt) >> >> >> ---------------------------------------------------------------------- >> >> Message: 1 >> Date: Fri, 26 Aug 2016 12:29:31 -0400 >> From: "Christof Schmitt" > >> To: gpfsug main discussion list > >> Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? >> Message-ID: >> > >> >> Content-Type: text/plain; charset="UTF-8" >> >> That would be the case when Active Directory is configured for authentication. In that case the SMB service includes two aspects: One is the actual SMB file server, and the second one is the service for the Active Directory integration. Since NFS depends on authentication and id mapping services, it requires SMB to be running. >> >> Regards, >> >> Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ >> christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) >> >> >> >> From: "Sobey, Richard A" > >> To: "'gpfsug-discuss at spectrumscale.org'" >> > >> Date: 08/26/2016 04:48 AM >> Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? >> Sent by: gpfsug-discuss-bounces at spectrumscale.org >> >> >> >> Sorry all, prepare for a deluge of emails like this, hopefully it?ll help other people implementing CES in the future. >> >> I?m trying to stop SMB on a node, but getting the following output: >> >> [root at cesnode ~]# mmces service stop smb >> smb: Request denied. Please stop NFS first >> >> [root at cesnode ~]# mmces service list >> Enabled services: SMB >> SMB is running >> >> As you can see there is no way to stop NFS when it?s not running but it seems to be blocking me. It?s happening on all the nodes in the cluster. >> >> SS version is 4.2.0 running on a fully up to date RHEL 7.1 server. >> >> Richard_______________________________________________ >> gpfsug-discuss mailing list >> gpfsug-discuss at spectrumscale.org >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >> >> >> >> >> >> ------------------------------ >> >> Message: 2 >> Date: Fri, 26 Aug 2016 12:29:31 -0400 >> From: "Christof Schmitt" > >> To: gpfsug main discussion list > >> Subject: Re: [gpfsug-discuss] CES and mmuserauth command >> Message-ID: >> > >> >> Content-Type: text/plain; charset="ISO-2022-JP" >> >> The --user-name option applies to both, AD and LDAP authentication. In the LDAP case, this information is correct. I will try to get some clarification added for the AD case. >> >> The same applies to the information shown in "service list". There is a common field that holds the information and the parameter from the initial "service create" is stored there. The meaning is different for AD and >> LDAP: For LDAP it is the username being used to access the LDAP server, while in the AD case it was only the user initially used until the machine account was created. >> >> Regards, >> >> Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ >> christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) >> >> >> >> From: Jan-Frode Myklebust > >> To: gpfsug main discussion list > >> Date: 08/26/2016 05:59 AM >> Subject: Re: [gpfsug-discuss] CES and mmuserauth command >> Sent by: gpfsug-discuss-bounces at spectrumscale.org >> >> >> >> >> On Fri, Aug 26, 2016 at 1:49 AM, Christof Schmitt < christof.schmitt at us.ibm.com> wrote: >> >> When joinging the AD domain, --user-name, --password and --server are only used to initially identify and logon to the AD and to create the machine account for the cluster. Once that is done, that information is no longer used, and e.g. the account from --user-name could be deleted, the password changed or the specified DC could be removed from the domain (as long as other DCs are remaining). >> >> >> That was my initial understanding of the --user-name, but when reading the man-page I get the impression that it's also used to do connect to AD to do user and group lookups: >> >> ------------------------------------------------------------------------------------------------------ >> ??user?name userName >> Specifies the user name to be used to perform operations >> against the authentication server. The specified user >> name must have sufficient permissions to read user and >> group attributes from the authentication server. >> ------------------------------------------------------------------------------------------------------- >> >> Also it's strange that "mmuserauth service list" would list the USER_NAME if it was only somthing that was used at configuration time..? >> >> >> >> -jf_______________________________________________ >> gpfsug-discuss mailing list >> gpfsug-discuss at spectrumscale.org >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >> >> >> >> >> >> >> ------------------------------ >> >> _______________________________________________ >> gpfsug-discuss mailing list >> gpfsug-discuss at spectrumscale.org >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >> >> >> End of gpfsug-discuss Digest, Vol 55, Issue 44 >> ********************************************** >> >> _______________________________________________ >> gpfsug-discuss mailing list >> gpfsug-discuss at spectrumscale.org >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >> _______________________________________________ >> gpfsug-discuss mailing list >> gpfsug-discuss at spectrumscale.org >> http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > From christof.schmitt at us.ibm.com Fri Sep 2 19:20:45 2016 From: christof.schmitt at us.ibm.com (Christof Schmitt) Date: Fri, 2 Sep 2016 11:20:45 -0700 Subject: [gpfsug-discuss] CES and mmuserauth command In-Reply-To: References:

Message-ID: After looking into this again, the source of confusion is probably from the fact that there are three different authentication schemes present here: When configuring a LDAP server for file or object authentication, then the specified server, user and password are used during normal operations for querying user data. The same applies for configuring object authentication with AD; AD is here treated as a LDAP server. Configuring AD for file authentication is different in that during the "mmuserauth service create", the machine account is created, and then that account is used to connect to a DC that is chosen from the DCs discovered through DNS and not necessarily the one used for the initial configuration. I submitted an internal request to explain this better in the mmuserauth manpage. Regards, Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) From: Christof Schmitt/Tucson/IBM at IBMUS To: gpfsug main discussion list Date: 08/26/2016 09:30 AM Subject: Re: [gpfsug-discuss] CES and mmuserauth command Sent by: gpfsug-discuss-bounces at spectrumscale.org The --user-name option applies to both, AD and LDAP authentication. In the LDAP case, this information is correct. I will try to get some clarification added for the AD case. The same applies to the information shown in "service list". There is a common field that holds the information and the parameter from the initial "service create" is stored there. The meaning is different for AD and LDAP: For LDAP it is the username being used to access the LDAP server, while in the AD case it was only the user initially used until the machine account was created. Regards, Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) From: Jan-Frode Myklebust To: gpfsug main discussion list Date: 08/26/2016 05:59 AM Subject: Re: [gpfsug-discuss] CES and mmuserauth command Sent by: gpfsug-discuss-bounces at spectrumscale.org On Fri, Aug 26, 2016 at 1:49 AM, Christof Schmitt < christof.schmitt at us.ibm.com> wrote: When joinging the AD domain, --user-name, --password and --server are only used to initially identify and logon to the AD and to create the machine account for the cluster. Once that is done, that information is no longer used, and e.g. the account from --user-name could be deleted, the password changed or the specified DC could be removed from the domain (as long as other DCs are remaining). That was my initial understanding of the --user-name, but when reading the man-page I get the impression that it's also used to do connect to AD to do user and group lookups: ------------------------------------------------------------------------------------------------------ ??user?name userName Specifies the user name to be used to perform operations against the authentication server. The specified user name must have sufficient permissions to read user and group attributes from the authentication server. ------------------------------------------------------------------------------------------------------- Also it's strange that "mmuserauth service list" would list the USER_NAME if it was only somthing that was used at configuration time..? -jf_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss From r.sobey at imperial.ac.uk Fri Sep 2 22:02:03 2016 From: r.sobey at imperial.ac.uk (Sobey, Richard A) Date: Fri, 2 Sep 2016 21:02:03 +0000 Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? In-Reply-To: References:

, Message-ID: That makes more sense putting it that way. Cheers Richard Get Outlook for Android On Fri, Sep 2, 2016 at 5:04 PM +0100, "Stephen Ulmer" > wrote: I think that stopping SMB is an explicitly different assertion than suspending the node, et cetera. When you ask the service to stop, it should stop -- not start a game of whack-a-mole. If you wanted to move the service there are other other ways. If it fails, clearly it the IP address should move. Liberty, -- Stephen > On Sep 2, 2016, at 11:23 AM, Sobey, Richard A wrote: > > A fair point, but since we're not running NFS, a failure of the only other service [SMB], whether it stops through user input or some other means, should cause the node to go unhealthy (in CTDB parlance) and trigger a failover. That would be my preference. > > Otoh if you were running NFS and SMB and one of those services crashed, do you still want a node in the cluster that could potentially respond and fails to do so? I guess it's a question for each organisation to answer themselves. > > -----Original Message----- > From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Simon Thompson (Research Computing - IT Services) > Sent: 02 September 2016 16:16 > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > > > Should it? > > If you were running nfs and smb, would you necessarily want to fail the ip over? > > Simon > ________________________________________ > From: gpfsug-discuss-bounces at spectrumscale.org [gpfsug-discuss-bounces at spectrumscale.org] on behalf of Sobey, Richard A [r.sobey at imperial.ac.uk] > Sent: 02 September 2016 16:10 > To: gpfsug main discussion list > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > > I've verified the upgrade has fixed this issue so thanks again. > > However I've noticed that stopping SMB doesn't trigger an IP address failover. I think it should. mmces node suspend (or rebooting, or mmces address move, or etc..) seems to trigger the failover. > > Richard > > From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Danny Alexander Calderon Rodriguez > Sent: 27 August 2016 13:53 > To: gpfsug-discuss at spectrumscale.org > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > > Hi Richard > > This is fixed in release 4.2.1, if you cant upgrade now, you can fix this manuallly > > > Just do this. > > edit file /usr/lpp/mmfs/lib/mmcesmon/SMBService.py > > > > Change > > if authType == 'ad' and not nodeState.nfsStopped: > > to > > > > nfsEnabled = utils.isProtocolEnabled("NFS", self.logger) > if authType == 'ad' and not nodeState.nfsStopped and nfsEnabled: > > > You need to stop the gpfs service in each node where you apply the change > > > " after change the lines please use tap key" > > > > Enviado desde mi iPhone > > El 27/08/2016, a las 6:00 a.m., gpfsug-discuss-request at spectrumscale.org escribi?: > Send gpfsug-discuss mailing list submissions to > gpfsug-discuss at spectrumscale.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > or, via email, send a message with subject or body 'help' to > gpfsug-discuss-request at spectrumscale.org > > You can reach the person managing the list at > gpfsug-discuss-owner at spectrumscale.org > > When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." > > > Today's Topics: > > 1. Re: Cannot stop SMB... stop NFS first?(Christof Schmitt) > 2. Re: CES and mmuserauth command (Christof Schmitt) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Fri, 26 Aug 2016 12:29:31 -0400 > From: "Christof Schmitt" > > To: gpfsug main discussion list > > Subject: Re: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > Message-ID: > > > > Content-Type: text/plain; charset="UTF-8" > > That would be the case when Active Directory is configured for authentication. In that case the SMB service includes two aspects: One is the actual SMB file server, and the second one is the service for the Active Directory integration. Since NFS depends on authentication and id mapping services, it requires SMB to be running. > > Regards, > > Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ > christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) > > > > From: "Sobey, Richard A" > > To: "'gpfsug-discuss at spectrumscale.org'" > > > Date: 08/26/2016 04:48 AM > Subject: [gpfsug-discuss] Cannot stop SMB... stop NFS first? > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > Sorry all, prepare for a deluge of emails like this, hopefully it?ll help other people implementing CES in the future. > > I?m trying to stop SMB on a node, but getting the following output: > > [root at cesnode ~]# mmces service stop smb > smb: Request denied. Please stop NFS first > > [root at cesnode ~]# mmces service list > Enabled services: SMB > SMB is running > > As you can see there is no way to stop NFS when it?s not running but it seems to be blocking me. It?s happening on all the nodes in the cluster. > > SS version is 4.2.0 running on a fully up to date RHEL 7.1 server. > > Richard_______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > > ------------------------------ > > Message: 2 > Date: Fri, 26 Aug 2016 12:29:31 -0400 > From: "Christof Schmitt" > > To: gpfsug main discussion list > > Subject: Re: [gpfsug-discuss] CES and mmuserauth command > Message-ID: > > > > Content-Type: text/plain; charset="ISO-2022-JP" > > The --user-name option applies to both, AD and LDAP authentication. In the LDAP case, this information is correct. I will try to get some clarification added for the AD case. > > The same applies to the information shown in "service list". There is a common field that holds the information and the parameter from the initial "service create" is stored there. The meaning is different for AD and > LDAP: For LDAP it is the username being used to access the LDAP server, while in the AD case it was only the user initially used until the machine account was created. > > Regards, > > Christof Schmitt || IBM || Spectrum Scale Development || Tucson, AZ > christof.schmitt at us.ibm.com || +1-520-799-2469 (T/L: 321-2469) > > > > From: Jan-Frode Myklebust > > To: gpfsug main discussion list > > Date: 08/26/2016 05:59 AM > Subject: Re: [gpfsug-discuss] CES and mmuserauth command > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > > On Fri, Aug 26, 2016 at 1:49 AM, Christof Schmitt < christof.schmitt at us.ibm.com> wrote: > > When joinging the AD domain, --user-name, --password and --server are only used to initially identify and logon to the AD and to create the machine account for the cluster. Once that is done, that information is no longer used, and e.g. the account from --user-name could be deleted, the password changed or the specified DC could be removed from the domain (as long as other DCs are remaining). > > > That was my initial understanding of the --user-name, but when reading the man-page I get the impression that it's also used to do connect to AD to do user and group lookups: > > ------------------------------------------------------------------------------------------------------ > ??user?name userName > Specifies the user name to be used to perform operations > against the authentication server. The specified user > name must have sufficient permissions to read user and > group attributes from the authentication server. > ------------------------------------------------------------------------------------------------------- > > Also it's strange that "mmuserauth service list" would list the USER_NAME if it was only somthing that was used at configuration time..? > > > > -jf_______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > > > ------------------------------ > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > End of gpfsug-discuss Digest, Vol 55, Issue 44 > ********************************************** > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From bauer at cesnet.cz Mon Sep 5 14:30:54 2016 From: bauer at cesnet.cz (Miroslav Bauer) Date: Mon, 5 Sep 2016 15:30:54 +0200 Subject: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state Message-ID: <9bfb2882-10de-5f2c-98c1-35ac2ac958a2@cesnet.cz> Hello, is there any way to recall a migrated file back to a regular state (other than renaming a file)? I would like to free some space on an external pool (TSM), that is being used by migrated files. And it would be desirable to prevent repeated backups of an already backed-up data (due to changed ctime/inode). I guess that you can acheive only premigrated state with dsmrecall tool (two copies of file data - one on GPFS pool and one on external pool). Maybe deleting 'dmapi.IBMPMig' xattr will do the trick but I don't think it's safe, nor clean :). Thank you in advance, -- Miroslav Bauer -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 3716 bytes Desc: S/MIME Cryptographic Signature URL: From janfrode at tanso.net Mon Sep 5 14:51:44 2016 From: janfrode at tanso.net (Jan-Frode Myklebust) Date: Mon, 05 Sep 2016 13:51:44 +0000 Subject: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state In-Reply-To: <9bfb2882-10de-5f2c-98c1-35ac2ac958a2@cesnet.cz> References: <9bfb2882-10de-5f2c-98c1-35ac2ac958a2@cesnet.cz> Message-ID: I believe what you're looking for is dsmrecall -RESident. Plus reconcile on tsm-server to free up the space. Ref: http://www.ibm.com/support/knowledgecenter/SSSR2R_7.1.2/com.ibm.itsm.hsmul.doc/r_cmd_dsmrecall.html -jf man. 5. sep. 2016 kl. 15.30 skrev Miroslav Bauer : > Hello, > > is there any way to recall a migrated file back to a regular state > (other than renaming a file)? I would like to free some space > on an external pool (TSM), that is being used by migrated files. > And it would be desirable to prevent repeated backups of an > already backed-up data (due to changed ctime/inode). > > I guess that you can acheive only premigrated state with dsmrecall tool > (two copies of file data - one on GPFS pool and one on external pool). > Maybe deleting 'dmapi.IBMPMig' xattr will do the trick but I don't think > it's safe, nor clean :). > > Thank you in advance, > > -- > Miroslav Bauer > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From bauer at cesnet.cz Mon Sep 5 15:13:42 2016 From: bauer at cesnet.cz (Miroslav Bauer) Date: Mon, 5 Sep 2016 16:13:42 +0200 Subject: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state In-Reply-To: References: <9bfb2882-10de-5f2c-98c1-35ac2ac958a2@cesnet.cz> Message-ID: <55108548-0515-49c8-0e76-fca9b247d337@cesnet.cz> That's right, I must have totally overlooked that! Many thanks! :) -- Miroslav Bauer On 09/05/2016 03:51 PM, Jan-Frode Myklebust wrote: > I believe what you're looking for is dsmrecall -RESident. Plus > reconcile on tsm-server to free up the space. > > Ref: > > http://www.ibm.com/support/knowledgecenter/SSSR2R_7.1.2/com.ibm.itsm.hsmul.doc/r_cmd_dsmrecall.html > > > -jf > man. 5. sep. 2016 kl. 15.30 skrev Miroslav Bauer >: > > Hello, > > is there any way to recall a migrated file back to a regular state > (other than renaming a file)? I would like to free some space > on an external pool (TSM), that is being used by migrated files. > And it would be desirable to prevent repeated backups of an > already backed-up data (due to changed ctime/inode). > > I guess that you can acheive only premigrated state with dsmrecall > tool > (two copies of file data - one on GPFS pool and one on external pool). > Maybe deleting 'dmapi.IBMPMig' xattr will do the trick but I don't > think > it's safe, nor clean :). > > Thank you in advance, > > -- > Miroslav Bauer > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 3716 bytes Desc: S/MIME Cryptographic Signature URL: From mark.birmingham at stfc.ac.uk Mon Sep 5 15:27:29 2016 From: mark.birmingham at stfc.ac.uk (mark.birmingham at stfc.ac.uk) Date: Mon, 5 Sep 2016 14:27:29 +0000 Subject: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state In-Reply-To: <55108548-0515-49c8-0e76-fca9b247d337@cesnet.cz> References: <9bfb2882-10de-5f2c-98c1-35ac2ac958a2@cesnet.cz> <55108548-0515-49c8-0e76-fca9b247d337@cesnet.cz> Message-ID: <47B8D67E32CC2D44A587CD18636BECC82BB3A610@exchmbx01> Yes, that's fine. Just submit the request through SBS. Mark Mark Birmingham Development Team Leader High Performance Systems Group STFC Daresbury Laboratory Phone: +44 (0)1925 603381 Email: mark.birmingham at stfc.ac.uk From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Miroslav Bauer Sent: 05 September 2016 15:14 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state That's right, I must have totally overlooked that! Many thanks! :) -- Miroslav Bauer On 09/05/2016 03:51 PM, Jan-Frode Myklebust wrote: I believe what you're looking for is dsmrecall -RESident. Plus reconcile on tsm-server to free up the space. Ref: http://www.ibm.com/support/knowledgecenter/SSSR2R_7.1.2/com.ibm.itsm.hsmul.doc/r_cmd_dsmrecall.html -jf man. 5. sep. 2016 kl. 15.30 skrev Miroslav Bauer >: Hello, is there any way to recall a migrated file back to a regular state (other than renaming a file)? I would like to free some space on an external pool (TSM), that is being used by migrated files. And it would be desirable to prevent repeated backups of an already backed-up data (due to changed ctime/inode). I guess that you can acheive only premigrated state with dsmrecall tool (two copies of file data - one on GPFS pool and one on external pool). Maybe deleting 'dmapi.IBMPMig' xattr will do the trick but I don't think it's safe, nor clean :). Thank you in advance, -- Miroslav Bauer _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From mark.birmingham at stfc.ac.uk Mon Sep 5 15:30:53 2016 From: mark.birmingham at stfc.ac.uk (mark.birmingham at stfc.ac.uk) Date: Mon, 5 Sep 2016 14:30:53 +0000 Subject: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state In-Reply-To: <47B8D67E32CC2D44A587CD18636BECC82BB3A610@exchmbx01> References: <9bfb2882-10de-5f2c-98c1-35ac2ac958a2@cesnet.cz> <55108548-0515-49c8-0e76-fca9b247d337@cesnet.cz> <47B8D67E32CC2D44A587CD18636BECC82BB3A610@exchmbx01> Message-ID: <47B8D67E32CC2D44A587CD18636BECC82BB3A62A@exchmbx01> Sorry All! Noob error - replied to the wrong email!!! Mark Mark Birmingham Development Team Leader High Performance Systems Group STFC Daresbury Laboratory Phone: +44 (0)1925 603381 Email: mark.birmingham at stfc.ac.uk From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of mark.birmingham at stfc.ac.uk Sent: 05 September 2016 15:27 To: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state Yes, that's fine. Just submit the request through SBS. Mark Mark Birmingham Development Team Leader High Performance Systems Group STFC Daresbury Laboratory Phone: +44 (0)1925 603381 Email: mark.birmingham at stfc.ac.uk From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Miroslav Bauer Sent: 05 September 2016 15:14 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state That's right, I must have totally overlooked that! Many thanks! :) -- Miroslav Bauer On 09/05/2016 03:51 PM, Jan-Frode Myklebust wrote: I believe what you're looking for is dsmrecall -RESident. Plus reconcile on tsm-server to free up the space. Ref: http://www.ibm.com/support/knowledgecenter/SSSR2R_7.1.2/com.ibm.itsm.hsmul.doc/r_cmd_dsmrecall.html -jf man. 5. sep. 2016 kl. 15.30 skrev Miroslav Bauer >: Hello, is there any way to recall a migrated file back to a regular state (other than renaming a file)? I would like to free some space on an external pool (TSM), that is being used by migrated files. And it would be desirable to prevent repeated backups of an already backed-up data (due to changed ctime/inode). I guess that you can acheive only premigrated state with dsmrecall tool (two copies of file data - one on GPFS pool and one on external pool). Maybe deleting 'dmapi.IBMPMig' xattr will do the trick but I don't think it's safe, nor clean :). Thank you in advance, -- Miroslav Bauer _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From dominic.mueller at de.ibm.com Tue Sep 6 13:04:36 2016 From: dominic.mueller at de.ibm.com (Dominic Mueller-Wicke01) Date: Tue, 6 Sep 2016 14:04:36 +0200 Subject: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state In-Reply-To: References: Message-ID: Hi Miroslav, please use the command: > dsmrecall -resident -detail or use it with file lists Greetings, Dominic. From: gpfsug-discuss-request at spectrumscale.org To: gpfsug-discuss at spectrumscale.org Date: 06.09.2016 13:00 Subject: gpfsug-discuss Digest, Vol 56, Issue 10 Sent by: gpfsug-discuss-bounces at spectrumscale.org Send gpfsug-discuss mailing list submissions to gpfsug-discuss at spectrumscale.org To subscribe or unsubscribe via the World Wide Web, visit http://gpfsug.org/mailman/listinfo/gpfsug-discuss or, via email, send a message with subject or body 'help' to gpfsug-discuss-request at spectrumscale.org You can reach the person managing the list at gpfsug-discuss-owner at spectrumscale.org When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." Today's Topics: 1. Re: DMAPI - Unmigrate file to Regular state (mark.birmingham at stfc.ac.uk) ----- Message from on Mon, 5 Sep 2016 14:30:53 +0000 ----- To: Subject: Re: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state Sorry All! Noob error ? replied to the wrong email!!! Mark Mark Birmingham Development Team Leader High Performance Systems Group STFC Daresbury Laboratory Phone: +44 (0)1925 603381 Email: mark.birmingham at stfc.ac.uk From: gpfsug-discuss-bounces at spectrumscale.org [ mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of mark.birmingham at stfc.ac.uk Sent: 05 September 2016 15:27 To: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state Yes, that?s fine. Just submit the request through SBS. Mark Mark Birmingham Development Team Leader High Performance Systems Group STFC Daresbury Laboratory Phone: +44 (0)1925 603381 Email: mark.birmingham at stfc.ac.uk From: gpfsug-discuss-bounces at spectrumscale.org [ mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Miroslav Bauer Sent: 05 September 2016 15:14 To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state That's right, I must have totally overlooked that! Many thanks! :) -- Miroslav Bauer On 09/05/2016 03:51 PM, Jan-Frode Myklebust wrote: I believe what you're looking for is dsmrecall -RESident. Plus reconcile on tsm-server to free up the space. Ref: http://www.ibm.com/support/knowledgecenter/SSSR2R_7.1.2/com.ibm.itsm.hsmul.doc/r_cmd_dsmrecall.html -jf man. 5. sep. 2016 kl. 15.30 skrev Miroslav Bauer : Hello, is there any way to recall a migrated file back to a regular state (other than renaming a file)? I would like to free some space on an external pool (TSM), that is being used by migrated files. And it would be desirable to prevent repeated backups of an already backed-up data (due to changed ctime/inode). I guess that you can acheive only premigrated state with dsmrecall tool (two copies of file data - one on GPFS pool and one on external pool). Maybe deleting 'dmapi.IBMPMig' xattr will do the trick but I don't think it's safe, nor clean :). Thank you in advance, -- Miroslav Bauer _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From volobuev at us.ibm.com Tue Sep 6 20:06:32 2016 From: volobuev at us.ibm.com (Yuri L Volobuev) Date: Tue, 6 Sep 2016 12:06:32 -0700 Subject: [gpfsug-discuss] Migration to separate metadata and data disks In-Reply-To: <505ff859-d49a-04cc-bd9d-50f7b2a8df0b@cesnet.cz> References: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz>

<2ce7334b-28c1-7a14-814a-fbcf99d8049e@nasa.gov> <505ff859-d49a-04cc-bd9d-50f7b2a8df0b@cesnet.cz> Message-ID: The correct way to accomplish what you're looking for (in particular, changing the fs-wide level of replication) is mmrestripefs -R. This command also takes care of moving data off disks now marked metadataOnly. The restripe job hits an error trying to move blocks of the inode file, i.e. before it gets to actual user data blocks. Note that at this point the metadata replication factor is still 2. This suggests one of two possibilities: (1) there isn't enough actual free space on the remaining metadataOnly disks, (2) there isn't enough space in some failure groups to allocate two replicas. All of this assumes you're operating within a single storage pool. If multiple storage pools are in play, there are other possibilities. 'mmdf' output would be helpful in providing more helpful advice. With the information at hand, I can only suggest trying to accomplish the task in two phases: (a) deallocated extra metadata replicas, by doing mmchfs -m 1 + mmrestripefs -R (b) move metadata off SATA disks. I do want to point out that metadata replication is a highly recommended insurance policy to have for your file system. As with other kinds of insurance, you may or may not need it, but if you do end up needing it, you'll be very glad you have it. The costs, in terms of extra metadata space and performance overhead, are very reasonable. yuri From: Miroslav Bauer To: gpfsug-discuss at spectrumscale.org, Date: 09/01/2016 07:29 AM Subject: Re: [gpfsug-discuss] Migration to separate metadata and data disks Sent by: gpfsug-discuss-bounces at spectrumscale.org Yes, failure group id is exactly what I meant :). Unfortunately, mmrestripefs with -R behaves the same as with -r. I also believed that mmrestripefs -R is the correct tool for fixing the replication settings on inodes (according to manpages), but I will try possible solutions you and Marc suggested and let you know how it went. Thank you, -- Miroslav Bauer On 09/01/2016 04:02 PM, Aaron Knister wrote: > Oh! I think you've already provided the info I was looking for :) I > thought that failGroup=3 meant there were 3 failure groups within the > SSDs. I suspect that's not at all what you meant and that actually is > the failure group of all of those disks. That I think explains what's > going on-- there's only one failure group's worth of metadata-capable > disks available and as such GPFS can't place the 2nd replica for > existing files. > > Here's what I would suggest: > > - Create at least 2 failure groups within the SSDs > - Put the default metadata replication factor back to 2 > - Run a restripefs -R to shuffle files around and restore the metadata > replication factor of 2 to any files created while it was set to 1 > > If you're not interested in replication for metadata then perhaps all > you need to do is the mmrestripefs -R. I think that should > un-replicate the file from the SATA disks leaving the copy on the SSDs. > > Hope that helps. > > -Aaron > > On 9/1/16 9:39 AM, Aaron Knister wrote: >> By the way, I suspect the no space on device errors are because GPFS >> believes for some reason that it is unable to maintain the metadata >> replication factor of 2 that's likely set on all previously created >> inodes. >> >> On 9/1/16 9:36 AM, Aaron Knister wrote: >>> I must admit, I'm curious as to the reason you're dropping the >>> replication factor from 2 down to 1. There are some serious advantages >>> we've seen to having multiple metadata replicas, as far as error >>> recovery is concerned. >>> >>> Could you paste an output of mmlsdisk for the filesystem? >>> >>> -Aaron >>> >>> On 9/1/16 9:30 AM, Miroslav Bauer wrote: >>>> Hello, >>>> >>>> I have a GPFS 3.5 filesystem (fs1) and I'm trying to migrate the >>>> filesystem metadata from state: >>>> -m = 2 (default metadata replicas) >>>> - SATA disks (dataAndMetadata, failGroup=1) >>>> - SSDs (metadataOnly, failGroup=3) >>>> to the desired state: >>>> -m = 1 >>>> - SATA disks (dataOnly, failGroup=1) >>>> - SSDs (metadataOnly, failGroup=3) >>>> >>>> I have done the following steps in the following order: >>>> 1) change SATA disks to dataOnly (stanza file modifies the 'usage' >>>> attribute only): >>>> # mmchdisk fs1 change -F dataOnly_disks.stanza >>>> Attention: Disk parameters were changed. >>>> Use the mmrestripefs command with the -r option to relocate data and >>>> metadata. >>>> Verifying file system configuration information ... >>>> mmchdisk: Propagating the cluster configuration data to all >>>> affected nodes. This is an asynchronous process. >>>> >>>> 2) change default metadata replicas number 2->1 >>>> # mmchfs fs1 -m 1 >>>> >>>> 3) run mmrestripefs as suggested by output of 1) >>>> # mmrestripefs fs1 -r >>>> Scanning file system metadata, phase 1 ... >>>> Error processing inodes. >>>> No space left on device >>>> mmrestripefs: Command failed. Examine previous error messages to >>>> determine cause. >>>> >>>> It is, however, still possible to create new files on the filesystem. >>>> When I return one of the SATA disks as a dataAndMetadata disk, the >>>> mmrestripefs >>>> command stops complaining about No space left on device. Both df and >>>> mmdf >>>> say that there is enough space both for data (SATA) and metadata >>>> (SSDs). >>>> Does anyone have an idea why is it complaining? >>>> >>>> Thanks, >>>> >>>> -- >>>> Miroslav Bauer >>>> >>>> >>>> >>>> >>>> _______________________________________________ >>>> gpfsug-discuss mailing list >>>> gpfsug-discuss at spectrumscale.org >>>> http://gpfsug.org/mailman/listinfo/gpfsug-discuss >>>> >>> >> > [attachment "smime.p7s" deleted by Yuri L Volobuev/Austin/IBM] _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From bauer at cesnet.cz Wed Sep 7 10:40:19 2016 From: bauer at cesnet.cz (Miroslav Bauer) Date: Wed, 7 Sep 2016 11:40:19 +0200 Subject: [gpfsug-discuss] Migration to separate metadata and data disks In-Reply-To: References: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz>

<2ce7334b-28c1-7a14-814a-fbcf99d8049e@nasa.gov> <505ff859-d49a-04cc-bd9d-50f7b2a8df0b@cesnet.cz> Message-ID: Hello Yuri, here goes the actual mmdf output of filesystem in question: disk disk size failure holds holds free free name group metadata data in full blocks in fragments --------------- ------------- -------- -------- ----- -------------------- ------------------- Disks in storage pool: system (Maximum disk size allowed is 40 TB) dcsh_10C 5T 1 Yes Yes 1.661T ( 33%) 68.48G ( 1%) dcsh_10D 6.828T 1 Yes Yes 2.809T ( 41%) 83.82G ( 1%) dcsh_11C 5T 1 Yes Yes 1.659T ( 33%) 69.01G ( 1%) dcsh_11D 6.828T 1 Yes Yes 2.81T ( 41%) 83.33G ( 1%) dcsh_12C 5T 1 Yes Yes 1.659T ( 33%) 69.48G ( 1%) dcsh_12D 6.828T 1 Yes Yes 2.807T ( 41%) 83.14G ( 1%) dcsh_13C 5T 1 Yes Yes 1.659T ( 33%) 69.35G ( 1%) dcsh_13D 6.828T 1 Yes Yes 2.81T ( 41%) 82.97G ( 1%) dcsh_14C 5T 1 Yes Yes 1.66T ( 33%) 69.06G ( 1%) dcsh_14D 6.828T 1 Yes Yes 2.811T ( 41%) 83.61G ( 1%) dcsh_15C 5T 1 Yes Yes 1.658T ( 33%) 69.38G ( 1%) dcsh_15D 6.828T 1 Yes Yes 2.814T ( 41%) 83.69G ( 1%) dcsd_15D 6.828T 1 Yes Yes 2.811T ( 41%) 83.98G ( 1%) dcsd_15C 5T 1 Yes Yes 1.66T ( 33%) 68.66G ( 1%) dcsd_14D 6.828T 1 Yes Yes 2.81T ( 41%) 84.18G ( 1%) dcsd_14C 5T 1 Yes Yes 1.659T ( 33%) 69.43G ( 1%) dcsd_13D 6.828T 1 Yes Yes 2.81T ( 41%) 83.27G ( 1%) dcsd_13C 5T 1 Yes Yes 1.66T ( 33%) 69.1G ( 1%) dcsd_12D 6.828T 1 Yes Yes 2.81T ( 41%) 83.61G ( 1%) dcsd_12C 5T 1 Yes Yes 1.66T ( 33%) 69.42G ( 1%) dcsd_11D 6.828T 1 Yes Yes 2.811T ( 41%) 83.59G ( 1%) dcsh_10B 5T 1 Yes Yes 1.633T ( 33%) 76.97G ( 2%) dcsh_11A 5T 1 Yes Yes 1.632T ( 33%) 77.29G ( 2%) dcsh_11B 5T 1 Yes Yes 1.633T ( 33%) 76.73G ( 1%) dcsh_12A 5T 1 Yes Yes 1.634T ( 33%) 76.49G ( 1%) dcsd_11C 5T 1 Yes Yes 1.66T ( 33%) 69.25G ( 1%) dcsd_10D 6.828T 1 Yes Yes 2.811T ( 41%) 83.39G ( 1%) dcsh_10A 5T 1 Yes Yes 1.633T ( 33%) 77.06G ( 2%) dcsd_10C 5T 1 Yes Yes 1.66T ( 33%) 69.83G ( 1%) dcsd_15B 5T 1 Yes Yes 1.635T ( 33%) 76.52G ( 1%) dcsd_15A 5T 1 Yes Yes 1.634T ( 33%) 76.24G ( 1%) dcsd_14B 5T 1 Yes Yes 1.634T ( 33%) 76.31G ( 1%) dcsd_14A 5T 1 Yes Yes 1.634T ( 33%) 76.23G ( 1%) dcsd_13B 5T 1 Yes Yes 1.634T ( 33%) 76.13G ( 1%) dcsd_13A 5T 1 Yes Yes 1.634T ( 33%) 76.22G ( 1%) dcsd_12B 5T 1 Yes Yes 1.635T ( 33%) 77.49G ( 2%) dcsd_12A 5T 1 Yes Yes 1.633T ( 33%) 77.13G ( 2%) dcsd_11B 5T 1 Yes Yes 1.633T ( 33%) 76.86G ( 2%) dcsd_11A 5T 1 Yes Yes 1.632T ( 33%) 76.22G ( 1%) dcsd_10B 5T 1 Yes Yes 1.633T ( 33%) 76.79G ( 1%) dcsd_10A 5T 1 Yes Yes 1.633T ( 33%) 77.21G ( 2%) dcsh_15B 5T 1 Yes Yes 1.635T ( 33%) 76.04G ( 1%) dcsh_15A 5T 1 Yes Yes 1.634T ( 33%) 76.84G ( 2%) dcsh_14B 5T 1 Yes Yes 1.635T ( 33%) 76.75G ( 1%) dcsh_14A 5T 1 Yes Yes 1.633T ( 33%) 76.05G ( 1%) dcsh_13B 5T 1 Yes Yes 1.634T ( 33%) 76.35G ( 1%) dcsh_13A 5T 1 Yes Yes 1.634T ( 33%) 76.68G ( 1%) dcsh_12B 5T 1 Yes Yes 1.635T ( 33%) 76.74G ( 1%) ssd_5_5 80G 3 Yes No 22.31G ( 28%) 7.155G ( 9%) ssd_4_4 80G 3 Yes No 22.21G ( 28%) 7.196G ( 9%) ssd_3_3 80G 3 Yes No 22.2G ( 28%) 7.239G ( 9%) ssd_2_2 80G 3 Yes No 22.24G ( 28%) 7.146G ( 9%) ssd_1_1 80G 3 Yes No 22.29G ( 28%) 7.134G ( 9%) ------------- -------------------- ------------------- (pool total) 262.3T 92.96T ( 35%) 3.621T ( 1%) Disks in storage pool: maid4 (Maximum disk size allowed is 466 TB) ...... ------------- -------------------- ------------------- (pool total) 291T 126.5T ( 43%) 562.6G ( 0%) Disks in storage pool: maid5 (Maximum disk size allowed is 466 TB) ...... ------------- -------------------- ------------------- (pool total) 436.6T 120.8T ( 28%) 25.23G ( 0%) Disks in storage pool: maid6 (Maximum disk size allowed is 466 TB) ....... ------------- -------------------- ------------------- (pool total) 582.1T 358.7T ( 62%) 9.458G ( 0%) ============= ==================== =================== (data) 1.535P 698.9T ( 44%) 4.17T ( 0%) (metadata) 262.3T 92.96T ( 35%) 3.621T ( 1%) ============= ==================== =================== (total) 1.535P 699T ( 44%) 4.205T ( 0%) Inode Information ----------------- Number of used inodes: 79607225 Number of free inodes: 82340423 Number of allocated inodes: 161947648 Maximum number of inodes: 1342177280 I have a smaller testing FS with the same setup (with plenty of free space), and the actual sequence of commands that worked for me was: mmchfs fs1 -m1 mmrestripefs fs1 -R mmrestripefs fs1 -b mmchdisk fs1 change -F ~/nsd_metadata_test (dataAndMetadata -> dataOnly) mmrestripefs fs1 -r Could you please evaluate more on the performance overhead with having metadata on SSD+SATA? Are the read operations automatically directed to faster disks by GPFS? Is each write operation waiting for write to be finished by SATA disks? Thank you, -- Miroslav Bauer On 09/06/2016 09:06 PM, Yuri L Volobuev wrote: > > The correct way to accomplish what you're looking for (in particular, > changing the fs-wide level of replication) is mmrestripefs -R. This > command also takes care of moving data off disks now marked metadataOnly. > > The restripe job hits an error trying to move blocks of the inode > file, i.e. before it gets to actual user data blocks. Note that at > this point the metadata replication factor is still 2. This suggests > one of two possibilities: (1) there isn't enough actual free space on > the remaining metadataOnly disks, (2) there isn't enough space in some > failure groups to allocate two replicas. > > All of this assumes you're operating within a single storage pool. If > multiple storage pools are in play, there are other possibilities. > > 'mmdf' output would be helpful in providing more helpful advice. With > the information at hand, I can only suggest trying to accomplish the > task in two phases: (a) deallocated extra metadata replicas, by doing > mmchfs -m 1 + mmrestripefs -R (b) move metadata off SATA disks. I do > want to point out that metadata replication is a highly recommended > insurance policy to have for your file system. As with other kinds of > insurance, you may or may not need it, but if you do end up needing > it, you'll be very glad you have it. The costs, in terms of extra > metadata space and performance overhead, are very reasonable. > > yuri > > > Miroslav Bauer ---09/01/2016 07:29:06 AM---Yes, failure group id is > exactly what I meant :). Unfortunately, mmrestripefs with -R > > From: Miroslav Bauer > To: gpfsug-discuss at spectrumscale.org, > Date: 09/01/2016 07:29 AM > Subject: Re: [gpfsug-discuss] Migration to separate metadata and data > disks > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > ------------------------------------------------------------------------ > > > > Yes, failure group id is exactly what I meant :). Unfortunately, > mmrestripefs with -R > behaves the same as with -r. I also believed that mmrestripefs -R is the > correct tool for > fixing the replication settings on inodes (according to manpages), but I > will try possible > solutions you and Marc suggested and let you know how it went. > > Thank you, > -- > Miroslav Bauer > > On 09/01/2016 04:02 PM, Aaron Knister wrote: > > Oh! I think you've already provided the info I was looking for :) I > > thought that failGroup=3 meant there were 3 failure groups within the > > SSDs. I suspect that's not at all what you meant and that actually is > > the failure group of all of those disks. That I think explains what's > > going on-- there's only one failure group's worth of metadata-capable > > disks available and as such GPFS can't place the 2nd replica for > > existing files. > > > > Here's what I would suggest: > > > > - Create at least 2 failure groups within the SSDs > > - Put the default metadata replication factor back to 2 > > - Run a restripefs -R to shuffle files around and restore the metadata > > replication factor of 2 to any files created while it was set to 1 > > > > If you're not interested in replication for metadata then perhaps all > > you need to do is the mmrestripefs -R. I think that should > > un-replicate the file from the SATA disks leaving the copy on the SSDs. > > > > Hope that helps. > > > > -Aaron > > > > On 9/1/16 9:39 AM, Aaron Knister wrote: > >> By the way, I suspect the no space on device errors are because GPFS > >> believes for some reason that it is unable to maintain the metadata > >> replication factor of 2 that's likely set on all previously created > >> inodes. > >> > >> On 9/1/16 9:36 AM, Aaron Knister wrote: > >>> I must admit, I'm curious as to the reason you're dropping the > >>> replication factor from 2 down to 1. There are some serious advantages > >>> we've seen to having multiple metadata replicas, as far as error > >>> recovery is concerned. > >>> > >>> Could you paste an output of mmlsdisk for the filesystem? > >>> > >>> -Aaron > >>> > >>> On 9/1/16 9:30 AM, Miroslav Bauer wrote: > >>>> Hello, > >>>> > >>>> I have a GPFS 3.5 filesystem (fs1) and I'm trying to migrate the > >>>> filesystem metadata from state: > >>>> -m = 2 (default metadata replicas) > >>>> - SATA disks (dataAndMetadata, failGroup=1) > >>>> - SSDs (metadataOnly, failGroup=3) > >>>> to the desired state: > >>>> -m = 1 > >>>> - SATA disks (dataOnly, failGroup=1) > >>>> - SSDs (metadataOnly, failGroup=3) > >>>> > >>>> I have done the following steps in the following order: > >>>> 1) change SATA disks to dataOnly (stanza file modifies the 'usage' > >>>> attribute only): > >>>> # mmchdisk fs1 change -F dataOnly_disks.stanza > >>>> Attention: Disk parameters were changed. > >>>> Use the mmrestripefs command with the -r option to relocate > data and > >>>> metadata. > >>>> Verifying file system configuration information ... > >>>> mmchdisk: Propagating the cluster configuration data to all > >>>> affected nodes. This is an asynchronous process. > >>>> > >>>> 2) change default metadata replicas number 2->1 > >>>> # mmchfs fs1 -m 1 > >>>> > >>>> 3) run mmrestripefs as suggested by output of 1) > >>>> # mmrestripefs fs1 -r > >>>> Scanning file system metadata, phase 1 ... > >>>> Error processing inodes. > >>>> No space left on device > >>>> mmrestripefs: Command failed. Examine previous error messages to > >>>> determine cause. > >>>> > >>>> It is, however, still possible to create new files on the filesystem. > >>>> When I return one of the SATA disks as a dataAndMetadata disk, the > >>>> mmrestripefs > >>>> command stops complaining about No space left on device. Both df and > >>>> mmdf > >>>> say that there is enough space both for data (SATA) and metadata > >>>> (SSDs). > >>>> Does anyone have an idea why is it complaining? > >>>> > >>>> Thanks, > >>>> > >>>> -- > >>>> Miroslav Bauer > >>>> > >>>> > >>>> > >>>> > >>>> _______________________________________________ > >>>> gpfsug-discuss mailing list > >>>> gpfsug-discuss at spectrumscale.org > >>>> http://gpfsug.org/mailman/listinfo/gpfsug-discuss > >>>> > >>> > >> > > > > > [attachment "smime.p7s" deleted by Yuri L Volobuev/Austin/IBM] > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 3716 bytes Desc: S/MIME Cryptographic Signature URL: From S.J.Thompson at bham.ac.uk Wed Sep 7 13:36:48 2016 From: S.J.Thompson at bham.ac.uk (Simon Thompson (Research Computing - IT Services)) Date: Wed, 7 Sep 2016 12:36:48 +0000 Subject: [gpfsug-discuss] Remote cluster mount failing Message-ID: Hi All, I'm trying to get some multi cluster thing working between two of our GPFS clusters. In the "client" cluster, when trying to mount the "remote" cluster, I get: # mmmount gpfs Wed 7 Sep 13:33:06 BST 2016: mmmount: Mounting file systems ... mount: mount /dev/gpfs on /gpfs failed: Connection timed out mmmount: Command failed. Examine previous error messages to determine cause. And in the log file: Wed Sep 7 13:33:07.481 2016: [N] The client side TLS handshake with node 10.0.0.182 was cancelled: connection reset by peer (return code 420). Wed Sep 7 13:33:07.486 2016: [N] The client side TLS handshake with node 10.0.0.181 was cancelled: connection reset by peer (return code 420). Wed Sep 7 13:33:07.487 2016: [E] Failed to join remote cluster GPFS_STORAGE.CLUSTER Wed Sep 7 13:33:07.488 2016: [W] Command: err 78: mount GPFS_STORAGE.CLUSTER:gpfs Wed Sep 7 13:33:07.489 2016: Connection timed out In the remote cluster, I see: Wed Sep 7 13:33:07.487 2016: [W] The TLS handshake with node 10.0.0.222 failed with error 447 (server side). Wed Sep 7 13:33:07.488 2016: [X] Connection from 10.10.0.35 refused, authentication failed Wed Sep 7 13:33:07.489 2016: [E] Killing connection from 10.10.0.35, err 703 Wed Sep 7 13:33:07.490 2016: Operation not permitted Weirdly though on other nodes in the client cluster this succeeds fine and can mount, so I think I got all the bits in the mmauth and mmremotecluster configured correctly. Any suggestions? Thanks Simon From volobuev at us.ibm.com Wed Sep 7 17:38:03 2016 From: volobuev at us.ibm.com (Yuri L Volobuev) Date: Wed, 7 Sep 2016 09:38:03 -0700 Subject: [gpfsug-discuss] Migration to separate metadata and data disks In-Reply-To: References: <7927f34a-28e5-6fc2-a55d-62b2066a08da@cesnet.cz>

<2ce7334b-28c1-7a14-814a-fbcf99d8049e@nasa.gov><505ff859-d49a-04cc-bd9d-50f7b2a8df0b@cesnet.cz>

Message-ID: Hi Miroslav, The mmdf output is very helpful. It suggests very strongly what the problem is: > ssd_5_5?????????????????? 80G??????? 3 Yes????? No?????????? 22.31G ( 28%)??????? 7.155G ( 9%) > ssd_4_4?????????????????? 80G??????? 3 Yes????? No?????????? 22.21G ( 28%)??????? 7.196G ( 9%) > ssd_3_3?????????????????? 80G??????? 3 Yes????? No??????????? 22.2G ( 28%)??????? 7.239G ( 9%) > ssd_2_2?????????????????? 80G??????? 3 Yes????? No?????????? 22.24G ( 28%)??????? 7.146G ( 9%) > ssd_1_1?????????????????? 80G??????? 3 Yes????? No?????????? 22.29G ( 28%)??????? 7.134G ( 9%) >... > ==================== =================== > (data)???????????????? 1.535P??????????????????????????????? 698.9T ( 44%)???????? 4.17T ( 0%) > (metadata)???????????? 262.3T??????????????????????????????? 92.96T ( 35%)??????? 3.621T ( 1%) >... > Number of allocated inodes:? 161947648 > Maximum number of inodes:?? 1342177280 You have 5 80G SSDs. That's not enough. Even with metadata spread across a couple dozen more SATA disks, SSDs are over 3/4 full. There's no way to accurately estimate the amount of metadata in this file system with the data at hand, but if we (very conservatively) assume that each SATA disk has only as much metadata as each SSD, i.e. ~57G, that would greatly exceed the amount of free space available on your SSDs. You need more free metadata space. Another way to look at this: you got 1.5PB of data under management. A reasonable rule-of-thumb estimate for the amount of metadata is 1-2% of the data (this is a typical ratio, but of course every file system is different, and large deviations are possible. A degenerate case is an fs containing nothing but directories, and in this case metadata usage is 100%). So you have to have at least a few TB of metadata storage. 5 80G SSDs aren't enough for an fs of this size. > Could you please evaluate more on the performance overhead with > having metadata > on SSD+SATA? Are the read operations automatically directed to > faster disks by GPFS? > Is each write operation waiting for write to be finished by SATA disks? Mixing disks with sharply different performance characteristics within a single storage pool is detrimental to performance. GPFS stripes blocks across all disks in a storage pool, expecting all of them to be equally suitable. If SSDs are mixed with SATA disks, the overall metadata write performance is going to be bottlenecked by SATA drives. On reads, given a choice of two replicas, GPFS V4.1.1+ picks the the replica residing on the fastest disk, but given that SSDs represent only a small fraction of your total metadata usage, this likely doesn't help a whole lot. You're on the right track in trying to shift all metadata to SSDs and away from SATA, the overall file system performance will improve as the result. yuri > > Thank you, > -- > Miroslav Bauer > On 09/06/2016 09:06 PM, Yuri L Volobuev wrote: > The correct way to accomplish what you're looking for (in > particular, changing the fs-wide level of replication) is > mmrestripefs -R. This command also takes care of moving data off > disks now marked metadataOnly. > > The restripe job hits an error trying to move blocks of the inode > file, i.e. before it gets to actual user data blocks. Note that at > this point the metadata replication factor is still 2. This suggests > one of two possibilities: (1) there isn't enough actual free space > on the remaining metadataOnly disks, (2) there isn't enough space in > some failure groups to allocate two replicas. > > All of this assumes you're operating within a single storage pool. > If multiple storage pools are in play, there are other possibilities. > > 'mmdf' output would be helpful in providing more helpful advice. > With the information at hand, I can only suggest trying to > accomplish the task in two phases: (a) deallocated extra metadata > replicas, by doing mmchfs -m 1 + mmrestripefs -R (b) move metadata > off SATA disks. I do want to point out that metadata replication is > a highly recommended insurance policy to have for your file system. > As with other kinds of insurance, you may or may not need it, but if > you do end up needing it, you'll be very glad you have it. The > costs, in terms of extra metadata space and performance overhead, > are very reasonable. > > yuri > > > Miroslav Bauer ---09/01/2016 07:29:06 AM---Yes, failure group id is > exactly what I meant :). Unfortunately, mmrestripefs with -R > > From: Miroslav Bauer > To: gpfsug-discuss at spectrumscale.org, > Date: 09/01/2016 07:29 AM > Subject: Re: [gpfsug-discuss] Migration to separate metadata and data disks > Sent by: gpfsug-discuss-bounces at spectrumscale.org > > > > Yes, failure group id is exactly what I meant :). Unfortunately, > mmrestripefs with -R > behaves the same as with -r. I also believed that mmrestripefs -R is the > correct tool for > fixing the replication settings on inodes (according to manpages), but I > will try possible > solutions you and Marc suggested and let you know how it went. > > Thank you, > -- > Miroslav Bauer > > On 09/01/2016 04:02 PM, Aaron Knister wrote: > > Oh! I think you've already provided the info I was looking for :) I > > thought that failGroup=3 meant there were 3 failure groups within the > > SSDs. I suspect that's not at all what you meant and that actually is > > the failure group of all of those disks. That I think explains what's > > going on-- there's only one failure group's worth of metadata-capable > > disks available and as such GPFS can't place the 2nd replica for > > existing files. > > > > Here's what I would suggest: > > > > - Create at least 2 failure groups within the SSDs > > - Put the default metadata replication factor back to 2 > > - Run a restripefs -R to shuffle files around and restore the metadata > > replication factor of 2 to any files created while it was set to 1 > > > > If you're not interested in replication for metadata then perhaps all > > you need to do is the mmrestripefs -R. I think that should > > un-replicate the file from the SATA disks leaving the copy on the SSDs. > > > > Hope that helps. > > > > -Aaron > > > > On 9/1/16 9:39 AM, Aaron Knister wrote: > >> By the way, I suspect the no space on device errors are because GPFS > >> believes for some reason that it is unable to maintain the metadata > >> replication factor of 2 that's likely set on all previously created > >> inodes. > >> > >> On 9/1/16 9:36 AM, Aaron Knister wrote: > >>> I must admit, I'm curious as to the reason you're dropping the > >>> replication factor from 2 down to 1. There are some serious advantages > >>> we've seen to having multiple metadata replicas, as far as error > >>> recovery is concerned. > >>> > >>> Could you paste an output of mmlsdisk for the filesystem? > >>> > >>> -Aaron > >>> > >>> On 9/1/16 9:30 AM, Miroslav Bauer wrote: > >>>> Hello, > >>>> > >>>> I have a GPFS 3.5 filesystem (fs1) and I'm trying to migrate the > >>>> filesystem metadata from state: > >>>> -m = 2 (default metadata replicas) > >>>> - SATA disks (dataAndMetadata, failGroup=1) > >>>> - SSDs (metadataOnly, failGroup=3) > >>>> to the desired state: > >>>> -m = 1 > >>>> - SATA disks (dataOnly, failGroup=1) > >>>> - SSDs (metadataOnly, failGroup=3) > >>>> > >>>> I have done the following steps in the following order: > >>>> 1) change SATA disks to dataOnly (stanza file modifies the 'usage' > >>>> attribute only): > >>>> # mmchdisk fs1 change -F dataOnly_disks.stanza > >>>> Attention: Disk parameters were changed. > >>>> ? Use the mmrestripefs command with the -r option to relocate data and > >>>> metadata. > >>>> Verifying file system configuration information ... > >>>> mmchdisk: Propagating the cluster configuration data to all > >>>> ? affected nodes. ?This is an asynchronous process. > >>>> > >>>> 2) change default metadata replicas number 2->1 > >>>> # mmchfs fs1 -m 1 > >>>> > >>>> 3) run mmrestripefs as suggested by output of 1) > >>>> # mmrestripefs fs1 -r > >>>> Scanning file system metadata, phase 1 ... > >>>> Error processing inodes. > >>>> No space left on device > >>>> mmrestripefs: Command failed. ?Examine previous error messages to > >>>> determine cause. > >>>> > >>>> It is, however, still possible to create new files on the filesystem. > >>>> When I return one of the SATA disks as a dataAndMetadata disk, the > >>>> mmrestripefs > >>>> command stops complaining about No space left on device. Both df and > >>>> mmdf > >>>> say that there is enough space both for data (SATA) and metadata > >>>> (SSDs). > >>>> Does anyone have an idea why is it complaining? > >>>> > >>>> Thanks, > >>>> > >>>> -- > >>>> Miroslav Bauer > >>>> > >>>> > >>>> > >>>> > >>>> _______________________________________________ > >>>> gpfsug-discuss mailing list > >>>> gpfsug-discuss at spectrumscale.org > >>>> http://gpfsug.org/mailman/listinfo/gpfsug-discuss > >>>> > >>> > >> > > > > > [attachment "smime.p7s" deleted by Yuri L Volobuev/Austin/IBM] > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > [attachment "smime.p7s" deleted by Yuri L Volobuev/Austin/IBM] > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From volobuev at us.ibm.com Wed Sep 7 17:58:07 2016 From: volobuev at us.ibm.com (Yuri L Volobuev) Date: Wed, 7 Sep 2016 09:58:07 -0700 Subject: [gpfsug-discuss] Remote cluster mount failing In-Reply-To: References: Message-ID: It's unclear what's wrong. I'd have two main suspects: (1) TLS protocol version confusion, due to a difference in GSKit version and/or configuration (e.g. NIST SP800 compliance) on two sides (2) firewall. TLS issues are usually messy and tedious to work though. I'd recommend opening a PMR to facilitate debug data collection and analysis. A lot of gory detail may be needed to figure out what's going on. yuri From: "Simon Thompson (Research Computing - IT Services)" To: "gpfsug-discuss at spectrumscale.org" , Date: 09/07/2016 05:37 AM Subject: [gpfsug-discuss] Remote cluster mount failing Sent by: gpfsug-discuss-bounces at spectrumscale.org Hi All, I'm trying to get some multi cluster thing working between two of our GPFS clusters. In the "client" cluster, when trying to mount the "remote" cluster, I get: # mmmount gpfs Wed 7 Sep 13:33:06 BST 2016: mmmount: Mounting file systems ... mount: mount /dev/gpfs on /gpfs failed: Connection timed out mmmount: Command failed. Examine previous error messages to determine cause. And in the log file: Wed Sep 7 13:33:07.481 2016: [N] The client side TLS handshake with node 10.0.0.182 was cancelled: connection reset by peer (return code 420). Wed Sep 7 13:33:07.486 2016: [N] The client side TLS handshake with node 10.0.0.181 was cancelled: connection reset by peer (return code 420). Wed Sep 7 13:33:07.487 2016: [E] Failed to join remote cluster GPFS_STORAGE.CLUSTER Wed Sep 7 13:33:07.488 2016: [W] Command: err 78: mount GPFS_STORAGE.CLUSTER:gpfs Wed Sep 7 13:33:07.489 2016: Connection timed out In the remote cluster, I see: Wed Sep 7 13:33:07.487 2016: [W] The TLS handshake with node 10.0.0.222 failed with error 447 (server side). Wed Sep 7 13:33:07.488 2016: [X] Connection from 10.10.0.35 refused, authentication failed Wed Sep 7 13:33:07.489 2016: [E] Killing connection from 10.10.0.35, err 703 Wed Sep 7 13:33:07.490 2016: Operation not permitted Weirdly though on other nodes in the client cluster this succeeds fine and can mount, so I think I got all the bits in the mmauth and mmremotecluster configured correctly. Any suggestions? Thanks Simon _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From Valdis.Kletnieks at vt.edu Wed Sep 7 19:45:43 2016 From: Valdis.Kletnieks at vt.edu (Valdis Kletnieks) Date: Wed, 07 Sep 2016 14:45:43 -0400 Subject: [gpfsug-discuss] Weirdness with 'mmces address add' Message-ID: <27691.1473273943@turing-police.cc.vt.edu> We're in the middle of deploying Spectrum Archive, and I've hit a snag. We assigned some floating IP addresses, which now need to be changed. So I look at the mmces manpage, and it looks like I need to add the new addresses, and delete the old ones. We're on GPFS 4.2.1.0, if that matters... What 'man mmces' says: 1. To add an address to a specified node, issue this command: mmces address add --ces-node node1 --ces-ip 10.1.2.3 (and at least 6 or 8 more uses of an IP address). What happens when I try it: (And yes, we have an 'isb' ces-group defined with addresses in it already) # mmces address add --ces-group isb --ces-ip 172.28.45.72 Cannot resolve 172.28.45.72; Name or service not known mmces address add: Incorrect value for --ces-ip option Usage: mmces address add [--ces-node Node] [--attribute Attribute] [--ces-group Group] {--ces-ip {IP[,IP...]} Am I missing some special sauce? (My first guess is that it's complaining because there's no PTR in the DNS for that address yet - but if it was going to do DNS lookups, it should be valid to give a hostname rather than an IP address (and nowhere in the manpage does it even *hint* that --ces-ip can be anything other than a list of IP addresses). Or is it time for me to file a PMR? From xhejtman at ics.muni.cz Wed Sep 7 21:11:11 2016 From: xhejtman at ics.muni.cz (Lukas Hejtmanek) Date: Wed, 7 Sep 2016 22:11:11 +0200 Subject: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state In-Reply-To: References:

Message-ID: <20160907201111.xmksazqjekk2ihsy@ics.muni.cz> On Tue, Sep 06, 2016 at 02:04:36PM +0200, Dominic Mueller-Wicke01 wrote: > Hi Miroslav, > > please use the command: > dsmrecall -resident -detail > or use it with file lists well, it looks like Client Version 7, Release 1, Level 4.4 leaks file descriptors: 09/07/2016 21:03:07 ANS1587W Unable to read extended attributes for object /exports/tape_tape/VO_metacentrum/home/jfeit/atlases/atlases/novo3/atlases/images/.svn/prop-base due to errno: 24, reason: Too many open files after about 15 minutes of run, I can see 88 opened files in /proc/$PID/fd when using: dsmrecall -R -RESid -D /path/* is it something known fixed in newer versions? -- Luk?? Hejtm?nek From taylorm at us.ibm.com Wed Sep 7 21:40:13 2016 From: taylorm at us.ibm.com (Michael L Taylor) Date: Wed, 7 Sep 2016 13:40:13 -0700 Subject: [gpfsug-discuss] Weirdness with 'mmces address add' In-Reply-To: References: Message-ID: Can't be for certain this is what you're hitting but reverse DNS lookup is documented the KC: http://www.ibm.com/support/knowledgecenter/STXKQY_4.2.1/com.ibm.spectrum.scale.v4r21.doc/bl1ins_protocolnodeipfurtherconfig.htm Note: All CES IPs must have an associated hostname and reverse DNS lookup must be configured for each. For more information, see Adding export IPs in Deploying protocols. http://www.ibm.com/support/knowledgecenter/STXKQY_4.2.1/com.ibm.spectrum.scale.v4r21.doc/bl1ins_deployingprotocolstasks.htm Note: Export IPs must have an associated hostname and reverse DNS lookup must be configured for each. Can you make sure the IPs have reverse DNS lookup and try again? Will get the mmces man page updated for address add -------------- next part -------------- An HTML attachment was scrubbed... URL: From Valdis.Kletnieks at vt.edu Wed Sep 7 22:23:30 2016 From: Valdis.Kletnieks at vt.edu (Valdis.Kletnieks at vt.edu) Date: Wed, 07 Sep 2016 17:23:30 -0400 Subject: [gpfsug-discuss] Weirdness with 'mmces address add' In-Reply-To: References:

Message-ID: <41089.1473283410@turing-police.cc.vt.edu> On Wed, 07 Sep 2016 13:40:13 -0700, "Michael L Taylor" said: > Can't be for certain this is what you're hitting but reverse DNS lookup is > documented the KC: > Note: All CES IPs must have an associated hostname and reverse DNS lookup > must be configured for each. For more information, see Adding export IPs in > Deploying protocols. Bingo. That was it. Since the DNS will take a while to fix, I fed the appropriate entries to /etc/hosts and it worked fine. I got thrown for a loop because if there is enough code to do that checking, it should be able to accept a hostname as well (RFE time? :) From ulmer at ulmer.org Wed Sep 7 22:34:07 2016 From: ulmer at ulmer.org (Stephen Ulmer) Date: Wed, 7 Sep 2016 17:34:07 -0400 Subject: [gpfsug-discuss] Weirdness with 'mmces address add' In-Reply-To: <41089.1473283410@turing-police.cc.vt.edu> References:

<41089.1473283410@turing-police.cc.vt.edu> Message-ID: Hostnames can have many A records. IPs *generally* only have one PTR (though it?s not restricted, multiple PTRs is not recommended). Just knowing that you can see why allowing names would create more questions than it answers. So if it did take names instead of IP addresses, it would usually only do what you meant part of the time -- and sometimes none of the time. :) -- Stephen > On Sep 7, 2016, at 5:23 PM, Valdis.Kletnieks at vt.edu wrote: > > On Wed, 07 Sep 2016 13:40:13 -0700, "Michael L Taylor" said: > >> Can't be for certain this is what you're hitting but reverse DNS lookup is >> documented the KC: > >> Note: All CES IPs must have an associated hostname and reverse DNS lookup >> must be configured for each. For more information, see Adding export IPs in >> Deploying protocols. > > Bingo. That was it. Since the DNS will take a while to fix, I fed > the appropriate entries to /etc/hosts and it worked fine. > > I got thrown for a loop because if there is enough code to do that checking, > it should be able to accept a hostname as well (RFE time? :) > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss From Valdis.Kletnieks at vt.edu Wed Sep 7 22:54:05 2016 From: Valdis.Kletnieks at vt.edu (Valdis.Kletnieks at vt.edu) Date: Wed, 07 Sep 2016 17:54:05 -0400 Subject: [gpfsug-discuss] Weirdness with 'mmces address add' In-Reply-To: References:

<41089.1473283410@turing-police.cc.vt.edu> Message-ID: <43934.1473285245@turing-police.cc.vt.edu> On Wed, 07 Sep 2016 17:34:07 -0400, Stephen Ulmer said: > Hostnames can have many A records. And quad-A records. :) (Despite our best efforts, we're still one of the 100 biggest IPv6 deployments according to http://www.worldipv6launch.org/measurements/ - were's sitting at 84th in traffic volume and 18th by percent penetration, mostly because we deployed it in production literally last century...) From janfrode at tanso.net Thu Sep 8 06:08:47 2016 From: janfrode at tanso.net (Jan-Frode Myklebust) Date: Thu, 08 Sep 2016 05:08:47 +0000 Subject: [gpfsug-discuss] Weirdness with 'mmces address add' In-Reply-To: <27691.1473273943@turing-police.cc.vt.edu> References: <27691.1473273943@turing-police.cc.vt.edu> Message-ID: I believe your first guess is correct. The ces-ip needs to be resolvable for some reason... Just put a name for it in /etc/hosts, if you can't add it to your dns. -jf ons. 7. sep. 2016 kl. 20.45 skrev Valdis Kletnieks : > We're in the middle of deploying Spectrum Archive, and I've hit a > snag. We assigned some floating IP addresses, which now need to > be changed. So I look at the mmces manpage, and it looks like I need > to add the new addresses, and delete the old ones. > > We're on GPFS 4.2.1.0, if that matters... > > What 'man mmces' says: > > 1. To add an address to a specified node, issue this command: > > mmces address add --ces-node node1 --ces-ip 10.1.2.3 > > (and at least 6 or 8 more uses of an IP address). > > What happens when I try it: (And yes, we have an 'isb' ces-group defined > with > addresses in it already) > > # mmces address add --ces-group isb --ces-ip 172.28.45.72 > Cannot resolve 172.28.45.72; Name or service not known > mmces address add: Incorrect value for --ces-ip option > Usage: > mmces address add [--ces-node Node] [--attribute Attribute] [--ces-group > Group] > {--ces-ip {IP[,IP...]} > > Am I missing some special sauce? (My first guess is that it's complaining > because there's no PTR in the DNS for that address yet - but if it was > going > to do DNS lookups, it should be valid to give a hostname rather than an IP > address (and nowhere in the manpage does it even *hint* that --ces-ip can > be anything other than a list of IP addresses). > > Or is it time for me to file a PMR? > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From dominic.mueller at de.ibm.com Thu Sep 8 06:35:55 2016 From: dominic.mueller at de.ibm.com (Dominic Mueller-Wicke01) Date: Thu, 8 Sep 2016 07:35:55 +0200 Subject: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state In-Reply-To: References: Message-ID: Please open a PMR for the not working "recall to resident". Some investigation is needed here. Thanks. Greetings, Dominic. From: gpfsug-discuss-request at spectrumscale.org To: gpfsug-discuss at spectrumscale.org Date: 07.09.2016 23:23 Subject: gpfsug-discuss Digest, Vol 56, Issue 14 Sent by: gpfsug-discuss-bounces at spectrumscale.org Send gpfsug-discuss mailing list submissions to gpfsug-discuss at spectrumscale.org To subscribe or unsubscribe via the World Wide Web, visit http://gpfsug.org/mailman/listinfo/gpfsug-discuss or, via email, send a message with subject or body 'help' to gpfsug-discuss-request at spectrumscale.org You can reach the person managing the list at gpfsug-discuss-owner at spectrumscale.org When replying, please edit your Subject line so it is more specific than "Re: Contents of gpfsug-discuss digest..." Today's Topics: 1. Re: Remote cluster mount failing (Yuri L Volobuev) 2. Weirdness with 'mmces address add' (Valdis Kletnieks) 3. Re: DMAPI - Unmigrate file to Regular state (Lukas Hejtmanek) 4. Weirdness with 'mmces address add' (Michael L Taylor) 5. Re: Weirdness with 'mmces address add' (Valdis.Kletnieks at vt.edu) ----- Message from "Yuri L Volobuev" on Wed, 7 Sep 2016 09:58:07 -0700 ----- To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Remote cluster mount failing It's unclear what's wrong. I'd have two main suspects: (1) TLS protocol version confusion, due to a difference in GSKit version and/or configuration (e.g. NIST SP800 compliance) on two sides (2) firewall. TLS issues are usually messy and tedious to work though. I'd recommend opening a PMR to facilitate debug data collection and analysis. A lot of gory detail may be needed to figure out what's going on. yuri Inactive hide details for "Simon Thompson (Research Computing - IT Services)" ---09/07/2016 05:37:11 AM---Hi All, I'm trying to"Simon Thompson (Research Computing - IT Services)" ---09/07/2016 05:37:11 AM---Hi All, I'm trying to get some multi cluster thing working between two of our GPFS From: "Simon Thompson (Research Computing - IT Services)" To: "gpfsug-discuss at spectrumscale.org" , Date: 09/07/2016 05:37 AM Subject: [gpfsug-discuss] Remote cluster mount failing Sent by: gpfsug-discuss-bounces at spectrumscale.org Hi All, I'm trying to get some multi cluster thing working between two of our GPFS clusters. In the "client" cluster, when trying to mount the "remote" cluster, I get: # mmmount gpfs Wed 7 Sep 13:33:06 BST 2016: mmmount: Mounting file systems ... mount: mount /dev/gpfs on /gpfs failed: Connection timed out mmmount: Command failed. Examine previous error messages to determine cause. And in the log file: Wed Sep 7 13:33:07.481 2016: [N] The client side TLS handshake with node 10.0.0.182 was cancelled: connection reset by peer (return code 420). Wed Sep 7 13:33:07.486 2016: [N] The client side TLS handshake with node 10.0.0.181 was cancelled: connection reset by peer (return code 420). Wed Sep 7 13:33:07.487 2016: [E] Failed to join remote cluster GPFS_STORAGE.CLUSTER Wed Sep 7 13:33:07.488 2016: [W] Command: err 78: mount GPFS_STORAGE.CLUSTER:gpfs Wed Sep 7 13:33:07.489 2016: Connection timed out In the remote cluster, I see: Wed Sep 7 13:33:07.487 2016: [W] The TLS handshake with node 10.0.0.222 failed with error 447 (server side). Wed Sep 7 13:33:07.488 2016: [X] Connection from 10.10.0.35 refused, authentication failed Wed Sep 7 13:33:07.489 2016: [E] Killing connection from 10.10.0.35, err 703 Wed Sep 7 13:33:07.490 2016: Operation not permitted Weirdly though on other nodes in the client cluster this succeeds fine and can mount, so I think I got all the bits in the mmauth and mmremotecluster configured correctly. Any suggestions? Thanks Simon _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss ----- Message from Valdis Kletnieks on Wed, 07 Sep 2016 14:45:43 -0400 ----- To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] Weirdness with 'mmces address add' We're in the middle of deploying Spectrum Archive, and I've hit a snag. We assigned some floating IP addresses, which now need to be changed. So I look at the mmces manpage, and it looks like I need to add the new addresses, and delete the old ones. We're on GPFS 4.2.1.0, if that matters... What 'man mmces' says: 1. To add an address to a specified node, issue this command: mmces address add --ces-node node1 --ces-ip 10.1.2.3 (and at least 6 or 8 more uses of an IP address). What happens when I try it: (And yes, we have an 'isb' ces-group defined with addresses in it already) # mmces address add --ces-group isb --ces-ip 172.28.45.72 Cannot resolve 172.28.45.72; Name or service not known mmces address add: Incorrect value for --ces-ip option Usage: mmces address add [--ces-node Node] [--attribute Attribute] [--ces-group Group] {--ces-ip {IP[,IP...]} Am I missing some special sauce? (My first guess is that it's complaining because there's no PTR in the DNS for that address yet - but if it was going to do DNS lookups, it should be valid to give a hostname rather than an IP address (and nowhere in the manpage does it even *hint* that --ces-ip can be anything other than a list of IP addresses). Or is it time for me to file a PMR? ----- Message from Lukas Hejtmanek on Wed, 7 Sep 2016 22:11:11 +0200 ----- To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] DMAPI - Unmigrate file to Regular state On Tue, Sep 06, 2016 at 02:04:36PM +0200, Dominic Mueller-Wicke01 wrote: > Hi Miroslav, > > please use the command: > dsmrecall -resident -detail > or use it with file lists well, it looks like Client Version 7, Release 1, Level 4.4 leaks file descriptors: 09/07/2016 21:03:07 ANS1587W Unable to read extended attributes for object /exports/tape_tape/VO_metacentrum/home/jfeit/atlases/atlases/novo3/atlases/images/.svn/prop-base due to errno: 24, reason: Too many open files after about 15 minutes of run, I can see 88 opened files in /proc/$PID/fd when using: dsmrecall -R -RESid -D /path/* is it something known fixed in newer versions? -- Luk?? Hejtm?nek ----- Message from "Michael L Taylor" on Wed, 7 Sep 2016 13:40:13 -0700 ----- To: gpfsug-discuss at spectrumscale.org Subject: [gpfsug-discuss] Weirdness with 'mmces address add' Can't be for certain this is what you're hitting but reverse DNS lookup is documented the KC: http://www.ibm.com/support/knowledgecenter/STXKQY_4.2.1/com.ibm.spectrum.scale.v4r21.doc/bl1ins_protocolnodeipfurtherconfig.htm Note: All CES IPs must have an associated hostname and reverse DNS lookup must be configured for each. For more information, see Adding export IPs in Deploying protocols. http://www.ibm.com/support/knowledgecenter/STXKQY_4.2.1/com.ibm.spectrum.scale.v4r21.doc/bl1ins_deployingprotocolstasks.htm Note: Export IPs must have an associated hostname and reverse DNS lookup must be configured for each. Can you make sure the IPs have reverse DNS lookup and try again? Will get the mmces man page updated for address add ----- Message from Valdis.Kletnieks at vt.edu on Wed, 07 Sep 2016 17:23:30 -0400 ----- To: gpfsug main discussion list Subject: Re: [gpfsug-discuss] Weirdness with 'mmces address add' On Wed, 07 Sep 2016 13:40:13 -0700, "Michael L Taylor" said: > Can't be for certain this is what you're hitting but reverse DNS lookup is > documented the KC: > Note: All CES IPs must have an associated hostname and reverse DNS lookup > must be configured for each. For more information, see Adding export IPs in > Deploying protocols. Bingo. That was it. Since the DNS will take a while to fix, I fed the appropriate entries to /etc/hosts and it worked fine. I got thrown for a loop because if there is enough code to do that checking, it should be able to accept a hostname as well (RFE time? :) _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From S.J.Thompson at bham.ac.uk Fri Sep 9 15:37:28 2016 From: S.J.Thompson at bham.ac.uk (Simon Thompson (Research Computing - IT Services)) Date: Fri, 9 Sep 2016 14:37:28 +0000 Subject: [gpfsug-discuss] Remote cluster mount failing In-Reply-To: References:

Message-ID: That?s sorta what I was expecting. Though I was hoping someone might have said 'oh just run mmchconfig ....' or something easy. PMR on its way in. Thanks! Simon From: > on behalf of Yuri L Volobuev > Reply-To: "gpfsug-discuss at spectrumscale.org" > Date: Wednesday, 7 September 2016 at 17:58 To: "gpfsug-discuss at spectrumscale.org" > Subject: Re: [gpfsug-discuss] Remote cluster mount failing It's unclear what's wrong. I'd have two main suspects: (1) TLS protocol version confusion, due to a difference in GSKit version and/or configuration (e.g. NIST SP800 compliance) on two sides (2) firewall. TLS issues are usually messy and tedious to work though. I'd recommend opening a PMR to facilitate debug data collection and analysis. A lot of gory detail may be needed to figure out what's going on. yuri [Inactive hide details for "Simon Thompson (Research Computing - IT Services)" ---09/07/2016 05:37:11 AM---Hi All, I'm trying to]"Simon Thompson (Research Computing - IT Services)" ---09/07/2016 05:37:11 AM---Hi All, I'm trying to get some multi cluster thing working between two of our GPFS From: "Simon Thompson (Research Computing - IT Services)" > To: "gpfsug-discuss at spectrumscale.org" >, Date: 09/07/2016 05:37 AM Subject: [gpfsug-discuss] Remote cluster mount failing Sent by: gpfsug-discuss-bounces at spectrumscale.org ________________________________ Hi All, I'm trying to get some multi cluster thing working between two of our GPFS clusters. In the "client" cluster, when trying to mount the "remote" cluster, I get: # mmmount gpfs Wed 7 Sep 13:33:06 BST 2016: mmmount: Mounting file systems ... mount: mount /dev/gpfs on /gpfs failed: Connection timed out mmmount: Command failed. Examine previous error messages to determine cause. And in the log file: Wed Sep 7 13:33:07.481 2016: [N] The client side TLS handshake with node 10.0.0.182 was cancelled: connection reset by peer (return code 420). Wed Sep 7 13:33:07.486 2016: [N] The client side TLS handshake with node 10.0.0.181 was cancelled: connection reset by peer (return code 420). Wed Sep 7 13:33:07.487 2016: [E] Failed to join remote cluster GPFS_STORAGE.CLUSTER Wed Sep 7 13:33:07.488 2016: [W] Command: err 78: mount GPFS_STORAGE.CLUSTER:gpfs Wed Sep 7 13:33:07.489 2016: Connection timed out In the remote cluster, I see: Wed Sep 7 13:33:07.487 2016: [W] The TLS handshake with node 10.0.0.222 failed with error 447 (server side). Wed Sep 7 13:33:07.488 2016: [X] Connection from 10.10.0.35 refused, authentication failed Wed Sep 7 13:33:07.489 2016: [E] Killing connection from 10.10.0.35, err 703 Wed Sep 7 13:33:07.490 2016: Operation not permitted Weirdly though on other nodes in the client cluster this succeeds fine and can mount, so I think I got all the bits in the mmauth and mmremotecluster configured correctly. Any suggestions? Thanks Simon _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: graycol.gif URL: From volobuev at us.ibm.com Fri Sep 9 17:29:35 2016 From: volobuev at us.ibm.com (Yuri L Volobuev) Date: Fri, 9 Sep 2016 09:29:35 -0700 Subject: [gpfsug-discuss] Remote cluster mount failing In-Reply-To: References:

Message-ID: It could be "easy" in the end, e.g. regenerating the key ("mmauth genkey new") may fix the issue. Figuring out exactly what is going wrong is messy though, and requires looking at a number of debug data points, something that's awkward to do on a public mailing list. I don't think you want to post certificates et al on a mailing list. The PMR channel is more appropriate for this kind of thing. yuri From: "Simon Thompson (Research Computing - IT Services)" To: gpfsug main discussion list , Date: 09/09/2016 07:37 AM Subject: Re: [gpfsug-discuss] Remote cluster mount failing Sent by: gpfsug-discuss-bounces at spectrumscale.org That?s sorta what I was expecting. Though I was hoping someone might have said 'oh just run mmchconfig ....' or something easy. PMR on its way in. Thanks! Simon From: on behalf of Yuri L Volobuev Reply-To: "gpfsug-discuss at spectrumscale.org" < gpfsug-discuss at spectrumscale.org> Date: Wednesday, 7 September 2016 at 17:58 To: "gpfsug-discuss at spectrumscale.org"