From abeattie at au1.ibm.com Sat Jun 1 11:11:42 2019 From: abeattie at au1.ibm.com (Andrew Beattie) Date: Sat, 1 Jun 2019 10:11:42 +0000 Subject: [gpfsug-discuss] Gateway role on a NSD server In-Reply-To: References: Message-ID: An HTML attachment was scrubbed... URL: From marc.caubet at psi.ch Mon Jun 3 09:51:53 2019 From: marc.caubet at psi.ch (Caubet Serrabou Marc (PSI)) Date: Mon, 3 Jun 2019 08:51:53 +0000 Subject: [gpfsug-discuss] About new Lenovo DSS Software Release Message-ID: <0081EB235765E14395278B9AE1DF34180FE897CC@MBX214.d.ethz.ch> Dear all, this question mostly targets Lenovo Engineers and customers. Is there any update about the release date for the new software for Lenovo DSS G-Series? Also, I would like to know which version of GPFS will come with this software. Thanks a lot, Marc _________________________________________________________ Paul Scherrer Institut High Performance Computing & Emerging Technologies Marc Caubet Serrabou Building/Room: OHSA/014 Forschungsstrasse, 111 5232 Villigen PSI Switzerland Telephone: +41 56 310 46 67 E-Mail: marc.caubet at psi.ch -------------- next part -------------- An HTML attachment was scrubbed... URL: From TROPPENS at de.ibm.com Wed Jun 5 09:42:15 2019 From: TROPPENS at de.ibm.com (Ulf Troppens) Date: Wed, 5 Jun 2019 10:42:15 +0200 Subject: [gpfsug-discuss] Agenda - User Meeting along ISC Frankfurt In-Reply-To: References: Message-ID: The agenda is now published: https://www.spectrumscaleug.org/event/spectrum-scale-user-group-meeting-isc/ Please use the registration link to attend. Looking forward to meet many of you there. -- IBM Spectrum Scale Development - Client Engagements & Solutions Delivery Consulting IT Specialist Author "Storage Networks Explained" IBM Deutschland Research & Development GmbH Vorsitzender des Aufsichtsrats: Matthias Hartmann Gesch?ftsf?hrung: Dirk Wittkopp Sitz der Gesellschaft: B?blingen / Registergericht: Amtsgericht Stuttgart, HRB 243294 From: "Ulf Troppens" To: "gpfsug main discussion list" Date: 22/05/2019 10:55 Subject: [EXTERNAL] [gpfsug-discuss] Save the date - User Meeting along ISC Frankfurt Sent by: gpfsug-discuss-bounces at spectrumscale.org Greetings: IBM will host a joint "IBM Spectrum Scale and IBM Spectrum LSF User Meeting" at ISC. As with other user group meetings, the agenda will include user stories, updates on IBM Spectrum Scale & IBM Spectrum LSF, and access to IBM experts and your peers. We are still looking for customers to talk about their experience with Spectrum Scale and/or Spectrum LSF. Please send me a personal mail, if you are interested to talk. The meeting is planned for: Monday June 17th, 2019 - 1pm-5.30pm ISC Frankfurt, Germany I will send more details later. Best, Ulf -- IBM Spectrum Scale Development - Client Engagements & Solutions Delivery Consulting IT Specialist Author "Storage Networks Explained" IBM Deutschland Research & Development GmbH Vorsitzender des Aufsichtsrats: Matthias Hartmann Gesch?ftsf?hrung: Dirk Wittkopp Sitz der Gesellschaft: B?blingen / Registergericht: Amtsgericht Stuttgart, HRB 243294 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=kZaabFheMr5-INuBtDMnDjxzZMuvvQ-K0cx1FAfh4lg&m=uUqyuk8-P-Ra6X6T7ReoLj3kWy-VUg53oU2RZpf8bbg&s=XCJDxns17Ixdyviy_nuN0pCJsTkAN6dxCU994sl33qo&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From knop at us.ibm.com Fri Jun 7 22:45:31 2019 From: knop at us.ibm.com (Felipe Knop) Date: Fri, 7 Jun 2019 17:45:31 -0400 Subject: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Message-ID: All, There have been reported issues (including kernel crashes) on Spectrum Scale with the latest RHEL7.6 kernel 3.10.0-957.21.2. Please consider delaying upgrades to this kernel until further information is provided. Thanks, Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 -------------- next part -------------- An HTML attachment was scrubbed... URL: From zmance at ucar.edu Fri Jun 7 22:51:13 2019 From: zmance at ucar.edu (Zachary Mance) Date: Fri, 7 Jun 2019 15:51:13 -0600 Subject: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 In-Reply-To: References: Message-ID: Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------------------------------------------------------------------------------------------- Zach Mance zmance at ucar.edu (303) 497-1883 HPC Data Infrastructure Group / CISL / NCAR --------------------------------------------------------------------------------------------------------------- On Fri, Jun 7, 2019 at 3:45 PM Felipe Knop wrote: > All, > > There have been reported issues (including kernel crashes) on Spectrum > Scale with the latest RHEL7.6 kernel 3.10.0-957.21.2. Please consider > delaying upgrades to this kernel until further information is provided. > > Thanks, > > Felipe > > ---- > Felipe Knop knop at us.ibm.com > GPFS Development and Security > IBM Systems > IBM Building 008 > 2455 South Rd, Poughkeepsie, NY 12601 > (845) 433-9314 T/L 293-9314 > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From knop at us.ibm.com Fri Jun 7 23:07:49 2019 From: knop at us.ibm.com (Felipe Knop) Date: Fri, 7 Jun 2019 18:07:49 -0400 Subject: [gpfsug-discuss] =?utf-8?q?Spectrum_Scale_with_RHEL7=2E6_kernel?= =?utf-8?b?CTMuMTAuMC05NTcuMjEuMg==?= In-Reply-To: References:

Message-ID: Zach, This appears to be affecting all Scale versions, including 5.0.2 -- but only when moving to the new 3.10.0-957.21.2 kernel. (3.10.0-957 is not impacted) Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 From: Zachary Mance To: gpfsug main discussion list Date: 06/07/2019 05:51 PM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org Which versions of Spectrum Scale versions are?you referring to? 5.0.2-3? --------------------------------------------------------------------------------------------------------------- Zach Mance ?zmance at ucar.edu??(303) 497-1883 HPC Data Infrastructure Group?/ CISL / NCAR ---------------------------------------------------------------------------------------------------------------? On Fri, Jun 7, 2019 at 3:45 PM Felipe Knop wrote: All, There have been reported issues (including kernel crashes) on Spectrum Scale with the latest RHEL7.6 kernel 3.10.0-957.21.2. Please consider delaying upgrades to this kernel until further information is provided. Thanks, Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=oNT2koCZX0xmWlSlLblR9Q&m=ZcS98SBJVzdDsVcuu7KjSr64rfzEBaFDD86UkLkp8Vw&s=mjERh67H5DB6dfP0I1KES4-9Ku25AVoQxHoB5gArxR4&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From Robert.Oesterlin at nuance.com Sat Jun 8 18:22:12 2019 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Sat, 8 Jun 2019 17:22:12 +0000 Subject: [gpfsug-discuss] Forcing an internal mount to complete Message-ID: I have a few file systems that are showing ?internal mount? on my NSD servers, even though they are not mounted. I?d like to force them, without have to restart GPFS on those nodes - any options? Not mounted on any other (local cluster) nodes. Bob Oesterlin Sr Principal Storage Engineer, Nuance -------------- next part -------------- An HTML attachment was scrubbed... URL: From aaron.knister at gmail.com Sun Jun 9 02:16:08 2019 From: aaron.knister at gmail.com (Aaron Knister) Date: Sat, 8 Jun 2019 21:16:08 -0400 Subject: [gpfsug-discuss] Forcing an internal mount to complete In-Reply-To: References: Message-ID: <9892D4F1-2A0F-4D4E-BA63-F72A80442BEF@gmail.com> Bob, I wonder if something like an ?mmdf? or an ?mmchmgr? would trigger the internal mounts to release. Sent from my iPhone > On Jun 8, 2019, at 13:22, Oesterlin, Robert wrote: > > I have a few file systems that are showing ?internal mount? on my NSD servers, even though they are not mounted. I?d like to force them, without have to restart GPFS on those nodes - any options? > > Not mounted on any other (local cluster) nodes. > > > Bob Oesterlin > Sr Principal Storage Engineer, Nuance > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: From salut4tions at gmail.com Sun Jun 9 04:24:47 2019 From: salut4tions at gmail.com (Jordan Robertson) Date: Sat, 8 Jun 2019 23:24:47 -0400 Subject: [gpfsug-discuss] Forcing an internal mount to complete In-Reply-To: <9892D4F1-2A0F-4D4E-BA63-F72A80442BEF@gmail.com> References: <9892D4F1-2A0F-4D4E-BA63-F72A80442BEF@gmail.com> Message-ID: Hey Bob, Ditto on what Aaron said, it sounds as if the last fs manager might need a nudge. Things can get weird when a filesystem isn't mounted anywhere but a manager is needed for an operation though, so I would keep an eye on the ras logs of the cluster manager during the kick just to make sure the management duty isn't bouncing (which in turn can cause waiters). -Jordan On Sat, Jun 8, 2019 at 9:16 PM Aaron Knister wrote: > Bob, I wonder if something like an ?mmdf? or an ?mmchmgr? would trigger > the internal mounts to release. > > Sent from my iPhone > > On Jun 8, 2019, at 13:22, Oesterlin, Robert > wrote: > > I have a few file systems that are showing ?internal mount? on my NSD > servers, even though they are not mounted. I?d like to force them, without > have to restart GPFS on those nodes - any options? > > > > Not mounted on any other (local cluster) nodes. > > > > > > Bob Oesterlin > > Sr Principal Storage Engineer, Nuance > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Robert.Oesterlin at nuance.com Sun Jun 9 13:18:39 2019 From: Robert.Oesterlin at nuance.com (Oesterlin, Robert) Date: Sun, 9 Jun 2019 12:18:39 +0000 Subject: [gpfsug-discuss] [EXTERNAL] Re: Forcing an internal mount to complete In-Reply-To: References: <9892D4F1-2A0F-4D4E-BA63-F72A80442BEF@gmail.com> Message-ID: Thanks for the suggestions - as it turns out, it was the *remote* mounts causing the issues - which surprises me. I wanted to do a ?mmchfs? on the local cluster, to change the default mount point. Why would GPFS care if it?s remote mounted? Oh - well? Bob Oesterlin Sr Principal Storage Engineer, Nuance -------------- next part -------------- An HTML attachment was scrubbed... URL: From salut4tions at gmail.com Sun Jun 9 14:20:28 2019 From: salut4tions at gmail.com (Jordan Robertson) Date: Sun, 9 Jun 2019 09:20:28 -0400 Subject: [gpfsug-discuss] [EXTERNAL] Re: Forcing an internal mount to complete In-Reply-To: References: <9892D4F1-2A0F-4D4E-BA63-F72A80442BEF@gmail.com>

Message-ID: If there's any I/O going to the filesystem at all, GPFS has to keep it internally mounted on at least a few nodes such as the token managers and fs manager. I *believe* that holds true even for remote clusters, in that they still need to reach back to the "local" cluster when operating on the filesystem. I could be wrong about that though. On Sun, Jun 9, 2019, 09:06 Oesterlin, Robert wrote: > Thanks for the suggestions - as it turns out, it was the **remote** > mounts causing the issues - which surprises me. I wanted to do a ?mmchfs? > on the local cluster, to change the default mount point. Why would GPFS > care if it?s remote mounted? > > > > Oh - well? > > > > > > Bob Oesterlin > > Sr Principal Storage Engineer, Nuance > > > > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: From spectrumscale at kiranghag.com Sun Jun 9 14:38:29 2019 From: spectrumscale at kiranghag.com (KG) Date: Sun, 9 Jun 2019 19:08:29 +0530 Subject: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 In-Reply-To: References:

Message-ID: One of my customer already upgraded their DR site. Is rollback advised? They will be running from DR site for a day in another week. On Sat, Jun 8, 2019, 03:37 Felipe Knop wrote: > Zach, > > This appears to be affecting all Scale versions, including 5.0.2 -- but > only when moving to the new 3.10.0-957.21.2 kernel. (3.10.0-957 is not > impacted) > > Felipe > > ---- > Felipe Knop knop at us.ibm.com > GPFS Development and Security > IBM Systems > IBM Building 008 > 2455 South Rd, Poughkeepsie, NY 12601 > (845) 433-9314 T/L 293-9314 > > > > [image: Inactive hide details for Zachary Mance ---06/07/2019 05:51:37 > PM---Which versions of Spectrum Scale versions are you referring]Zachary > Mance ---06/07/2019 05:51:37 PM---Which versions of Spectrum Scale versions > are you referring to? 5.0.2-3? --------------------------- > > From: Zachary Mance > To: gpfsug main discussion list > Date: 06/07/2019 05:51 PM > Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 > kernel 3.10.0-957.21.2 > Sent by: gpfsug-discuss-bounces at spectrumscale.org > ------------------------------ > > > > Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? > > --------------------------------------------------------------------------------------------------------------- > Zach Mance *zmance at ucar.edu* (303) 497-1883 > > HPC Data Infrastructure Group / CISL / NCAR > > --------------------------------------------------------------------------------------------------------------- > > > On Fri, Jun 7, 2019 at 3:45 PM Felipe Knop <*knop at us.ibm.com* > > wrote: > > All, > > There have been reported issues (including kernel crashes) on Spectrum > Scale with the latest RHEL7.6 kernel 3.10.0-957.21.2. Please consider > delaying upgrades to this kernel until further information is provided. > > Thanks, > > Felipe > > ---- > Felipe Knop *knop at us.ibm.com* > GPFS Development and Security > IBM Systems > IBM Building 008 > 2455 South Rd, Poughkeepsie, NY 12601 > (845) 433-9314 T/L 293-9314 > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at *spectrumscale.org* > *http://gpfsug.org/mailman/listinfo/gpfsug-discuss* > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > > > > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at spectrumscale.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From scottg at emailhosting.com Sun Jun 9 18:32:24 2019 From: scottg at emailhosting.com (Scott Goldman) Date: Sun, 09 Jun 2019 18:32:24 +0100 Subject: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 In-Reply-To: Message-ID: An HTML attachment was scrubbed... URL: From knop at us.ibm.com Mon Jun 10 05:29:14 2019 From: knop at us.ibm.com (Felipe Knop) Date: Mon, 10 Jun 2019 00:29:14 -0400 Subject: [gpfsug-discuss] =?utf-8?q?Spectrum_Scale_with_RHEL7=2E6=09kernel?= =?utf-8?b?CTMuMTAuMC05NTcuMjEuMg==?= In-Reply-To: References: Message-ID: Scott, Currently, we are only aware of the problem with 3.10.0-957.21.2 . We are not yet aware of the same problems also affecting 3.10.0-957.12.1, but hope to find out more shortly. Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 From: Scott Goldman To: gpfsug main discussion list Date: 06/09/2019 01:50 PM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org And to be clear.. There is a .12 version: 3.10.0-957.12.1.el7.x86_64 Did you mean the .12 version or the .21? Conveniently, the kernel numbers are easily proposed! Sent from my BlackBerry - the most secure mobile device From: spectrumscale at kiranghag.com Sent: June 9, 2019 2:38 PM To: gpfsug-discuss at spectrumscale.org Reply-to: gpfsug-discuss at spectrumscale.org Subject: Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 One of my customer already upgraded their DR site. Is rollback advised? They will be running from DR site for a day in another week. On Sat, Jun 8, 2019, 03:37 Felipe Knop wrote: Zach, This appears to be affecting all Scale versions, including 5.0.2 -- but only when moving to the new 3.10.0-957.21.2 kernel. (3.10.0-957 is not impacted) Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 Zachary Mance ---06/07/2019 05:51:37 PM---Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------- From: Zachary Mance To: gpfsug main discussion list Date: 06/07/2019 05:51 PM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------------------------------------------------------------------------------------------- Zach Mance zmance at ucar.edu (303) 497-1883 HPC Data Infrastructure Group / CISL / NCAR --------------------------------------------------------------------------------------------------------------- On Fri, Jun 7, 2019 at 3:45 PM Felipe Knop wrote: All, There have been reported issues (including kernel crashes) on Spectrum Scale with the latest RHEL7.6 kernel 3.10.0-957.21.2. Please consider delaying upgrades to this kernel until further information is provided. Thanks, Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=oNT2koCZX0xmWlSlLblR9Q&m=fQfU5Pw8BtsrqD8JCFskfMdm8ZIGWtDY-gMtk_iljwU&s=vVEdtvFYxwXzh3n52YWo4_XJIh4IvWzRl3NaAkmA-9E&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From knop at us.ibm.com Mon Jun 10 05:41:29 2019 From: knop at us.ibm.com (Felipe Knop) Date: Mon, 10 Jun 2019 00:41:29 -0400 Subject: [gpfsug-discuss] =?utf-8?q?Spectrum_Scale_with_RHEL7=2E6_kernel?= =?utf-8?b?CTMuMTAuMC05NTcuMjEuMg==?= In-Reply-To: References:

Message-ID: Hi, Though we are still learning what workload results in the problem, it appears that even minimal I/O on the file system may cause the OS to crash. One pattern that we saw was 'mkdir'. There is a chance that the DR site was not yet impacted because no I/O workload has been run there. In that case, rolling back to the prior kernel level (one which has been tested before) may be advisable. Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 From: KG To: gpfsug main discussion list Date: 06/09/2019 09:38 AM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org One of my customer already upgraded their DR site. Is rollback advised? They will be running from DR site for a day in another?week. On Sat, Jun 8, 2019, 03:37 Felipe Knop wrote: Zach, This appears to be affecting all Scale versions, including 5.0.2 -- but only when moving to the new 3.10.0-957.21.2 kernel. (3.10.0-957 is not impacted) Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 Zachary Mance ---06/07/2019 05:51:37 PM---Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------- From: Zachary Mance To: gpfsug main discussion list Date: 06/07/2019 05:51 PM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org Which versions of Spectrum Scale versions are?you referring to? 5.0.2-3? --------------------------------------------------------------------------------------------------------------- Zach Mance ?zmance at ucar.edu??(303) 497-1883 HPC Data Infrastructure Group?/ CISL / NCAR ---------------------------------------------------------------------------------------------------------------? On Fri, Jun 7, 2019 at 3:45 PM Felipe Knop wrote: All, There have been reported issues (including kernel crashes) on Spectrum Scale with the latest RHEL7.6 kernel 3.10.0-957.21.2. Please consider delaying upgrades to this kernel until further information is provided. Thanks, Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss[attachment "graycol.gif" deleted by Felipe Knop/Poughkeepsie/IBM] _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=oNT2koCZX0xmWlSlLblR9Q&m=7I4gXXVtdbnAsgAcK0NWr4-5d-a1bRr4578aC1wKRMo&s=jFJmGOvjWTjDfI_vI2pHOOvqzPw5rWbtLvrZdTEDtCg&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From scottg at emailhosting.com Mon Jun 10 06:02:19 2019 From: scottg at emailhosting.com (Scott Goldman) Date: Mon, 10 Jun 2019 06:02:19 +0100 Subject: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 In-Reply-To: Message-ID: <3uok4eacuqj53g26epedg19j.1560142939257@emailhosting.com> An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From Renar.Grunenberg at huk-coburg.de Mon Jun 10 13:24:52 2019 From: Renar.Grunenberg at huk-coburg.de (Grunenberg, Renar) Date: Mon, 10 Jun 2019 12:24:52 +0000 Subject: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 In-Reply-To: References:

Message-ID: Hallo Felippe, here are the change list: RHBA-2019:1337 kernel bug fix update Summary: Updated kernel packages that fix various bugs are now available for Red Hat Enterprise Linux 7. The kernel packages contain the Linux kernel, the core of any Linux operating system. This update fixes the following bugs: * Mellanox CX-5 MAC learning with OVS H/W offload not working (BZ#1686292) * RHEL7.4 NFS4.1 client and server repeated SEQUENCE / TEST_STATEIDs with SEQUENCE Reply has SEQ4_STATUS_RECALLABLE_STATE_REVOKED set - NFS server should return NFS4ERR_DELEG_REVOKED or NFS4ERR_BAD_STATEID for revoked delegations (BZ#1689811) * PANIC: "BUG: unable to handle kernel paging request" in the mtip32xx mtip_init_cmd_header routine (BZ#1689929) * The nvme cli delete-ns command hangs indefinitely. (BZ#1690519) * drm/nouveau: nv50 - Graphics become sluggish or frozen for nvidia Pascal cards (Regression from 1584963) - Need to flush fb writes when rewinding push buffer (BZ#1690761) * [CEE/SD] Ceph+NFS server crashed and rebooted due to CephFS kernel client issue (BZ#1692266) * [Mellanox OVS offload] tc fails to calculate the checksum in case vlan trunk and header rewrite (BZ#1693110) * aio O_DIRECT writes to non-page-aligned file locations on ext4 can result in the overlapped portion of the page containing zeros (BZ#1693561) * [HP WS 7.6 bug] Audio driver does not recognize multi function audio jack microphone input (BZ#1693562) * XFS returns ENOSPC when using extent size hint with space still available (BZ#1693796) * OVN requires IPv6 to be enabled (BZ#1694981) * breaks DMA API for non-GPL drivers (BZ#1695511) * ovl_create can return positive retval and crash the host (BZ#1696292) * ceph: append mode is broken for sync/direct write (BZ#1696595) * Problem building module due to -EXPORT_SYMBOL_GPL/-EXPORT_SYMBOL (BZ#1697241) * Failed to load kpatch module after install the rpm package occasionally on ppc64le (BZ#1697867) * [Hyper-V][RHEL7] Stop suppressing PCID bit (BZ#1697940) * Resizing an online EXT4 filesystem on a loopback device hangs (BZ#1698110) * dm table: propagate BDI_CAP_STABLE_WRITES (BZ#1699722) * [ESXi][RHEL7.6]After upgrade to kernel-3.10.0-957.el7, system is unable to discover newly added VMware LSI Logic SAS virtual disks without a reboot. (BZ#1699723) * kernel: zcrypt: fix specification exception on z196 at ap probe (BZ#1700706) * XFS: Metadata corruption detected at xfs_attr3_leaf_write_verify() (BZ#1701293) * stime showed huge values related to wrong calculation of time deltas (L3:) (BZ#1701743) * Kernel panic due to NULL pointer dereference at sysfs_do_create_link_sd.isra.2+0x34 while loading [ipmi_si] module using hard-coded device (BZ#1701991) * IPv6 ECMP modulo N hashing inefficient when X^2 rt6i_nsiblings (BZ#1702282) * security: double-free attempted in security_inode_init_security() (BZ#1702286) * Missing wakeup leaves task stuck waiting in blk_queue_enter() (BZ#1702921) * Satellite Capsule sync triggers several XFS corruptions (BZ#1702922) * BUG: SELinux doesn't handle NFS crossmnt well (BZ#1702923) * md_clear flag missing from /proc/cpuinfo on late microcode update (BZ#1712993) * MDS mitigations are not enabled after double microcode update (BZ#1712998) * WARNING: CPU: 0 PID: 0 at kernel/jump_label.c:90 __static_key_slow_dec+0xa6/0xb0 (BZ#1713004) Users of kernel are advised to upgrade to these updated packages, which fix these bugs. Full details and references: https://access.redhat.com/errata/RHBA-2019:1337?sc_cid=701600000006NHXAA2 Revision History: Issue Date: 2019-06-04 Updated: 2019-06-04 Regards Renar Renar Grunenberg Abteilung Informatik - Betrieb HUK-COBURG Bahnhofsplatz 96444 Coburg Telefon: 09561 96-44110 Telefax: 09561 96-44104 E-Mail: Renar.Grunenberg at huk-coburg.de Internet: www.huk.de ________________________________ HUK-COBURG Haftpflicht-Unterst?tzungs-Kasse kraftfahrender Beamter Deutschlands a. G. in Coburg Reg.-Gericht Coburg HRB 100; St.-Nr. 9212/101/00021 Sitz der Gesellschaft: Bahnhofsplatz, 96444 Coburg Vorsitzender des Aufsichtsrats: Prof. Dr. Heinrich R. Schradin. Vorstand: Klaus-J?rgen Heitmann (Sprecher), Stefan Gronbach, Dr. Hans Olav Her?y, Dr. J?rg Rheinl?nder (stv.), Sarah R?ssler, Daniel Thomas. ________________________________ Diese Nachricht enth?lt vertrauliche und/oder rechtlich gesch?tzte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese Nachricht irrt?mlich erhalten haben, informieren Sie bitte sofort den Absender und vernichten Sie diese Nachricht. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Nachricht ist nicht gestattet. This information may contain confidential and/or privileged information. If you are not the intended recipient (or have received this information in error) please notify the sender immediately and destroy this information. Any unauthorized copying, disclosure or distribution of the material in this information is strictly forbidden. ________________________________ Von: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] Im Auftrag von Felipe Knop Gesendet: Montag, 10. Juni 2019 06:41 An: gpfsug main discussion list Betreff: Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Hi, Though we are still learning what workload results in the problem, it appears that even minimal I/O on the file system may cause the OS to crash. One pattern that we saw was 'mkdir'. There is a chance that the DR site was not yet impacted because no I/O workload has been run there. In that case, rolling back to the prior kernel level (one which has been tested before) may be advisable. Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 [Inactive hide details for KG ---06/09/2019 09:38:55 AM---One of my customer already upgraded their DR site. Is rollback advised]KG ---06/09/2019 09:38:55 AM---One of my customer already upgraded their DR site. Is rollback advised? They will be running from DR From: KG > To: gpfsug main discussion list > Date: 06/09/2019 09:38 AM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org ________________________________ One of my customer already upgraded their DR site. Is rollback advised? They will be running from DR site for a day in another week. On Sat, Jun 8, 2019, 03:37 Felipe Knop > wrote: Zach, This appears to be affecting all Scale versions, including 5.0.2 -- but only when moving to the new 3.10.0-957.21.2 kernel. (3.10.0-957 is not impacted) Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 Zachary Mance ---06/07/2019 05:51:37 PM---Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------- From: Zachary Mance > To: gpfsug main discussion list > Date: 06/07/2019 05:51 PM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org ________________________________ Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------------------------------------------------------------------------------------------- Zach Mance zmance at ucar.edu (303) 497-1883 HPC Data Infrastructure Group / CISL / NCAR --------------------------------------------------------------------------------------------------------------- On Fri, Jun 7, 2019 at 3:45 PM Felipe Knop > wrote: All, There have been reported issues (including kernel crashes) on Spectrum Scale with the latest RHEL7.6 kernel 3.10.0-957.21.2. Please consider delaying upgrades to this kernel until further information is provided. Thanks, Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss[attachment "graycol.gif" deleted by Felipe Knop/Poughkeepsie/IBM] _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 105 bytes Desc: image001.gif URL: From Renar.Grunenberg at huk-coburg.de Mon Jun 10 13:43:02 2019 From: Renar.Grunenberg at huk-coburg.de (Grunenberg, Renar) Date: Mon, 10 Jun 2019 12:43:02 +0000 Subject: [gpfsug-discuss] WG: Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 In-Reply-To: <1848549064bc481fb5cb4dcc24c51376@SMXRF105.msg.hukrf.de> References:

<1848549064bc481fb5cb4dcc24c51376@SMXRF105.msg.hukrf.de> Message-ID: Hallo Felipe, here are the change list: RHBA-2019:1337 kernel bug fix update Summary: Updated kernel packages that fix various bugs are now available for Red Hat Enterprise Linux 7. The kernel packages contain the Linux kernel, the core of any Linux operating system. This update fixes the following bugs: * Mellanox CX-5 MAC learning with OVS H/W offload not working (BZ#1686292) * RHEL7.4 NFS4.1 client and server repeated SEQUENCE / TEST_STATEIDs with SEQUENCE Reply has SEQ4_STATUS_RECALLABLE_STATE_REVOKED set - NFS server should return NFS4ERR_DELEG_REVOKED or NFS4ERR_BAD_STATEID for revoked delegations (BZ#1689811) * PANIC: "BUG: unable to handle kernel paging request" in the mtip32xx mtip_init_cmd_header routine (BZ#1689929) * The nvme cli delete-ns command hangs indefinitely. (BZ#1690519) * drm/nouveau: nv50 - Graphics become sluggish or frozen for nvidia Pascal cards (Regression from 1584963) - Need to flush fb writes when rewinding push buffer (BZ#1690761) * [CEE/SD] Ceph+NFS server crashed and rebooted due to CephFS kernel client issue (BZ#1692266) * [Mellanox OVS offload] tc fails to calculate the checksum in case vlan trunk and header rewrite (BZ#1693110) * aio O_DIRECT writes to non-page-aligned file locations on ext4 can result in the overlapped portion of the page containing zeros (BZ#1693561) * [HP WS 7.6 bug] Audio driver does not recognize multi function audio jack microphone input (BZ#1693562) * XFS returns ENOSPC when using extent size hint with space still available (BZ#1693796) * OVN requires IPv6 to be enabled (BZ#1694981) * breaks DMA API for non-GPL drivers (BZ#1695511) * ovl_create can return positive retval and crash the host (BZ#1696292) * ceph: append mode is broken for sync/direct write (BZ#1696595) * Problem building module due to -EXPORT_SYMBOL_GPL/-EXPORT_SYMBOL (BZ#1697241) * Failed to load kpatch module after install the rpm package occasionally on ppc64le (BZ#1697867) * [Hyper-V][RHEL7] Stop suppressing PCID bit (BZ#1697940) * Resizing an online EXT4 filesystem on a loopback device hangs (BZ#1698110) * dm table: propagate BDI_CAP_STABLE_WRITES (BZ#1699722) * [ESXi][RHEL7.6]After upgrade to kernel-3.10.0-957.el7, system is unable to discover newly added VMware LSI Logic SAS virtual disks without a reboot. (BZ#1699723) * kernel: zcrypt: fix specification exception on z196 at ap probe (BZ#1700706) * XFS: Metadata corruption detected at xfs_attr3_leaf_write_verify() (BZ#1701293) * stime showed huge values related to wrong calculation of time deltas (L3:) (BZ#1701743) * Kernel panic due to NULL pointer dereference at sysfs_do_create_link_sd.isra.2+0x34 while loading [ipmi_si] module using hard-coded device (BZ#1701991) * IPv6 ECMP modulo N hashing inefficient when X^2 rt6i_nsiblings (BZ#1702282) * security: double-free attempted in security_inode_init_security() (BZ#1702286) * Missing wakeup leaves task stuck waiting in blk_queue_enter() (BZ#1702921) * Satellite Capsule sync triggers several XFS corruptions (BZ#1702922) * BUG: SELinux doesn't handle NFS crossmnt well (BZ#1702923) * md_clear flag missing from /proc/cpuinfo on late microcode update (BZ#1712993) * MDS mitigations are not enabled after double microcode update (BZ#1712998) * WARNING: CPU: 0 PID: 0 at kernel/jump_label.c:90 __static_key_slow_dec+0xa6/0xb0 (BZ#1713004) Users of kernel are advised to upgrade to these updated packages, which fix these bugs. Full details and references: https://access.redhat.com/errata/RHBA-2019:1337?sc_cid=701600000006NHXAA2 Revision History: Issue Date: 2019-06-04 Updated: 2019-06-04 Regards Renar Renar Grunenberg Abteilung Informatik - Betrieb HUK-COBURG Bahnhofsplatz 96444 Coburg Telefon: 09561 96-44110 Telefax: 09561 96-44104 E-Mail: Renar.Grunenberg at huk-coburg.de Internet: www.huk.de ________________________________ HUK-COBURG Haftpflicht-Unterst?tzungs-Kasse kraftfahrender Beamter Deutschlands a. G. in Coburg Reg.-Gericht Coburg HRB 100; St.-Nr. 9212/101/00021 Sitz der Gesellschaft: Bahnhofsplatz, 96444 Coburg Vorsitzender des Aufsichtsrats: Prof. Dr. Heinrich R. Schradin. Vorstand: Klaus-J?rgen Heitmann (Sprecher), Stefan Gronbach, Dr. Hans Olav Her?y, Dr. J?rg Rheinl?nder (stv.), Sarah R?ssler, Daniel Thomas. ________________________________ Diese Nachricht enth?lt vertrauliche und/oder rechtlich gesch?tzte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese Nachricht irrt?mlich erhalten haben, informieren Sie bitte sofort den Absender und vernichten Sie diese Nachricht. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Nachricht ist nicht gestattet. This information may contain confidential and/or privileged information. If you are not the intended recipient (or have received this information in error) please notify the sender immediately and destroy this information. Any unauthorized copying, disclosure or distribution of the material in this information is strictly forbidden. ________________________________ Von: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] Im Auftrag von Felipe Knop Gesendet: Montag, 10. Juni 2019 06:41 An: gpfsug main discussion list > Betreff: Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Hi, Though we are still learning what workload results in the problem, it appears that even minimal I/O on the file system may cause the OS to crash. One pattern that we saw was 'mkdir'. There is a chance that the DR site was not yet impacted because no I/O workload has been run there. In that case, rolling back to the prior kernel level (one which has been tested before) may be advisable. Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 [Inactive hide details for KG ---06/09/2019 09:38:55 AM---One of my customer already upgraded their DR site. Is rollback advised]KG ---06/09/2019 09:38:55 AM---One of my customer already upgraded their DR site. Is rollback advised? They will be running from DR From: KG > To: gpfsug main discussion list > Date: 06/09/2019 09:38 AM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org ________________________________ One of my customer already upgraded their DR site. Is rollback advised? They will be running from DR site for a day in another week. On Sat, Jun 8, 2019, 03:37 Felipe Knop > wrote: Zach, This appears to be affecting all Scale versions, including 5.0.2 -- but only when moving to the new 3.10.0-957.21.2 kernel. (3.10.0-957 is not impacted) Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 Zachary Mance ---06/07/2019 05:51:37 PM---Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------- From: Zachary Mance > To: gpfsug main discussion list > Date: 06/07/2019 05:51 PM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org ________________________________ Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------------------------------------------------------------------------------------------- Zach Mance zmance at ucar.edu (303) 497-1883 HPC Data Infrastructure Group / CISL / NCAR --------------------------------------------------------------------------------------------------------------- On Fri, Jun 7, 2019 at 3:45 PM Felipe Knop > wrote: All, There have been reported issues (including kernel crashes) on Spectrum Scale with the latest RHEL7.6 kernel 3.10.0-957.21.2. Please consider delaying upgrades to this kernel until further information is provided. Thanks, Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss[attachment "graycol.gif" deleted by Felipe Knop/Poughkeepsie/IBM] _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 105 bytes Desc: image001.gif URL: From kraemerf at de.ibm.com Mon Jun 10 13:47:46 2019 From: kraemerf at de.ibm.com (Frank Kraemer) Date: Mon, 10 Jun 2019 14:47:46 +0200 Subject: [gpfsug-discuss] *NEWS* - IBM Spectrum Scale Erasure Code Edition v5.0.3 Message-ID: FYI - What is IBM Spectrum Scale Erasure Code Edition, and why should I consider it? IBM Spectrum Scale Erasure Code Edition provides all the functionality, reliability, scalability, and performance of IBM Spectrum Scale on the customer?s own choice of commodity hardware with the added benefit of network-dispersed IBM Spectrum Scale RAID, and all of its features providing data protection, storage efficiency, and the ability to manage storage in hyperscale environments. SAS, NL-SAS, and NVMe drives are supported right now. IBM Spectrum Scale Erasure Code Edition supports 4 different erasure codes: 4+2P, 4+3P, 8+2P, and 8+3P in addition to 3 and 4 way replication. Choosing an erasure code involves considering several factors. IBM Spectrum Scale Erasure Code Edition more details see section 18 in the Scale FAQ on the web https://www.ibm.com/support/knowledgecenter/STXKQY/gpfsclustersfaq.html Each IBM Spectrum Scale Erasure Code Edition recovery group can have 4 - 32 storage nodes, and there can be up to 128 storage nodes in an IBM Spectrum Scale cluster using IBM Spectrum Scale Erasure Code Edition. For more information, see Planning for erasure code selection in the IBM Spectrum Scale Erasure Code Edition Knowledge Center. https://www.ibm.com/support/knowledgecenter/en/STXKQY_ECE_5.0.3/ibmspectrumscaleece503_welcome.html Minimum requirements for IBM Spectrum Scale Erasure Code Edition see: https://www.ibm.com/support/knowledgecenter/STXKQY_ECE_5.0.3/com.ibm.spectrum.scale.ece.v5r03.doc/b1lece_min_hwrequirements.htm The hardware and network precheck tools can be downloaded from the following links: Hardware precheck: https://github.com/IBM/SpectrumScale_ECE_OS_READINESS Network precheck: https://github.com/IBM/SpectrumScale_NETWORK_READINESS The network can be either Ethernet or InfiniBand, and must be at least 25 Gbps bandwidth, with an average latency of 1.0 msec or less between any two storage nodes. -frank- -------------- next part -------------- An HTML attachment was scrubbed... URL: From knop at us.ibm.com Mon Jun 10 14:43:10 2019 From: knop at us.ibm.com (Felipe Knop) Date: Mon, 10 Jun 2019 09:43:10 -0400 Subject: [gpfsug-discuss] =?utf-8?q?WG=3A_Spectrum_Scale_with_RHEL7=2E6=09?= =?utf-8?q?kernel=093=2E10=2E0-957=2E21=2E2?= In-Reply-To: References:

<1848549064bc481fb5cb4dcc24c51376@SMXRF105.msg.hukrf.de> Message-ID: Renar, Thanks. Of the changes below, it appears that * security: double-free attempted in security_inode_init_security() (BZ#1702286) was the one that ended up triggering the problem. Our investigations now show that RHEL kernels >= 3.10.0-957.19.1 are impacted. Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 From: "Grunenberg, Renar" To: "'gpfsug-discuss at spectrumscale.org'" Date: 06/10/2019 08:43 AM Subject: [EXTERNAL] [gpfsug-discuss] WG: Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org Hallo Felipe, here are the change list: RHBA-2019:1337 kernel bug fix update Summary: Updated kernel packages that fix various bugs are now available for Red Hat Enterprise Linux 7. The kernel packages contain the Linux kernel, the core of any Linux operating system. This update fixes the following bugs: * Mellanox CX-5 MAC learning with OVS H/W offload not working (BZ#1686292) * RHEL7.4 NFS4.1 client and server repeated SEQUENCE / TEST_STATEIDs with SEQUENCE Reply has SEQ4_STATUS_RECALLABLE_STATE_REVOKED set - NFS server should return NFS4ERR_DELEG_REVOKED or NFS4ERR_BAD_STATEID for revoked delegations (BZ#1689811) * PANIC: "BUG: unable to handle kernel paging request" in the mtip32xx mtip_init_cmd_header routine (BZ#1689929) * The nvme cli delete-ns command hangs indefinitely. (BZ#1690519) * drm/nouveau: nv50 - Graphics become sluggish or frozen for nvidia Pascal cards (Regression from 1584963) - Need to flush fb writes when rewinding push buffer (BZ#1690761) * [CEE/SD] Ceph+NFS server crashed and rebooted due to CephFS kernel client issue (BZ#1692266) * [Mellanox OVS offload] tc fails to calculate the checksum in case vlan trunk and header rewrite (BZ#1693110) * aio O_DIRECT writes to non-page-aligned file locations on ext4 can result in the overlapped portion of the page containing zeros (BZ#1693561) * [HP WS 7.6 bug] Audio driver does not recognize multi function audio jack microphone input (BZ#1693562) * XFS returns ENOSPC when using extent size hint with space still available (BZ#1693796) * OVN requires IPv6 to be enabled (BZ#1694981) * breaks DMA API for non-GPL drivers (BZ#1695511) * ovl_create can return positive retval and crash the host (BZ#1696292) * ceph: append mode is broken for sync/direct write (BZ#1696595) * Problem building module due to -EXPORT_SYMBOL_GPL/-EXPORT_SYMBOL (BZ#1697241) * Failed to load kpatch module after install the rpm package occasionally on ppc64le (BZ#1697867) * [Hyper-V][RHEL7] Stop suppressing PCID bit (BZ#1697940) * Resizing an online EXT4 filesystem on a loopback device hangs (BZ#1698110) * dm table: propagate BDI_CAP_STABLE_WRITES (BZ#1699722) * [ESXi][RHEL7.6]After upgrade to kernel-3.10.0-957.el7, system is unable to discover newly added VMware LSI Logic SAS virtual disks without a reboot. (BZ#1699723) * kernel: zcrypt: fix specification exception on z196 at ap probe (BZ#1700706) * XFS: Metadata corruption detected at xfs_attr3_leaf_write_verify() (BZ#1701293) * stime showed huge values related to wrong calculation of time deltas (L3:) (BZ#1701743) * Kernel panic due to NULL pointer dereference at sysfs_do_create_link_sd.isra.2+0x34 while loading [ipmi_si] module using hard-coded device (BZ#1701991) * IPv6 ECMP modulo N hashing inefficient when X^2 rt6i_nsiblings (BZ#1702282) * security: double-free attempted in security_inode_init_security() (BZ#1702286) * Missing wakeup leaves task stuck waiting in blk_queue_enter() (BZ#1702921) * Satellite Capsule sync triggers several XFS corruptions (BZ#1702922) * BUG: SELinux doesn't handle NFS crossmnt well (BZ#1702923) * md_clear flag missing from /proc/cpuinfo on late microcode update (BZ#1712993) * MDS mitigations are not enabled after double microcode update (BZ#1712998) * WARNING: CPU: 0 PID: 0 at kernel/jump_label.c:90 __static_key_slow_dec +0xa6/0xb0 (BZ#1713004) Users of kernel are advised to upgrade to these updated packages, which fix these bugs. Full details and references: https://access.redhat.com/errata/RHBA-2019:1337?sc_cid=701600000006NHXAA2 Revision History: Issue Date: 2019-06-04 Updated: 2019-06-04 Regards Renar Renar Grunenberg Abteilung Informatik - Betrieb HUK-COBURG Bahnhofsplatz 96444 Coburg Telefon: 09561 96-44110 Telefax: 09561 96-44104 E-Mail: Renar.Grunenberg at huk-coburg.de Internet: www.huk.de HUK-COBURG Haftpflicht-Unterst?tzungs-Kasse kraftfahrender Beamter Deutschlands a. G. in Coburg Reg.-Gericht Coburg HRB 100; St.-Nr. 9212/101/00021 Sitz der Gesellschaft: Bahnhofsplatz, 96444 Coburg Vorsitzender des Aufsichtsrats: Prof. Dr. Heinrich R. Schradin. Vorstand: Klaus-J?rgen Heitmann (Sprecher), Stefan Gronbach, Dr. Hans Olav Her?y, Dr. J?rg Rheinl?nder (stv.), Sarah R?ssler, Daniel Thomas. Diese Nachricht enth?lt vertrauliche und/oder rechtlich gesch?tzte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese Nachricht irrt?mlich erhalten haben, informieren Sie bitte sofort den Absender und vernichten Sie diese Nachricht. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Nachricht ist nicht gestattet. This information may contain confidential and/or privileged information. If you are not the intended recipient (or have received this information in error) please notify the sender immediately and destroy this information. Any unauthorized copying, disclosure or distribution of the material in this information is strictly forbidden. Von: gpfsug-discuss-bounces at spectrumscale.org [ mailto:gpfsug-discuss-bounces at spectrumscale.org] Im Auftrag von Felipe Knop Gesendet: Montag, 10. Juni 2019 06:41 An: gpfsug main discussion list Betreff: Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Hi, Though we are still learning what workload results in the problem, it appears that even minimal I/O on the file system may cause the OS to crash. One pattern that we saw was 'mkdir'. There is a chance that the DR site was not yet impacted because no I/O workload has been run there. In that case, rolling back to the prior kernel level (one which has been tested before) may be advisable. Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 Inactive hide details for KG ---06/09/2019 09:38:55 AM---One of my customer already upgraded their DR site. Is rollback advisedKG ---06/09/2019 09:38:55 AM---One of my customer already upgraded their DR site. Is rollback advised? They will be running from DR From: KG To: gpfsug main discussion list Date: 06/09/2019 09:38 AM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org One of my customer already upgraded their DR site. Is rollback advised? They will be running from DR site for a day in another week. On Sat, Jun 8, 2019, 03:37 Felipe Knop wrote: Zach, This appears to be affecting all Scale versions, including 5.0.2 -- but only when moving to the new 3.10.0-957.21.2 kernel. (3.10.0-957 is not impacted) Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 Zachary Mance ---06/07/2019 05:51:37 PM---Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------- From: Zachary Mance To: gpfsug main discussion list Date: 06/07/2019 05:51 PM Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 Sent by: gpfsug-discuss-bounces at spectrumscale.org Which versions of Spectrum Scale versions are you referring to? 5.0.2-3? --------------------------------------------------------------------------------------------------------------- Zach Mance zmance at ucar.edu (303) 497-1883 HPC Data Infrastructure Group / CISL / NCAR --------------------------------------------------------------------------------------------------------------- On Fri, Jun 7, 2019 at 3:45 PM Felipe Knop wrote: All, There have been reported issues (including kernel crashes) on Spectrum Scale with the latest RHEL7.6 kernel 3.10.0-957.21.2. Please consider delaying upgrades to this kernel until further information is provided. Thanks, Felipe ---- Felipe Knop knop at us.ibm.com GPFS Development and Security IBM Systems IBM Building 008 2455 South Rd, Poughkeepsie, NY 12601 (845) 433-9314 T/L 293-9314 _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss[attachment "graycol.gif" deleted by Felipe Knop/Poughkeepsie/IBM] _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=oNT2koCZX0xmWlSlLblR9Q&m=yWFAPveNSlMNNB5WT9HWp-2gQFFcYeCEsQdME5UvoGw&s=xZFqiCTjE-2e_6gM6MkzBcALK0hp-3ZquA7bt2GIjt8&e= -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: graycol.gif Type: image/gif Size: 105 bytes Desc: not available URL: From Renar.Grunenberg at huk-coburg.de Tue Jun 11 13:27:46 2019 From: Renar.Grunenberg at huk-coburg.de (Grunenberg, Renar) Date: Tue, 11 Jun 2019 12:27:46 +0000 Subject: [gpfsug-discuss] WG: Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2 In-Reply-To: References: