[gpfsug-discuss] AFM - how to update directories with deleted files during prefetch
Venkateswara R Puvvada
vpuvvada at in.ibm.com
Fri Jun 30 13:35:18 BST 2017
What is the version of GPFS ?
>Mostly I want to update the directories to make AFM cache aware of file
deletions on home. On home I can use a policy run to find all directories
which changed since the last >update and pass them to prefetch on AFM
cache.
AFM prefetch has undocumented option to delete files from cache without
doing lookup from home. It supports all types of list files. Find all
deleted file from home and prefetch at cache to delete them.
mmafmctl device prefetch -j fileset --list-file
<list-of-files-to-be-deleted-from-cache> --delete
>Wish – maybe an RFE?
>I can use policy runs to collect all changed items on home since the last
update. I wish that I can pass this list to afm prefetch to do all updates
on AFM cache, too. Same >as backup tools use the list to do incremental
backups.
This feature support was already added but undocumented today. This
feature will be externalized in future releases.
~Venkat (vpuvvada at in.ibm.com)
From: "Billich Heinrich Rainer (PSI)" <heiner.billich at psi.ch>
To: "gpfsug-discuss at spectrumscale.org"
<gpfsug-discuss at spectrumscale.org>
Date: 06/30/2017 03:37 PM
Subject: [gpfsug-discuss] AFM - how to update directories with
deleted files during prefetch
Sent by: gpfsug-discuss-bounces at spectrumscale.org
Hello
I have a short question about AFM prefetch and some more remarks regarding
AFM and it’s use for data migration. I understand that many of you have
done this for very large amounts of data and number of files. I would
welcome an input, comments or remarks. Sorry if this is a bit too long for
a mailing list.
Short:
How can I tell an AFM cache to update a directory when I do prefetch? I
know about ‘find .’ or ‘ls –lsR’ but this really is no option for us as it
takes too long. Mostly I want to update the directories to make AFM cache
aware of file deletions on home. On home I can use a policy run to find
all directories which changed since the last update and pass them to
prefetch on AFM cache.
I know that I can find some workaround based on the directory list, like
an ‘ls –lsa’ just for those directories, but this doesn’t sound very
efficient. And depending on cache effects and timeout settings it may work
or not (o.k. – it will work most time).
We do regular file deletions and will accumulated millions of deleted
files on cache over time if we don’t update the directories to make AFM
cache aware of the deletion.
Background:
We will use AFM to migrate data on filesets to another cluster. We have to
do this several times in the next few months, hence I want to get a
reliable and easy to use procedure. The old system is home, the new system
is a read-only AFM cache. We want to use ‘mmafmctl prefetch’ to move the
data. Home will be in use while we run the migration. Once almost all data
is moved we do a (short) break for a last sync and make the read-only AFM
cache a ‘normal’ fileset. During the break I want to use policy runs and
prefetch only and no time consuming ‘ls –lsr’ or ‘find .’ I don’t want to
use this metadata intensive posix operation during operation, either.
More general:
AFM can be used for data migration. But I don’t see how to use it
efficiently. How to do incremental transfers, how to ensure that the we
really have identical copies before we switch and how to keep the switch
time short , i.e. the time when both old and new aren’t accessible for
clients,
Wish – maybe an RFE?
I can use policy runs to collect all changed items on home since the last
update. I wish that I can pass this list to afm prefetch to do all updates
on AFM cache, too. Same as backup tools use the list to do incremental
backups.
And a tool to create policy lists of home and cache and to compare the
lists would be nice, too. As you do this during the break/switch it should
be fast and reliable and leave no doubts.
Kind regards,
Heiner
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20170630/17b621dc/attachment-0002.htm>
More information about the gpfsug-discuss
mailing list