[gpfsug-discuss] naive question about rsync: run it on a client or on NSD server?

Simon Thompson S.J.Thompson at bham.ac.uk
Fri Feb 14 14:56:30 GMT 2020


I wouldn't run it on an NSD server. Ideally you want to avoid running other processes etc on there.

If you are running on clients, you also might want to look at: https://github.com/hpc/mpifileutils

And use MPI to parallelise the find and copy.

Simon

On 14/02/2020, 14:25, "gpfsug-discuss-bounces at spectrumscale.org on behalf of giovanni.bracco at enea.it" <gpfsug-discuss-bounces at spectrumscale.org on behalf of giovanni.bracco at enea.it> wrote:

    We must replicate about 100 TB data between two filesystems supported by 
    two different storages (DDN9900 and DDN7990) both connected to the same 
    NSD servers (6 of them) and we plan to use rsync.
    
    Non special GPFS attributes, just the standard POSIX one, we plan to use 
    the standard rsync.
    
    The question:
    is there any advantage in running the rsync on one of the NSD server or 
    is better to run it on a client?
    
    The environment:
    GPFS 4.2.3.19, NSD CentOS7.4,  clients mostly CentOS6.4 (connected by IB 
    QDR) and CentOS7.3 (connected by OPA), connection between NSD and 
    storage with IB QDR)
    
    Giovanni
    
    -- 
    Giovanni Bracco
    phone  +39 351 8804788
    E-mail  giovanni.bracco at enea.it
    WWW http://www.afs.enea.it/bracco
    _______________________________________________
    gpfsug-discuss mailing list
    gpfsug-discuss at spectrumscale.org
    http://gpfsug.org/mailman/listinfo/gpfsug-discuss
    



More information about the gpfsug-discuss mailing list