[Linux-cluster] GFS2 mounts taking a *very* long time

Yevheniy Demchenko zheka at uvt.cz
Fri Jan 13 00:52:04 UTC 2012


Hi!
This patched version of dlm will probably resolve your issue, please try it.
http://www.bosson.eu/temp/dlm-kmod-1.0-1.el6.src.rpm
See detailed description in the list earlier ( Subject: [Linux-cluster] [PATCH] dlm: faster dlm recovery )
And yes, mounts and umounts with unpatched dlm are proportional to N*N, where N is a number of files.

Sincerely,
Yevheniy Demchenko

On Jan 13, 2012, at 00:50 , Scooter Morris wrote:

> Greetings all,
>    We've got a 4 node cluster running RHEL 6.2.   As part of the cluster, we've got several gfs2 filesystem.  We've often noticed that when we reboot a single node in the cluster, the gfs2 mounts take a long time -- eventually getting the 120 second delay messages.  When we migrated to 6.2, the default mount script echoed the filesystem being mounted, and we discovered that the long delays were filesystem-dependent.  In particular, two filesystems were causing all of the problems, both of which had >1M files in them.  We also noticed that dlm_recoverd on one of the other nodes accumulates a lot of time when this is happening.  Is this expected?  Are there non-ilnear handshaking algorithms between the mounting node and the cluster that are dependent on the number of files?
> 
> Thanks in advance!
> 
> -- scooter
> 
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20120113/4a52142f/attachment.htm>


More information about the Linux-cluster mailing list