[Linux-cluster] Problem in clvmd/dlm_recoverd

Nuno Fernandes npf-mlists at eurotux.com
Fri Nov 14 21:53:13 UTC 2008


On Friday 14 November 2008 16:26:49 David Teigland wrote:
> On Fri, Nov 14, 2008 at 10:00:13AM +0000, Nuno Fernandes wrote:
> > 22236 [dlm_recoverd]              dlm_wait_function
> > 25097 [dlm_recoverd]              dlm_wait_function
>
> dlm recovery appears to be stuck; this is usually due to a problem at the
> network level.  The recovery seems to be caused by a node starting clvmd.
Hi,

I don't know if it helps, but groupd is using all available CPU, but only in 2 
of the nodes.

I don't know if it's required to be up.. but we've disabled IPV6..

snip of modprobe.conf:

alias net-pf-10 off

Best regards,
./npf

>
> sysrq-t backtraces from all the nodes could confirm some of this, and
> adding <dlm log_debug="1"/> to cluster.conf would give us more information
> the next time it happens.
>
> Dave

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20081114/871ed8e3/attachment.htm>


More information about the Linux-cluster mailing list