[Linux-cluster] Problem in clvmd/dlm_recoverd

David Teigland teigland at redhat.com
Fri Nov 14 16:26:49 UTC 2008


On Fri, Nov 14, 2008 at 10:00:13AM +0000, Nuno Fernandes wrote:
> 22236 [dlm_recoverd]              dlm_wait_function
> 25097 [dlm_recoverd]              dlm_wait_function

dlm recovery appears to be stuck; this is usually due to a problem at the
network level.  The recovery seems to be caused by a node starting clvmd.

sysrq-t backtraces from all the nodes could confirm some of this, and
adding <dlm log_debug="1"/> to cluster.conf would give us more information
the next time it happens.

Dave




More information about the Linux-cluster mailing list