[Linux-cluster] Problem in clvmd/dlm_recoverd

David Teigland teigland at redhat.com
Mon Nov 17 14:49:16 UTC 2008


On Mon, Nov 17, 2008 at 09:32:46AM +0000, Nuno Fernandes wrote:
> On Friday 14 November 2008 22:05:15 David Teigland wrote:
> > On Fri, Nov 14, 2008 at 09:53:13PM +0000, Nuno Fernandes wrote:
> > > > On Fri, Nov 14, 2008 at 10:00:13AM +0000, Nuno Fernandes wrote:
> > > > dlm recovery appears to be stuck; this is usually due to a problem at
> > > > the network level.  The recovery seems to be caused by a node starting
> > > > clvmd.
> > >
> > > Hi,
> > >
> > > I don't know if it helps, but groupd is using all available CPU, but
> > > only in 2 of the nodes.
> >
> > That sounds like https://bugzilla.redhat.com/show_bug.cgi?id=444529
> > which is fixed in 5.3.  I suspect that's the cause of you're problems.
> >
> > Dave
> 
> Hi,
> 

> Is there anyway i can unstuck the servers without rebooting all the
> servers at the same time?

Reboot just the nodes where groupd (or dlm_controld or gfs_controld) are
running at 100% cpu.

Dave




More information about the Linux-cluster mailing list