[Linux-cluster] What does FAIL_STOP_WAIT state mean for clvmd and rgmanager

Lon Hohberger lhh at redhat.com
Thu Sep 9 18:03:19 UTC 2010


On Mon, 2010-08-23 at 17:58 +1000, Joel Heenan wrote:
> Can someone please explain what this means and what you can do to get
> out of it:
> 
> [root at cluster-host ~]# group_tool -v
> type             level name       id       state node id local_done
> fence            0     default    00010003 JOIN_STOP_WAIT 1 100050001
> 1
> [1 1 2 3 4]
> dlm              1     clvmd      00020003 FAIL_STOP_WAIT 2 200030003
> 1
> [1 2 3 4]
> dlm              1     rgmanager  00030003 FAIL_STOP_WAIT 2 200030003
> 1
> [1 2 3 4]

It looks like fencing has not completed.  How do you have 2 node 1's in
the fencing group?

-- Lon




More information about the Linux-cluster mailing list