[Linux-cluster] node fenced by dlm_controld on a clean shutdown

Jacek Konieczny jajcus at jajcus.net
Mon Nov 19 09:39:20 UTC 2012


On Mon, Nov 19, 2012 at 10:16:48AM +0100, Jacek Konieczny wrote:
> It goes like that:
> - resources using the shared storage are properly stopped by Pacemaker.
> - DRBD is cleanly demoted and unconfigured by Pacemaker
> - Pacemaker cleanly exits
> - CLVMD is stopped.
> – dlm_controld is stopped
> – corosync is being stopped
> 
> and at this point the node is fenced (rebooted) by the dlm_controld on
> the other node. I would expect it continue with a clean shutdown.
> 
> Any idea how to debug/fix it?
> Is this '541 cpg_dispatch error 9' the problem?

I found a workaround: I have added a 10 seconds pause between
dlm_controld and corosync shutdown. The node shuts down cleanly now (is
not fenced). '541 cpg_dispatch error 9' is still there in the logs,
though.

Greets,
        Jacek




More information about the Linux-cluster mailing list