[Cluster-devel] fence daemon problems
Dietmar Maurer
dietmar at proxmox.com
Wed Oct 3 16:55:55 UTC 2012
> The intention of that is to prevent an inquorate node/partition from killing a
> quorate group of nodes that are running normally. e.g. if a 5 node cluster is
> partitioned into 2/3 or 1/4. You don't want the 2 or 1 node group to fence
> the 3 or 4 nodes that are fine.
sure, I understand that.
> The difficult cases, which I think you're seeing, are partitions where no group
> has quorum, e.g. 2/2. In this case we do nothing, and the user has to resolve
> it by resetting some of the nodes
The problem with that is that those 'difficult' cases are very likely. For example
a switch reboot results in that state if you do not have redundant network (yes,
I know that this setup is simply wrong).
And things get worse, because it is not possible to reboot such nodes, because
rgmanager shutdown simply hangs. Is there any way to avoid that, so that it is at
least possible to reboot those nodes?
More information about the Cluster-devel
mailing list