[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

[Linux-cluster] Graceful recover after connectivity failure



I am using Centos5.1 with GNBD and GNBD fencing.

Following the failure of a cluster member - eg a temporary
loss of connectivity - which results in the node being
fenced, is there a clean way to re-join the cluster without
having to reboot the affected node?

I am finding that it is impossible to shut down or restart the
cluster components on the affected node, and even trying to force
a reboot from a ssh session just hangs.

There seems to be a chicken-and-egg situation - a gfs filesystem
cannot be unmounted if the node is fenced, and cman/clvmd cannot
be stopped/restarted if a filesystem is mounted.   Forcibly
trying to kill the cluster processes also fails.

-- Cliff


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]