Re: [Linux-cluster] Service Recovery Failure

On 06/30/2011 01:57 AM, Rahul Borate wrote:
Hi all,

I just performed a test which fail miserably. I have two nodes
node-1 and node-2

Global file system /gfs is on node-1.

You do not have fencing configured.

On the clean shut down, the node withdraws and the other node knows that it's safe to take over services. When the node simply disappears, it doesn't know what state the other node is in. The survivor's only safe action is to block I/O, fence the lost node (to put it in a known state), then after successful fence (and only then), I/O will resume.


