------------------------------
Message: 8
Date: Mon, 01 Oct 2007 10:20:51 -0400
From: Lon Hohberger <lhh redhat com>
Subject: Re: [Linux-cluster] service can not be relocated
To: linux clustering <
linux-cluster redhat com>
Message-ID: <1191248451 4477 15 camel ayanami boston devel redhat com
>
Content-Type: text/plain
Now the service was on node01.
I did a test as follows:
I unplugged network cable of node01 for a while then plugged in again.
Service cman was terminated on node02 suddenly,
and it could not stop on node02.
logs on node02:
node02 openais[2813]: [CLM ] CLM CONFIGURATION CHANGE
node02 openais[2813]: [CLM ] New Configuration:
node02 openais[2813]: [CLM ] r(0) ip(192.168.0.221
)
node02 openais[2813]: [CLM ] Members Left:
node02 openais[2813]: [CLM ] Members Joined:
node02 openais[2813]: [SYNC ] This node is within the primary component and will provide service.
node02 openais[2813]: [CLM ] CLM CONFIGURATION CHANGE
node02 openais[2813]: [CLM ] New Configuration:
node02 openais[2813]: [CLM ] r(0) ip(192.168.0.219)
node02 openais[2813]: [CLM ] r(0) ip(
192.168.0.221)
node02 openais[2813]: [CLM ] Members Left:
node02 openais[2813]: [CLM ] Members Joined:
node02 openais[2813]: [CLM ] r(0) ip(192.168.0.219)
node02 openais[2813]: [SYNC ] This node is within the primary component and will provide service.
node02 openais[2813]: [TOTEM] entering OPERATIONAL state.
node02 openais[2813]: [MAIN ] Killing node node01 because it has rejoined the cluster without cman_tool join
node02 openais[2813]: [CMAN ] cman killed by node 2 for reason 3
node02 dlm_controld[2843]: groupd is down, exiting
node02 kernel: dlm: closing connection to node 1
node02 gfs_controld[2849]: groupd_dispatch error -1 errno 11
node02 gfs_controld[2849]: groupd connection died
node02 gfs_controld[2849]: cluster is down, exiting
node02 ccsd[2807]: Unable to connect to cluster infrastructure after 30 seconds.
node02 ccsd[2807]: Unable to connect to cluster infrastructure after 60 seconds.
node02 ccsd[2807]: Unable to connect to cluster infrastructure after 90 seconds.