[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] quorum dissolved but resources are still alive



Hi,
 replying to your original email ...
the problem i can see in the logs is the line:
 openais[971]: [SYNC ] This node is within the primary component and will
provide service.

as you have expected_votes=2 and node votes=1 this shouldn't happen, so it
looks as a bug

P.S.
 If you had fencing configured - when node2 is back it would fence node1
and start the services

On Tue, 31 May 2011 18:22:01 +0200, Martin Claudio
<claudio martin abilene it> wrote:
> Hi,
> 
> i have a problem with a 2 node cluster with this conf:
> 
> 
>          <clusternodes>
>                  <clusternode name="TEST1" nodeid="1" votes="1">
>                          <fence/>
>                  </clusternode>
>                  <clusternode name="TEST2" nodeid="2" votes="2">
>                          <fence/>
>                  </clusternode>
>          </clusternodes>
>          <cman expected_votes="2"/>
> 
> 
> all is ok but when node 2 goes down quorum dissolved but resources is 
> not stopped, here log:
> 
> 
> clurgmgrd[1302]: <emerg> #1: Quorum Dissolved
> kernel: dlm: closing connection to node 2
> openais[971]: [CLM  ]       r(0) ip(10.1.1.11)
> openais[971]: [CLM  ] Members Left:
> openais[971]: [CLM  ]       r(0) ip(10.1.1.12)
> openais[971]: [CLM  ] Members Joined:
> openais[971]: [CMAN ] quorum lost, blocking activity
> openais[971]: [CLM  ] CLM CONFIGURATION CHANGE
> openais[971]: [CLM  ] New Configuration:
> openais[971]: [CLM  ]       r(0) ip(10.1.1.11)
> openais[971]: [CLM  ] Members Left:
> openais[971]: [CLM  ] Members Joined:
> openais[971]: [SYNC ] This node is within the primary component and will

> provide service.
> openais[971]: [TOTEM] entering OPERATIONAL state.
> openais[971]: [CLM  ] got nodejoin message 10.1.1.11
> openais[971]: [CPG  ] got joinlist message from node 1
> ccsd[964]: Cluster is not quorate.  Refusing connection.
> 
> 
> cluster recognized that quorum is dissolved but resource manager doesn't

> stop resource, ip address is still alive, filesystem is still mount, 
> i'll expect an emergency shutdown but it does not happen....


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]