[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] OpenAIS hangs at shutdown



On 08/23/2010 11:15 AM, Andrés Mauricio Mujica Zalamea wrote:

Hi, i'm having an issue with OpenAIS after an update from RHEL 5.3 to
5.5.

Since the update process the openais service hangs when i try to stop
it, openais gets stuck on the no connection error and the only way to
shutdown the server is by force.

These are some relevant logs...


Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] The token was lost in
the OPERATIONAL state.
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Receive multicast
socket recv buffer size (320000 bytes).
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Transmit multicast
socket send buffer size (262142 bytes).
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] entering GATHER state
from 2.
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Creating commit token
because I am the rep.
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Saving state aru 4d
high seq received 4d
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Storing new sequence id
for ring 240
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] entering COMMIT state.
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] entering RECOVERY
state.
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] position [0] member
10.117.157.135:
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] previous ring seq 572
rep 10.117.157.135
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] aru 4d high delivered
4d received flag 1
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] position [1] member
10.117.157.136:
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] previous ring seq 572
rep 10.117.157.135
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] aru 4d high delivered
4d received flag 1
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Did not need to
originate any messages in recovery.
Aug 18 23:45:39 cluster01 openais[3370]: [TOTEM] Sending initial ORF
token
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] CLM CONFIGURATION
CHANGE
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] New Configuration:
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.135)
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.136)
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] Members Left:
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] Members Joined:
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] CLM CONFIGURATION
CHANGE
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] New Configuration:
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.135)
Aug 18 23:45:39 cluster01 openais[3370]: [CLM ] r(0) ip(10.117.157.136)
Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] Members Left:
Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] Members Joined:
Aug 18 23:45:40 cluster01 openais[3370]: [SYNC ] This node is within the
primary component and will provide service.
Aug 18 23:45:40 cluster01 openais[3370]: [TOTEM] entering OPERATIONAL
state.
Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] got nodejoin message
10.117.157.135
Aug 18 23:45:40 cluster01 openais[3370]: [CLM ] got nodejoin message
10.117.157.136
Aug 18 23:45:40 cluster01 openais[3370]: [CPG ] got joinlist message
from node 2
Aug 18 23:45:40 cluster01 openais[3370]: [CPG ] got joinlist message
from node 1



which rpm version are you using?

https://bugzilla.redhat.com/show_bug.cgi?id=566467 may be relevant here but was fixed in openais-0.80.6-16.el5.

Thanks
-steve



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]