[Linux-cluster] Nodes are not joining to the cluster
Srija
swap_project at yahoo.com
Thu Mar 3 15:37:12 UTC 2011
Thanks for your reply.
--- On Thu, 3/3/11, Seb <mailing.sr at gmail.com> wrote:
>
> There is no <quorumd> section in your config
> file?
No
> Have you been able to identify a quorum disk on the
> nodes?
There is no quorum disk allocated for this configuration. As mentioned,
only I know, quotum was alocated through command line etc.
>
> The host-priv.domain.org
> is in your /etc/hosts? on all nodes?
>
Yes.
> Why have they been rebooted? for
> maintenance/upgrade?
>
For maintenance. But before the reboot, the cluster service on that node was not shutdown.
> Any iptable used?
>
No.
> Could you please provide the logs showing the start
> of the cluster service?
>
I am mentioning here one of the server's log , when ccs started.
_______________________________________________________________________________________________________
Mar 1 20:20:39 host ccsd[5287]: Starting ccsd 2.0.115:
Mar 1 20:20:39 host ccsd[5287]: Built: May 25 2010 04:32:00
Mar 1 20:20:39 host ccsd[5287]: Copyright (C) Red Hat, Inc. 2004 All rights reserved.
Mar 1 20:20:39 host ccsd[5287]: cluster.conf (cluster name = xxxxxxx, version = 21) found.
Mar 1 20:20:40 host openais[5302]: [MAIN ] AIS Executive Service RELEASE 'subrev 1887 version 0.80.6'
Mar 1 20:20:40 host openais[5302]: [MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and contributors.
Mar 1 20:20:40 host openais[5302]: [MAIN ] Copyright (C) 2006 Red Hat, Inc.
Mar 1 20:20:40 host openais[5302]: [MAIN ] AIS Executive Service: started and ready to provide service.
Mar 1 20:20:40 host openais[5302]: [MAIN ] Using default multicast address of xxx.xxx.xxx.xx
Mar 1 20:20:40 host openais[5302]: [TOTEM] Token Timeout (10000 ms) retransmit timeout (495 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] token hold (386 ms) retransmits before loss (20 retrans)
Mar 1 20:20:40 host openais[5302]: [TOTEM] join (60 ms) send_join (0 ms) consensus (20000 ms) merge (200 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] downcheck (1000 ms) fail to recv const (50 msgs)
Mar 1 20:20:40 host openais[5302]: [TOTEM] seqno unchanged const (30 rotations) Maximum network MTU 1402
Mar 1 20:20:40 host openais[5302]: [TOTEM] window size per rotation (50 messages) maximum messages per rotation (17 messages)
Mar 1 20:20:40 host openais[5302]: [TOTEM] send threads (0 threads)
Mar 1 20:20:40 host openais[5302]: [TOTEM] RRP token expired timeout (495 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] RRP token problem counter (2000 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] RRP threshold (10 problem count)
Mar 1 20:20:40 host openais[5302]: [TOTEM] RRP mode set to none.
Mar 1 20:20:40 host openais[5302]: [TOTEM] heartbeat_failures_allowed (0)
Mar 1 20:20:40 host openais[5302]: [TOTEM] max_network_delay (50 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0
Mar 1 20:20:40 host openais[5302]: [TOTEM] Receive multicast socket recv buffer size (262142 bytes).
Mar 1 20:20:40 host openais[5302]: [TOTEM] Transmit multicast socket send buffer size (262142 bytes).
Mar 1 20:20:40 host openais[5302]: [TOTEM] The network interface [192.168.xxx.x] is now up.
Mar 1 20:20:40 host openais[5302]: [TOTEM] Created or loaded sequence id 6160.192.168.xxx.x for this ring.
Mar 1 20:20:40 host openais[5302]: [TOTEM] entering GATHER state from 15.
Mar 1 20:20:40 host openais[5302]: [CMAN ] CMAN 2.0.115 (built May 25 2010 04:32:02) started
Mar 1 20:20:40 host openais[5302]: [MAIN ] Service initialized 'openais CMAN membership service 2.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais extended virtual synchrony service'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais cluster membership service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais availability management framework B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais checkpoint service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais event service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais distributed locking service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais message service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais configuration service'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais cluster closed process group service v1.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais cluster config database access v1.01'
Mar 1 20:20:40 host openais[5302]: [SYNC ] Not using a virtual synchrony filter.
Mar 1 20:20:40 host openais[5302]: [TOTEM] Creating commit token because I am the rep.
Mar 1 20:20:40 host openais[5302]: [TOTEM] Saving state aru 0 high seq received 0
Mar 1 20:20:40 host openais[5302]: [TOTEM] Storing new sequence id for ring 1814
Mar 1 20:20:40 host openais[5302]: [TOTEM] entering COMMIT state.
Mar 1 20:20:40 host openais[5302]: [TOTEM] entering RECOVERY state.
Mar 1 20:20:40 host openais[5302]: [TOTEM] position [0] member 192.168.xxx.x:
Mar 1 20:20:40 host openais[5302]: [TOTEM] previous ring seq 6160 rep 192.168.xxx.x
Mar 1 20:20:40 host openais[5302]: [TOTEM] aru 0 high delivered 0 received flag 1
Mar 1 20:20:40 host openais[5302]: [TOTEM] Did not need to originate any messages in recovery.
Mar 1 20:20:40 host openais[5302]: [TOTEM] Sending initial ORF token
Mar 1 20:20:40 host openais[5302]: [CLM ] CLM CONFIGURATION CHANGE
Mar 1 20:20:40 host openais[5302]: [CLM ] New Configuration:
Mar 1 20:20:40 host openais[5302]: [CLM ] Members Left:
Mar 1 20:20:40 host openais[5302]: [CLM ] Members Joined:
Mar 1 20:20:40 host openais[5302]: [CLM ] CLM CONFIGURATION CHANGE
Mar 1 20:20:40 host openais[5302]: [CLM ] New Configuration:
Mar 1 20:20:40 host openais[5302]: [CLM ] r(0) ip(192.168.xxx.x)
Mar 1 20:20:40 host openais[5302]: [CLM ] Members Left:
Mar 1 20:20:40 host openais[5302]: [CLM ] Members Joined:
Mar 1 20:20:40 host openais[5302]: [CLM ] r(0) ip(192.168.xxx.x)
Mar 1 20:20:40 host openais[5302]: [SYNC ] This node is within the primary component and will provide service.
Mar 1 20:20:40 host openais[5302]: [TOTEM] entering OPERATIONAL state.
Mar 1 20:20:40 host openais[5302]: [CLM ] got nodejoin message 192.168.xxx.x
Mar 1 20:20:41 host ccsd[5287]: Initial status:: Inquorate
Mar 1 20:20:41 host ccsd[5287]: Cluster is not quorate. Refusing connection.
Mar 1 20:20:41 host ccsd[5287]: Error while processing connect: Connection refused
Mar 1 20:20:42 host ccsd[5287]: Cluster is not quorate. Refusing connection.
Mar 1 20:20:42 host ccsd[5287]: Error while processing connect: Connection refused
Mar 1 20:20:42 host ccsd[5287]: Cluster is not quorate. Refusing connection.
Mar 1 20:20:42 host ccsd[5287]: Error while processing connect: Connection refused
_______________________________________________________________________________________________________
Thanks again
More information about the Linux-cluster
mailing list