[Linux-cluster] Nodes are not joining to the cluster

Srija swap_project at yahoo.com
Thu Mar 3 15:37:12 UTC 2011


Thanks  for your reply.

--- On Thu, 3/3/11, Seb <mailing.sr at gmail.com> wrote:
 >  
> There is no <quorumd> section in your config
> file?

No

> Have you been able to identify a quorum disk on the
> nodes?

There is no  quorum disk allocated  for this configuration. As mentioned,
only I know, quotum was alocated through command line etc.
>  
> The host-priv.domain.org
> is in your /etc/hosts? on all nodes?
>  

Yes.

> Why have they been rebooted? for
> maintenance/upgrade?
>  

For maintenance. But before  the reboot, the cluster  service on that node was not shutdown.

> Any iptable used?
>  

No.

> Could you please provide the logs showing the start
> of the cluster service?
>  

I am mentioning here  one of the  server's log ,  when ccs  started.
_______________________________________________________________________________________________________

Mar  1 20:20:39 host ccsd[5287]: Starting ccsd 2.0.115:
Mar  1 20:20:39 host ccsd[5287]:  Built: May 25 2010 04:32:00
Mar  1 20:20:39 host ccsd[5287]:  Copyright (C) Red Hat, Inc.  2004  All rights reserved.
Mar  1 20:20:39 host ccsd[5287]: cluster.conf (cluster name = xxxxxxx, version = 21) found.
Mar  1 20:20:40 host openais[5302]: [MAIN ] AIS Executive Service RELEASE 'subrev 1887 version 0.80.6'
Mar  1 20:20:40 host openais[5302]: [MAIN ] Copyright (C) 2002-2006 MontaVista Software, Inc and contributors.
Mar  1 20:20:40 host openais[5302]: [MAIN ] Copyright (C) 2006 Red Hat, Inc.
Mar  1 20:20:40 host openais[5302]: [MAIN ] AIS Executive Service: started and ready to provide service.
Mar  1 20:20:40 host openais[5302]: [MAIN ] Using default multicast address of xxx.xxx.xxx.xx
Mar  1 20:20:40 host openais[5302]: [TOTEM] Token Timeout (10000 ms) retransmit timeout (495 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] token hold (386 ms) retransmits before loss (20 retrans)
Mar  1 20:20:40 host openais[5302]: [TOTEM] join (60 ms) send_join (0 ms) consensus (20000 ms) merge (200 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] downcheck (1000 ms) fail to recv const (50 msgs)
Mar  1 20:20:40 host openais[5302]: [TOTEM] seqno unchanged const (30 rotations) Maximum network MTU 1402
Mar  1 20:20:40 host openais[5302]: [TOTEM] window size per rotation (50 messages) maximum messages per rotation (17 messages)
Mar  1 20:20:40 host openais[5302]: [TOTEM] send threads (0 threads)
Mar  1 20:20:40 host openais[5302]: [TOTEM] RRP token expired timeout (495 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] RRP token problem counter (2000 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] RRP threshold (10 problem count)
Mar  1 20:20:40 host openais[5302]: [TOTEM] RRP mode set to none.
Mar  1 20:20:40 host openais[5302]: [TOTEM] heartbeat_failures_allowed (0)
Mar  1 20:20:40 host openais[5302]: [TOTEM] max_network_delay (50 ms)
Mar  1 20:20:40 host openais[5302]: [TOTEM] HeartBeat is Disabled. To enable set heartbeat_failures_allowed > 0
Mar  1 20:20:40 host openais[5302]: [TOTEM] Receive multicast socket recv buffer size (262142 bytes).
Mar  1 20:20:40 host openais[5302]: [TOTEM] Transmit multicast socket send buffer size (262142 bytes).
Mar  1 20:20:40 host openais[5302]: [TOTEM] The network interface [192.168.xxx.x] is now up.
Mar  1 20:20:40 host openais[5302]: [TOTEM] Created or loaded sequence id 6160.192.168.xxx.x for this ring.
Mar  1 20:20:40 host openais[5302]: [TOTEM] entering GATHER state from 15.
Mar  1 20:20:40 host openais[5302]: [CMAN ] CMAN 2.0.115 (built May 25 2010 04:32:02) started
Mar  1 20:20:40 host openais[5302]: [MAIN ] Service initialized 'openais CMAN membership service 2.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais extended virtual synchrony service'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais cluster membership service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais availability management framework B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais checkpoint service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais event service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais distributed locking service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais message service B.01.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais configuration service'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais cluster closed process group service v1.01'
Mar  1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais cluster config database access v1.01'
Mar  1 20:20:40 host openais[5302]: [SYNC ] Not using a virtual synchrony filter.
Mar  1 20:20:40 host openais[5302]: [TOTEM] Creating commit token because I am the rep.
Mar  1 20:20:40 host openais[5302]: [TOTEM] Saving state aru 0 high seq received 0
Mar  1 20:20:40 host openais[5302]: [TOTEM] Storing new sequence id for ring 1814
Mar  1 20:20:40 host openais[5302]: [TOTEM] entering COMMIT state.
Mar  1 20:20:40 host openais[5302]: [TOTEM] entering RECOVERY state.
Mar  1 20:20:40 host openais[5302]: [TOTEM] position [0] member 192.168.xxx.x:
Mar  1 20:20:40 host openais[5302]: [TOTEM] previous ring seq 6160 rep 192.168.xxx.x
Mar  1 20:20:40 host openais[5302]: [TOTEM] aru 0 high delivered 0 received flag 1
Mar  1 20:20:40 host openais[5302]: [TOTEM] Did not need to originate any messages in recovery.
Mar  1 20:20:40 host openais[5302]: [TOTEM] Sending initial ORF token
Mar  1 20:20:40 host openais[5302]: [CLM  ] CLM CONFIGURATION CHANGE
Mar  1 20:20:40 host openais[5302]: [CLM  ] New Configuration:
Mar  1 20:20:40 host openais[5302]: [CLM  ] Members Left:
Mar  1 20:20:40 host openais[5302]: [CLM  ] Members Joined:
Mar  1 20:20:40 host openais[5302]: [CLM  ] CLM CONFIGURATION CHANGE
Mar  1 20:20:40 host openais[5302]: [CLM  ] New Configuration:
Mar  1 20:20:40 host openais[5302]: [CLM  ]         r(0) ip(192.168.xxx.x)
Mar  1 20:20:40 host openais[5302]: [CLM  ] Members Left:
Mar  1 20:20:40 host openais[5302]: [CLM  ] Members Joined:
Mar  1 20:20:40 host openais[5302]: [CLM  ]         r(0) ip(192.168.xxx.x)
Mar  1 20:20:40 host openais[5302]: [SYNC ] This node is within the primary component and will provide service.
Mar  1 20:20:40 host openais[5302]: [TOTEM] entering OPERATIONAL state.
Mar  1 20:20:40 host openais[5302]: [CLM  ] got nodejoin message 192.168.xxx.x
Mar  1 20:20:41 host ccsd[5287]: Initial status:: Inquorate
Mar  1 20:20:41 host ccsd[5287]: Cluster is not quorate.  Refusing connection.
Mar  1 20:20:41 host ccsd[5287]: Error while processing connect: Connection refused
Mar  1 20:20:42 host ccsd[5287]: Cluster is not quorate.  Refusing connection.
Mar  1 20:20:42 host ccsd[5287]: Error while processing connect: Connection refused
Mar  1 20:20:42 host ccsd[5287]: Cluster is not quorate.  Refusing connection.
Mar  1 20:20:42 host ccsd[5287]: Error while processing connect: Connection refused


_______________________________________________________________________________________________________


Thanks  again


      




More information about the Linux-cluster mailing list