[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

RE: [Linux-cluster] two node cluster not coming up



Hi Bob,

Yeah the communication is all good, can ping each other, in fact I'm scp'ing
the cluster.conf file to svr01 and there's nothing else on that network,
they might as well be connected through a x-over cable as these two machines
are the only machines on the network. I got around the startup hang by
starting the OS with the "I" keypress and said No to all cluster services.
Once in the OS, I did

# service ccsd start
# service cman start

This works fine on svr00, on svr01 it comes back with [FAILED]. When I do,
'service cman status' it says the following

[root svr01 ~]# service cman status
Protocol version: 5.0.1
Config version: 4
Cluster name: testcluster
Cluster ID: 27453
Cluster Member: No
Membership state: Joining

When I do a manual: cman_tool join -w I get back a "cman_tool: Node is
already active"

Also, I load my modules automatically during system startup with S99local. 

BTW, I do now see the following messages in my /var/log/messages file

Jun 29 14:36:39 svr01 ccsd[3685]: Unable to perform sendto: Cannot assign
requested address
Jun 29 14:36:40 svr01 kernel: CMAN: sending membership request
Jun 29 14:36:41 svr01 ccsd[3685]: Unable to perform sendto: Cannot assign
requested address
Jun 29 14:36:45 svr01 last message repeated 2 times
Jun 29 14:36:45 svr01 kernel: CMAN: sending membership request
Jun 29 14:36:47 svr01 ccsd[3685]: Unable to perform sendto: Cannot assign
requested address
Jun 29 14:36:50 svr01 kernel: CMAN: sending membership request
Jun 29 14:37:25 svr01 last message repeated 7 times

Does this help? Can I have the application generate more detailed logging?

Thanks in advance
\R

-----Original Message-----
From: linux-cluster-bounces redhat com
[mailto:linux-cluster-bounces redhat com] On Behalf Of Robert Peterson
Sent: Friday, June 30, 2006 12:36 AM
To: linux clustering
Subject: Re: [Linux-cluster] two node cluster not coming up

RR wrote:
> Hi Bob,
>
> Attached is the cluster.conf file, and below is the status I get from 
> the command "service cman status" on the working node:
>
> Svr00# service cman status
> Protocol version: 5.0.1
> Config version: 4
> Cluster name: testcluster
> Cluster ID: 27453
> Cluster Member: Yes
> Membership state: Cluster-Member
> Nodes: 1
> Expected_votes: 1
> Total_votes: 1
> Quorum: 1
> Active subsystems: 0
> Node name: svr00
> Node addresses: 10.1.3.64
>
> svr00# clustat
> Member Status: Quorate
>
> Resource Group Manager not running; no service information available.
>
>   Member Name                              Status
>   ------ ----                              ------
>   svr00                                     Online, Local
>   svr01                                     Offline
>
>
> I rebooted svr01 and now it just sits there at Starting clvmd: during 
> bootup.
>
> Hope this helps in anyone understanding my issue? Do I need all the 
> other services configured for this to work properly? i.e. clvmd, fenced,
etc. etc.
> I just wanted to see two nodes in a cluster first before I configured 
> any resources, services, fencing etc etc.
>
> \R
>   
Hi RR,

Hm.  I didn't see anything obviously wrong with your cluster.conf file.
I guess I'd reboot svr01 and try to bring it into the cluster manually, and
see if it complains about anything along the way. 
(You may need to bring it up in single-user mode so that it doesn't hang at
the service script that starts clvmd) Something like this:

modprobe lock_dlm
modprobe gfs
ccsd
cman_tool join -w
fence_tool join -w
clvmd

I'd verify that your communications are sound, that you can ping svr00 from
svr01, and that multicast is working.  Any reason you went with multicast
rather than broadcast?  You could see if a broadcast ping (ping -b) would
work from svr01 to svr00.  Also, you could test to see if your firewall is
blocking the IO by temporarily doing "service iptables stop" on both nodes.
I'd hope that selinux isn't interfering either, but you could try doing
"setenforce 0" just as an experiment to make sure.  These are just some
ideas.

Regards,

Bob Peterson
Red Hat Cluster Suite

--
Linux-cluster mailing list
Linux-cluster redhat com
https://www.redhat.com/mailman/listinfo/linux-cluster


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]