[Linux-cluster] Timed out waiting for a response from Resource Group Manager

Jacques Botha jacquesb at fnb.co.za
Wed Jul 25 14:43:53 UTC 2007


Here is the cluster.conf like you asked Lon.


<?xml version="1.0"?>
<cluster alias="fnbgw" config_version="12" name="fnbgw">
        <quorumd interval="2" label="fnbgw_qdisk" log_level="7"
min_score="3" status_file="/tmp/qdisk_status" tko="5" votes="4">
                <heuristic interval="10" program="ping 172.20.28.193 -c3
-t1" score="2"/>
                <heuristic interval="10" program="ping 172.20.28.195 -c3
-t1" score="1"/>
                <heuristic interval="10" program="ping 172.20.28.196 -c3
-t1" score="1"/>
                <heuristic interval="10" program="ping 172.20.28.197 -c3
-t1" score="1"/>
                <heuristic interval="10" program="ping 172.20.28.198 -c3
-t1" score="1"/>
        </quorumd>
        <fence_daemon clean_start="0" post_fail_delay="0"
post_join_delay="3"/>
        <clusternodes>
                <clusternode name="fnbgw01.fnb.co.za" nodeid="1"
votes="1">
                        <fence>
                                <method name="1">
                                        <device blade="11"
name="BSM_BLADE_CENTRE_BLUE"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="fnbgw02.fnb.co.za" nodeid="2"
votes="1">
                        <fence>
                                <method name="1">
                                        <device blade="12"
name="BSM_BLADE_CENTRE_BLUE"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="fnbgw03.fnb.co.za" nodeid="3"
votes="1">
                        <fence>
                                <method name="1">
                                        <device blade="11"
name="BSM_BLADE_CENTRE_RED3"/>
                                </method>
                        </fence>
                </clusternode>
                <clusternode name="fnbgw04.fnb.co.za" nodeid="4"
votes="1">
                        <fence>
                                <method name="1">
                                        <device blade="12"
name="BSM_BLADE_CENTRE_RED3"/>
                                </method>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman/>
        <fencedevices>
                <fencedevice agent="fence_bladecenter" ipaddr="1.1.1.1"
login="secretlogin" name="BSM_BLADE_CENTRE_BLUE"
passwd="verysecretpassword"/>
                <fencedevice agent="fence_bladecenter" ipaddr="1.1.1.1"
login="secretlogin" name="BSM_BLADE_CENTRE_RED"
passwd="verysecretpassword"/>
        </fencedevices>
        <rm>
                <resources>
                        <ip address="172.20.28.200" monitor_link="1"/>
                </resources>
                <failoverdomains>
                        <failoverdomain name="vmfail" ordered="0"
restricted="0">
                                <failoverdomainnode
name="fnbgw01.fnb.co.za" priority="1"/>
                                <failoverdomainnode
name="fnbgw02.fnb.co.za" priority="1"/>
                                <failoverdomainnode
name="fnbgw03.fnb.co.za" priority="1"/>
                                <failoverdomainnode
name="fnbgw04.fnb.co.za" priority="1"/>
                        </failoverdomain>
                </failoverdomains>
                <service autostart="1" domain="vmfail" exclusive="0"
name="ip200" recovery="restart">
                        <ip ref="172.20.28.200"/>
                </service>
        </rm>
</cluster>



On Wed, 2007-07-25 at 10:27 -0400, Lon Hohberger wrote:
> On Tue, Jul 24, 2007 at 03:19:11PM +0200, Jacques Beth wrote:
> > Hi All
> > 
> > I've got a quorate 4 node cluster, with a quorum disk.
> > 
> > _If_ I have _No_ fail over domain, resources or services defined,
> > executing clustat is _fast_, and gives the following:
> > 
> > [root at fnbgw02 ~]# clustat
> > Member Status: Quorate
> > 
> >   Member Name                        ID   Status
> >   ------ ----                        ---- ------
> >   fnbgw01.fnb.co.za                     1 Online
> >   fnbgw02.fnb.co.za                     2 Online, Local
> >   fnbgw03.fnb.co.za                     3 Online
> >   fnbgw04.fnb.co.za                     4 Online
> >   /dev/sdb1                             0 Online, Quorum Disk
> > 
> > 
> > _If_ I define a failover group, a resource and a service, executing
> > clustat is _slow_ and it fails to show the defined service:
> 
> That shouldn't matter (I'm not saying it doesn't matter, but that it
> shouldn't).  It's especially weird that it looks like you don't even
> have rgmanager running... (so there's no service output).
> 
> > 
> > [root at fnbgw02 ~]# clustat
> > Timed out waiting for a response from Resource Group Manager
> > Member Status: Quorate
> > 
> >   Member Name                        ID   Status
> >   ------ ----                        ---- ------
> >   fnbgw01.fnb.co.za                     1 Online, Local
> >   fnbgw02.fnb.co.za                     2 Online
> >   fnbgw03.fnb.co.za                     3 Online
> >   fnbgw04.fnb.co.za                     4 Online
> >   /dev/sdb1                             0 Online, Quorum Disk
> > 
> > This is on CentOS5, with all the latest updates from CentOS applied.
> 
> Could you paste your cluster.conf somewhere?
> 

To read FirstRand Bank's Disclaimer for this email click on the following address or copy into your Internet browser: 
https://www.fnb.co.za/disclaimer.html 

If you are unable to access the Disclaimer, send a blank e-mail to
firstrandbankdisclaimer at fnb.co.za and we will send you a copy of the Disclaimer.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20070725/1cf7781b/attachment.sig>


More information about the Linux-cluster mailing list