[Linux-cluster] CentOS5.x86_64 HA Cluster will not fail over to remaining node

Christian Ullman christian at zmcconsulting.com
Thu Oct 11 14:40:50 UTC 2007


Hi-

 

I have a 2 node cluster on CentOS5.x86_64.

 

nodes: ccweb1 & ccweb2, which is an httpd cluster only

NOTE: no shared storages in employed as the content is static and the all
the dynamic data is stored on another database cluster. 

 

When the service "ca1-ccweb", which consists of a floating IP and
/etc/init.d/httpd script (see config file below), running on a ccweb1 node
and is fenced via:

 

fence_ipmilan -P -v -i 10.1.2.13 -l ADMIN -p password -A password -o off

 

"clustat" reports the service is stopped but does not restart it on the
remaining node:

 

<begin>

[root at ccweb2 ~]# clustat

Member Status: Quorate

 

  Member Name                        ID   Status

  ------ ----                        ---- ------

  ccweb1                                1 Offline

  ccweb2                                2 Online, Local, rgmanager

 

  Service Name         Owner (Last)                   State

  ------- ----         ----- ------                   -----

  service:ca1-ccweb    ccweb1                         stopping

<end>

 

After running:

 

clusvcadm -d ca1-ccweb -m ca1-ccweb1.daz3d.com

 

clustat reports the same as above.  In order to clear this out, I have to
restart rgmanager.

 

Any thoughts?

 

Christian

 

<begin cluster.conf>

<?xml version="1.0"?>

<cluster config_version="12" name="ccweb-clu">

        <fence_daemon post_fail_delay="0" post_join_delay="3"/>

        <clusternodes>

                <clusternode name="ccweb1 " nodeid="1" votes="1">

                        <fence>

                                <method name="1">

                                        <device lanplus="1"
name="ccweb1-fence"/>

                                </method>

                        </fence>

                </clusternode>

                <clusternode name="ccweb2 " nodeid="2" votes="1">

                        <fence>

                                <method name="1">

                                        <device lanplus="1"
name="ccweb2-fence"/>

                                </method>

                        </fence>

                </clusternode>

        </clusternodes>

        <cman expected_votes="1" two_node="1"/>

        <fencedevices>

                <fencedevice agent="fence_ipmilan" auth="password"
ipaddr="10.1.2.13" login="ADMIN" name="ccweb1-fence" passwd="password"/>

                <fencedevice agent="fence_ipmilan" auth="password"
ipaddr="10.1.2.14" login="ADMIN" name="ccweb2-fence" passwd="password"/>

        </fencedevices>

        <rm>

                <failoverdomains>

                        <failoverdomain name="ca1-ccweb-fail-dom"
ordered="0" restricted="0">

                                <failoverdomainnode name="ccweb2"
priority="1"/>

                                <failoverdomainnode name="ccweb1"
priority="1"/>

                        </failoverdomain>

                </failoverdomains>

                <resources/>

                <service autostart="1" domain="ca1-ccweb-fail-dom"
name="ca1-ccweb">

                        <ip address="38.103.62.30" monitor_link="1">

                                <script file="/etc/init.d/httpd"
name="httpd-init"/>

                        </ip>

                </service>

        </rm>

</cluster>

<end cluster.conf>

 

 

"Those who would give up security for shorts, deserve neither" - Benament
Frankenhoffer

 

 

 

 

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071011/adf9e756/attachment.htm>


More information about the Linux-cluster mailing list