[Linux-cluster] CentOS5.x86_64 HA Cluster will not fail over to remaining node



I have a 2 node cluster on CentOS5.x86_64.


nodes: ccweb1 & ccweb2, which is an httpd cluster only

NOTE: no shared storages in employed as the content is static and the all the dynamic data is stored on another database cluster.


When the service "ca1-ccweb", which consists of a floating IP and /etc/init.d/httpd script (see config file below), running on a ccweb1 node and is fenced via:


fence_ipmilan -P -v -i -l ADMIN -p password -A password -o off


"clustat" reports the service is stopped but does not restart it on the remaining node:



[root ccweb2 ~]# clustat

Member Status: Quorate


  Member Name                        ID   Status

  ------ ----                        ---- ------

  ccweb1                                1 Offline

  ccweb2                                2 Online, Local, rgmanager


  Service Name         Owner (Last)                   State

  ------- ----         ----- ------                   -----

  service:ca1-ccweb    ccweb1                         stopping



After running:


clusvcadm -d ca1-ccweb -m ca1-ccweb1.daz3d.com


clustat reports the same as above.  In order to clear this out, I have to restart rgmanager.


Any thoughts?




<?xml version="1.0"?>

<cluster config_version="12" name="ccweb-clu">

        <fence_daemon post_fail_delay="0" post_join_delay="3"/>


                <clusternode name="ccweb1 " nodeid="1" votes="1">


                                <method name="1">

                                        <device lanplus="1" name="ccweb1-fence"/>




                <clusternode name="ccweb2 " nodeid="2" votes="1">


                                <method name="1">

                                        <device lanplus="1" name="ccweb2-fence"/>





        <cman expected_votes="1" two_node="1"/>


                <fencedevice agent="fence_ipmilan" auth="password" ipaddr="" login="ADMIN" name="ccweb1-fence" passwd="password"/>

                <fencedevice agent="fence_ipmilan" auth="password" ipaddr="" login="ADMIN" name="ccweb2-fence" passwd="password"/>




                        <failoverdomain name="ca1-ccweb-fail-dom" ordered="0" restricted="0">

                                <failoverdomainnode name="ccweb2" priority="1"/>

                                <failoverdomainnode name="ccweb1" priority="1"/>




                <service autostart="1" domain="ca1-ccweb-fail-dom" name="ca1-ccweb">

                        <ip address="" monitor_link="1">

                                <script file="/etc/init.d/httpd" name="httpd-init"/>





