Re: [Linux-cluster] Two nodes DRBD - Fail-Over Actif/Passif Cluster.

On 02/15/2011 08:50 PM, vincent blondel ing be wrote:

below the cluster.conf file ...

<?xml version="1.0"?>
<cluster name="cluster" config_version="6">
   <!-- post_join_delay: number of seconds the daemon will wait before
                         fencing any victims after a node joins the domain
        post_fail_delay: number of seconds the daemon will wait before
                       fencing any victims after a domain member fails
        clean_start    : prevent any startup fencing the daemon might do.
                       It indicates that the daemon should assume all nodes
                       are in a clean state to start. -->
   <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
     <clusternode name="reporter1.lab.intranet" votes="1" nodeid="1">
         <!-- Handle fencing manually -->
         <method name="human">
           <device name="human" nodename="reporter1.lab.intranet"/>
     <clusternode name="reporter2.lab.intranet" votes="1" nodeid="2">
         <!-- Handle fencing manually -->
         <method name="human">
           <device name="human" nodename="reporter2.lab.intranet"/>
   <!-- cman two nodes specification -->
   <cman expected_votes="1" two_node="1"/>
     <!-- Define manual fencing -->
     <fencedevice name="human" agent="fence_manual"/>
         <failoverdomain name="example_pri" nofailback="0" ordered="1" restricted="0">
            <failoverdomainnode name="reporter1.lab.intranet" priority="1"/>
            <failoverdomainnode name="reporter2.lab.intranet" priority="2"/>
            <ip address="" monitor_link="on" sleeptime="10"/>
            <apache config_file="conf/httpd.conf" name="example_server" server_root="/etc/httpd" shutdown_wait="0"/>
       <service autostart="1" domain="example_pri" exclusive="0" name="example_apache" recovery="relocate">
                 <ip ref=""/>
                 <apache ref="example_server"/>

and this is the result I get on both servers ...

[root reporter1 ~]# clustat
Cluster Status for cluster @ Mon Feb 14 22:22:53 2011
Member Status: Quorate

  Member Name                                      ID   Status
  ------ ----                                      ---- ------
  reporter1.lab.intranet                               1 Online, Local, rgmanager
  reporter2.lab.intranet                               2 Online, rgmanager

  Service Name                            Owner (Last)                            State
  ------- ----                            ----- ------                            -----
  service:example_apache                  (none)                                  stopped

as you can see, everything is stopped or in other words nothing runs .. so my question are :

Having a read through /var/log/messages for possible causes would be a good start.

do I have to configure manually load balanced ip as an alias ip on both sides or is it done automatically by redhat cluster ?

RHCS will automatically assign the IP to an interface that is on the same subnet. You most definitely shouldn't create the IP manually on any of the nodes.

I just made a simple try with apache but I do not find anywhere reference to the start/stop script for apache in the examples, is that normal ??
do you have some best practice regarding this picture ??

I'm not familiar with the <apache> tag in cluster.conf, I usually configure most things as init script resources.


