[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] Manual Fencing problem

Ok i think that it works but now i have another problem
On the node0 messages i can see the follow message

fence_manual: Node node1 needs to be reset before recovery can procede.  Waiting for node1 to rejoin the cluster or for manual acknowledgement that it has been reset (i.e. fence_ack_manual -n node1)
so i try to fence_ack_manual -n node1
and i recived this message
fence_ack_manual -n node1
Warning:  If the node "node1" has not been manually fenced
(i.e. power cycled or disconnected from shared storage devices)
the GFS file system may become corrupted and all its data
unrecoverable!  Please verify that the node shown above has
been reset or disconnected from storage.
Are you certain you want to continue? [yN] y
can't open /tmp/fence_manual.fifo: No such file or directory
Thank you for your help
Best Regards
Mauro Casiraghi
On Thu, Nov 13, 2008 at 3:58 PM, John Ruemker <jruemker redhat com> wrote:
Try adding the nodename attribute to each device as seen here:

Mauro Casiraghi wrote:
I have two cluster nodes with the follow configuration
 For each node i had setup manual fencing
 <?xml version="1.0"?>
<cluster alias="rhcs" config_version="13" name="mauro">
       <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
               <clusternode name="node0" nodeid="1" votes="1">
                               <method name="1">
                                              <device name="Manual-0" nodename="node0"/>

               <clusternode name="node1" nodeid="2" votes="1">
                               <method name="1">
                                             <device name="Manual-1" nodename="node1"/>

       <cman expected_votes="1" two_node="1"/>
               <fencedevice agent="fence_manual" name="Manual-0" />
               <fencedevice agent="fence_manual" name="Manual-1" />
                       <failoverdomain name="rhcs-domain" ordered="0" restricted="1">
                               <failoverdomainnode name="node0" priority="1"/>
                               <failoverdomainnode name="node1" priority="1"/>
                       <ip address="xx.xxx.xx.78" monitor_link="1"/>
               <service autostart="1" domain="rhcs-domain" exclusive="0" name="rhcs-web" recovery="relocate">
                       <ip ref="xx.xxx.xx..78"/>
 On my messages (node0) i had recived this message
 Nov 13 12:06:34 lxxxxxxx fenced[2002]: fencing node "node1"
Nov 13 12:06:34 lxxxxxxx fenced[2002]: agent "fence_manual" reports: failed: fence_manual no node name
 How can i fix this problem


Linux-cluster mailing list
Linux-cluster redhat com

[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]