[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] Manual Fencing problem



Ok i think that it works but now i have another problem
 
On the node0 messages i can see the follow message
 

fence_manual: Node node1 needs to be reset before recovery can procede.  Waiting for node1 to rejoin the cluster or for manual acknowledgement that it has been reset (i.e. fence_ack_manual -n node1)
so i try to fence_ack_manual -n node1
 
and i recived this message
 
fence_ack_manual -n node1
Warning:  If the node "node1" has not been manually fenced
(i.e. power cycled or disconnected from shared storage devices)
the GFS file system may become corrupted and all its data
unrecoverable!  Please verify that the node shown above has
been reset or disconnected from storage.
Are you certain you want to continue? [yN] y
can't open /tmp/fence_manual.fifo: No such file or directory
 
Thank you for your help
 
Best Regards
 
Mauro Casiraghi
 
On Thu, Nov 13, 2008 at 3:58 PM, John Ruemker <jruemker redhat com> wrote:
Try adding the nodename attribute to each device as seen here:


Mauro Casiraghi wrote:
I have two cluster nodes with the follow configuration
 For each node i had setup manual fencing
 <?xml version="1.0"?>
<cluster alias="rhcs" config_version="13" name="mauro">
       <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
       <clusternodes>
               <clusternode name="node0" nodeid="1" votes="1">
                       <fence>
                               <method name="1">
                                              <device name="Manual-0" nodename="node0"/>

                               </method>
                       </fence>
               </clusternode>
               <clusternode name="node1" nodeid="2" votes="1">
                       <fence>
                               <method name="1">
                                             <device name="Manual-1" nodename="node1"/>

                               </method>
                       </fence>
               </clusternode>
       </clusternodes>
       <cman expected_votes="1" two_node="1"/>
       <fencedevices>
               <fencedevice agent="fence_manual" name="Manual-0" />
               <fencedevice agent="fence_manual" name="Manual-1" />
       </fencedevices>
       <rm>
               <failoverdomains>
                       <failoverdomain name="rhcs-domain" ordered="0" restricted="1">
                               <failoverdomainnode name="node0" priority="1"/>
                               <failoverdomainnode name="node1" priority="1"/>
                       </failoverdomain>
               </failoverdomains>
               <resources>
                       <ip address="xx.xxx.xx.78" monitor_link="1"/>
               </resources>
               <service autostart="1" domain="rhcs-domain" exclusive="0" name="rhcs-web" recovery="relocate">
                       <ip ref="xx.xxx.xx..78"/>
               </service>
       </rm>
</cluster>
 On my messages (node0) i had recived this message
 Nov 13 12:06:34 lxxxxxxx fenced[2002]: fencing node "node1"
Nov 13 12:06:34 lxxxxxxx fenced[2002]: agent "fence_manual" reports: failed: fence_manual no node name
 How can i fix this problem
 

-John

--
Linux-cluster mailing list
Linux-cluster redhat com
https://www.redhat.com/mailman/listinfo/linux-cluster


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]