[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

[Linux-cluster] force fencing

Hello list

I'm trying to setup a 3 nodes Cluster with 2 failover Domain for an HA
mail solution.
I want 1 run active for the Imap server in the Imap Failover domain , 1
node active for the Smtp in the Smtp Failover domain and the 3rd in the
2 failover domain as a backup node.

I run Centos 5.3
My fence device is a wti power switch

My cluster.conf is in attachement

My SMTP service is composed of:
	1 IP
	1 amavisd scritp
	1 postfix script
	2 NFS mount for postfix and amavis

If I manually kill the postfix master process (to simulate a crash), my
node is not fence and the logs said:

Jul  6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <info> Executing
/etc/init.d/postfix status
Jul  6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <err> script:postfix:
status of /etc/init.d/postfix failed (returned 3)
Jul  6 10:00:40 centos-smtp1 clurgmgrd[4228]: <notice> status on script
"postfix" returned 1 (generic error)
Jul  6 10:00:40 centos-smtp1 clurgmgrd[4228]: <notice> Stopping service
Jul  6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <info> Executing
/etc/init.d/amavisd stop
Jul  6 10:00:40 centos-smtp1 kernel: do_vfs_lock: VFS is out of sync
with lock manager!
Jul  6 10:00:40 centos-smtp1 last message repeated 8 times
Jul  6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <info> Executing
/etc/init.d/postfix stop
Jul  6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <err> script:postfix:
stop of /etc/init.d/postfix failed (returned 1)
Jul  6 10:00:41 centos-smtp1 clurgmgrd[4228]: <notice> stop on script
"postfix" returned 1 (generic error)
Jul  6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <info> Removing IPv4
address from bond0
Jul  6 10:00:41 centos-smtp1 avahi-daemon[3552]: Withdrawing address
record for on bond0.
Jul  6 10:00:51 centos-smtp1 clurgmgrd: [4228]: <info> unmounting
Jul  6 10:00:51 centos-smtp1 clurgmgrd: [4228]: <info> unmounting
Jul  6 10:00:51 centos-smtp1 clurgmgrd[4228]: <crit> #12: RG
service:Postfix failed to stop; intervention required
Jul  6 10:00:51 centos-smtp1 clurgmgrd[4228]: <notice> Service
service:Postfix is failed
Jul  6 10:00:52 centos-smtp1 ntpd[3322]: synchronized to,
stratum 1

Clustat said:

Cluster Status for cluster-test @ Mon Jul  6 10:02:39 2009
Member Status: Quorate

 Member Name                                                     ID   Status
 ------ ----                                                     ---- ------
 centos-imap1.ill.fr                                                 1
Online, Local, rgmanager
 centos-imap2.ill.fr                                                 2
Online, rgmanager
 centos-smtp1.ill.fr                                                 3
Online, rgmanager
 /dev/disk/by-id/scsi-360a98000567247514634507447594661-part1        0
Online, Quorum Disk

 Service Name                                                   Owner
(Last)                                                   State
 ------- ----                                                   -----
------                                                   -----
centos-imap2.ill.fr                                            started

(centos-smtp1.ill.fr)                                          failed

So I have to disable the Postfix servcie with:
	clusvcadm -d Postfix
and re-enable
	clusvcadm -e Postfix

Could you explain my why my original smtp node is not fenced and why my
service is not start on the 2nd node ???

Is there a way to force the fencing ???

ARMANET Stephane
Division Projet Technique
Service Informatique
  Groupe Infrastructure

Institut Laue langevin
<?xml version="1.0"?>
<cluster alias="cluster-test" config_version="57" name="cluster-test">
	<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
		<clusternode name="centos-imap1.test.fr" nodeid="1" votes="1">
				<method name="1">
					<device name="psu2" option="off" port="1"/>
					<device name="psu2" option="off" port="5"/>
				<method name="2">
					<device name="centos_manual-fence" nodename="centos-imap1.test.fr"/>
		<clusternode name="centos-imap2.test.fr" nodeid="2" votes="1">
				<method name="1">
					<device name="psu2" option="off" port="2"/>
					<device name="psu2" option="off" port="6"/>
				<method name="2">
					<device name="centos_manual-fence" nodename="centos-imap2.test.fr"/>
		<clusternode name="centos-smtp1.test.fr" nodeid="3" votes="1">
				<method name="1">
					<device name="psu1" option="off" port="1"/>
					<device name="psu1" option="off" port="5"/>
				<method name="2">
					<device name="centos_manual-fence" nodename="centos-smtp1.test.fr"/>
		<fencedevice agent="fence_manual" name="centos_manual-fence"/>
		<fencedevice agent="fence_wti" ipaddr="" name="psu1" passwd="passwd"/>
		<fencedevice agent="fence_wti" ipaddr="" name="psu2" passwd="passwd"/>
	<rm log_facility="local4" log_level="7">
			<failoverdomain name="imap-FOD" nofailback="0" ordered="1" restricted="1">
				<failoverdomainnode name="centos-imap1.test.fr" priority="1"/>
				<failoverdomainnode name="centos-imap2.test.fr" priority="2"/>
			<failoverdomain name="smtp-FOD" ordered="1" restricted="1">
				<failoverdomainnode name="centos-smtp1.test.fr" priority="1"/>
				<failoverdomainnode name="centos-imap2.test.fr" priority="2"/>
			<netfs export="/vol/volSMTP/postfix" force_unmount="1" fstype="nfs" host="romulus.test.fr" mountpoint="/var/spool/postfix" name="NFS-postfix" options="rw,nolock"/>
			<fs device="/dev/mapper/vgMail-lvMailboxes" force_fsck="1" force_unmount="1" fsid="34650" fstype="ext3" mountpoint="/var/spool/imap" name="lvMailboxes" options="commit=1" self_fence="1"/>
			<fs device="/dev/mapper/vgMail-lvDBMail" force_fsck="1" force_unmount="1" fsid="4277" fstype="ext3" mountpoint="/var/lib/imap" name="lvDBMail" options="commit=1" self_fence="1"/>
			<netfs export="/vol/volSMTP/amavis" force_unmount="1" fstype="nfs" host="romulus.test.fr" mountpoint="/var/lib/amavis" name="NFS Amavis" options=""/>
		<service autostart="1" domain="imap-FOD" name="Imap" recovery="relocate">
			<ip address="" monitor_link="1">
				<script file="/etc/init.d/cyrus-imapd" name="Cyrus-imapd"/>
			<fs ref="lvMailboxes"/>
			<fs ref="lvDBMail"/>
		<service autostart="1" domain="smtp-FOD" name="Postfix" recovery="relocate">
			<ip address="" monitor_link="1">
				<script file="/etc/init.d/postfix" name="postfix"/>
				<script file="/etc/init.d/amavisd" name="amavisd"/>
			<netfs ref="NFS-postfix"/>
			<netfs ref="NFS Amavis"/>
	<quorumd interval="2" label="QDISK" min_score="1" tko="5" votes="2">
		<heuristic interval="2" program="/bin/ping -c 1 -t 1" score="1"/>
		<heuristic interval="5" program="/bin/ping -c 3 -t 1" score="1"/>
	<totem consensus="4800" join="60" token="25000" token_retransmits_before_loss_const="20"/>

[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]