[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] Service won't relocate after yum updates



Correction: "but the nfs service will failover" should read "but the nfs service will not failover" Sorry.

Randy

Randy Brown wrote:
I just ran `yum update` on one of the nodes in my two node cluster and now the nfs service won't relocate to the updated node. Here are the versions of relevant packages on each node:

Node 1 (updated node)
[root nfs1-cluster ~]# rpm -qa |grep -e cman -e lvm -e gfs -e rgmanager -e kernel
kmod-gfs-0.1.16-6.2.6.18_8.1.15.el5
lvm2-2.02.26-3.el5
kmod-gfs-0.1.19-7.el5_1.1
gfs-utils-0.1.12-1.el5
system-config-lvm-1.0.22-1.0.el5
cman-2.0.73-1.el5_1.1
lvm2-cluster-2.02.26-1.el5
rgmanager-2.0.31-1.el5.centos
gfs2-utils-0.1.38-1.el5
kernel-2.6.18-53.1.4.el5
kernel-2.6.18-8.1.15.el5
kernel-headers-2.6.18-53.1.4.el5

Node 2
[root nfs2-cluster ~]# rpm -qa |grep -e cman -e lvm -e gfs -e rgmanager -e kernel
gfs2-utils-0.1.25-1.el5
kmod-gfs-0.1.16-5.2.6.18_8.1.14.el5
kmod-gfs-0.1.16-6.2.6.18_8.1.15.el5
system-config-lvm-1.0.22-1.0.el5
cman-2.0.64-1.0.1.el5
rgmanager-2.0.24-1.el5.centos
gfs-utils-0.1.11-3.el5
lvm2-2.02.16-3.el5
lvm2-cluster-2.02.16-3.el5
kernel-2.6.18-8.1.14.el5
kernel-2.6.18-8.1.15.el5
kernel-headers-2.6.18-8.1.15.el5

The cluster will start on the new machine but the nfs service will failover to it as it did prior to the upgrade. The messages I see in /var/log/messages are: Dec 6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> Member 2 shutting down Dec 6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> Starting stopped service service:nfs Dec 6 10:14:08 nfs1-cluster clurgmgrd: [4455]: <err> No export path specified. Dec 6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> start on nfsclient "fs-shared-client" returned 2 (invalid argument(s)) Dec 6 10:14:08 nfs1-cluster clurgmgrd[4455]: <warning> #68: Failed to start service:nfs; return value: 1 Dec 6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> Stopping service service:nfs Dec 6 10:14:08 nfs1-cluster clurgmgrd: [4455]: <err> No export path specified. Dec 6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient "rfcdata-client" returned 2 (invalid argument(s)) Dec 6 10:14:08 nfs1-cluster clurgmgrd: [4455]: <err> No export path specified. Dec 6 10:14:08 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient "fs-shared-client" returned 2 (invalid argument(s)) Dec 6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> Service service:nfs is recovering Dec 6 10:14:09 nfs1-cluster clurgmgrd[4455]: <warning> #71: Relocating failed service service:nfs Dec 6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> Stopping service service:nfs Dec 6 10:14:09 nfs1-cluster clurgmgrd: [4455]: <err> No export path specified. Dec 6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient "rfcdata-client" returned 2 (invalid argument(s)) Dec 6 10:14:09 nfs1-cluster clurgmgrd: [4455]: <err> No export path specified. Dec 6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient "fs-shared-client" returned 2 (invalid argument(s)) Dec 6 10:14:09 nfs1-cluster clurgmgrd[4455]: <notice> Service service:nfs is stopped Dec 6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> Starting stopped service service:nfs Dec 6 10:14:47 nfs1-cluster clurgmgrd: [4455]: <err> No export path specified. Dec 6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> start on nfsclient "fs-shared-client" returned 2 (invalid argument(s)) Dec 6 10:14:47 nfs1-cluster clurgmgrd[4455]: <warning> #68: Failed to start service:nfs; return value: 1 Dec 6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> Stopping service service:nfs Dec 6 10:14:47 nfs1-cluster clurgmgrd: [4455]: <err> No export path specified. Dec 6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient "rfcdata-client" returned 2 (invalid argument(s)) Dec 6 10:14:47 nfs1-cluster clurgmgrd: [4455]: <err> No export path specified. Dec 6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> stop on nfsclient "fs-shared-client" returned 2 (invalid argument(s)) Dec 6 10:14:47 nfs1-cluster clurgmgrd[4455]: <notice> Service service:nfs is recovering Dec 6 10:14:47 nfs1-cluster clurgmgrd[4455]: <warning> #71: Relocating failed service service:nfs Dec 6 10:14:49 nfs1-cluster clurgmgrd[4455]: <notice> Service service:nfs is now running on member 2

The export path in the nfsclient resource box, when using system-config-cluster, is marked optional and it has not been a problem leaving that blank in the past. Has something regarding this changed?

Thanks in advance for any assistance,

Randy

Cluster.conf:
[root nfs1-cluster cluster]# cat cluster.conf
<?xml version="1.0"?>
<cluster alias="ohd_cluster" config_version="120" name="ohd_cluster">
       <fence_daemon post_fail_delay="0" post_join_delay="60"/>
       <clusternodes>
<clusternode name="nfs1-cluster.nws.noaa.gov" nodeid="1" votes="1">
                       <fence>
                               <method name="1">
<device name="nfspower" port="8" switch="1"/>
                               </method>
                       </fence>
               </clusternode>
<clusternode name="nfs2-cluster.nws.noaa.gov" nodeid="2" votes="1">
                       <fence>
                               <method name="1">
<device name="nfspower" port="7" switch="1"/>
                               </method>
                       </fence>
               </clusternode>
       </clusternodes>
       <cman expected_votes="1" two_node="1"/>
       <rm>
               <failoverdomains>
<failoverdomain name="nfs-failover" ordered="0" restricted="1"> <failoverdomainnode name="nfs1-cluster.nws.noaa.gov" priority="1"/> <failoverdomainnode name="nfs2-cluster.nws.noaa.gov" priority="1"/>
                       </failoverdomain>
               </failoverdomains>
               <resources>
                       <ip address="140.90.91.244" monitor_link="1"/>
<clusterfs device="/dev/VolGroupFS/LogVol-shared" force_unmount="0" fsid="30647" fstype="gfs" mountpoint="/fs/shared" name="fs-shared" options="acl"/>
                       <nfsexport name="fs-shared-exp"/>
<nfsclient name="fs-shared-client" options="no_root_squash,rw" path="" target="140.90.91.0/24"/> <clusterfs device="/dev/VolGroupTemp/LogVol-rfcdata" force_unmount="0" fsid="54233" fstype="gfs" mountpoint="/rfcdata" name="rfcdata" options="acl"/>
                       <nfsexport name="rfcdata-exp"/>
<nfsclient name="rfcdata-client" options="no_root_squash,rw" path="" target="140.90.91.0/24"/>
               </resources>
               <service autostart="1" domain="nfs-failover" name="nfs">
                       <clusterfs ref="fs-shared">
                               <nfsexport ref="fs-shared-exp">
<nfsclient ref="fs-shared-client"/>
                               </nfsexport>
                       </clusterfs>
                       <ip ref="140.90.91.244"/>
                       <clusterfs ref="rfcdata">
                               <nfsexport ref="rfcdata-exp">
                                       <nfsclient ref="rfcdata-client"/>
                               </nfsexport>
                               <ip ref="140.90.91.244"/>
                       </clusterfs>
               </service>
       </rm>
       <fencedevices>
<fencedevice agent="fence_apc" ipaddr="192.168.42.30" login="rbrown" name="nfspower" passwd="Tele4m32"/>
       </fencedevices>
</cluster>
--
Linux-cluster mailing list
Linux-cluster redhat com
https://www.redhat.com/mailman/listinfo/linux-cluster
begin:vcard
fn:Randy Brown
n:Brown;Randy
org:National Weather Service;Office of hydrologic Development
adr:;;1325 East West Highway;Silver Spring;MD;20910;USA
email;internet:randy brown noaa gov
title:Senior Systems Administrator
tel;work:301-713-1669 x110
url:http://www.nws.noaa.gov/ohd/
version:2.1
end:vcard


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]