[Linux-cluster] IP Relocate Error / IP Restart error

Dan Deshayes dan.deshayes at algitech.com
Fri Jul 6 14:36:15 UTC 2007


Hello,
I'm bumping this question since I'm experienceing a smiliar problem.
When one of my services fails and the cluster is trying to restart it, 
the node withdraws the ip and route.
It seems that it can't setup the ip again when it has withdrawn. It can 
failover between nodes which holds
other ipnumbers though, but never back except when I manully puts back 
the ip and route.
I don't want to relocate the service just if sms-pixie fails but only to 
restart it (its stops when it looses connection to a server).
I'm using bond and my configuration looks like this:

                <resources>
                        <script file="/etc/init.d/mysqld" name="mysqld-db"/>
                        <script file="/etc/init.d/postgresql" 
name="psql-db"/>
                        <script file="/etc/init.d/sms-pixe" 
name="sms-pixie"/>
                        <ip address="<ip1>" interface="bond0" 
monitor_link="1"/>
                        <ip address="<ip2>" interface="bond0" 
monitor_link="1"/>
                </resources>
                <service autostart="1" domain="www-project1" 
name="www-project1" recovery="restart">
                        <ip ref="<ip1>"/>
                        <script ref="psql-db"/>
                </service>
                <service autostart="1" domain="www-project2" 
name="www-project2" recovery="restart">
                        <ip ref="<ip2>"/>
                        <script ref="mysqld-db"/>
                </service>

Any thoughts would be appriciated.

Regards, Dan


Robert Gil wrote:

> I have an IP address as a resource. I have the ip address in a 2 node 
> failover domain (total 4 nodes).
>  
> When i run ifconfig eth0:1 down
>  
> The service shows as stopped in clustat and the following errors show 
> in the logs
>  
> Jun  1 12:25:36 <host> clurgmgrd[5346]: <warning> #71: Relocating 
> failed service mastervip
> Jun  1 12:25:36 <host> clurgmgrd[5346]: <warning> #70: Attempting to 
> restart service mastervip locally.
> Jun  1 12:25:37 <host> clurgmgrd[5346]: <notice> Recovering failed 
> service mastervip
> Jun  1 12:25:37 <host> clurgmgrd[5346]: <notice> start on 
> ip:192.168.2.100 returned 1 (generic error)
> Jun  1 12:25:37 <host> clurgmgrd[5346]: <warning> #68: Failed to start 
> mastervip; return value: 1
> Jun  1 12:25:37 <host> clurgmgrd[5346]: <notice> Stopping service 
> mastervip
> Jun  1 12:25:37 <host> clurgmgrd[5346]: <notice> Service mastervip is 
> stopped
>  
> The following is the resources in /etc/cluster.conf
>  
>                 <resources>
>                         <clusterfs device="/dev/mapper/mqdata-mqdata" 
> force_unmount="0" fsid="22567" fstype="gfs" mountpoint="/mqdata" 
> name="mqdata" options=""/>
>                         <ip address="192.168.2.100" interface="eth0" 
> monitor_link="1"/>
>                 </resources>
>  
> The service in /etc/cluster.conf
>  
>                 <service autostart="1" domain="mysql" exclusive="1" 
> name="mastervip" recovery="relocate">
>                         <ip ref="192.168.2.100"/>
>                 </service>
> Any ideas?
>  
> Thanks,
>
>  
> *
> *Robert Gil*
> *Linux Systems Administrator*
> *American Home Mortgage*
> *Phone: 631-622-8410*
> Cell: 631-827-5775
> *Fax: 516-495-5861*
> *
>  
>
>------------------------------------------------------------------------
>
>--
>Linux-cluster mailing list
>Linux-cluster at redhat.com
>https://www.redhat.com/mailman/listinfo/linux-cluster
>




More information about the Linux-cluster mailing list