[Linux-cluster] IP Relocate Error / IP Restart error

dan.deshayes at algitech.com dan.deshayes at algitech.com
Mon Jul 9 14:06:40 UTC 2007


Hi,
thx for the reply but I'm not sure thats my problem.
I couldn't find the syntax for disabling the exclusivity (I'm not using gui)
but as far as I've understood its disabled by default. I tried with
exclusive="0" (not sure if its the right syntax though) but didn't solve
my problem.
But if the cluster was running with exclusive-mode the relocation
shouldn't work either, right?
As stated earlier the service restarts fine aslong as the node already
have an external ip.
Anyone with other ideas. maybe related to the "IP monitor failing
periodically"? but I don't have any problems running the cluster aslong as
the bond0 interface goes down, so maybe not.

Regards, Dan.

> I managed to figure out this problem in my environment. It had to do
> with the service being set to exclusive. When a service is set to
> exclusive, it is the only service allowed to run on that box. Which
> means no other services, such as mysql, can run on that server. So, if
> the ip is taken down, it will not start up again, because it does not
> have exclusivity to the server. I have no need for exclusivity in any
> way. If your running multiple services don't use it.
>
>
> Robert Gil
> Linux Systems Administrator
> American Home Mortgage
>
>
> -----Original Message-----
> From: linux-cluster-bounces at redhat.com
> [mailto:linux-cluster-bounces at redhat.com] On Behalf Of Dan Deshayes
> Sent: Friday, July 06, 2007 10:36 AM
> To: linux clustering
> Subject: Re: [Linux-cluster] IP Relocate Error / IP Restart error
>
> Hello,
> I'm bumping this question since I'm experienceing a smiliar problem.
> When one of my services fails and the cluster is trying to restart it,
> the node withdraws the ip and route.
> It seems that it can't setup the ip again when it has withdrawn. It can
> failover between nodes which holds other ipnumbers though, but never
> back except when I manully puts back the ip and route.
> I don't want to relocate the service just if sms-pixie fails but only to
> restart it (its stops when it looses connection to a server).
> I'm using bond and my configuration looks like this:
>
>                 <resources>
>                         <script file="/etc/init.d/mysqld"
> name="mysqld-db"/>
>                         <script file="/etc/init.d/postgresql"
> name="psql-db"/>
>                         <script file="/etc/init.d/sms-pixe"
> name="sms-pixie"/>
>                         <ip address="<ip1>" interface="bond0"
> monitor_link="1"/>
>                         <ip address="<ip2>" interface="bond0"
> monitor_link="1"/>
>                 </resources>
>                 <service autostart="1" domain="www-project1"
> name="www-project1" recovery="restart">
>                         <ip ref="<ip1>"/>
>                         <script ref="psql-db"/>
>                 </service>
>                 <service autostart="1" domain="www-project2"
> name="www-project2" recovery="restart">
>                         <ip ref="<ip2>"/>
>                         <script ref="mysqld-db"/>
>                 </service>
>
> Any thoughts would be appriciated.
>
> Regards, Dan
>
>
> Robert Gil wrote:
>
>> I have an IP address as a resource. I have the ip address in a 2 node
>> failover domain (total 4 nodes).
>>
>> When i run ifconfig eth0:1 down
>>
>> The service shows as stopped in clustat and the following errors show
>> in the logs
>>
>> Jun  1 12:25:36 <host> clurgmgrd[5346]: <warning> #71: Relocating
>> failed service mastervip Jun  1 12:25:36 <host> clurgmgrd[5346]:
>> <warning> #70: Attempting to restart service mastervip locally.
>> Jun  1 12:25:37 <host> clurgmgrd[5346]: <notice> Recovering failed
>> service mastervip Jun  1 12:25:37 <host> clurgmgrd[5346]: <notice>
>> start on ip:192.168.2.100 returned 1 (generic error) Jun  1 12:25:37
>> <host> clurgmgrd[5346]: <warning> #68: Failed to start mastervip;
>> return value: 1 Jun  1 12:25:37 <host> clurgmgrd[5346]: <notice>
>> Stopping service mastervip Jun  1 12:25:37 <host> clurgmgrd[5346]:
>> <notice> Service mastervip is stopped
>>
>> The following is the resources in /etc/cluster.conf
>>
>>                 <resources>
>>                         <clusterfs device="/dev/mapper/mqdata-mqdata"
>> force_unmount="0" fsid="22567" fstype="gfs" mountpoint="/mqdata"
>> name="mqdata" options=""/>
>>                         <ip address="192.168.2.100" interface="eth0"
>> monitor_link="1"/>
>>                 </resources>
>>
>> The service in /etc/cluster.conf
>>
>>                 <service autostart="1" domain="mysql" exclusive="1"
>> name="mastervip" recovery="relocate">
>>                         <ip ref="192.168.2.100"/>
>>                 </service>
>> Any ideas?
>>
>> Thanks,
>>
>>
>> *
>> *Robert Gil*
>> *Linux Systems Administrator*
>> *American Home Mortgage*
>> *Phone: 631-622-8410*
>> Cell: 631-827-5775
>> *Fax: 516-495-5861*
>> *
>>
>>
>>-----------------------------------------------------------------------
>>-
>>
>>--
>>Linux-cluster mailing list
>>Linux-cluster at redhat.com
>>https://www.redhat.com/mailman/listinfo/linux-cluster
>>
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
> --
> Linux-cluster mailing list
> Linux-cluster at redhat.com
> https://www.redhat.com/mailman/listinfo/linux-cluster
>





More information about the Linux-cluster mailing list