[Linux-cluster] nanny segfault problem

Christopher Barry christopher.barry at qlogic.com
Tue Nov 13 20:05:39 UTC 2007


Greetings All,

running RHEL4U5

I have a bunch of services on my cluster w/ access via redundant
directors.

I've created a generic service checking script, which I'm specifying in
lvs.cf's 'send_program' config parameter.

script is attached to this post. see that for how it works with the
symlinks described below.

I create symlinks to the script for every service I want to check, with
their name containing the port to hit, as in:
/sbin/lvs-<port>.sh

so the symlink name to check ssh availability, for instance, is:
/sbin/lvs-22.sh

The script works fine, and returns the first contiguous block of
[[:alnum:]] text data from the connection attempt for use with the
expect line of lvs.cf.


The problem is, when nanny is spawned by pulse, all of the nanny
processes segfault.

> Nov 13 14:40:44 kop-sds-dir-01 lvs[17740]: create_monitor for ssh_access/kop-sds-01 running as pid 17749
> Nov 13 14:40:44 kop-sds-dir-01 nanny[17749]: making 10.32.12.11:22 available
> Nov 13 14:40:44 kop-sds-dir-01 kernel: nanny[17749]: segfault at 000000000000006c rip 000000335e570810 rsp 0000007fbfffe978 error 4

this occurs almost instantly for every nanny process.

Can anyone venture a guess as to what is happening?

see my lvs.cf here:
http://nanny-error.pastebin.com/m592f7911



-- 
Regards,
-C

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071113/0bac5274/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: A_POLICY_VIOLATED_FILE_WAS_DETECTED_AND_REMOVED.TXT
Type: application/x-shellscript
Size: 150 bytes
Desc: A_POLICY_VIOLATED_FILE_WAS_DETECTED_AND_REMOVED.TXT
URL: <http://listman.redhat.com/archives/linux-cluster/attachments/20071113/0bac5274/attachment.bin>


More information about the Linux-cluster mailing list