[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [Linux-cluster] fence_drac5 timeouts

$ fence_drac5 -V
3.1.5 (built Fri Feb 22 06:44:39 UTC 2013)
Copyright (C) Red Hat, Inc. 2004-2010 All rights reserved.

It's the version that came in the CentOS repos.

How do I get the fixed version?

On Thu, Aug 1, 2013 at 3:42 AM, Marek Grac <mgrac redhat com> wrote:
On 07/26/2013 04:29 AM, ch urnd wrote:
I'm trying to get fence_drac5 working on a cluster I'm setting up of two Dell R410's.  The primary issue I'm seeing are timeouts.  The fence does seem to work as the other node will get shut down, but the script always exits 1.

Please which version do you use?

This looks very likely to be bug resolved in https://git.fedorahosted.org/cgit/fence-agents.git/commit/?id=4bd62484e17cc63b27a103c744ec11fb00610b48
when autodetect of EOL was not working properly on DRAC devices when using ssh.


Here's the output:

# fence_drac5 -a --power-timeout 30 -x -l root -p calvin -c 'admin1->' -o reboot
Connection timed out

# fence_drac5 -a --power-timeout 30 -v -x -l root -p calvin -c 'admin1->' -o reboot
root 192 168 1 100's password: 
/admin1-> racadm serveraction powerstatus
Server power status: ON
/admin1-> racadm serveraction powerdown
Server power operation successful
/admin1->Traceback (most recent call last):
  File "/usr/sbin/fence_drac5", line 154, in <module>
  File "/usr/sbin/fence_drac5", line 137, in main
    result = fence_action(conn, options, set_power_status, get_power_status, get_list_devices)
  File "/usr/share/fence/fencing.py", line 838, in fence_action
    if wait_power_status(tn, options, get_power_fn) == 0:
  File "/usr/share/fence/fencing.py", line 744, in wait_power_status
    if get_power_fn(tn, options) != options["-o"]:
  File "/usr/sbin/fence_drac5", line 38, in get_power_status
    status = re.compile("(^|: )(ON|OFF|Powering ON|Powering OFF)\s*$", re.IGNORECASE | re.MULTILINE).search(conn.before).group(2)
AttributeError: 'NoneType' object has no attribute 'group'

Even though I pass "-o reboot", it still powers off.  It does the same even if I don't pass that option.

I added --power-timeout 30 in the latest test to see if that'd help but no dice.  Doesn't work without it either.

I have tried fence_ipmilan & it works great, but the iDRAC interfaces are somewhat exposed & need to use SSH for security reasons, which limits me to fence_drac5.


Linux-cluster mailing list
Linux-cluster redhat com

[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]