[dm-devel] Multipath not re-activating failed paths?
Darryl Dixon
esrever_otua at pythonhacker.is-a-geek.net
Thu Sep 14 23:53:40 UTC 2006
Hi All,
I have a working dm-multipath set up with a multipath root device. For
some reason, while multipath seems to correctly use both paths, and will
gracefully handle the failing of a path (uninterrupted IO works OK), it
does not seem to want to detect once the failed path has come back up
again. In other words, in my two-path setup, it will load balance between
the paths, continue successfully on one path when one fails, but it will
then be 'stuck' on that path forever until the next reboot, even if the
first path is back up and otherwise working fine.
>From what I can understand of the multipath.conf settings, the paths
should be tested every 5 seconds, and should be marked 'active' once they
come back up.
How can I best go about debugging/investigating this?
My setup details:
Machine: HP Blade BL25P with QLogic dual-ported HBA
Storage: Two paths to SUN 3510
OS: RHEL4 x86_64
DM package: device-mapper-multipath-0.4.5-16.1.RHEL4
uname -r: 2.6.9-42.0.2.ELsmp
contents of /etc/multipath.conf:
----------8<----------[cut]
devnode_blacklist {
devnode "^cciss!c[0-9]d[0-9]*"
}
defaults {
user_friendly_names yes
no_path_retry fail
path_grouping_policy multibus
failback immediate
}
multipaths {
multipath {
wwid 3500000e01190e340
alias os
}
}
----------8<----------[cut]
Output of multipath -l:
----------8<----------[cut]
3500000e01190e100
[size=68 GB][features="0"][hwhandler="0"]
\_ round-robin 0 [active]
\_ 0:0:3:0 sdd 8:48 [active]
\_ 1:0:3:0 sdh 8:112 [active]
3500000e01190e3f0
[size=68 GB][features="0"][hwhandler="0"]
\_ round-robin 0 [active]
\_ 0:0:1:0 sdb 8:16 [active]
\_ 1:0:0:0 sde 8:64 [active]
os (3500000e01190e340)
[size=68 GB][features="0"][hwhandler="0"]
\_ round-robin 0 [active]
\_ 0:0:0:0 sda 8:0 [active]
\_ 1:0:2:0 sdg 8:96 [active]
3500000e01190e310
[size=68 GB][features="0"][hwhandler="0"]
\_ round-robin 0 [active]
\_ 0:0:2:0 sdc 8:32 [active]
\_ 1:0:1:0 sdf 8:80 [active]
----------8<----------[cut]
Contents of /dev/mapper/:
----------8<----------[cut]
brw-rw---- 1 root disk 253, 3 Sep 15 2006 3500000e01190e100
brw-rw---- 1 root disk 253, 2 Sep 15 2006 3500000e01190e310
brw-rw---- 1 root disk 253, 1 Sep 15 2006 3500000e01190e3f0
crw------- 1 root root 10, 63 Sep 15 2006 control
brw-rw---- 1 root disk 253, 0 Sep 15 2006 os
brw-rw---- 1 root disk 253, 4 Sep 15 2006 os1
brw-rw---- 1 root disk 253, 5 Sep 15 2006 os2
brw-rw---- 1 root disk 253, 6 Sep 15 2006 os3
----------8<----------[cut]
Output of df -k:
----------8<----------[cut]
Filesystem 1K-blocks Used Available Use% Mounted on
/dev/mapper/os2 50394996 29944792 17890248 63% /
/dev/mapper/os1 101086 23801 72066 25% /boot
none 5036176 0 5036176 0% /dev/shm
----------8<----------[cut]
Any and all pointers or assistance appreciated.
regards,
Darryl Dixon
http://www.winterhouseconsulting.com
More information about the dm-devel
mailing list