[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [dm-devel] RDAC path checker status change messages




On Wed, 24 Jun 2009, Charlie Brady wrote:

 If your concern is that the last message is "down", then we should be
 having a static variable and print up or ghost message only once (when
 we toggle from down to up/ghost).

You do have a static variable, so that messages are only printed when the status changes.

I think I need to retract that. The messages appear to be printed when the status is not changing, at least for the "path down" messages.

I've patched scsi_dh_rdac.c so that SUN/LCSM100_I is included in rdac_dev_list[], and restarted iscsi and multipathd. I've now provoked a path failure via iptables. Here are the logs I see (with unpatched multipathd) - the duplicate message suppression from the rdac path checker is not working:

Jun 25 12:35:02 sun4150node1 kernel: device-mapper: multipath: Using scsi_dh module scsi_dh_rdac for failover/failback and
 device management.
Jun 25 12:35:08 sun4150node1 multipathd: cannot open /sbin/dasd_id : No such file or directory Jun 25 12:35:08 sun4150node1 multipathd: cannot open /sbin/gnbd_import : No such file or directory Jun 25 12:35:08 sun4150node1 multipathd: [copy.c] cannot open /sbin/dasd_id Jun 25 12:35:08 sun4150node1 multipathd: cannot copy /sbin/dasd_id in ramfs : No such file or directory Jun 25 12:35:08 sun4150node1 multipathd: [copy.c] cannot open /sbin/gnbd_import Jun 25 12:35:08 sun4150node1 multipathd: cannot copy /sbin/gnbd_import in ramfs : No such file or directory
Jun 25 12:35:08 sun4150node1 multipathd: mpath0: event checker started
Jun 25 12:35:02 sun4150node1 kernel: device-mapper: multipath: Using scsi_dh module scsi_dh_rdac for failover/failback and
 device management.
Jun 25 12:46:55 sun4150node1 kernel: ping timeout of 5 secs expired, last rx 1703243, last ping 1708243, now 1713243 Jun 25 12:46:55 sun4150node1 kernel: connection3:0: iscsi: detected conn error (1011) Jun 25 12:46:55 sun4150node1 iscsid: Kernel reported iSCSI connection 3:0 error (1011) state (3) Jun 25 12:48:55 sun4150node1 kernel: session3: iscsi: session recovery timed out after 120 secs
Jun 25 12:48:55 sun4150node1 kernel: iscsi: cmd 0x12 is not queued (8)
Jun 25 12:48:55 sun4150node1 multipathd: path checkers start up
Jun 25 12:48:55 sun4150node1 multipathd: sdd: rdac checker reports path is down Jun 25 12:48:55 sun4150node1 kernel: device-mapper: multipath: Failing path 8:48. Jun 25 12:48:55 sun4150node1 multipathd: checker failed path 8:48 in map mpath0
Jun 25 12:48:55 sun4150node1 multipathd: mpath0: remaining active paths: 1
Jun 25 12:48:55 sun4150node1 multipathd: dm-2: add map (uevent)
Jun 25 12:48:55 sun4150node1 multipathd: dm-2: devmap already registered
Jun 25 12:49:00 sun4150node1 kernel: iscsi: cmd 0x12 is not queued (8)
Jun 25 12:49:00 sun4150node1 multipathd: sdd: rdac checker reports path is down
Jun 25 12:49:35 sun4150node1 last message repeated 7 times
Jun 25 12:49:35 sun4150node1 last message repeated 7 times
Jun 25 12:50:05 sun4150node1 last message repeated 6 times
Jun 25 12:50:05 sun4150node1 kernel: iscsi: cmd 0x28 is not queued (8)
Jun 25 12:50:05 sun4150node1 kernel: sd 14:0:0:0: SCSI error: return code = 0x00010000 Jun 25 12:50:05 sun4150node1 kernel: end_request: I/O error, dev sdd, sector 0 Jun 25 12:50:05 sun4150node1 kernel: Buffer I/O error on device sdd, logical block 0 Jun 25 12:50:05 sun4150node1 kernel: Buffer I/O error on device sdd, logical block 1 Jun 25 12:50:05 sun4150node1 kernel: Buffer I/O error on device sdd, logical block 2 Jun 25 12:50:05 sun4150node1 kernel: Buffer I/O error on device sdd, logical block 3
Jun 25 12:50:05 sun4150node1 kernel: iscsi: cmd 0x28 is not queued (8)


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]