disk problems or false alarm??

Guolin Cheng guolin at alexa.com
Fri Apr 30 17:54:59 UTC 2004


Hi, 

 

 I run hundreds of Redhat 8.0 boxes and Fedora Core 1 boxes, both
Operation systems boxes give me some trouble reporting disk errors like
the following (collected from /var/log/messages of each linux boxes by
my own script). And a "badblocks" command on some of the related hard
drive reports that failed sectors found, while others reports no,
false-positive. Any one can give me suggestions or hints?

 

Thanks a lot.

 

......

Host:       arc242

arc242:     Apr 29 13:51:32 arc242 kernel: hdb: dma_intr: status=0x51 {
DriveReady SeekComplete Error }

arc242:     Apr 29 13:51:32 arc242 kernel: hdb: dma_intr: error=0x01 {
AddrMarkNotFound }, LBAsect=38613129, sector=38613064

arc242:     Apr 29 13:51:36 arc242 kernel: hdb: dma_intr: status=0x51 {
DriveReady SeekComplete Error }

arc242:     Apr 29 13:51:36 arc242 kernel: hdb: dma_intr: error=0x01 {
AddrMarkNotFound }, LBAsect=38613129, sector=38613064

arc242:     Apr 29 13:51:43 arc242 kernel: hdb: dma_intr: status=0x51 {
DriveReady SeekComplete Error }

arc242:     Apr 29 13:51:43 arc242 kernel: hdb: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=38613129, sector=38613064

arc242:     Apr 29 13:51:43 arc242 kernel: end_request: I/O error, dev
03:41 (hdb), sector 38613064

arc242:     Apr 29 13:51:49 arc242 kernel: hdb: dma_intr: status=0x51 {
DriveReady SeekComplete Error }

arc242:     Apr 29 13:51:49 arc242 kernel: hdb: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=38613129, sector=38613064

arc242:     Apr 29 13:51:49 arc242 kernel: end_request: I/O error, dev
03:41 (hdb), sector 38613064

 

Host:       arc292

arc292:     Apr 29 04:02:27 arc292 kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }

arc292:     Apr 29 04:02:27 arc292 kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=239379157, high=14, low=4498133,
sector=331888

arc292:     Apr 29 04:02:27 arc292 kernel: end_request: I/O error, dev
03:0b (hda), sector 331888

arc292:     Apr 29 04:02:29 arc292 kernel: hda: dma_intr: status=0x51 {
DriveReady SeekComplete Error }

arc292:     Apr 29 04:02:29 arc292 kernel: hda: dma_intr: error=0x40 {
UncorrectableError }, LBAsect=239379157, high=14, low=4498133,
sector=331888

arc292:     Apr 29 04:02:29 arc292 kernel: end_request: I/O error, dev
03:0b (hda), sector 331888

 

...... blahblah...

 

I tried to run "badblocks" on the boxes to test whether there are real
hardware problems, then I got some of them really reports problems, and
some of them NOT. Anyone know why?

 

 

[root at arc242 root]# badblocks -s -v -n -b 512 -c 4096 /dev/hdb 38620000
38600000

Checking for bad blocks in non-destructive read-write mode

>From block 38600000 to 38620000

Checking for bad blocks (non-destructive read-write test)

Testing with random pattern: done                        

Pass completed, 0 bad blocks found.

[root at arc242 root]#

 

[root at arc292 root]#  badblocks -s -v -n -b 512 -c 4096 /dev/hda
239400000 239300000

Checking for bad blocks in non-destructive read-write mode

>From block 239300000 to 239400000

Checking for bad blocks (non-destructive read-write test)

Testing with random pattern: 239379104/239400000

239379105

done                        

Pass completed, 2 bad blocks found.

[root at arc292 root]# 

[root at arc292 root]#

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/fedora-list/attachments/20040430/ba15ce69/attachment-0001.htm>


More information about the fedora-list mailing list