[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: Problems with ext3 fs



Just to add a "me too" data point, I've experienced similar issues.
I'm using ext3 on top of an IDE software raid 1 of two identical seagate
barracuda ATA-IV drives on a dell poweredge dual xeon workstation
running redhat 7.2 with the RedHat 2.4.9-31 kernel.

There are three data partitions on each drive, each of which are
mirrored. (/boot, /, and /home). Over time, depending on overall disk
use and NOT on use on a particular filesystem, the / filesystem becomes
corrupt. Strangely enough, I can run bonnie, dd tests, copies, etc all
day on the /home and /boot ext3 filesystem and they have never become
corrupt- only the / partition does.

I have a lot of data points for this behavior.. 8 of these machines, all
identical in configuration, exhibit the same symptoms.

The corruption can be repaired with regular fscks from a rescue CD, but
if left unchecked it garbles files and eventually progresses to the
point where the root partition will not even mount on bootup.

What's aggravating my problem is that for some reason the root
filesystem is only fsck'd on boot when a power-off event occurs. If I
manually set needs_** with hdparm it is ignored (or possibly reset upon
a clean shutdown). I posted these symptoms last month in hopes that
someone had seem them before. I got some hints to check my /etc/fstab
file to make sure / gets fsck'd, but that was ok.



On Thu, 2002-04-25 at 02:48, Dmitry N. Hramtsov wrote:
> Hi,
> 
> I have read this thread and want to say that I have expirienced the same 
> problem.
> 
> Once, when I was loged in, one of my filesystems ( root filesystem :( ) 
> became read-only with the same symptoms.
> But results was more tragic.  Almost in each directory avg. 10% of files 
> had incorrect attributes, owner, links, etc.
> But filenames was correct.  So, I was obliged to re-create root 
> filesystem :(
> 
> Maybe following details will help someone to solve this problem.
> 
> Best regards,
> Dmitry N. Hramtsov
> 
> p.s. Details:
> 
> Here is a part of my crash-logs:
> Apr 21 14:47:39 aurora kernel: VFS: dquota write failed on dev 09:03
> Apr 21 14:47:39 aurora last message repeated 4 times
> Apr 21 14:58:50 aurora -- MARK --
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=21033212, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=488976648, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=210302268, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=357644568, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=1090029900, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=1022394500, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=687411540, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=17912052, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=1360078176, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=1149826340, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=0, want=150288672, limit=10240000
> Apr 21 15:02:14 aurora kernel: attempt to access beyond end of device
> Apr 21 15:02:14 aurora kernel: 09:03: rw=2, want=623969628, limit=10240000
> Apr 21 15:07:35 aurora kernel: VFS: dquota write failed on dev 09:06
> Apr 21 15:07:35 aurora last message repeated 32 times
> 
> 
> Here is my configuration:
> hdn aurora:~$ uname -a
> Linux aurora 2.4.18 #1 SMP Sun Apr 21 17:20:25 NOVST 2002 i686 unknown
> 
> hdn aurora:~$ mount
> /dev/md/3 on / type ext3 (rw,errors=remount-ro,usrquota,grpquota)
> proc on /proc type proc (rw)
> /dev/hda2 on /boot type ext2 (rw,noexec,nosuid,nodev)
> /dev/md/5 on /var type ext3 (rw,nosuid,nodev,usrquota,grpquota)
> /dev/md/6 on /home type ext3 (rw,nosuid,nodev,usrquota,grpquota)
> /dev/md/7 on /tmp type ext3 (rw,nosuid,nodev,usrquota,grpquota)
> 
> hdn aurora:~$ cat /proc/mdstat
> Personalities : [raid5]
> read_ahead 1024 sectors
> md1 : active raid5 ide/host2/bus1/target0/lun0/part1[2] 
> ide/host2/bus0/target0/lun0/part1[3] 
> ide/host0/bus1/target0/lun0/part1[0] ide/host0/bus0/target0/lun0/part1[1]
>       1023744 blocks level 5, 128k chunk, algorithm 2 [3/3] [UUU]
> 
> md3 : active raid5 ide/host2/bus1/target0/lun0/part3[2] 
> ide/host2/bus0/target0/lun0/part3[1] 
> ide/host0/bus1/target0/lun0/part3[3] ide/host0/bus0/target0/lun0/part3[0]
>       10240000 blocks level 5, 128k chunk, algorithm 2 [3/3] [UUU]
> 
> md5 : active raid5 ide/host2/bus1/target0/lun0/part5[2] 
> ide/host2/bus0/target0/lun0/part5[0] ide/host0/bus0/target0/lun0/part5[1]
>       16383744 blocks level 5, 128k chunk, algorithm 2 [3/3] [UUU]
> 
> md6 : active raid5 ide/host2/bus1/target0/lun0/part6[2] 
> ide/host2/bus0/target0/lun0/part6[3] 
> ide/host0/bus1/target0/lun0/part6[0] ide/host0/bus0/target0/lun0/part6[1]
>       16383744 blocks level 5, 128k chunk, algorithm 2 [3/3] [UUU]
> 
> md7 : active raid5 ide/host2/bus1/target0/lun0/part7[4] 
> ide/host2/bus0/target0/lun0/part7[2] 
> ide/host0/bus1/target0/lun0/part7[0] ide/host0/bus0/target0/lun0/part7[1]
>       1023744 blocks level 5, 128k chunk, algorithm 2 [3/3] [UUU]
> 
> 
> My part of dmesg (on boot):
> Apr 22 08:51:10 aurora kernel: VFS: Diskquotas version dquot_6.4.0 
> initialized
> Apr 22 08:51:10 aurora kernel: Journalled Block Device driver loaded
> Apr 22 08:51:10 aurora kernel: devfs: v1.10 (20020120) Richard Gooch 
> (rgooch atnf csiro au)
> Apr 22 08:51:10 aurora kernel: devfs: boot_options: 0x1
> Apr 22 08:51:10 aurora kernel: pty: 256 Unix98 ptys configured
> Apr 22 08:51:10 aurora kernel: Serial driver version 5.05c (2001-07-08) 
> with MANY_PORTS SHARE_IRQ SERIAL_PCI enabled
> Apr 22 08:51:10 aurora kernel: ttyS00 at 0x03f8 (irq = 4) is a 16550A
> Apr 22 08:51:10 aurora kernel: ttyS01 at 0x02f8 (irq = 3) is a 16550A
> Apr 22 08:51:10 aurora kernel: block: 128 slots per queue, batch=32
> Apr 22 08:51:10 aurora kernel: RAMDISK driver initialized: 16 RAM disks 
> of 4096K size 1024 blocksize
> Apr 22 08:51:10 aurora kernel: Uniform Multi-Platform E-IDE driver 
> Revision: 6.31
> Apr 22 08:51:10 aurora kernel: ide: Assuming 33MHz system bus speed for 
> PIO modes; override with idebus=xx
> Apr 22 08:51:10 aurora kernel: VP_IDE: IDE controller on PCI bus 00 dev 39
> Apr 22 08:51:10 aurora kernel: VP_IDE: chipset revision 6
> Apr 22 08:51:10 aurora kernel: VP_IDE: not 100%% native mode: will probe 
> irqs later
> Apr 22 08:51:10 aurora kernel:     ide0: BM-DMA at 0xa000-0xa007, BIOS 
> settings: hda:DMA, hdb:DMA
> Apr 22 08:51:10 aurora kernel:     ide1: BM-DMA at 0xa008-0xa00f, BIOS 
> settings: hdc:DMA, hdd:pio
> Apr 22 08:51:10 aurora kernel: HPT370: IDE controller on PCI bus 00 dev 70
> Apr 22 08:51:10 aurora kernel: HPT370: chipset revision 3
> Apr 22 08:51:10 aurora kernel: HPT370: not 100%% native mode: will probe 
> irqs later
> Apr 22 08:51:10 aurora kernel:     ide2: BM-DMA at 0xcc00-0xcc07, BIOS 
> settings: hde:DMA, hdf:pio
> Apr 22 08:51:10 aurora kernel:     ide3: BM-DMA at 0xcc08-0xcc0f, BIOS 
> settings: hdg:DMA, hdh:pio
> Apr 22 08:51:10 aurora kernel: hda: C/H/S=36798/16/255 from BIOS ignored
> Apr 22 08:51:10 aurora kernel: hda: IBM-DTLA-307075, ATA DISK drive
> Apr 22 08:51:10 aurora kernel: hdb: IBM-DTLA-307045, ATA DISK drive
> Apr 22 08:51:10 aurora kernel: hdc: IBM-DTLA-307075, ATA DISK drive
> Apr 22 08:51:10 aurora kernel: hde: IBM-DTLA-307045, ATA DISK drive
> Apr 22 08:51:10 aurora kernel: hdg: IBM-DTLA-307075, ATA DISK drive
> Apr 22 08:51:10 aurora kernel: ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
> Apr 22 08:51:10 aurora kernel: ide1 at 0x170-0x177,0x376 on irq 15
> Apr 22 08:51:10 aurora kernel: ide2 at 0xbc00-0xbc07,0xc002 on irq 18
> Apr 22 08:51:10 aurora kernel: ide3 at 0xc400-0xc407,0xc802 on irq 18
> Apr 22 08:51:10 aurora kernel: hda: 150136560 sectors (76870 MB) 
> w/1916KiB Cache, CHS=148945/16/63
> Apr 22 08:51:10 aurora kernel: hdb: 90069840 sectors (46116 MB) 
> w/1916KiB Cache, CHS=5606/255/63
> Apr 22 08:51:10 aurora kernel: hdc: 150136560 sectors (76870 MB) 
> w/1916KiB Cache, CHS=148945/16/63
> Apr 22 08:51:10 aurora kernel: hde: 90069840 sectors (46116 MB) 
> w/1916KiB Cache, CHS=89355/16/63, UDMA(44)
> Apr 22 08:51:10 aurora kernel: hdg: 150136560 sectors (76870 MB) 
> w/1916KiB Cache, CHS=148945/16/63, UDMA(44)
> 
> 
> 
> 
> 
> 
> _______________________________________________
> Ext3-users mailing list
> Ext3-users redhat com
> https://listman.redhat.com/mailman/listinfo/ext3-users






[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]