[Techtalk] Faster badblock scans?

jas at spamcop.net jas at spamcop.net
Tue Aug 26 10:57:48 EST 2003


Quoting Julie <txjulie at austin.rr.com>:

> Maria Blackmore wrote:
> > On Mon, 25 Aug 2003, Julie wrote:
> >
> >
> >>Does anyone know of a faster way to scan IDE drives for bad blocks?
> >
> >
> > Hi,
> >
> > I'm sorry to say this, but don't bother.  Don't just delete this post as
> > being unhelpful, please read it and take note, it really is for your own
> > good.
> 
> While I appreciate the rant, and could quite easily have written
> it myself, I'd still like an answer to my question.
> 
> For what it's worth, the drives are still in excellent condition,
> with far fewer than 1/100th of 1% of the blocks bad.  Last nights
> scan was of 10,000,000 blocks, of which 9 came up bad.  One bad
> block in 1,000,000 is hardly proof of pending calamity.  Mostly
> the problem is that of the 60,000,000 blocks of IDE drive on this
> machine, 40 or 50 of them seem to have gone bad over the past 12
> to 18 months.

Why aren't they being mapped out by the drive's firmware, though? Is that 
somehow disabled (in which case, just enable it!) - or is that defect list 
already full, at which point the defects you're finding now are just the tip of 
the iceberg - and the disk's dying. (Someone said on another list that IBM 
drives don't do this - if it's an IBM drive, that could also explain the 
problem.)

> And while rants are often fun, in the same sense that a train wreck
> can be fun to watch, the proof that Linux isn't ready for enterprise
> computing is that Linux can't survive having a couple of bad spots
> crop up on a large disk array without barfing.  Seeing as the 240GB
> on this machine is slowly filling, and I expect to have 1TB on it
> before year's end, I'd like to be running an operating system that
> can survive the occasional non-recoverable disk error.

Linux just relies (quite reasonably, IMHO) on the disk's firmware handling that, 
as it's supposed to. Being more tolerant of defective drives would be nice, 
though - and both Windows NT/2k/XP and Linux handle this in the same way: you 
need to be using the fault-tolerant drivers (FTDISK.SYS under NT, software RAID 
under Linux) to get this extra layer of protection. The ordinary drivers just 
assume the drive does what it says on the tin. For 1 Tb of data, there will be 
several drives involved anyway: would adding one more to get RAID 5 be a 
problem? That would avoid the whole issue, it seems: you'd get the extra layer 
of remapping, and you'd be able to recover the lost data instead of just 
identifying it.


James.


More information about the Techtalk mailing list