Duplicated files in the pristine FC4t2 installation

Roland McGrath roland at redhat.com
Mon May 2 19:35:52 UTC 2005


> Roland McGrath wrote:
> > I think what one clearly wants is for rpm to maintain an installed file
> > indexed keyed by md5sum.  Then you can have a tool that just uses this
> > database to identify duplicates (and doesn't take forever), or have rpm do
> > so itself when installing new files.
> > 
> 
> Hmm, what about hash collisions, that would be really really BAD

If you are concerned about them you can still compare contents before
declaring two files identical.  But using the hashes as the main detector
makes it fast, since you only examine the data of files that are 99.999%
likely to be identical.




More information about the fedora-devel-list mailing list