+1, with comments :)

Is this just bytecode (ie, classfiles), or is it a hash of the contents
of all files? The latter sounds good for arbitrary zip files (and other
archives when supported).

Cheers,
Brett

Joakim Erdfelt wrote:
> I would like to see a new hash technique be added to the index.
> 
> Here's the scenario.
> 
> You have an anonymous ant.jar, and want to find out what it is.
> Using the traditional hashing technique, the entire file is hashed and a
> unique hash identification string is calculated.
> 
> A problem occurs when the jar file being interrogated against the MRM index
> has had its contents recompressed without any real change to the bytecode.
> 
> I propose adding another key to the lucene document for an Artifact
> called "Bytecode Hash".
> 
> I've mocked up a simple demonstration of this hash technique in a small
> maven project located at http://joakim.erdfelt.com/truehash.tar.gz
> 
> I am willing to implement a patch for this.
> 
> - Joakim
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to