Re: WIP/POC archiva-jarinfo

Brett Porter Wed, 19 Mar 2008 05:02:04 -0700

I've only taken a quick look. I do like the idea of gathering moreinformation so this is a good avenue to pursue.


I have some really fundamental questions first though, so help me out:
- how would this replace scanning if it only deals with JARs?

- what are the assumptions that have been proven to be misguided thatyou refer to?- I haven't looked at the code, but my initial reaction was that iteither overlaps or copies a lot from current archiva classes or maven-shared-jar - is that correct or am I missing something?

When I think about collecting more information about artifacts, Ireally think about decorators that can be added (so you can add yourown). Wouldn't this be best done as a small set of those? (which couldbe integrated into trunk now - in fact it was something I wanted tolook at tomorrow to add the new archetype descriptor indexing now thatits released)


Thanks,
Brett

On 11/03/2008, at 3:25 PM, Joakim Erdfelt wrote:

I've been working on and off on a few different archiva relatedtools / tasks / libs.
Brett and Wendy convinced me to upload what I got and outline whatI've got in mind to let the creative juices flow. (besides, I'mrunning out of time to commit to archiva, so this work will be slowto progress if i do it alone).
Concept: archiva-jarinfo.
A library for jar indexing / searching / identification for localrepositories, arbitrary directories of jars, and even remoterepositories.
For use by ...
* Archiva itself as a possible replacement for repository scanning,indexing, and searching.(Searching on checksums, filenames, classnames, imports,identification fields, and even public / exposed methods)* Archiva RepoMan WebStart Tool - a tool I've been wanting to helpidentify and upload content to an Archiva repository.* Archiva Maven Plugin - imagine typing $ mvn archiva:search -Dquery=Logger and getting hits onlog4j, slf4j, commons-logging, plexus-logging, etc... found fromresults from local repository and remote repository.* Q4E integration - adding some ability to q4e to search localrepository and remote repositories for dependencies.
Some details.
(Some of this exists and works, Some of it does not, remember thisis a Work in Progress)
The existing repository scanning / indexing in Archiva server makessome assumptions that have proven to be misguided (such as onlysearching for new content based on timestamp). The new approachthat archiva-jarinfo takes is to mitigate the time consuming part ofthe scan that the new content timestamp check attempts to avoid, theprocessing of the jar file.This is done by checking for a new xml file with the contents of thejar file (called ${artifact}-${version}.jarinfo), if the fileexists, it's up to date, if it doesn't exist, the jar details arecollected and the jarinfo file is created.I've seen this useful if you sync or copy repository directoriestoo. as the jarinfo files come along for the ride and reduce therequirements for archiva to determine the jar details yet again.The scan creates a Jar Info Bundle (*.jib file) that is just a jarfile with all of the *.jarinfo xml files in it, for consumption byremote JarInfo clients to use for indexing purposes.
The JarInfo client uses the JarInfo lib to create an index forchecksums, jar content filenames, and public/exposed bytecodeinformation.
The JarInfo client can search local repos, remote repos, and evenarbitrary directories of jar files.
The JarInfo client can take an anonymous Jar file and perform aseries of identification checks in an attempt to identify the Jarfile based on jar file contents, and even similarity to jar filesfound in the JarInfo indexes.
That's all the info I can squeeze out tonite, hopefully someone elsewill find this useful.
Thanks,
- Joakim


--
Brett Porter
[EMAIL PROTECTED]
http://blogs.exist.com/bporter/

Re: WIP/POC archiva-jarinfo

Reply via email to