WIP/POC archiva-jarinfo

Joakim Erdfelt Mon, 10 Mar 2008 21:25:37 -0700

I've been working on and off on a few different archiva related tools /tasks / libs.

Brett and Wendy convinced me to upload what I got and outline what I'vegot in mind to let the creative juices flow. (besides, I'm running outof time to commit to archiva, so this work will be slow to progress if ido it alone).


Concept: archiva-jarinfo.

A library for jar indexing / searching / identification for localrepositories, arbitrary directories of jars, and even remote repositories.


For use by ...

* Archiva itself as a possible replacement for repository scanning,indexing, and searching.(Searching on checksums, filenames, classnames, imports,identification fields, and even public / exposed methods)* Archiva RepoMan WebStart Tool - a tool I've been wanting to helpidentify and upload content to an Archiva repository.* Archiva Maven Plugin - imagine typing $ mvn archiva:search-Dquery=Logger and getting hits onlog4j, slf4j, commons-logging, plexus-logging, etc... found fromresults from local repository and remote repository.* Q4E integration - adding some ability to q4e to search localrepository and remote repositories for dependencies.


Some details.

(Some of this exists and works, Some of it does not, remember this is aWork in Progress)

The existing repository scanning / indexing in Archiva server makes someassumptions that have proven to be misguided (such as only searching fornew content based on timestamp). The new approach that archiva-jarinfotakes is to mitigate the time consuming part of the scan that the newcontent timestamp check attempts to avoid, the processing of the jar file.This is done by checking for a new xml file with the contents of the jarfile (called ${artifact}-${version}.jarinfo), if the file exists, it'sup to date, if it doesn't exist, the jar details are collected and thejarinfo file is created.I've seen this useful if you sync or copy repository directories too. asthe jarinfo files come along for the ride and reduce the requirementsfor archiva to determine the jar details yet again.The scan creates a Jar Info Bundle (*.jib file) that is just a jar filewith all of the *.jarinfo xml files in it, for consumption by remoteJarInfo clients to use for indexing purposes.

The JarInfo client uses the JarInfo lib to create an index forchecksums, jar content filenames, and public/exposed bytecode information.

The JarInfo client can search local repos, remote repos, and evenarbitrary directories of jar files.

The JarInfo client can take an anonymous Jar file and perform a seriesof identification checks in an attempt to identify the Jar file based onjar file contents, and even similarity to jar files found in the JarInfoindexes.

That's all the info I can squeeze out tonite, hopefully someone elsewill find this useful.


Thanks,
- Joakim

WIP/POC archiva-jarinfo

Reply via email to