Hi Seb, On Thu, Aug 9, 2012 at 10:38 PM, Sebastian Nagel <[email protected]> wrote: > Hi, > > I just discovered that some jar files > in the bin package (1.5.1) and also in nutch.job > are packed twice:
OK so currently there is an open issue for something (similar) to this. I think the ticket is open for 2.x (I cannot confirm as Jira is temporarily down being migrated to dedicated slave). You raise a very valid point though. I would like to ask the following 1) Can we confirm that some classes are always loaded from /lib as oppose to plugins/parse-tika/tika-core.jar for example 2) My feeling is that this cannot always be the case. We have some plugins e.g. automaton and parse-swf where there is a dedicated jar file available in plugins/regex-automaton/lib for example. 3) Another problem as you highlight is that we have duplicate versions of vaious jar's which are pulled as transitive dependencies when we fetch deps with Ivy. To resolve this we need to open a dependency deduplication session and sort this out. I am doing this over at Gora atm and once I have an accurate and reasonable way to to do (maven-dependency-plugin?) then I will try a similar approach with Nutch. This should also address the open Jira issue. Lewis

