Jana, Kumar Raja
Thu, 17 Dec 2009 22:19:55 -0800
Hi Chris, The tika-core.jar does not contain the classes in tika-parsers.jar and also those in tika-app/src folder. Please correct me if I am looking at the wrong folder. I usually execute "mvn dependency:copy-dependencies" after building tika on my machine. From the target/dependency folder, I see that tika-core jar and tika-parsers jar are dependencies for tika-app. Also, tika-app has tika-cli and tika-gui which I find very useful.
Thanks, Kumar -----Original Message----- From: Mattmann, Chris A (388J) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Friday, December 18, 2009 11:33 AM To: tika-user@lucene.apache.org Cc: tika-...@lucene.apache.org Subject: Re: Tika jar without dependencies ....CC¹ing to tika-user@, because this is probably more a question for the user list... Hi Kumar, I think what you want is the tika-core jar -- it contains only the core tika classes, no dependencies, and no parsing or app code. Give that a try. You can find that jar in your m2 repo, likely: $HOME/.m2/repository/org/apache/tika. HTH, Chris On 12/17/09 9:58 PM, "Jana, Kumar Raja" <kj...@ptc.com> wrote: > Hi, > > Is it possible to add one more target in Tika 0.6 build scripts to > generate a Tika-classes-only jar along with the tika-standalone.jar > (which contains the Tika classes + all the dependent jars)? > > This is 'coz we usually use most of the dependent jars in our > application and adding the Tika standalone jar to the classpath is > duplicating the code already present. Although, not a major concern in > most cases but sometimes the dependent jars' code in Tika standalone > overrides those in the actual patched jars (we used patched versions of > log4j and a few other jars). > > > > I tried changing the scripts in Tika-app folder and built the Tika 0.5 > source to achieve this. The only change is to remove/comment the > <Embed-Dependency> and <Embed-Transitive> tags (lines 55 and 56) in > tika-app\pom.xml file. I would love to see if this gets integrated into > the new release. > > > > Thanks, > > Kumar > > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: chris.mattm...@jpl.nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++