Re: Major speed improvements in package parsing

2009-06-03 Thread ogjunk-tika
Nice, thanks for the clarification! :) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Jukka Zitting > To: tika-dev@lucene.apache.org > Sent: Wednesday, June 3, 2009 8:05:36 AM > Subject: Re: Major speed improvements in pac

Re: Major speed improvements in package parsing

2009-06-03 Thread Jukka Zitting
Hi, On Wed, Jun 3, 2009 at 1:33 PM, wrote: > Nice, thanks for sharing!  You observed the same speed increase pattern > after running this several times to avoid any cold/hot cache side-effects? Yes. This wasn't a carefully crafted benchmark, but I did run a number of similar test using both the

Re: Major speed improvements in package parsing

2009-06-03 Thread ogjunk-tika
ka-dev@lucene.apache.org > Sent: Wednesday, June 3, 2009 6:18:02 AM > Subject: Major speed improvements in package parsing > > Hi, > > Inspired by TIKA-236, I ran the following ad-hoc test: > > $ time java -jar tika-0.3-standalone.jar --text lucene-2.0.0-src.zip > > outpu

Major speed improvements in package parsing

2009-06-03 Thread Jukka Zitting
Hi, Inspired by TIKA-236, I ran the following ad-hoc test: $ time java -jar tika-0.3-standalone.jar --text lucene-2.0.0-src.zip > output-0.3.txt real0m29.844s user0m39.686s sys 0m0.840s $ time java -jar tika-app-0.4-SNAPSHOT.jar --text lucene-2.0.0-src.zip > output-0.4.txt real0m1