Well, that's a high-level entry point into Tika... the question is, inside that method, which parser was invoked.
Were there any exceptions in your run? Mike McCandless http://blog.mikemccandless.com On Tue, Aug 30, 2011 at 12:20 PM, Mark Kerzner <[email protected]> wrote: > Yes, I know the precise line (from stepping through in the debugger) > String text = tika.parseToString(new FileInputStream(new File(fileName)), > metadata); > Thank you, > Mark > > On Tue, Aug 30, 2011 at 11:15 AM, Michael McCandless > <[email protected]> wrote: >> >> Hmm any idea which document types are leading to the open files? >> >> Or, did you hit any exceptions while parsing the docs? Might help us >> narrow down which parser isn't closing its temp file... >> >> Mike McCandless >> >> http://blog.mikemccandless.com >> >> On Tue, Aug 30, 2011 at 12:07 PM, Mark Kerzner <[email protected]> >> wrote: >> > Hi, >> > I am using the tika-app-1.0-SNAPSHOT.jar from 08/02, and it leaves some >> > files open, as you can see below. Once I parse enough files, I get a >> > "too >> > many files open" error. I used the snapshot because of a feature that I >> > had >> > there (don't remember which one right now). >> > Any advice? >> > Thank you, >> > Mark >> > >> > >> > mark@mark-desktop:/proc$ ls -l 27933/fd >> > total 0 >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 0 -> pipe:[19113179] >> > l-wx------ 1 mark mark 64 2011-08-30 10:58 1 -> pipe:[19113180] >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 10 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/jackson-core-asl-1.5.2.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 11 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/jackson-mapper-asl-1.5.2.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 12 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/commons-configuration-1.6.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 13 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/tika-app-1.0-SNAPSHOT.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 14 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/commons-lang-2.6.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 15 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/commons-collections-3.2.1.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 16 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/commons-digester-2.1.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 17 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/lucene-core-3.0.3.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 18 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/junit-4.8.2.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 19 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/guava-r09.jar >> > l-wx------ 1 mark mark 64 2011-08-30 10:58 2 -> pipe:[19113181] >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 20 -> >> > >> > /home/mark/NetBeansProjects/FreeEed/lib/truezip-samples-7.3-rc-1-jar-with-dependencies.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 21 -> >> > /usr/lib/jvm/java-6-sun-1.6.0.26/jre/lib/jce.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 22 -> >> > /home/mark/NetBeansProjects/FreeEed/freeeed_output/staging/inventory >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 23 -> >> > >> > /home/mark/NetBeansProjects/FreeEed/freeeed_output/staging/input00001.zip >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 25 -> /dev/random >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 26 -> /dev/urandom >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 27 -> >> > /usr/lib/jvm/java-6-sun-1.6.0.26/jre/lib/ext/sunpkcs11.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 28 -> >> > /tmp/apache-tika-363283955479395764.tmp (deleted) >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 29 -> >> > /tmp/apache-tika-363283955479395764.tmp (deleted) >> > l-wx------ 1 mark mark 64 2011-08-30 10:58 3 -> >> > /usr/lib/jvm/java-6-sun-1.6.0.26/jre/lib/rt.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 30 -> socket:[19118543] >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 31 -> >> > /usr/lib/jvm/java-6-sun-1.6.0.26/jre/lib/resources.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 35 -> >> > /usr/lib/jvm/java-6-sun-1.6.0.26/jre/lib/charsets.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 4 -> socket:[19113557] >> > lr-x------ 1 mark mark 64 2011-08-30 10:59 5 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/commons-cli-1.2.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 6 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/commons-httpclient-3.0.1.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 7 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/commons-logging-1.0.4.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 8 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/hadoop-core-0.20.2+737.jar >> > lr-x------ 1 mark mark 64 2011-08-30 10:58 9 -> >> > /home/mark/NetBeansProjects/FreeEed/lib/log4j-1.2.15.jar >> > mark@mark-desktop:/proc$ >> > > >
