I can't see any log output. Can you fetch and parse the pdfs with the parsechecker tool?
On Thursday, January 10, 2013, Bayu Widyasanyata <[email protected]> wrote: > For clarity, the log below is the about 4 of 5 my PDF docs that can't be > parsed by nutch. > > On Fri, Jan 11, 2013 at 8:29 AM, Bayu Widyasanyata > <[email protected]>wrote: > >> nutch parsing is still problem on pdf files. >> Only 1 pdf can be parsed successfully. >> >> 2013-01-11 08:11:23,679 WARN parse.ParseUtil - Unable to successfully >> parse content >> http://localhost/sapi/nospasi_Akhirat_Lebih_Utama_Daripada_Dunia.pdf of >> type application/pdf >> >> Even I had added on parse-plugins.xml explicitly: >> >> <mimeType name="application/pdf"> >> <plugin id="parse-tika" /> >> </mimeType> >> >> What the missed things? >> >> On Fri, Jan 11, 2013 at 7:55 AM, Lewis John Mcgibbney < >> [email protected]> wrote: >> >>> No problem at all. >>> >>> Better safe than sorry. >>> >>> Lewis >>> >>> On Thu, Jan 10, 2013 at 4:43 PM, Bayu Widyasanyata >>> <[email protected]>wrote: >>> >>> > Yes, I forgot that things even I already put on my notes on previous >>> > installation. >>> > I'm quite new on nutch and also Java developments :) >>> > >>> > Thanks! >>> > >>> > On Fri, Jan 11, 2013 at 7:01 AM, Lewis John Mcgibbney < >>> > [email protected]> wrote: >>> > >>> > > Hi, >>> > > >>> > > java.io.IOException: java.lang.ClassNotFoundException: >>> > > > com.mysql.jdbc.Driver >>> > > > >>> > > >>> > > If you look at ivy.xml [0] you will see that the mysql-connector-java >>> > > dependency is commented out. Please uncomment it, then build Nutch 2.x >>> > src >>> > > again. >>> > > >>> > > This will download the dependency and make it available on your >>> > classpath. >>> > > >>> > > Thank you >>> > > >>> > > Lewis >>> > > >>> > > [0] >>> > > >>> http://svn.apache.org/viewvc/nutch/branches/2.x/ivy/ivy.xml?view=markup >>> > > >>> > >>> > >>> > >>> > -- >>> > wassalam, >>> > [bayu] >>> > >>> >>> >>> >>> -- >>> *Lewis* >>> >> >> >> >> -- >> wassalam, >> [bayu] > > > > > -- > wassalam, > [bayu] > -- *Lewis*

