nutch parsing is still problem on pdf files.
Only 1 pdf can be parsed successfully.

2013-01-11 08:11:23,679 WARN  parse.ParseUtil - Unable to successfully
parse content
http://localhost/sapi/nospasi_Akhirat_Lebih_Utama_Daripada_Dunia.pdf of
type application/pdf

Even I had added on parse-plugins.xml explicitly:

    <mimeType name="application/pdf">
      <plugin id="parse-tika" />
    </mimeType>

What the missed things?

On Fri, Jan 11, 2013 at 7:55 AM, Lewis John Mcgibbney <
[email protected]> wrote:

> No problem at all.
>
> Better safe than sorry.
>
> Lewis
>
> On Thu, Jan 10, 2013 at 4:43 PM, Bayu Widyasanyata
> <[email protected]>wrote:
>
> > Yes, I forgot that things even I already put on my notes on previous
> > installation.
> > I'm quite new on nutch and also Java developments :)
> >
> > Thanks!
> >
> > On Fri, Jan 11, 2013 at 7:01 AM, Lewis John Mcgibbney <
> > [email protected]> wrote:
> >
> > > Hi,
> > >
> > > java.io.IOException: java.lang.ClassNotFoundException:
> > > > com.mysql.jdbc.Driver
> > > >
> > >
> > > If you look at ivy.xml [0] you will see that the mysql-connector-java
> > > dependency is commented out. Please uncomment it, then build Nutch 2.x
> > src
> > > again.
> > >
> > > This will download the dependency and make it available on your
> > classpath.
> > >
> > > Thank you
> > >
> > > Lewis
> > >
> > > [0]
> > >
> http://svn.apache.org/viewvc/nutch/branches/2.x/ivy/ivy.xml?view=markup
> > >
> >
> >
> >
> > --
> > wassalam,
> > [bayu]
> >
>
>
>
> --
> *Lewis*
>



-- 
wassalam,
[bayu]

Reply via email to