After Nutch 2.2 with gora 0.3 mysql will not be more supported? I want mean that crawl doesn't parsing many urls and I don't know why.
Weder On Thu, Jun 13, 2013 at 4:49 PM, Tejas Patil <[email protected]>wrote: > On Thu, Jun 13, 2013 at 12:41 PM, Weder Carlos Vieira < > [email protected]> wrote: > > > Hello everyone! > > > > This is my first mail here. > > > Welcome !! > > > > > I want to know more and more about nutch and share what a find out by > > myself with you. Thanks if someone can help me too. > > > > I was trying to use nutch. First I setup and test nutch 2.1 and its works > > fine, but many of crawled urls was saved on MySQL with null value, just > few > > url with status=2. I don't understand that but I go on... > > > > Next I try to setup and use (test) Nutch 2.2, in this case when I start > > command to initiate crawl I get this error below. > > > > Exception in thread "main" java.lang.ClassNotFoundException: > > org.apache.gora.sql.store.SqlStore > > at java.net.URLClassLoader$1.run(URLClassLoader.java:366) ..... > > > > The 'SqlStore' class was removed from nutch 2.2? because on Nutch 2.1 > this > > error doesn't appear. > > > Yes. Nutch 2.2 uses Apache Gora 0.3 which has deprecated their support for > mySQL as a data store. Follow https://wiki.apache.org/nutch/Nutch2Tutorial > > > > > > ----------------------------- > > > > In the other hand I want to ask a second question, How can I improve > > configuration of Nutch 2.1 (that Works fine) to fetch more and more url > > without 'null values'. > > > What do you mean by "w/o null values" ? > > > > > > > > > Thanks a lot. > > > > Weder > > >

