Hi Tom, All you need to do is ensure that gora-hbase dependency is uncommented within $NUTCH_HOME/ivy/ivy.xml https://github.com/apache/nutch/blob/2.x/ivy/ivy.xml#L116
You then need to ensure that that the storage.data.store.class is correct in $NUTCH_HOME/conf/nutch-default.xml. This needs to be set to 'org.apache.gora.hbase.store.HBaseStore' https://github.com/apache/nutch/blob/2.x/conf/nutch-default.xml#L1333-L1371 Finally, you need to configure $NUTCH_HOME/conf/gora.properties https://github.com/apache/nutch/blob/2.x/conf/gora.properties Make sure that the correct gora-hbase configuration is included. That is all you need to do. Lewis On Fri, Feb 19, 2016 at 10:29 PM, Tom Running <runningt...@gmail.com> wrote: > Furkan, > > What you had mention is exactly what I am trying to accomplish. > > Using Nutch to crawl websites and storing them at Hbase and indexing at > Solr via Gora? > > > I need a bit more help to ensure what I am about to do is correct.. > > #1. > after successfully build GORA. I have the following two .jar files in > /gora/gora-solr/lib/ directory. Lot of .jar files in the /lib directory > but only two .jar files relative to solr. > solr-solrj-4.10.3.jar > solr-core-4.10.3.jar > > > #2. > In the solr source distribution directory I have also see the same exact > .jar files. This is a source code download. I have not build this solr > yet. > > /home/solr/dist > solr-solrj-4.10.3.jar > solr-core-4.10.3.jar > solr-4.10.3.war > > > My question is. Should I copy the two solr files in #1 to > /home/solr/dist/ then build solr? > > > #3. > Should I also do the same thing for hbase. Copy the > /gora/gora-hbase/lib/hbase-* into /hbase/lib/ then build hbase? > > > > Thank you. > Tom > > On Wed, Feb 17, 2016 at 5:31 PM, Furkan KAMACI <furkankam...@gmail.com> > wrote: > >> Hi Tom, >> >> What do you aim? Using Nutch to crawl websites and storing them at Hbase >> and indexing at Solr via Gora? Do you have any other use cases? >> >> "Simply", you may think that Gora will act as Hibernate of NoSQL >> ecosystem at your use case. So, it will not run as a service, it will be a >> dependency. >> >> Kind Regards, >> Furkan KAMACI >> 17 Şub 2016 22:13 tarihinde "Lewis John Mcgibbney" < >> lewis.mcgibb...@gmail.com> yazdı: >> >> Hi Tom, >>> You can just follow the following tutorial >>> http://wiki.apache.org/nutch/Nutch2Tutorial >>> Replacing the gora-hbase configuration from within your Nutch >>> conf/nutch-default.xml and conf/gora.properties and with the relevant >>> dependency from within ivy/ivy.xml with the gora-solr equivalent. >>> Any more issues then please let us know. Gora does not run as a service >>> no, it is a dependency and is managed through your client dependency >>> manager (which in Nutch 2.X is Ivy). >>> Thanks >>> >>> On Wed, Feb 17, 2016 at 12:04 PM, Tom Running <runningt...@gmail.com> >>> wrote: >>> >>>> Furkan and Lewis, >>>> >>>> Thank you for your response to my SOS. I tried varies suggestion on >>>> editing pom.xlm file and including down grade the java JDK version to 1.7 >>>> and removed the .m2 folder and run mvn clean install again and it >>>> build successfully. >>>> >>>> Now Gora is successfully build. I am trying to understand how to get >>>> Gora run or start in order get the following three packages to work >>>> together Nutch, Solr and Hbase with GORA >>>> Does Gora start as a service? >>>> Or >>>> To get other three packages to work with GORA I will need to copy the >>>> *.jar to the three packages (Nutch, Solr and Hbase) lib folder? >>>> >>>> >>>> *I am a bit confuse on how to get these packages to work with GORA. I >>>> had read GORA's quickstart guide but am still not too clear on what to do.* >>>> >>>> >>>> *Can you provide some direction.* >>>> >>>> *Thank you.* >>>> >>>> *Tom* >>>> >>>> On Wed, Feb 17, 2016 at 1:56 PM, Furkan KAMACI <furkankam...@gmail.com> >>>> wrote: >>>> >>>>> Hi Tom, >>>>> >>>>> It seems that your maven is at offline mode. There may be a problem >>>>> with your settings.xml or environment variable for maven home. How do you >>>>> build your project? Could you build it with -X option and send the output? >>>>> >>>>> Kind Regards, >>>>> Furkan KAMACI >>>>> 17 Şub 2016 20:51 tarihinde "Tom Running" <runningt...@gmail.com> >>>>> yazdı: >>>>> >>>>> What to do with the error below. >>>>> >>>>> >>>>> [INFO] Building Apache Gora :: Accumulo 0.6.1 >>>>> [INFO] >>>>> ------------------------------------------------------------------------ >>>>> [WARNING] The POM for org.apache.accumulo:accumulo-core:jar:1.5.1 is >>>>> missing, no dependency information available >>>>> [WARNING] The POM for >>>>> org.apache.accumulo:accumulo-minicluster:jar:1.5.1 is missing, no >>>>> dependency information available >>>>> [WARNING] The POM for org.jboss.netty:netty:jar:3.2.2.Final is >>>>> missing, no dependency information available >>>>> [INFO] >>>>> ------------------------------------------------------------------------ >>>>> [INFO] Reactor Summary: >>>>> [INFO] >>>>> [INFO] Apache Gora ........................................ SUCCESS [ >>>>> 1.468 s] >>>>> [INFO] Apache Gora :: Compiler ............................ SUCCESS [ >>>>> 0.121 s] >>>>> [INFO] Apache Gora :: Compiler-CLI ........................ SUCCESS [ >>>>> 0.032 s] >>>>> [INFO] Apache Gora :: Shims Hadoop ........................ SUCCESS [ >>>>> 0.543 s] >>>>> [INFO] Apache Gora :: Shims Hadoop 1.x .................... SUCCESS [ >>>>> 0.190 s] >>>>> [INFO] Apache Gora :: Shims Hadoop 2.x .................... SUCCESS [ >>>>> 0.295 s] >>>>> [INFO] Apache Gora :: Shims Distribution .................. SUCCESS [ >>>>> 0.026 s] >>>>> [INFO] Apache Gora :: Core ................................ SUCCESS [ >>>>> 0.806 s] >>>>> [INFO] Apache Gora :: Accumulo ............................ FAILURE [ >>>>> 0.120 s] >>>>> [INFO] Apache Gora :: Cassandra ........................... SKIPPED >>>>> [INFO] Apache Gora :: GoraCI .............................. SKIPPED >>>>> [INFO] Apache Gora :: HBase ............................... SKIPPED >>>>> [INFO] Apache Gora :: MongoDB ............................. SKIPPED >>>>> [INFO] Apache Gora :: Solr ................................ SKIPPED >>>>> [INFO] Apache Gora :: Tutorial ............................ SKIPPED >>>>> [INFO] Apache Gora :: Sources-Dist ........................ SKIPPED >>>>> [INFO] >>>>> ------------------------------------------------------------------------ >>>>> [INFO] BUILD FAILURE >>>>> [INFO] >>>>> ------------------------------------------------------------------------ >>>>> [INFO] Total time: 6.359 s >>>>> [INFO] Finished at: 2016-02-17T02:00:39-05:00 >>>>> [INFO] Final Memory: 25M/61M >>>>> [INFO] >>>>> ------------------------------------------------------------------------ >>>>> [ERROR] Failed to execute goal on project gora-accumulo: Could not >>>>> resolve dependencies for project >>>>> org.apache.gora:gora-accumulo:bundle:0.6.1: The following artifacts could >>>>> not be resolved: org.apache.gora:gora-core:jar:0.6.1, >>>>> org.apache.gora:gora-core:jar:tests:0.6.1, >>>>> org.apache.accumulo:accumulo-core:jar:1.5.1, >>>>> org.apache.accumulo:accumulo-minicluster:jar:1.5.1, jline:jline:jar:0.9.1, >>>>> org.jboss.netty:netty:jar:3.2.2.Final, >>>>> org.codehaus.jackson:jackson-jaxrs:jar:1.8.3, >>>>> org.codehaus.jackson:jackson-xc:jar:1.8.3: Cannot access central ( >>>>> https://repo.maven.apache.org/maven2) in offline mode and the >>>>> artifact org.apache.gora:gora-core:jar:0.6.1 has not been downloaded from >>>>> it before. -> [Help 1] >>>>> [ERROR] >>>>> >>>>> >>>> >>> >>> >>> -- >>> *Lewis* >>> >> > -- *Lewis*