Hi Tom,
All you need to do is ensure that gora-hbase dependency is uncommented
within $NUTCH_HOME/ivy/ivy.xml
https://github.com/apache/nutch/blob/2.x/ivy/ivy.xml#L116

You then need to ensure that that the storage.data.store.class is correct
in $NUTCH_HOME/conf/nutch-default.xml. This needs to be set to
'org.apache.gora.hbase.store.HBaseStore'
https://github.com/apache/nutch/blob/2.x/conf/nutch-default.xml#L1333-L1371

Finally, you need to configure $NUTCH_HOME/conf/gora.properties
https://github.com/apache/nutch/blob/2.x/conf/gora.properties
Make sure that the correct gora-hbase configuration is included.

That is all you need to do.
Lewis

On Fri, Feb 19, 2016 at 10:29 PM, Tom Running <runningt...@gmail.com> wrote:

> Furkan,
>
> What you had mention is exactly what I am trying to accomplish.
> > Using Nutch to crawl websites and storing them at Hbase and indexing at
> Solr via Gora?
>
>
> I need a bit more help to ensure what I am about to do is correct..
>
> #1.
> after successfully build GORA.  I have the following two .jar files in
> /gora/gora-solr/lib/  directory.  Lot of .jar files in the /lib directory
> but only two .jar files relative to solr.
> solr-solrj-4.10.3.jar
> solr-core-4.10.3.jar
>
>
> #2.
> In the solr source distribution directory I have also see the same exact
> .jar files.  This is a source code download.  I have not build this solr
> yet.
>
> /home/solr/dist
> solr-solrj-4.10.3.jar
> solr-core-4.10.3.jar
> solr-4.10.3.war
>
>
> My question is.   Should I copy the two solr files in #1 to
> /home/solr/dist/  then build solr?
>
>
> #3.
> Should I also do the same thing for hbase.  Copy the
> /gora/gora-hbase/lib/hbase-*     into    /hbase/lib/  then build hbase?
>
>
>
> Thank you.
> Tom
>
> On Wed, Feb 17, 2016 at 5:31 PM, Furkan KAMACI <furkankam...@gmail.com>
> wrote:
>
>> Hi Tom,
>>
>> What do you aim? Using Nutch to crawl websites and storing them at Hbase
>> and indexing at Solr via Gora? Do you have any other use cases?
>>
>> "Simply", you may think that Gora will act as Hibernate of NoSQL
>> ecosystem at your use case. So, it will not run as a service, it will be a
>> dependency.
>>
>> Kind Regards,
>> Furkan KAMACI
>> 17 Şub 2016 22:13 tarihinde "Lewis John Mcgibbney" <
>> lewis.mcgibb...@gmail.com> yazdı:
>>
>> Hi Tom,
>>> You can just follow the following tutorial
>>> http://wiki.apache.org/nutch/Nutch2Tutorial
>>> Replacing the gora-hbase configuration from within your Nutch
>>> conf/nutch-default.xml and conf/gora.properties and with the relevant
>>> dependency from within ivy/ivy.xml with the gora-solr equivalent.
>>> Any more issues then please let us know. Gora does not run as a service
>>> no, it is a dependency and is managed through your client dependency
>>> manager (which in Nutch 2.X is Ivy).
>>> Thanks
>>>
>>> On Wed, Feb 17, 2016 at 12:04 PM, Tom Running <runningt...@gmail.com>
>>> wrote:
>>>
>>>> Furkan and Lewis,
>>>>
>>>> Thank you for your response to my SOS.  I tried varies suggestion on
>>>> editing pom.xlm file and including down grade the java JDK version to 1.7
>>>> and removed the .m2 folder and run      mvn clean install   again and it
>>>> build successfully.
>>>>
>>>> Now Gora is successfully build.  I am trying to understand how to get
>>>> Gora run or start in order get the following three packages to work
>>>> together Nutch, Solr and Hbase with GORA
>>>> Does Gora start as a service?
>>>> Or
>>>> To get other three packages to work with GORA I will need to copy the
>>>> *.jar to the three packages (Nutch, Solr and Hbase) lib folder?
>>>>
>>>>
>>>> *I am a bit confuse on how to get these packages to work with GORA.  I
>>>> had read GORA's quickstart guide but am still not too clear on what to do.*
>>>>
>>>>
>>>> *Can you provide some direction.*
>>>>
>>>> *Thank you.*
>>>>
>>>> *Tom*
>>>>
>>>> On Wed, Feb 17, 2016 at 1:56 PM, Furkan KAMACI <furkankam...@gmail.com>
>>>> wrote:
>>>>
>>>>> Hi Tom,
>>>>>
>>>>> It seems that your maven is at offline mode. There may be a problem
>>>>> with your settings.xml or environment variable for maven home. How do you
>>>>> build your project? Could you build it with -X option and send the output?
>>>>>
>>>>> Kind Regards,
>>>>> Furkan KAMACI
>>>>> 17 Şub 2016 20:51 tarihinde "Tom Running" <runningt...@gmail.com>
>>>>> yazdı:
>>>>>
>>>>> What to do with the error below.
>>>>>
>>>>>
>>>>> [INFO] Building Apache Gora :: Accumulo 0.6.1
>>>>> [INFO]
>>>>> ------------------------------------------------------------------------
>>>>> [WARNING] The POM for org.apache.accumulo:accumulo-core:jar:1.5.1 is
>>>>> missing, no dependency information available
>>>>> [WARNING] The POM for
>>>>> org.apache.accumulo:accumulo-minicluster:jar:1.5.1 is missing, no
>>>>> dependency information available
>>>>> [WARNING] The POM for org.jboss.netty:netty:jar:3.2.2.Final is
>>>>> missing, no dependency information available
>>>>> [INFO]
>>>>> ------------------------------------------------------------------------
>>>>> [INFO] Reactor Summary:
>>>>> [INFO]
>>>>> [INFO] Apache Gora ........................................ SUCCESS [
>>>>> 1.468 s]
>>>>> [INFO] Apache Gora :: Compiler ............................ SUCCESS [
>>>>> 0.121 s]
>>>>> [INFO] Apache Gora :: Compiler-CLI ........................ SUCCESS [
>>>>> 0.032 s]
>>>>> [INFO] Apache Gora :: Shims Hadoop ........................ SUCCESS [
>>>>> 0.543 s]
>>>>> [INFO] Apache Gora :: Shims Hadoop 1.x .................... SUCCESS [
>>>>> 0.190 s]
>>>>> [INFO] Apache Gora :: Shims Hadoop 2.x .................... SUCCESS [
>>>>> 0.295 s]
>>>>> [INFO] Apache Gora :: Shims Distribution .................. SUCCESS [
>>>>> 0.026 s]
>>>>> [INFO] Apache Gora :: Core ................................ SUCCESS [
>>>>> 0.806 s]
>>>>> [INFO] Apache Gora :: Accumulo ............................ FAILURE [
>>>>> 0.120 s]
>>>>> [INFO] Apache Gora :: Cassandra ........................... SKIPPED
>>>>> [INFO] Apache Gora :: GoraCI .............................. SKIPPED
>>>>> [INFO] Apache Gora :: HBase ............................... SKIPPED
>>>>> [INFO] Apache Gora :: MongoDB ............................. SKIPPED
>>>>> [INFO] Apache Gora :: Solr ................................ SKIPPED
>>>>> [INFO] Apache Gora :: Tutorial ............................ SKIPPED
>>>>> [INFO] Apache Gora :: Sources-Dist ........................ SKIPPED
>>>>> [INFO]
>>>>> ------------------------------------------------------------------------
>>>>> [INFO] BUILD FAILURE
>>>>> [INFO]
>>>>> ------------------------------------------------------------------------
>>>>> [INFO] Total time: 6.359 s
>>>>> [INFO] Finished at: 2016-02-17T02:00:39-05:00
>>>>> [INFO] Final Memory: 25M/61M
>>>>> [INFO]
>>>>> ------------------------------------------------------------------------
>>>>> [ERROR] Failed to execute goal on project gora-accumulo: Could not
>>>>> resolve dependencies for project
>>>>> org.apache.gora:gora-accumulo:bundle:0.6.1: The following artifacts could
>>>>> not be resolved: org.apache.gora:gora-core:jar:0.6.1,
>>>>> org.apache.gora:gora-core:jar:tests:0.6.1,
>>>>> org.apache.accumulo:accumulo-core:jar:1.5.1,
>>>>> org.apache.accumulo:accumulo-minicluster:jar:1.5.1, jline:jline:jar:0.9.1,
>>>>> org.jboss.netty:netty:jar:3.2.2.Final,
>>>>> org.codehaus.jackson:jackson-jaxrs:jar:1.8.3,
>>>>> org.codehaus.jackson:jackson-xc:jar:1.8.3: Cannot access central (
>>>>> https://repo.maven.apache.org/maven2) in offline mode and the
>>>>> artifact org.apache.gora:gora-core:jar:0.6.1 has not been downloaded from
>>>>> it before. -> [Help 1]
>>>>> [ERROR]
>>>>>
>>>>>
>>>>
>>>
>>>
>>> --
>>> *Lewis*
>>>
>>
>


-- 
*Lewis*

Reply via email to