Hi Radek,
CC'd [email protected]
Thanks for quick response, I really appreciate this :)
Please see my responses inline


On Tue, Jan 27, 2015 at 2:27 PM, Radoslaw Stankiewicz <[email protected]>
wrote:

>
> thanks for a comment. I’m glad you like it.
>

Yep, I love it :)


> Of course I can cotribute my containter - just tell me how can I do this.
>

OK, well I created two things to help drive this on

   - I've added a docker directory to the Nutch 2.X source code [0].
   Although this is empty we will be gathering community contributions and
   making them available there
   - I've created an issue on our Jira issue tracker [1] to make sure that
   we track the integration of the container into the Nutch codebase. I feel
   that if we can upgrade the Gora dependency to Gora 0.6 (which the Gora
   community is in the process of releasing) then we can upograde your image
   to use Hadoop 2.X and HBase 0.98.X

It would be excellent if you could check out a version of the Nutch 2.X
source code from [2], then create a patch file and submit it to the Jira
issue at [1]. I will then review and we can hopefully commit it to the
codebase with the aim of releasing Gora 0.6 and then working to upgrade the
container to the newer dependencies mentioned above.

[0] http://svn.apache.org/repos/asf/nutch/branches/2.x/docker/

[1] https://issues.apache.org/jira/browse/NUTCH-1924
[2] http://svn.apache.org/repos/asf/nutch/branches/2.x/


>
> best regards,
> Radek Stankiewicz
>
> PS.
> I created this image because 2 months ago I had a project recently with
> Nutch 2.1 or later - I tried to find working configuration with hadoop and
> nosql database.
>

Unfortunately the gora-nosql module is now deprecated within Nutch 2.X.


> I tried to configure Nutch 2.X to work with latest hadoop (2.4 or 2.6) but
> I failed. I tried different versions of Gora, accumulo, hbase.
>

Yes this can be very painful sometimes. The feedback the Nutch developers
got from ApacheCon EU in Budapest was that we needed a Docket container for
the various Nutch 2.X flavours.


>  I only succeeded with version you’ve commented.
>

Fantastic, I am glad to hear that.


>
> Now I can see that 5 days ago there was release and it may be possible to
> create working docker image for hadoop 2.4, accumulo 1.5.1 and nutch 2.3.
>

Yes I think that an Accumulo Docker image would be excellent as well.
Please feel free to sign up for Jira and create a child issue here [3]

[3] https://issues.apache.org/jira/browse/NUTCH-1900


> But hbase version is quite old, major Hadoop distributions comes with 0.98
> or later.
>

Yes we can work on this once we have release Gora 0.6.
Thank you very much. It is great that you are willing to contribute to the
Nutch project and I am very much looking forward to everyone benefiting
from these containers.
Lewis


>
>
> Begin forwarded message:
>
>> On wtorek, sty 27, 2015 at 9:36 PM, [email protected] <
>> [email protected]>, wrote:
>>
>> Hi stankiewicz,
>>
>> We just wanted to let you know that someone left a comment on your
>> stankiewicz/hbase_hadoop_nutch repository on the docker registry.
>> The user lewismc wrote:
>>
>> "Hi @stankiewicz, I am Lewis McGibbney one of the Nutch PMC and
>> developers of the Nutch 2.X branch. I wonder if you would be interested in
>> formally contributing your container to the Nutch project?
>> Our 2.X roadmap has Docket containers as high priority
>> https://wiki.apache.org/nutch/Nutch2Roadmap
>> Please also see
>> https://issues.apache.org/jira/browse/NUTCH-1900
>> Thank you and great work on your container this looks excellent."
>
> On wtorek, sty 27, 2015 at 9:36 PM, [email protected] <
>> [email protected]>, wrote:
>>
>> Hi stankiewicz,
>>
>> We just wanted to let you know that someone left a comment on your
>> stankiewicz/hbase_hadoop_nutch repository on the docker registry.
>> The user lewismc wrote:
>>
>> "Hi @stankiewicz, I am Lewis McGibbney one of the Nutch PMC and
>> developers of the Nutch 2.X branch. I wonder if you would be interested in
>> formally contributing your container to the Nutch project?
>> Our 2.X roadmap has Docket containers as high priority
>> https://wiki.apache.org/nutch/Nutch2Roadmap
>> Please also see
>> https://issues.apache.org/jira/browse/NUTCH-1900
>> Thank you and great work on your container this looks excellent."
>
>


-- 
*Lewis*

Reply via email to