Moved the old nutchhadooptutorial page from Nutch wiki "Front page" to
"Archive and Legacy".

~tejas


On Wed, Jan 22, 2014 at 5:09 PM, Tejas Patil <[email protected]>wrote:

> Thanks *Julien* for pointing me to new "NutchHadoopSingleNodeTutorial"
> wiki page [0]. I would soon remove the old nutchhadooptutorial page from
> wiki.
>
> [0] : http://wiki.apache.org/nutch/NutchHadoopSingleNodeTutorial
>
> *@d_k*, there are already tutorials for running Nutch 2.x. See [1] and
> [2]. Those are not as extensive as the tutorial for 1.x [3] but carry the
> steps which are different for 2.x. The rest steps after datastore setup are
> similar - the only difference being in the command params which can be
> figured out from the usage and so they were not duplicated in those 2.x
> tutorials to avoid maintenance overhead. Do you think that the 2.x
> tutorials are inadequate in some regards ?
>
> [1] : http://wiki.apache.org/nutch/Nutch2Tutorial
> [2] : http://wiki.apache.org/nutch/Nutch2Cassandra
> [3] : http://wiki.apache.org/nutch/NutchTutorial
>
> Thanks,
> Tejas
>
>
> On Wed, Jan 22, 2014 at 2:47 AM, d_k <[email protected]> wrote:
>
>> Actually what I would like to see is a Nutch 2.x tutorial at the same
>> level of detail as the http://wiki.apache.org/nutch/NutchHadoopTutorial
>> What is the process of contributing to that wiki page?
>>
>>
>> On Tue, Jan 21, 2014 at 9:33 PM, Julien Nioche <
>> [email protected]> wrote:
>>
>>> Hi
>>>
>>> The whole thing has been replaced with
>>>  
>>> http://wiki.apache.org/nutch/NutchHadoopSingleNodeTutorial<http://wiki.apache.org/nutch/NutchHadoopSingleNodeTutorial>which
>>>  does exactly what you described. +1 to remove the old
>>> nutchhadooptutorial page
>>>
>>> J.
>>>
>>>
>>> On 21 January 2014 17:44, Tejas Patil <[email protected]> wrote:
>>>
>>>> Hi nutch-dev,
>>>>
>>>> I was looking at [0] and realized that with the massive number of
>>>> Hadoop setup tutorials out there on internet, we need not repeat the same
>>>> on nutch wiki page and instead assume that user has already done Hadoop
>>>> setup. For convinience, we could direct users to the Hadoop wiki page which
>>>> has Hadoop setup details.
>>>> Plus, I propose following:
>>>>
>>>> - Section "Downloading Hadoop and Nutch" : Remove the Hadoop portions
>>>> and let the Nutch stuff stay.
>>>> - Section "Setting Up The Deployment Architecture" must be removed.
>>>> - Section "Deploy Nutch to Single Machine" and "Deploy Nutch to
>>>> Multiple Machines" can be merged together.
>>>> - Section "Performing a Nutch Crawl", "Testing the Crawl" and
>>>> "Performing a Search" must be merged, its contents must be updated.
>>>> - Section "Rsyncing Code to Slaves" and "Updates" can be completely
>>>> removed.
>>>>
>>>> Any comments ?
>>>>
>>>> [0] : http://wiki.apache.org/nutch/NutchHadoopTutorial
>>>>
>>>> Thanks,
>>>> Tejas
>>>>
>>>
>>>
>>>
>>> --
>>>
>>> Open Source Solutions for Text Engineering
>>>
>>> http://digitalpebble.blogspot.com/
>>> http://www.digitalpebble.com
>>> http://twitter.com/digitalpebble
>>>
>>
>>
>

Reply via email to