I have kept the crawl command but notified the users that it is deprecated.
I have added the crawl script in section 3.3 [0]

The wiki looks a bit updated and I hope all the basic questions by Nutch
Users can be redirected to wiki pointers.

*Few things still need to be updated:*
1. How to choose Nutch parameters for optimal configuration
2. A full tutorial for Nutch 2 with Hbase. Notify users of current bugs
with MySql and others stores.

Please add here if someone feels any section is updated

[0] - http://wiki.apache.org/nutch/NutchTutorial


On Thu, Mar 21, 2013 at 3:43 AM, kiran chitturi
<[email protected]>wrote:

> Hi Feng, I have created a wiki page for (bin/crawl) thinking about this.
> Please feel free to edit any of the wiki's and update the documentation.
>
>
>
> [0] http://wiki.apache.org/nutch/bin/crawl
>
>
> On Thu, Mar 21, 2013 at 1:18 AM, feng lu <[email protected]> wrote:
>
>> <<
>> Second, for a user running Nutch on a single node or local mode the
>> default size of topN (50,000) makes the crawl run for a long time. Can we
>> make the topN parameter configurable through the script ?
>> >>
>>
>> May be i agree with Tejas that let user to modify the parameters below to
>> their needs. But we can add some detail information into the bin/crawl
>> wiki to tell users how to modify these parameters and what is the meaning
>> of these parameters.
>>
>>
>> On Thu, Mar 21, 2013 at 3:01 AM, kiran chitturi <
>> [email protected]> wrote:
>>
>>> Hi!
>>>
>>> I want to update the Nutch tutorials in the wiki with the crawl script
>>> (./bin/crawl). The presence of the crawl command in the tutorials makes
>>> users use these crawl command run in to issues which makes us suggest them
>>> use the crawl script instead of the command.
>>>
>>> Can we make it uniform all over wiki that crawl command is deprecated
>>> and it is recommended to use crawl script ?
>>>
>>> Second, for a user running Nutch on a single node or local mode the
>>> default size of topN (50,000) makes the crawl run for a long time. Can we
>>> make the topN parameter configurable through the script ?
>>>
>>> Thank you,
>>>
>>> --
>>> Kiran Chitturi
>>>
>>> <http://www.linkedin.com/in/kiranchitturi>
>>>
>>>
>>>
>>
>>
>> --
>> Don't Grow Old, Grow Up... :-)
>>
>
>
>
> --
> Kiran Chitturi
>
> <http://www.linkedin.com/in/kiranchitturi>
>
>
>


-- 
Kiran Chitturi

<http://www.linkedin.com/in/kiranchitturi>

Reply via email to