Hi Tejas,

+1 for keeping the pages separate for 1.x and 2.x.

I was fixing only few versions issues and renaming page links until now in
wiki. You brought up a good point, that I have been intending to ask the
Nutch devs.

I feel the 2.x should have its own page with its own set of links regarding
the architecture and everything. The home wiki page looks like a mix of 1.x
and 2.x and it is easy to get confused with parameters and options in 1.x
and 2.x.

There are significant differences in the other commands too in 1.x and 2.x
and I think we need to take up the task of remaking the whole command line
argument page, the table.

The command line arguments page is quite important for users as you have
mentioned and I am up for keeping the pages separate for 1.x and 2.x.


On Wed, Mar 20, 2013 at 2:52 PM, Tejas Patil <[email protected]>wrote:

> Hi Kiran,
>
> The command line arguments to the fetch command shown on wiki page [2]
> doesn't seem to be in sync with what is implemented in [0] and [1].
>
> For 1.x [0]
> Usage: Fetcher <segment> [-threads n]
>
> For 2.x [1]
> Usage: FetcherJob (<batchId> | -all) [-crawlId <id>] [-threads N]
> [-resume] [-numTasks N]
>
> On wiki page [2]:
> Usage: bin/nutch fetch <segment> [-threads n] [-noParsing]
>
> I strongly feel that these params must be mentioned in the wiki page.
> Also, people have been pondering over @user for the differences wrt 1.x and
> 2.x. As the options are different for both these versions, providing usage
> for both these versions would make things easy for users. What say ?
>
> There were lot of updates for other wiki pages too which might also need
> similar change.
>
> [0]
> http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/fetcher/Fetcher.java?view=markup
> [1]
> http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/fetcher/FetcherJob.java?view=markup
> [2] http://wiki.apache.org/nutch/bin/nutch%20fetch
>
> Thanks,
> Tejas
>



-- 
Kiran Chitturi

<http://www.linkedin.com/in/kiranchitturi>

Reply via email to