On Wed, Mar 20, 2013 at 12:46 PM, kiran chitturi <[email protected]>wrote:
> This is one of way of updating the differences for commandLine options > between 1.x and 2.x. Please check [0] > > We can maintain the difference between the 2 versions like this but my > question is to whether the top paragraph that details the fetcher, Is it > the same way it works for 1.x and 2.x? > > If it is very different from 1.x and 2.x, we would be better off maintaing > separate pages. > Maintaining a single page for a command which has info about both the versions would be easier for users. Generally the overview of commands across versions would be same so the upper portion would be common. In case when there are considerable differences, the same page can highlight the differences. For maintaining consistency, I would prefer to have a single page for a single command in such cases too. Lets see what others have to say. > > [0] - http://wiki.apache.org/nutch/bin/nutch%20fetch#preview > > +1. This is what I had in my mind :) For 2.x, the explanation of commands could be more verbose. It would take time for getting these changes to all the pages. Maybe over a period of time we can achieve that. Thanks Kiran for initiating this process !!! > > On Wed, Mar 20, 2013 at 3:09 PM, kiran chitturi <[email protected] > > wrote: > >> Hi Tejas, >> >> +1 for keeping the pages separate for 1.x and 2.x. >> >> I was fixing only few versions issues and renaming page links until now >> in wiki. You brought up a good point, that I have been intending to ask the >> Nutch devs. >> >> I feel the 2.x should have its own page with its own set of links >> regarding the architecture and everything. The home wiki page looks like a >> mix of 1.x and 2.x and it is easy to get confused with parameters and >> options in 1.x and 2.x. >> >> There are significant differences in the other commands too in 1.x and >> 2.x and I think we need to take up the task of remaking the whole command >> line argument page, the table. >> >> The command line arguments page is quite important for users as you have >> mentioned and I am up for keeping the pages separate for 1.x and 2.x. >> >> >> On Wed, Mar 20, 2013 at 2:52 PM, Tejas Patil >> <[email protected]>wrote: >> >>> Hi Kiran, >>> >>> The command line arguments to the fetch command shown on wiki page [2] >>> doesn't seem to be in sync with what is implemented in [0] and [1]. >>> >>> For 1.x [0] >>> Usage: Fetcher <segment> [-threads n] >>> >>> For 2.x [1] >>> Usage: FetcherJob (<batchId> | -all) [-crawlId <id>] [-threads N] >>> [-resume] [-numTasks N] >>> >>> On wiki page [2]: >>> Usage: bin/nutch fetch <segment> [-threads n] [-noParsing] >>> >>> I strongly feel that these params must be mentioned in the wiki page. >>> Also, people have been pondering over @user for the differences wrt 1.x and >>> 2.x. As the options are different for both these versions, providing usage >>> for both these versions would make things easy for users. What say ? >>> >>> There were lot of updates for other wiki pages too which might also need >>> similar change. >>> >>> [0] >>> http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/fetcher/Fetcher.java?view=markup >>> [1] >>> http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/fetcher/FetcherJob.java?view=markup >>> [2] http://wiki.apache.org/nutch/bin/nutch%20fetch >>> >>> Thanks, >>> Tejas >>> >> >> >> >> -- >> Kiran Chitturi >> >> <http://www.linkedin.com/in/kiranchitturi> >> >> >> > > > -- > Kiran Chitturi > > <http://www.linkedin.com/in/kiranchitturi> > > >

