Hi Feng, I have created a wiki page for (bin/crawl) thinking about this.
Please feel free to edit any of the wiki's and update the documentation.
[0] http://wiki.apache.org/nutch/bin/crawl
On Thu, Mar 21, 2013 at 1:18 AM, feng lu amuseme...@gmail.com wrote:
Second, for a user running
[
https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13609430#comment-13609430
]
kiran commented on NUTCH-1406:
--
Hi Kristof,
Are there any updates or test for this patch ?
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The RunNutchInEclipse page has been changed by kiranchitturi:
http://wiki.apache.org/nutch/RunNutchInEclipse?action=diffrev1=38rev2=39
* Ubuntu Release 11.04 (natty)
. Kernel Linux
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The IndexMetatags page has been changed by kiranchitturi:
http://wiki.apache.org/nutch/IndexMetatags?action=diffrev1=3rev2=4
= Nutch - Parse Metatags =
'''Summary:''' When crawling
Dear Wiki user,
You have subscribed to a wiki page or wiki category on Nutch Wiki for change
notification.
The NutchTutorial page has been changed by kiranchitturi:
http://wiki.apache.org/nutch/NutchTutorial?action=diffrev1=61rev2=62
This will include any URL in the domain
I have kept the crawl command but notified the users that it is deprecated.
I have added the crawl script in section 3.3 [0]
The wiki looks a bit updated and I hope all the basic questions by Nutch
Users can be redirected to wiki pointers.
*Few things still need to be updated:*
1. How to choose
6 matches
Mail list logo