Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The following page has been changed by NicolásLichtmaier:
http://wiki.apache.org/nutch/Upgrading_from_0%2e8%2ex_to_0%2e9
The comment on the change is:
Removed backticks, which are wrong here.
------------------------------------------------------------------------------
* if you ran your fetcher in parsing mode, or if you already parsed the
segments, then for each segment you need to first remove directories containing
parsed content:
{{{
- for i in `segments/2006*`
+ for i in segments/2006* ; do
- do
(cd $i && rm -rf crawl_parse parse_data parse_text)
done
}}}
* then you can use 'mergesegs' command, which was modified to perform the
conversion of remaining segment parts, e.g. like this:
{{{
mkdir converted
- for i in `segments/2006*`
+ for i in segments/2006* ; do
- do
nutch mergesegs converted $i
done
}}}
@@ -40, +38 @@
* finally, you will need to re-parse converted segments:
{{{
- for i in `converted/*`
+ for i in converted/* ; do
- do
nutch parse $i
done
}}}
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-cvs mailing list
Nutch-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-cvs