nutch-site.xml is the only config file you should touch, by copying
the appropriate section from nutch-default.xml and customizing it.
Yes, you will need to write a custom plugin like the creativecommons
one.
Erik
On Aug 10, 2005, at 2:44 PM, Fuad Efendi wrote:
I probably need to work with plugins, and to modify config files... I
need to add additional field to Document, and to show it on a web-page
nutch-conf.xsl
nutch-default.xml
nutch-site.xml
Am I right?
Thanks
-----Original Message-----
From: Fuad Efendi [mailto:[EMAIL PROTECTED]
Sent: Wednesday, August 10, 2005 2:15 PM
To: [email protected]
Subject: RE: [Nutch-general] How to extend Nutch
So, I need to modify some existing classes, isn't it?
-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Wednesday, August 10, 2005 1:48 PM
To: [EMAIL PROTECTED]
Subject: Re: [Nutch-general] How to extend Nutch
Probably IndexingFilter or HtmlParser for indexing and for indexing I
think there is something in org.apache.nutch.search.... some class
that
starts with Raw.... I just saw this in the Javadoc earlier.
Otis
--- Fuad Efendi <[EMAIL PROTECTED]> wrote:
I need specific pre-processing of a html-page, to add more fields to
Document before storing it in Index, and to modify web-interface
accordingly.
Where is the base point of extension?
Thanks!
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general