k.g.kumare san wrote:
Dear all
I am a research fellow who uses nutch for our domain.i
want detail explanation about nutch plugins .In my view the
explanation is not clear.Actually on which part i have to add my
language analyser .please describe in detail.can any one suggest any
books ,websites
I also need to know how to call / use my language
analyser and how to test whether nutch uses my language
analyser.(tamil south indian language , belongs to Asia
contienent).When the crawler crawls the tamil website ,the nutch must
calls my language analyser instead of calling its default analyser.i
need assistance for the above problem
with regards
Hi
pl refer the following link:
http://wiki.apache.org/nutch/WritingPluginExample
To do after writing plugin and compiling it:
- enlist u r new plugin in "plugin.includes" property present in
nutch-site.xml
- make sure that plugin's jar is present in directory specified by the
property "plugins.folders" in nutch-site.xml
- property "plugins.exclude" specifies rules for omission of plugins.
Make sure u r plugin is not excluded by these rules. (Generally this
property is not there, so no worries)
- If u want to specify the alias of plugin , u will have to make an
entry in parse-plugins.xml
To check whether, the plugin is getting loaded or not:
- pl have a look at org.apache.nutch.plugin.PluginRepository.java
class. run this class with appropriate arguments
- In the logs you should be able to see the name/s of folders in
which plugins are looked
- In the logs you should be able to see a detail list of
extension-points and registered plugins . U shud see ur plugin in here :)
Bye
--
This message has been scanned for viruses and
dangerous content and is believed to be clean.