Thx for the quick response,
Well i wrote a very simple plugin that tryes to the the same "doc" twice and
if there is and error
then put it in the orniginal doc custom field:
public NutchDocument filter(NutchDocument doc, Parse parse, Text url,
CrawlDatum datum, Inlinks inlinks) throws Ind
Hi there,
Im new to the forum and nutch as well...
I wrote a plugin to nutch that implements the IndexingFilter...
Now i want to add a new Document to the index from the plugin (split the
current doc)
I tryed testing it from something like this
NutchIndexWriter[] Writers =
NutchIndexWriterFactory