Hi - i've not yet committed a fix for: https://issues.apache.org/jira/browse/NUTCH-1449
This will allow you to stop documents from being indexed from within your indexing filter. Order can be configured using the indexing.filter.order or something configuration directive. -----Original message----- > From:Sourajit Basak <[email protected]> > Sent: Wed 23-Jan-2013 09:24 > To: [email protected] > Subject: conditional indexing > > We have an implementation of Indexing filter that runs side-by-side the > indexer-basic plugin. How is the order determined ? > Also, how do I do conditional indexing i.e. stop certain urls from being > indexed ? I think I can apply a filter but that approach will not work > since we index based on the page contents. >

