I think MIME-type filter is a fine method this, the only drawback is that you need to run the indexer twice.
Althouh a better solution would be to support JEXL expressions in IndexWriters and IndexerMapReduce to allow global filtering and per-IndexWriter filtering. This would not be very hard to patch in. -----Original message----- > From:Yossi Tamari <[email protected]> > Sent: Wednesday 23rd August 2017 19:40 > To: [email protected] > Subject: RE: Exchange documents in indexing job > > I don't see a good way to do it in configuration, but it should be very easy > to override the write method in the two plugins to have it check the mime > type and decide whether to call super.write or not. > (One terrible way to do it with configuration only would be to configure only > one of the indexers and use mimetype-filter to filter the matching type, and > then reconfigure for the other indexer and change mimetype-filter.txt to the > other mime type and index again...) > > -----Original Message----- > From: Roannel Fernández Hernández [mailto:[email protected]] > Sent: 23 August 2017 18:05 > To: [email protected] > Subject: Exchange documents in indexing job > > Hi folks: > > There is some way in Nutch to send some documents to a particular index > writer according to particular values of fields? > > I explain myself better. I have a document with a field called "mimetype" and > I want to send to Solr only the documents with value "text/plain" for this > field and send to RabbitMQ the documents with value "text/html". How can I do > that? > > Regards > > La @universidad_uci es Fidel. Los jóvenes no fallaremos. > #HastaSiempreComandante > #HastalaVictoriaSiempre > >

