On Sat, Aug 1, 2009 at 6:56 AM, Erik Hatcher<e...@ehatchersolutions.com> wrote: > Shouldn't DIH, I presume in either SolrWriter or DataImportHandler, call > processor.finish()? soon after commit DIH should call finish. > > Maybe DataImportHandler should subclass ContentStreamHandlerBase, which > calls #finish already. This would mean we implement a new > ContentStreamLoader. This would allow DIH to hand the streams off as either > data sources or data to entities, right? This is where we want to head with > Tika integration into DIH, methinks. If you wish to handle 'push' data DIH already has a ContentStreamDataSource. I guess Tika Integration would be easy with that > > Thoughts? > > Erik > >
-- ----------------------------------------------------- Noble Paul | Principal Engineer| AOL | http://aol.com