Vadim Gritsenko wrote:
Andy Armstrong wrote:
I'm currently working on integrating full text indexing and searching and as part of that I need to extend the o.a.x.c.i.Indexer interface to include
public void startDocument(Key key) throws DBException; public void endDocument(Key key) throws DBException;
Can you explain a bit why these methods are needed, and why existing Indexer interface is not enough?
I'm assuming (perhaps incorrectly) that Indexer.add() can be called more than once for each document which doesn't sit happily with Lucene's API.
My current approach is to have a one-to-one relationship between Xindice documents and Lucene's concept of a document but it's just occured to me that I should maybe have a Lucene document for each chunk of text in a Xindice document.
Sorry; I seem to be thinking out loud now. Let me go and experiment a little.
-- Andy Armstrong, Tagish