Has anyone any experience of merging indices of content both from a
web-crawl and from a series of database queries? This would mean using
Nutch to index the web content and Lucene to index the database content and
then merging the two. The difficulty is that, as I understand it, the Nutch
index holds references to segment indices rather than holding content
itself. Or is there some way in which Nutch could be used to index the
results of database queries as well as a normal web crawl?
Any ideas or assistance gratefully received.
Kelvin
_________________________________________________________________
MSN Messenger 7.5 is now out. Download it for FREE here.
http://messenger.msn.co.uk