On the implementation side, you might want to look at index-basic and query-basic to look at the way the indexing and querying is done. If you would like to add more meta data for the documents being indexed, you can extend these or write specific plugins which adds specific pieces of metadata. On querying, you might want to customize the search.jsp, to add additional search boxes if you want to let uses query specific meta-data fields.
Praveen. On Fri, 18 Feb 2005 11:14:17 -0800, David Spencer <[EMAIL PROTECTED]> wrote: > I've used Lucene a lot, am new to Nutch, and have started fooling around > w/ adding an NNTP and IMAP protocol handler to Nutch. > > Question is, what's the "policy" on adding more meta-data, or fields, to > Documents e.g. for NNTP documents (newsgroup articles) there are lots of > header fields one could index, esp, say, the newsgroup name. > > Is Nutch doing a kind of lowest common demoniantor approach, thus new > protocol handlers should not add more fields as they can't be used > across all protocols, and maybe this isn't as scalable as it increases > the disk requirements... > > or... > > Is it ok to add more fields as then the users can do "advanced" queries > if they want? > > thx, > Dave > > ------------------------------------------------------- > SF email is sponsored by - The IT Product Guide > Read honest & candid reviews on hundreds of IT Products from real users. > Discover which products truly live up to the hype. Start reading now. > http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click > _______________________________________________ > Nutch-developers mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/nutch-developers > ------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
