Hello, Please have a look at index-more and query-more plugins for content-type handling. Regards Piotr Vacuum Joe wrote:
I have been looking through the API docs and I can't figure this out. Here is my question:Is there a way to search based on meta-information, such as content type, or even the value of header fields? For example, let's say I would like to find only PDFs, or perhaps put higher weight on PDFs vs. other kinds of documents. Can this be done? I looked at the query interface. It looks like NutchBean allows me to specify a Query, and a Query is basically made up of Strings which are in the content. I can't find any way to specify meta-information I'm looking for. Any ideas on this? Thanks ____________________________________________________Start your day with Yahoo! - make it your home page http://www.yahoo.com/r/hs
------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
