Hi everybody, I need to query for documents not only for search terms but also for numeric values (or other general types). Let me try to explain with a hypothetical example.
Assuming there is a value for the number words in each document (or the number of person names, or whatever), I would want to formulate a query like "Give me documents containing 'jack johnson' AND with token_count > 250". I've been working with Lucene before and the keyword part is easy, but what would be a good solution to query for numbers etc.? One first idea I had was storing the numbers (which are basically a HashMap<String,Double>) in the index in some way or the other. But it is not at all obvious for me how to query them then. Another thing I could think of would be using a separate database of any type, but then how to bring those two together in a way that makes sense? Any pointers to useful resources and any types of hints are welcome! :-) Best, Niels -- Niels Ott Computational Linguist (B.A.) http://www.drni.de/niels/ --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]