I just tried the long query string as you suggested and it works great. Thanks, Shuai
On Jul 30, 2010, at 1:35 PM, Ian Lea wrote: > Yes, you can do that. Make a Query for the 30 papers and use that > with your main query in a BooleanQuery if doing it programatically. > Or with so few documents and papers to match, just in a long string > via QueryParser. See > http://lucene.apache.org/java/3_0_2/queryparsersyntax.html for details > on query parser syntax - that is also the answer to your other > question. > > If performance is a concern and you will be reusing the same set of > pubmed ids you should look at filters, specifically QueryWrapperFilter > and CachingWrapperFilter. > > > -- > Ian. > > > On Fri, Jul 30, 2010 at 6:19 PM, Shuai Weng <sh...@genome.stanford.edu> wrote: >> >> Sorry for the confusion.. >> >> Currently, we have total 7000 fulltext papers (with the pubmed IDs stored as >> the unique IDs) >> in the lucene index. We were wondering if we can search for a given term in >> a subset of these papers >> (eg, 30 papers; by providing a list of the pubmed IDs) instead of searching >> the term in these 7000 >> papers. In another word, we only care about the hits in the 30 given papers. >> How can we easily filter out >> the other papers? >> >> Thanks, >> Shuai >> >> >> On Jul 30, 2010, at 9:56 AM, Ian Lea wrote: >> >>> Depending on what exactly you mean by "subset" and "index pool", then yes. >>> >>> If you've got one lucene index containing docs >>> >>> docno: 1 >>> category: computers >>> text: some words about computers >>> >>> docno: 2 >>> category: computers >>> text: some more words about computers >>> >>> docno: 3 >>> category: finance >>> text: some words about finance >>> >>> then a search for "text:words" will match all 3 whereas a search for >>> "category:computers text:words" will only match 2. >>> >>> >>> If this isn't what you are asking about I suggest you provide more detail. >>> >>> >>> -- >>> Ian. >>> >>> On Fri, Jul 30, 2010 at 4:15 PM, Shuai Weng <sh...@genome.stanford.edu> >>> wrote: >>>> >>>> Hey, >>>> >>>> I was wondering if we can search info from a subset of papers >>>> instead of from the whole index pool. >>>> >>>> Thanks, >>>> Shuai >>> >>> --------------------------------------------------------------------- >>> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org >>> For additional commands, e-mail: java-user-h...@lucene.apache.org >> >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org >> For additional commands, e-mail: java-user-h...@lucene.apache.org >> >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org