Delete documents base on more than one condition?

2012-12-06 Thread rajashekar
Hi Is it possible to delete a set of documents where they match certain conditions? I would like to delete a set of articles that belong to a given user within a category. Thanks, - -- View this message in context: http://lucene.472066.n3.nabble.com/Delete-documents-base-on-more-

Re: Delete documents base on more than one condition?

2012-12-06 Thread parnab kumar
Hi Rajashekhar, yet it is possible . You can form a Boolean Query which will match the documents as per your required conditions . Then you can delete by the respective document ids by instantiating a indexReader. You can refer to Book Lucene in Action 2nd Edition for more details . Thanks, Parn

Re: Delete documents base on more than one condition?

2012-12-06 Thread Ian Lea
Or, easier, just pass the Query identifying the docs to IndexWriter.deleteDocuments(Query query). There are variants that take multiple queries and single or multiple terms. See the javadocs. You can't delete docs via IndexReader any more. -- Ian. On Thu, Dec 6, 2012 at 9:19 AM, parnab kuma

SpanQuery and Bits

2012-12-06 Thread Carsten Schnober
Hi, I have a problem understanding and applying the BitSets concept in Lucene 4.0. Unfortunately, there does not seem to be a lot of documentation about the topic. The general task is to extract Spans matching a SpanQuery which works with the following snippet: for (AtomicReaderContext atomic : r

Lucene 4.0.0 - find term position.

2012-12-06 Thread Vitaly_Artemov
Hi all, I am new with Lucene. I try to understand how can I find the term position. I use following code to index documents: ... IndexWriter writer = new IndexWriter(mIndexDir, mIwc); FileInputStream fis; fis = new FileInputStream(file); Document doc

Re: Lucene 4.0.0 - find term position.

2012-12-06 Thread lukai
terms = fileds.terms(...); termsEnum = terms.iterator(null); termsEnum.seekExat(...); docsAndPositionsEnum docsPosEnum = termsEnum.docsAndPositions(...); You can get the information in "docsPosEnum". On Thu, Dec 6, 2012 at 2:28 AM, wrote: > Hi all, > I am new with Lucene. > I try to understand

RE: Separating the document dataset and the index dataset

2012-12-06 Thread Jain Rahul
If you are using lucene 4.0 and afford to compress your document dataset while indexing, it will be a huge savings in terms of disk space and also in IO (resulting in indexing throughput). In our case, it has helped us a lot as compressed data size was roughly 3 times less than of original doc