subject:"IndexSearcher"

3.5.0:  I passed a fixed size executor service with one thread, and
then with two threads, to the IndexSearcher constructor.

It hung.

With three threads, it didn't work, but I got different results than
when I don't pass in an executor service at all.

Is this expected? Should the javadoc say something? (I can make a patch).

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Hanging with fixed thread pool in the IndexSearcher multithread code

2012-02-19 Thread Robert Muir

On Sun, Feb 19, 2012 at 9:08 AM, Benson Margulies bimargul...@gmail.com wrote:
 3.5.0:  I passed a fixed size executor service with one thread, and
 then with two threads, to the IndexSearcher constructor.

 It hung.

 With three threads, it didn't work, but I got different results than
 when I don't pass in an executor service at all.

 Is this expected? Should the javadoc say something? (I can make a patch).


I'm not sure I understand the details here, but I don't like the sound
of 'different results': is it possible you can work this down into a
test case that can be attached to jira?


-- 
lucidimagination.com

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Hanging with fixed thread pool in the IndexSearcher multithread code

I should have been clearer; the hang I can make into a test case but I
wondered if is would just get closed as 'works as designed'. the
result discrepancy needs some investigation, I should not have
mentioned it yet.

On Feb 19, 2012, at 10:40 AM, Robert Muir rcm...@gmail.com wrote:

 On Sun, Feb 19, 2012 at 9:08 AM, Benson Margulies bimargul...@gmail.com 
 wrote:
 3.5.0:  I passed a fixed size executor service with one thread, and
 then with two threads, to the IndexSearcher constructor.

 It hung.

 With three threads, it didn't work, but I got different results than
 when I don't pass in an executor service at all.

 Is this expected? Should the javadoc say something? (I can make a patch).


 I'm not sure I understand the details here, but I don't like the sound
 of 'different results': is it possible you can work this down into a
 test case that can be attached to jira?


 --
 lucidimagination.com

 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Hanging with fixed thread pool in the IndexSearcher multithread code

and there was a dumb typo.

1 thread: hang
2 threads: hang
3 or more: no hang

On Feb 19, 2012, at 10:40 AM, Robert Muir rcm...@gmail.com wrote:

 On Sun, Feb 19, 2012 at 9:08 AM, Benson Margulies bimargul...@gmail.com 
 wrote:
 3.5.0:  I passed a fixed size executor service with one thread, and
 then with two threads, to the IndexSearcher constructor.

 It hung.

 With three threads, it didn't work, but I got different results than
 when I don't pass in an executor service at all.

 Is this expected? Should the javadoc say something? (I can make a patch).


 I'm not sure I understand the details here, but I don't like the sound
 of 'different results': is it possible you can work this down into a
 test case that can be attached to jira?


 --
 lucidimagination.com

 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Hanging with fixed thread pool in the IndexSearcher multithread code

Conveniently, all the 'wrong-result' problems disappeared when I
followed your advice about counting hits.

On Sun, Feb 19, 2012 at 10:39 AM, Robert Muir rcm...@gmail.com wrote:
 On Sun, Feb 19, 2012 at 9:08 AM, Benson Margulies bimargul...@gmail.com 
 wrote:
 3.5.0:  I passed a fixed size executor service with one thread, and
 then with two threads, to the IndexSearcher constructor.

 It hung.

 With three threads, it didn't work, but I got different results than
 when I don't pass in an executor service at all.

 Is this expected? Should the javadoc say something? (I can make a patch).


 I'm not sure I understand the details here, but I don't like the sound
 of 'different results': is it possible you can work this down into a
 test case that can be attached to jira?


 --
 lucidimagination.com

 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Hanging with fixed thread pool in the IndexSearcher multithread code

See https://issues.apache.org/jira/browse/LUCENE-3803 for an example
of the hang. I think this nets out to pilot error, but maybe Javadoc
could protect the next person from making the same mistake.

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

RE: Hanging with fixed thread pool in the IndexSearcher multithread code

2012-02-19 Thread Uwe Schindler

See my response. The problem is not in Lucene; its in general a problem of 
fixed thread pools that execute other callables from within a callable running 
at the moment in the same thread pool. Callables are simply waiting for each 
other.

Use a separate thread pool for Lucene (or whenever you execute new callables 
from within another running callable)

Uwe

-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de

 -Original Message-
 From: Benson Margulies [mailto:bimargul...@gmail.com]
 Sent: Monday, February 20, 2012 1:47 AM
 To: java-user@lucene.apache.org
 Subject: Re: Hanging with fixed thread pool in the IndexSearcher multithread
 code
 
 See https://issues.apache.org/jira/browse/LUCENE-3803 for an example of the
 hang. I think this nets out to pilot error, but maybe Javadoc could protect 
 the
 next person from making the same mistake.
 
 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Hanging with fixed thread pool in the IndexSearcher multithread code

On Sun, Feb 19, 2012 at 8:07 PM, Uwe Schindler u...@thetaphi.de wrote:
 See my response. The problem is not in Lucene; its in general a problem of 
 fixed thread pools that execute other callables from within a callable 
 running at the moment in the same thread pool. Callables are simply waiting 
 for each other.

 Use a separate thread pool for Lucene (or whenever you execute new callables 
 from within another running callable)

Right. There's nothing like coding a test case to cast one's stupid
errors into high relief. Sorry for all the noise.



 Uwe

 -
 Uwe Schindler
 H.-H.-Meier-Allee 63, D-28213 Bremen
 http://www.thetaphi.de
 eMail: u...@thetaphi.de

 -Original Message-
 From: Benson Margulies [mailto:bimargul...@gmail.com]
 Sent: Monday, February 20, 2012 1:47 AM
 To: java-user@lucene.apache.org
 Subject: Re: Hanging with fixed thread pool in the IndexSearcher multithread
 code

 See https://issues.apache.org/jira/browse/LUCENE-3803 for an example of the
 hang. I think this nets out to pilot error, but maybe Javadoc could protect 
 the
 next person from making the same mistake.

 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org


 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Hanging with fixed thread pool in the IndexSearcher multithread code

2012-02-19 Thread Trejkaz

On Mon, Feb 20, 2012 at 12:07 PM, Uwe Schindler u...@thetaphi.de wrote:
 See my response. The problem is not in Lucene; its in general a problem of 
 fixed
 thread pools that execute other callables from within a callable running at 
 the
 moment in the same thread pool. Callables are simply waiting for each other.

What we do to get around this issue is to have a utility class which
you call to submit jobs to the executor, but instead of waiting after
submitting them, it starts calling get() starting from the end of the
list. So if there is no other thread available on the executor, the
main thread ends up doing all the work and then returns like normal.

The problem with this solution is that it requires all code in the
system to go through this utility to avoid the issue, and obviously
Lucene is one of those things which isn't written to defend against
this.

Java 7's solution seems to be ForkJoinPool but I gather there is no
simple way to use that with Lucene...

TX

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Filter and IndexSearcher in Lucene 4.0 (trunk)

2012-02-10 Thread Hany Azzam

Hi,

I apologise upfront for the trivial question. I have an IndexSearcher and I am 
applying a FieldCacheTermsFilter filter on it to only retrieve documents whose 
single docId is in a provided set of allowed docIds. I am particularly 
interested in the stats being estimated over the accepted set of documents. 
However, the filtering is not working. Am I missing something here?

h.
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

RE: Filter and IndexSearcher in Lucene 4.0 (trunk)

2012-02-10 Thread Uwe Schindler

Whats the problem?

-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de


 -Original Message-
 From: Hany Azzam [mailto:h...@eecs.qmul.ac.uk]
 Sent: Friday, February 10, 2012 6:43 PM
 To: java-user@lucene.apache.org
 Subject: Re: Filter and IndexSearcher in Lucene 4.0 (trunk)
 
 Hi,
 
 I apologise upfront for the trivial question. I have an IndexSearcher and
I am
 applying a FieldCacheTermsFilter filter on it to only retrieve documents
whose
 single docId is in a provided set of allowed docIds. I am particularly
interested
 in the stats being estimated over the accepted set of documents. However,
the
 filtering is not working. Am I missing something here?
 
 h.
 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Filter and IndexSearcher in Lucene 4.0 (trunk)

2012-02-10 Thread Hany Azzam

See the question was so trivial that you actually missed it :) 

The problem is that the docs are filtered (which is is great) but the stats 
(BasicStats) aren't, i.e. the stats have been calculated over the whole index 
and not just a selected set of documents. For example:

Filter filter = new FieldCacheTermsFilter(QNO, queryNumber);
searcher.search(qq, filter, collector);

stats.getNumberOfDocuments();

I only want to consider certain docs per query. The filter achieves that in 
terms of matching and the returned the results. However, the calculated score 
for each document has been calculated using the stats over the whole index and 
not just the filtered documents. Is there a way to calculate the stats only 
over the filtered documents? 

I hope the problem is a bit clearer now.

Thank you.

h.


On 10 Feb 2012, at 18:27, Uwe Schindler wrote:

 Whats the problem?
 
 -
 Uwe Schindler
 H.-H.-Meier-Allee 63, D-28213 Bremen
 http://www.thetaphi.de
 eMail: u...@thetaphi.de
 
 
 -Original Message-
 From: Hany Azzam [mailto:h...@eecs.qmul.ac.uk]
 Sent: Friday, February 10, 2012 6:43 PM
 To: java-user@lucene.apache.org
 Subject: Re: Filter and IndexSearcher in Lucene 4.0 (trunk)
 
 Hi,
 
 I apologise upfront for the trivial question. I have an IndexSearcher and
 I am
 applying a FieldCacheTermsFilter filter on it to only retrieve documents
 whose
 single docId is in a provided set of allowed docIds. I am particularly
 interested
 in the stats being estimated over the accepted set of documents. However,
 the
 filtering is not working. Am I missing something here?
 
 h.
 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org
 
 
 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org
 


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: IndexSearcher with two Indexes

2012-01-27 Thread Hany Azzam

Hi,

I have two indexes. One that contains all the documents in the collection and 
the other contains only the relevant documents. I am using Lucene 4.0 and the 
new SimilariyBase class to build my retrieval models (similarity functions). 
One of the retrieval models requires statistics to be computed across both of 
the indexes. How can an IndexSearcher use the two indexes at the same time to 
compute different components of the retrieval model? Is that possible?

Thank you very much,
Hany

Re: IndexSearcher with two Indexes

2012-01-27 Thread Robert Muir

On Fri, Jan 27, 2012 at 3:21 PM, Hany Azzam h...@eecs.qmul.ac.uk wrote:
 Hi,

 I have two indexes. One that contains all the documents in the collection and 
 the other contains only the relevant documents. I am using Lucene 4.0 and the 
 new SimilariyBase class to build my retrieval models (similarity functions). 
 One of the retrieval models requires statistics to be computed across both of 
 the indexes. How can an IndexSearcher use the two indexes at the same time to 
 compute different components of the retrieval model? Is that possible?


you can make a multireader over the two indexreaders, then make an
indexsearcher over that multireader... or are you trying to do
something else?


-- 
lucidimagination.com

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: IndexSearcher with two Indexes

2012-01-27 Thread Hany Azzam

Hi Robert,

Thanks for the reply. I am trying to do something different. If I use a 
mutireader then the searching/scoring will take place over the two indexes at 
the same time. However, in my case the subcomponents of the retrieval model are 
calculated over separate evidence spaces. For example, the retrieval model 
calculates something like that: 

score := P(query_term | documents) * P(query_term | relevant_documents)

The P(query_term | documents) can be estimated using the index over the whole 
collection of documents. The P(query_term | relevant_documents) can be 
estimated using the index over the relevant documents only (which are known 
prior to the execution of the query).

The question is can I do such a calculation which uses to separate indexes in 
one scoring function? 

Of course one option is to use the MultiSimilarity Class and combine the score 
somehow. However, the retrieval function is more complex than that and a simple 
combination using product or summation won't be feasible.

Any ideas on how to resolve this problem (if possible :))?

Thanks again,
h.

On 27 Jan 2012, at 20:29, Robert Muir wrote:

 On Fri, Jan 27, 2012 at 3:21 PM, Hany Azzam h...@eecs.qmul.ac.uk wrote:
 Hi,
 
 I have two indexes. One that contains all the documents in the collection 
 and the other contains only the relevant documents. I am using Lucene 4.0 
 and the new SimilariyBase class to build my retrieval models (similarity 
 functions). One of the retrieval models requires statistics to be computed 
 across both of the indexes. How can an IndexSearcher use the two indexes at 
 the same time to compute different components of the retrieval model? Is 
 that possible?
 
 
 you can make a multireader over the two indexreaders, then make an
 indexsearcher over that multireader... or are you trying to do
 something else?
 
 
 -- 
 lucidimagination.com
 
 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: IndexSearcher with two Indexes

2012-01-27 Thread Robert Muir

On Fri, Jan 27, 2012 at 4:53 PM, Hany Azzam h...@eecs.qmul.ac.uk wrote:
 Hi Robert,

 Thanks for the reply. I am trying to do something different. If I use a 
 mutireader then the searching/scoring will take place over the two indexes at 
 the same time. However, in my case the subcomponents of the retrieval model 
 are calculated over separate evidence spaces. For example, the retrieval 
 model calculates something like that:

 score := P(query_term | documents) * P(query_term | relevant_documents)

 The P(query_term | documents) can be estimated using the index over the whole 
 collection of documents. The P(query_term | relevant_documents) can be 
 estimated using the index over the relevant documents only (which are known 
 prior to the execution of the query).


In this situation, if you want to combine the statistics from
different indexes in your own way, you can look at
IndexSearcher.termStatistics() and
IndexSearcher.collectionStatistics().
These are intended for situations like distributed search, but maybe
you can make use of them.

here is some pseudocode:

IndexReader relevant = IndexReader.open(relevantDirectory);
IndexReader documents = IndexReader.open(documentsDirectory);

final IndexSearcher relevantSearcher = new IndexSearcher(relevant);
IndexSearcher documentsSearcher = new IndexSearcher(documents) {

  @Override
  public CollectionStatistics collectionStatistics(String field)
throws IOException {
CollectionStatistics documentStats = super.collectionStatistics(field);
return new CollectionStatistics(...
someCombinationOf(documentStats + stuff from relevantSearcher));
  }

  // do a similar thing for termStatistics()
};

documentsSearcher.search(...)

-- 
lucidimagination.com

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: RE: Question about FilterIndexReader and IndexSearcher

2011-06-27 Thread 周洲

Hi,
  I'am a student of Southeast University which locate in China, thank you 
for your help,but i still cann't filter the docs being deleted,i make a test 
demo,please tell me why the following procedure will be such a result?
 Why would IndexSearcher ignore the deleted docs cached in 
FilterIndexReader?

zhouzhou
2011-06-27



发件人： Uwe Schindler 
发送时间： 2011-06-26  19:05:11 
收件人： java-user@lucene.apache.org 
抄送： 
主题： RE: Question about FilterIndexReader and IndexSearcher 
 
Hi,
usage of FilterIndexReader is not always as easy as it seems. There are
several problem, that can easy lead to the fact that you FilterIndexReader
implements all document filtering, but IndexSearcher does not respect it. I
have no idea what you are doing, but the following thing need to be done to
correcty filter documents:
- FilterIndexReader should implement isDeleted() methods  co (I assume you
did this)
- FilterIndexReader should filter postings returned: termPositions(...) and
termDocs(...) to exclude deleted documents
- return the correct numer for numDocs()
The biggest problem since Lucene 2.9 is one specific method that will
circumvent all you had done above:
getSequentialSubReaders() is used by IndexSearcher to directly pass the
searches to all atomic segments of a MultiReader/DirectoryReader structure.
As the subreaders returned by this method do not implement the above (they
are passed as is by the default impl), IndexSearcher will in fact only talk
to them and so ignore the above methods on the top-level reader
To do this correct do one of the following:

- easy: override getSequentialSubReaders() to return null, this will make
the filtered IndexReader itself atomic, so IndexSearcher will use it during
search. The backside: searches may get significantly slower
- override getSequentialSubReaders() and also wrap each subreader returned
by the delegate reader with your impl.
If you implement the last option (but also the return-null option) you may
also override reopen(), to correctly wrap reopened segments - you need to do
this if you use reopen.
If you are already using Lucene trunk (coming version 4.0), you can follow
the following issue: https://issues.apache.org/jira/browse/LUCENE-3212
It will implement exactly the above once I have time to do it finally. I
will post a first patch soon. This version will not work with Lucene 3.x, as
it is lots of work to get all this running easily with Lucene 3.x
(especially the above termPositions, termDocs mehods). In Lucene 4.0 the
filtering of documents is much easier, you only have to override
getDeletedDocs() and numDocs(), everything else is automatically handled!
Hope that helps.
-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de
 -Original Message-
 From: 周洲 [mailto:zhou518z...@gmail.com]
 Sent: Sunday, June 26, 2011 7:08 AM
 To: java-user
 Subject: Question about FilterIndexReader and IndexSearcher
 
 Hello,
 I want to let IndexReader finding the modification in time,so i use
 MyFilterIndexReader which extend FilterIndexReader to cache the deleted
 document in RAM.when this FilterIndexReader be the argument of  a
 IndexSearcher,i found that this IndexSearcher can not filter the deleted
 document,so i want to know how IndexSearcher and FilterIndexReader be
 used can deleted documents filtered?
 
 
  zhouzhou
 --
 2011-06-26
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

RE: Question about FilterIndexReader and IndexSearcher

2011-06-26 Thread Uwe Schindler

Hi,

usage of FilterIndexReader is not always as easy as it seems. There are
several problem, that can easy lead to the fact that you FilterIndexReader
implements all document filtering, but IndexSearcher does not respect it. I
have no idea what you are doing, but the following thing need to be done to
correcty filter documents:

- FilterIndexReader should implement isDeleted() methods  co (I assume you
did this)
- FilterIndexReader should filter postings returned: termPositions(...) and
termDocs(...) to exclude deleted documents
- return the correct numer for numDocs()

The biggest problem since Lucene 2.9 is one specific method that will
circumvent all you had done above:

getSequentialSubReaders() is used by IndexSearcher to directly pass the
searches to all atomic segments of a MultiReader/DirectoryReader structure.
As the subreaders returned by this method do not implement the above (they
are passed as is by the default impl), IndexSearcher will in fact only talk
to them and so ignore the above methods on the top-level reader

To do this correct do one of the following:
 
- easy: override getSequentialSubReaders() to return null, this will make
the filtered IndexReader itself atomic, so IndexSearcher will use it during
search. The backside: searches may get significantly slower
- override getSequentialSubReaders() and also wrap each subreader returned
by the delegate reader with your impl.

If you implement the last option (but also the return-null option) you may
also override reopen(), to correctly wrap reopened segments - you need to do
this if you use reopen.

If you are already using Lucene trunk (coming version 4.0), you can follow
the following issue: https://issues.apache.org/jira/browse/LUCENE-3212
It will implement exactly the above once I have time to do it finally. I
will post a first patch soon. This version will not work with Lucene 3.x, as
it is lots of work to get all this running easily with Lucene 3.x
(especially the above termPositions, termDocs mehods). In Lucene 4.0 the
filtering of documents is much easier, you only have to override
getDeletedDocs() and numDocs(), everything else is automatically handled!

Hope that helps.

-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de

 -Original Message-
 From: 周洲 [mailto:zhou518z...@gmail.com]
 Sent: Sunday, June 26, 2011 7:08 AM
 To: java-user
 Subject: Question about FilterIndexReader and IndexSearcher
 
 Hello,
 I want to let IndexReader finding the modification in time,so i use
 MyFilterIndexReader which extend FilterIndexReader to cache the deleted
 document in RAM.when this FilterIndexReader be the argument of  a
 IndexSearcher,i found that this IndexSearcher can not filter the deleted
 document,so i want to know how IndexSearcher and FilterIndexReader be
 used can deleted documents filtered?
 
 
  zhouzhou
 --
 2011-06-26


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Question about FilterIndexReader and IndexSearcher

2011-06-26 Thread 周洲

Hello,
I want to let IndexReader finding the modification in time,so i use 
MyFilterIndexReader which extend 
FilterIndexReader to cache the deleted document in RAM.when this 
FilterIndexReader be the argument of
 a IndexSearcher,i found that this IndexSearcher can not filter the deleted 
document,so i want to know 
how IndexSearcher and FilterIndexReader be used can deleted documents filtered?


 zhouzhou
--
2011-06-26

Question about FilterIndexReader and IndexSearcher

2011-06-25 Thread 周洲

Hello,
I want to let IndexReader finding the modification in time,so i use 
MyFilterIndexReader which extend 
FilterIndexReader to cache the deleted document in RAM.when this 
FilterIndexReader be the argument of
 a IndexSearcher,i found that this IndexSearcher can not filter the deleted 
document,so i want to know 
how IndexSearcher and FilterIndexReader be used can deleted documents filtered?


 zhouzhou
--
2011-06-26

Re: Lucene: Indexsearcher: java.lang.UnsupportedOperationException

2011-04-20 Thread Patrick Diviacco

java.lang.UnsupportedOperationException
at org.apache.lucene.search.Query.createWeight(Query.java:88)
at
org.apache.lucene.search.BooleanQuery$BooleanWeight.init(BooleanQuery.java:185)
at org.apache.lucene.search.BooleanQuery.createWeight(BooleanQuery.java:360)
at org.apache.lucene.search.Query.weight(Query.java:95)
at org.apache.lucene.search.Searcher.createWeight(Searcher.java:185)
at org.apache.lucene.search.Searcher.search(Searcher.java:136)
at NVoting.init(NVoting.java:159)
at Main.main(Main.java:8)





On 20 April 2011 05:25, Anshum ansh...@gmail.com wrote:

 Could you also print and send the entire stack-trace?
 Also, the query.toString()

 --
 Anshum Gupta
 http://ai-cafe.blogspot.com


 On Tue, Apr 19, 2011 at 7:40 PM, Patrick Diviacco 
 patrick.divia...@gmail.com wrote:

  I get the following error message:
 java.lang.UnsupportedOperationException
 
  with Lucene search method: topDocs = searcher.search(booleanQuery, null,
  100);
 
  I'm using an old version of Lucene: Lucene 2.4.1 (I cannot upgrade!)
  Can you help me to understand why I get such error ?
 
  thanks
  This is the complete code: http://pastie.org/1811677

Lucene: Indexsearcher: java.lang.UnsupportedOperationException

2011-04-19 Thread Patrick Diviacco

I get the following error message: java.lang.UnsupportedOperationException

with Lucene search method: topDocs = searcher.search(booleanQuery, null,
100);

I'm using an old version of Lucene: Lucene 2.4.1 (I cannot upgrade!)
Can you help me to understand why I get such error ?

thanks
This is the complete code: http://pastie.org/1811677

Re: Lucene: Indexsearcher: java.lang.UnsupportedOperationException

2011-04-19 Thread Anshum

Could you also print and send the entire stack-trace?
Also, the query.toString()

--
Anshum Gupta
http://ai-cafe.blogspot.com


On Tue, Apr 19, 2011 at 7:40 PM, Patrick Diviacco 
patrick.divia...@gmail.com wrote:

 I get the following error message: java.lang.UnsupportedOperationException

 with Lucene search method: topDocs = searcher.search(booleanQuery, null,
 100);

 I'm using an old version of Lucene: Lucene 2.4.1 (I cannot upgrade!)
 Can you help me to understand why I get such error ?

 thanks
 This is the complete code: http://pastie.org/1811677

IndexSearcher Single Instance Bottleneck?

2011-03-10 Thread RobM

I currently have two types of searches on my website that are using the same
index and same instance of index searcher. One of the searches usually only
takes 50 - 100 milliseconds but the second usually takes 2 seconds. It seems
as though when someone does the second search and another user does the
first search immediately after the first search will wait for the second to
complete. Is that how Lucene works or am I just looking at my test wrong. If
so how should i solve this issue? Two indexes or two index searchers?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/IndexSearcher-Single-Instance-Bottleneck-tp2662376p2662376.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: IndexSearcher Single Instance Bottleneck?

2011-03-10 Thread Erick Erickson

No, Lucene itself shouldn't be doing this, the recommendation is for multiple
threads to share a single searcher. I'd first look upstream, are your requests
being processed serially? I.e. is there a single thread that's
handling requests?

Best
Erick

On Thu, Mar 10, 2011 at 4:25 PM, RobM rmcclana...@databanq.com wrote:
 I currently have two types of searches on my website that are using the same
 index and same instance of index searcher. One of the searches usually only
 takes 50 - 100 milliseconds but the second usually takes 2 seconds. It seems
 as though when someone does the second search and another user does the
 first search immediately after the first search will wait for the second to
 complete. Is that how Lucene works or am I just looking at my test wrong. If
 so how should i solve this issue? Two indexes or two index searchers?

 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/IndexSearcher-Single-Instance-Bottleneck-tp2662376p2662376.html
 Sent from the Lucene - Java Users mailing list archive at Nabble.com.

 -
 To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
 For additional commands, e-mail: java-user-h...@lucene.apache.org



-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

shared IndexSearcher (lucene 3.0.3)

2011-02-25 Thread Akos Tajti

Hi all,

in our project we're using lucene in tomcat. To avoid some overhead we have
a shared IndexSearcher instance. In the past we had too many open files
errors many times. To prevent this the IndexSearcher is closed and reopened
after indexing. The shared instance is not closed anywhere else in the code.
Is this the right way of preventing these kind of errors?

Thanks in advance for your answers,
Ákos Tajti

Re: shared IndexSearcher (lucene 3.0.3)

2011-02-25 Thread Simon Willnauer

Hey,

the too many open files can be prevented by raising the limit of open files ;)

there is a nice summary on the FAQ you might wanna look at:

http://wiki.apache.org/lucene-java/LuceneFAQ#Why_am_I_getting_an_IOException_that_says_.22Too_many_open_files.22.3F

if you have further questions just come back here!

Simon

On Fri, Feb 25, 2011 at 2:11 PM, Akos Tajti akos.ta...@gmail.com wrote:
 Hi all,

 in our project we're using lucene in tomcat. To avoid some overhead we have
 a shared IndexSearcher instance. In the past we had too many open files
 errors many times. To prevent this the IndexSearcher is closed and reopened
 after indexing. The shared instance is not closed anywhere else in the code.
 Is this the right way of preventing these kind of errors?

 Thanks in advance for your answers,
 Ákos Tajti


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Newbie: Life span of IndexWriter / IndexSearcher?

2011-01-16 Thread Raf

Look at the JavaDoc:
http://lucene.apache.org/java/3_0_2/api/core/org/apache/lucene/index/IndexReader.html#reopen()

The *reopen* method returns a *new reader* if the index has changed since
the original reader was opened.

So, you should do something like this:
IndexReader newReader = reader.reopen(true);
if (newReader != reader) {
reader.close();
 reader = newReader;
searcher = new IndexSearcher(reader);
}

instead of
reader.reopen(true);

Bye.
*Raf*

On Sun, Jan 16, 2011 at 11:06 AM, sol myr solmy...@yahoo.com wrote:

 Hi,

 Thank you kindly for replying.
 Unfortunately, reopen() doesn't help me see the changes.
 Here's my test:
 First I write  commit a document, and run a search - which correctly finds
 this document.
 Then I write  commit another document, re-open the reader and run another
 search - this should find 2 documents, but it only finds 1 document (the
 first one).
 BTW if instead of 'reader.reopen()' I instantiate a brand-new searcher (and
 reader), it correctly finds 2 documents...

 // Shared objects:
 Directory directory = FSDirectory.open(new File(c:/myDir));
 Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);
 IndexWriter writer = new IndexWriter(directory, analyzer,
  IndexWriter.MaxFieldLength.LIMITED);
 Query query =  new TermQuery(new Term(title, hello));

 // Write document #1:
 writer.addDocument(makeDoc(hello world 1)); // Field title=hello world
 1
 writer.commit();

 // First search (yields document #1 as expected):
 IndexReader reader=IndexReader.open(directory, true);
 IndexSearcher searcher = new IndexSearcher(reader);
 TopDocs results1 = searcher.search(query, 1);
 printResults(searcher, results1);

 // Write document #2:
 writer.addDocument(makeDoc(hello world 2)); // Field title=hello world
 2
 writer.commit();

 // Reopen reader, and search (should yield 2 documents, but I only see 1):
 reader.reopen(true);
 TopDocs results2 = searcher.search(query, 1);
 printResults(searcher, results2);


 --- On Thu, 1/13/11, Uwe Schindler u...@thetaphi.de wrote:

 From: Uwe Schindler u...@thetaphi.de
 Subject: RE: Newbie: Life span of IndexWriter / IndexSearcher?
 To: java-user@lucene.apache.org
 Date: Thursday, January 13, 2011, 7:40 AM

 You can leave the IndexWriter and IndexSearcher all the time. The only
 important thing, changes made by IndexWriter's commit() method are only
 seen
 by IndexSearcher, when the underlying IndexReader is reopened (e.g. by
 using
 IndexReader.reopen()) - please note that this only works with direct access
 to the IndexReaders, so I would recommend using the constructors of
 IndexSearcher that take IndexReaders (the Directory ones are only for easy
 beginner's use).

Re: Newbie: Life span of IndexWriter / IndexSearcher?

2011-01-16 Thread sol myr

Worked like a charm - thanks a lot.

--- On Sun, 1/16/11, Raf r.ventag...@gmail.com wrote:

From: Raf r.ventag...@gmail.com
Subject: Re: Newbie: Life span of IndexWriter / IndexSearcher?
To: java-user@lucene.apache.org
Date: Sunday, January 16, 2011, 3:16 AM

Look at the JavaDoc:
http://lucene.apache.org/java/3_0_2/api/core/org/apache/lucene/index/IndexReader.html#reopen()

The *reopen* method returns a *new reader* if the index has changed since
the original reader was opened.

So, you should do something like this:
IndexReader newReader = reader.reopen(true);
if (newReader != reader) {
reader.close();
 reader = newReader;
searcher = new IndexSearcher(reader);
}

instead of
reader.reopen(true);

Bye.
*Raf*

On Sun, Jan 16, 2011 at 11:06 AM, sol myr solmy...@yahoo.com wrote:

 Hi,

 Thank you kindly for replying.
 Unfortunately, reopen() doesn't help me see the changes.
 Here's my test:
 First I write  commit a document, and run a search - which correctly finds
 this document.
 Then I write  commit another document, re-open the reader and run another
 search - this should find 2 documents, but it only finds 1 document (the
 first one).
 BTW if instead of 'reader.reopen()' I instantiate a brand-new searcher (and
 reader), it correctly finds 2 documents...

 // Shared objects:
 Directory directory = FSDirectory.open(new File(c:/myDir));
 Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);
 IndexWriter writer = new IndexWriter(directory, analyzer,
      IndexWriter.MaxFieldLength.LIMITED);
 Query query =  new TermQuery(new Term(title, hello));

 // Write document #1:
 writer.addDocument(makeDoc(hello world 1)); // Field title=hello world
 1
 writer.commit();

 // First search (yields document #1 as expected):
 IndexReader reader=IndexReader.open(directory, true);
 IndexSearcher searcher = new IndexSearcher(reader);
 TopDocs results1 = searcher.search(query, 1);
 printResults(searcher, results1);

 // Write document #2:
 writer.addDocument(makeDoc(hello world 2)); // Field title=hello world
 2
 writer.commit();

 // Reopen reader, and search (should yield 2 documents, but I only see 1):
 reader.reopen(true);
 TopDocs results2 = searcher.search(query, 1);
 printResults(searcher, results2);

 --- On Thu, 1/13/11, Uwe Schindler u...@thetaphi.de wrote:

 From: Uwe Schindler u...@thetaphi.de
 Subject: RE: Newbie: Life span of IndexWriter / IndexSearcher?
 To: java-user@lucene.apache.org
 Date: Thursday, January 13, 2011, 7:40 AM

 You can leave the IndexWriter and IndexSearcher all the time. The only
 important thing, changes made by IndexWriter's commit() method are only
 seen
 by IndexSearcher, when the underlying IndexReader is reopened (e.g. by
 using
 IndexReader.reopen()) - please note that this only works with direct access
 to the IndexReaders, so I would recommend using the constructors of
 IndexSearcher that take IndexReaders (the Directory ones are only for easy
 beginner's use).

Re: Can not delete index file after close the IndexSearcher

Try adding

  try { searcher.close(); } catch (Exception e) { }

before

  searcher = new IndexSearcher(dir);

at the top of the loop.

At the end of a loop searcher is open, and is not closed before being
reassigned.  There is probably a better solution along the lines of
only opening new searcher if need to.


--
Ian.

2011/1/13 张志田 zhitian.zh...@dianping.com:
 Hi Yuhan,

 dir.close() can not solve the problem.

 The reason I have to close the old searcher is my program will replace the
 old index, the code posted here is just a scenario to simplify my question.

 Thanks,
 Garry

 在 2011年1月13日 上午10:45，Yuhan Zhang yzh...@onescreen.com写道：

 Hi Garry,

 I am guessing the directory needs to be closed before opening a new one.

 dir.close();
 dir = FSDirectory.open(new File(getIndexPath()));

 why not to open two IndexSearcher objects in an array of two instead of
 swapping them back and forth?
 it would be a lot easier.

 yuhan

 2011/1/12 张志田 zhitian.zh...@dianping.com

  Hi Mike,
 
  Sorry to make you confused. lock means the file handle is held by some
  other progress, the program can not delete it. There is no exception, I
 can
  see file.delete() method returns false. If I delete the cfs file in the
 OS
  manually, the warning is File was using by another person or program
 
  To simplify my question, I made some more code for testing. you can run
 it
  for reproducing, after two loops, you will see the message e.g. Can not
  delete file: D:\index\index2\_0.cfs
 
 
  Thank you very much
 
 
  public class SearchTest
  {
 
 private static final int MAX_RESULT = 1;
 
 private String indexPath1 = D:\\index\\index1;
 
 private String indexPath2 = D:\\index\\index2;
 
  private String backupIndexpath = D:\\index\\index3;
 
 private String indexPath = indexPath1;
 
 private Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);
 
  private IndexSearcher searcher;
 
 public void search()
 {
 while (true)
 {
 try
 {
 String keyword = test;
 String fieldName = searchfield;
 
  Directory dir = FSDirectory.open(new File(indexPath));
 
 searcher = new IndexSearcher(dir);
 
 QueryParser queryParse = new
 QueryParser(Version.LUCENE_30,
  fieldName, analyzer);
 Query query = queryParse.parse(keyword);
 
 TopDocs hits = searcher.search(query, MAX_RESULT);
 int size = 5;
 if (hits.scoreDocs.length  size)
 {
 size = hits.scoreDocs.length;
 }
 for (int i = 0; i  size; i++)
 {
 Document doc = searcher.doc(hits.scoreDocs[i].doc);
 String text = doc.get(fieldName);
 System.out.println(fieldContent is:  + text);
 }
 
 IndexSearcher oldSearcher = searcher;
 
  File newFile = new File(getIndexPath());
 for (File file : newFile.listFiles())
 {
 if (!file.delete())
 {
 System.out.println(Can not delete file:  +
  file.getAbsolutePath());
 }
 }
 
 // Copy index File from another folder to this folder
 copyDir(new File(backupIndexpath), newFile);
 
 Directory newDir = FSDirectory.open(newFile);
 IndexSearcher newSearcher = new IndexSearcher(newDir);
  searcher = newSearcher;
 
 oldSearcher.close();
 
 System.out.println(Closed Searcher:  +
  oldSearcher.getIndexReader().directory().toString());
 
 System.out.println(input 'Q' to quit testing...);
 BufferedReader br = new BufferedReader(new
  InputStreamReader(System.in));
 
 if (br.readLine().trim().equals(Q))
 {
 break;
 }
 }
 catch (CorruptIndexException e)
 {
 e.printStackTrace();
 }
 catch (IOException e)
 {
 e.printStackTrace();
 }
 catch (ParseException e)
 {
 e.printStackTrace();
 }
 }
 }
 
 private String getIndexPath()
 {
 if (indexPath.equals(indexPath1))
 {
 indexPath = indexPath2;
 }
 else
 {
 indexPath = indexPath1;
 }
 
 return indexPath;
 }
 
  public static void copyDir(File sourceLocation, File targetLocation)
  throws IOException
 {
 String[] children = sourceLocation.list();
 for (int i = 0; i  children.length; i++)
 {
 InputStream in = null;
 OutputStream out = null

Re: Can not delete index file after close the IndexSearcher

2011-01-13 Thread 张志田

Ian, thanks for your response. Your suggestion worked for me.

What does oldSearcher.close() do in my code? why I have to close the
searcher and oldSearcher together? In my opinion, oldSearcher held index1
while searcher held index2, they are using different resources, the
resources held by them should be released seperately.

I have another concern for your solution, searcher is a reference created
here for user searching out of this code snippet, if I closed and reopen it
here, there may be some service down time because there is no open searcher
for using.

In my original code, searcher opened all the time, so there is no service
down time or little, this is the reason I did not close it every time. Do
you have any suggestion to keep an alive searcher and the program can also
switch the index smoothly?

Thanks,
Garry



在 2011年1月13日 下午5:47，Ian Lea ian@gmail.com写道：

 Try adding

  try { searcher.close(); } catch (Exception e) { }

 before

  searcher = new IndexSearcher(dir);

 at the top of the loop.

 At the end of a loop searcher is open, and is not closed before being
 reassigned.  There is probably a better solution along the lines of
 only opening new searcher if need to.


 --
 Ian.

 2011/1/13 张志田 zhitian.zh...@dianping.com:
  Hi Yuhan,
 
  dir.close() can not solve the problem.
 
  The reason I have to close the old searcher is my program will replace
 the
  old index, the code posted here is just a scenario to simplify my
 question.
 
  Thanks,
  Garry
 
  在 2011年1月13日 上午10:45，Yuhan Zhang yzh...@onescreen.com写道：
 
  Hi Garry,
 
  I am guessing the directory needs to be closed before opening a new one.
 
  dir.close();
  dir = FSDirectory.open(new File(getIndexPath()));
 
  why not to open two IndexSearcher objects in an array of two instead of
  swapping them back and forth?
  it would be a lot easier.
 
  yuhan
 
  2011/1/12 张志田 zhitian.zh...@dianping.com
 
   Hi Mike,
  
   Sorry to make you confused. lock means the file handle is held by
 some
   other progress, the program can not delete it. There is no exception,
 I
  can
   see file.delete() method returns false. If I delete the cfs file in
 the
  OS
   manually, the warning is File was using by another person or program
  
   To simplify my question, I made some more code for testing. you can
 run
  it
   for reproducing, after two loops, you will see the message e.g. Can
 not
   delete file: D:\index\index2\_0.cfs
  
  
   Thank you very much
  
  
   public class SearchTest
   {
  
  private static final int MAX_RESULT = 1;
  
  private String indexPath1 = D:\\index\\index1;
  
  private String indexPath2 = D:\\index\\index2;
  
   private String backupIndexpath = D:\\index\\index3;
  
  private String indexPath = indexPath1;
  
  private Analyzer analyzer = new
 StandardAnalyzer(Version.LUCENE_30);
  
   private IndexSearcher searcher;
  
  public void search()
  {
  while (true)
  {
  try
  {
  String keyword = test;
  String fieldName = searchfield;
  
   Directory dir = FSDirectory.open(new File(indexPath));
  
  searcher = new IndexSearcher(dir);
  
  QueryParser queryParse = new
  QueryParser(Version.LUCENE_30,
   fieldName, analyzer);
  Query query = queryParse.parse(keyword);
  
  TopDocs hits = searcher.search(query, MAX_RESULT);
  int size = 5;
  if (hits.scoreDocs.length  size)
  {
  size = hits.scoreDocs.length;
  }
  for (int i = 0; i  size; i++)
  {
  Document doc = searcher.doc(hits.scoreDocs[i].doc);
  String text = doc.get(fieldName);
  System.out.println(fieldContent is:  + text);
  }
  
  IndexSearcher oldSearcher = searcher;
  
   File newFile = new File(getIndexPath());
  for (File file : newFile.listFiles())
  {
  if (!file.delete())
  {
  System.out.println(Can not delete file:  +
   file.getAbsolutePath());
  }
  }
  
  // Copy index File from another folder to this folder
  copyDir(new File(backupIndexpath), newFile);
  
  Directory newDir = FSDirectory.open(newFile);
  IndexSearcher newSearcher = new IndexSearcher(newDir);
   searcher = newSearcher;
  
  oldSearcher.close();
  
  System.out.println(Closed Searcher:  +
   oldSearcher.getIndexReader().directory().toString());
  
  System.out.println(input 'Q' to quit testing...);
  BufferedReader br = new BufferedReader(new
   InputStreamReader(System.in

Re: Can not delete index file after close the IndexSearcher

As I said, there is probably a better solution.  At the moment you are
opening searchers at the top and bottom of the loop and on second and
subsequent passes you are not closing the bottom one, that you've only
just opened, before opening a new one using the same instance
variable.  The resources of the bottom one would presumably be
released eventually by GC, but evidently not soon enough,

Replace the top searcher = new IndexSearcher(dir); line with

if (needToOpenNewSearcher()) {
   ...
}

where the logic in needToOpenNewSearcher() is for you to write.


--
Ian.


2011/1/13 张志田 zhitian.zh...@dianping.com:
 Ian, thanks for your response. Your suggestion worked for me.

 What does oldSearcher.close() do in my code? why I have to close the
 searcher and oldSearcher together? In my opinion, oldSearcher held index1
 while searcher held index2, they are using different resources, the
 resources held by them should be released seperately.

 I have another concern for your solution, searcher is a reference created
 here for user searching out of this code snippet, if I closed and reopen it
 here, there may be some service down time because there is no open searcher
 for using.

 In my original code, searcher opened all the time, so there is no service
 down time or little, this is the reason I did not close it every time. Do
 you have any suggestion to keep an alive searcher and the program can also
 switch the index smoothly?

 Thanks,
 Garry



 在 2011年1月13日 下午5:47，Ian Lea ian@gmail.com写道：

 Try adding

  try { searcher.close(); } catch (Exception e) { }

 before

  searcher = new IndexSearcher(dir);

 at the top of the loop.

 At the end of a loop searcher is open, and is not closed before being
 reassigned.  There is probably a better solution along the lines of
 only opening new searcher if need to.


 --
 Ian.

 2011/1/13 张志田 zhitian.zh...@dianping.com:
  Hi Yuhan,
 
  dir.close() can not solve the problem.
 
  The reason I have to close the old searcher is my program will replace
 the
  old index, the code posted here is just a scenario to simplify my
 question.
 
  Thanks,
  Garry
 
  在 2011年1月13日 上午10:45，Yuhan Zhang yzh...@onescreen.com写道：
 
  Hi Garry,
 
  I am guessing the directory needs to be closed before opening a new one.
 
  dir.close();
  dir = FSDirectory.open(new File(getIndexPath()));
 
  why not to open two IndexSearcher objects in an array of two instead of
  swapping them back and forth?
  it would be a lot easier.
 
  yuhan
 
  2011/1/12 张志田 zhitian.zh...@dianping.com
 
   Hi Mike,
  
   Sorry to make you confused. lock means the file handle is held by
 some
   other progress, the program can not delete it. There is no exception,
 I
  can
   see file.delete() method returns false. If I delete the cfs file in
 the
  OS
   manually, the warning is File was using by another person or program
  
   To simplify my question, I made some more code for testing. you can
 run
  it
   for reproducing, after two loops, you will see the message e.g. Can
 not
   delete file: D:\index\index2\_0.cfs
  
  
   Thank you very much
  
  
   public class SearchTest
   {
  
  private static final int MAX_RESULT = 1;
  
  private String indexPath1 = D:\\index\\index1;
  
  private String indexPath2 = D:\\index\\index2;
  
   private String backupIndexpath = D:\\index\\index3;
  
  private String indexPath = indexPath1;
  
  private Analyzer analyzer = new
 StandardAnalyzer(Version.LUCENE_30);
  
   private IndexSearcher searcher;
  
  public void search()
  {
  while (true)
  {
  try
  {
  String keyword = test;
  String fieldName = searchfield;
  
   Directory dir = FSDirectory.open(new File(indexPath));
  
  searcher = new IndexSearcher(dir);
  
  QueryParser queryParse = new
  QueryParser(Version.LUCENE_30,
   fieldName, analyzer);
  Query query = queryParse.parse(keyword);
  
  TopDocs hits = searcher.search(query, MAX_RESULT);
  int size = 5;
  if (hits.scoreDocs.length  size)
  {
  size = hits.scoreDocs.length;
  }
  for (int i = 0; i  size; i++)
  {
  Document doc = searcher.doc(hits.scoreDocs[i].doc);
  String text = doc.get(fieldName);
  System.out.println(fieldContent is:  + text);
  }
  
  IndexSearcher oldSearcher = searcher;
  
   File newFile = new File(getIndexPath());
  for (File file : newFile.listFiles())
  {
  if (!file.delete())
  {
  System.out.println(Can not delete file:  +
   file.getAbsolutePath

Re: Can not delete index file after close the IndexSearcher

2011-01-13 Thread 张志田

Ian, thank you very much.

I will try to change my switch solution.

Thanks again
Garry

在 2011年1月13日 下午6:41，Ian Lea ian@gmail.com写道：

 As I said, there is probably a better solution.  At the moment you are
 opening searchers at the top and bottom of the loop and on second and
 subsequent passes you are not closing the bottom one, that you've only
 just opened, before opening a new one using the same instance
 variable.  The resources of the bottom one would presumably be
 released eventually by GC, but evidently not soon enough,

 Replace the top searcher = new IndexSearcher(dir); line with

 if (needToOpenNewSearcher()) {
   ...
 }

 where the logic in needToOpenNewSearcher() is for you to write.


 --
 Ian.


 2011/1/13 张志田 zhitian.zh...@dianping.com:
  Ian, thanks for your response. Your suggestion worked for me.
 
  What does oldSearcher.close() do in my code? why I have to close the
  searcher and oldSearcher together? In my opinion, oldSearcher held index1
  while searcher held index2, they are using different resources, the
  resources held by them should be released seperately.
 
  I have another concern for your solution, searcher is a reference created
  here for user searching out of this code snippet, if I closed and reopen
 it
  here, there may be some service down time because there is no open
 searcher
  for using.
 
  In my original code, searcher opened all the time, so there is no service
  down time or little, this is the reason I did not close it every time. Do
  you have any suggestion to keep an alive searcher and the program can
 also
  switch the index smoothly?
 
  Thanks,
  Garry
 
 
 
  在 2011年1月13日 下午5:47，Ian Lea ian@gmail.com写道：
 
  Try adding
 
   try { searcher.close(); } catch (Exception e) { }
 
  before
 
   searcher = new IndexSearcher(dir);
 
  at the top of the loop.
 
  At the end of a loop searcher is open, and is not closed before being
  reassigned.  There is probably a better solution along the lines of
  only opening new searcher if need to.
 
 
  --
  Ian.
 
  2011/1/13 张志田 zhitian.zh...@dianping.com:
   Hi Yuhan,
  
   dir.close() can not solve the problem.
  
   The reason I have to close the old searcher is my program will replace
  the
   old index, the code posted here is just a scenario to simplify my
  question.
  
   Thanks,
   Garry
  
   在 2011年1月13日 上午10:45，Yuhan Zhang yzh...@onescreen.com写道：
  
   Hi Garry,
  
   I am guessing the directory needs to be closed before opening a new
 one.
  
   dir.close();
   dir = FSDirectory.open(new File(getIndexPath()));
  
   why not to open two IndexSearcher objects in an array of two instead
 of
   swapping them back and forth?
   it would be a lot easier.
  
   yuhan
  
   2011/1/12 张志田 zhitian.zh...@dianping.com
  
Hi Mike,
   
Sorry to make you confused. lock means the file handle is held by
  some
other progress, the program can not delete it. There is no
 exception,
  I
   can
see file.delete() method returns false. If I delete the cfs file in
  the
   OS
manually, the warning is File was using by another person or
 program
   
To simplify my question, I made some more code for testing. you can
  run
   it
for reproducing, after two loops, you will see the message e.g.
 Can
  not
delete file: D:\index\index2\_0.cfs
   
   
Thank you very much
   
   
public class SearchTest
{
   
   private static final int MAX_RESULT = 1;
   
   private String indexPath1 = D:\\index\\index1;
   
   private String indexPath2 = D:\\index\\index2;
   
private String backupIndexpath = D:\\index\\index3;
   
   private String indexPath = indexPath1;
   
   private Analyzer analyzer = new
  StandardAnalyzer(Version.LUCENE_30);
   
private IndexSearcher searcher;
   
   public void search()
   {
   while (true)
   {
   try
   {
   String keyword = test;
   String fieldName = searchfield;
   
Directory dir = FSDirectory.open(new
 File(indexPath));
   
   searcher = new IndexSearcher(dir);
   
   QueryParser queryParse = new
   QueryParser(Version.LUCENE_30,
fieldName, analyzer);
   Query query = queryParse.parse(keyword);
   
   TopDocs hits = searcher.search(query, MAX_RESULT);
   int size = 5;
   if (hits.scoreDocs.length  size)
   {
   size = hits.scoreDocs.length;
   }
   for (int i = 0; i  size; i++)
   {
   Document doc =
 searcher.doc(hits.scoreDocs[i].doc);
   String text = doc.get(fieldName);
   System.out.println(fieldContent is:  + text);
   }
   
   IndexSearcher oldSearcher = searcher;
   
File newFile = new File(getIndexPath

Re: Can not delete index file after close the IndexSearcher

In fact it's probably as simple as

   if (searcher == null) {
  searcher = new IndexSearcher(dir);
   }

at the top of the loop.


--
Ian.

2011/1/13 Ian Lea ian@gmail.com:
 As I said, there is probably a better solution.  At the moment you are
 opening searchers at the top and bottom of the loop and on second and
 subsequent passes you are not closing the bottom one, that you've only
 just opened, before opening a new one using the same instance
 variable.  The resources of the bottom one would presumably be
 released eventually by GC, but evidently not soon enough,

 Replace the top searcher = new IndexSearcher(dir); line with

 if (needToOpenNewSearcher()) {
   ...
 }

 where the logic in needToOpenNewSearcher() is for you to write.


 --
 Ian.


 2011/1/13 张志田 zhitian.zh...@dianping.com:
 Ian, thanks for your response. Your suggestion worked for me.

 What does oldSearcher.close() do in my code? why I have to close the
 searcher and oldSearcher together? In my opinion, oldSearcher held index1
 while searcher held index2, they are using different resources, the
 resources held by them should be released seperately.

 I have another concern for your solution, searcher is a reference created
 here for user searching out of this code snippet, if I closed and reopen it
 here, there may be some service down time because there is no open searcher
 for using.

 In my original code, searcher opened all the time, so there is no service
 down time or little, this is the reason I did not close it every time. Do
 you have any suggestion to keep an alive searcher and the program can also
 switch the index smoothly?

 Thanks,
 Garry



 在 2011年1月13日 下午5:47，Ian Lea ian@gmail.com写道：

 Try adding

  try { searcher.close(); } catch (Exception e) { }

 before

  searcher = new IndexSearcher(dir);

 at the top of the loop.

 At the end of a loop searcher is open, and is not closed before being
 reassigned.  There is probably a better solution along the lines of
 only opening new searcher if need to.


 --
 Ian.

 2011/1/13 张志田 zhitian.zh...@dianping.com:
  Hi Yuhan,
 
  dir.close() can not solve the problem.
 
  The reason I have to close the old searcher is my program will replace
 the
  old index, the code posted here is just a scenario to simplify my
 question.
 
  Thanks,
  Garry
 
  在 2011年1月13日 上午10:45，Yuhan Zhang yzh...@onescreen.com写道：
 
  Hi Garry,
 
  I am guessing the directory needs to be closed before opening a new one.
 
  dir.close();
  dir = FSDirectory.open(new File(getIndexPath()));
 
  why not to open two IndexSearcher objects in an array of two instead of
  swapping them back and forth?
  it would be a lot easier.
 
  yuhan
 
  2011/1/12 张志田 zhitian.zh...@dianping.com
 
   Hi Mike,
  
   Sorry to make you confused. lock means the file handle is held by
 some
   other progress, the program can not delete it. There is no exception,
 I
  can
   see file.delete() method returns false. If I delete the cfs file in
 the
  OS
   manually, the warning is File was using by another person or program
  
   To simplify my question, I made some more code for testing. you can
 run
  it
   for reproducing, after two loops, you will see the message e.g. Can
 not
   delete file: D:\index\index2\_0.cfs
  
  
   Thank you very much
  
  
   public class SearchTest
   {
  
  private static final int MAX_RESULT = 1;
  
  private String indexPath1 = D:\\index\\index1;
  
  private String indexPath2 = D:\\index\\index2;
  
   private String backupIndexpath = D:\\index\\index3;
  
  private String indexPath = indexPath1;
  
  private Analyzer analyzer = new
 StandardAnalyzer(Version.LUCENE_30);
  
   private IndexSearcher searcher;
  
  public void search()
  {
  while (true)
  {
  try
  {
  String keyword = test;
  String fieldName = searchfield;
  
   Directory dir = FSDirectory.open(new File(indexPath));
  
  searcher = new IndexSearcher(dir);
  
  QueryParser queryParse = new
  QueryParser(Version.LUCENE_30,
   fieldName, analyzer);
  Query query = queryParse.parse(keyword);
  
  TopDocs hits = searcher.search(query, MAX_RESULT);
  int size = 5;
  if (hits.scoreDocs.length  size)
  {
  size = hits.scoreDocs.length;
  }
  for (int i = 0; i  size; i++)
  {
  Document doc = searcher.doc(hits.scoreDocs[i].doc);
  String text = doc.get(fieldName);
  System.out.println(fieldContent is:  + text);
  }
  
  IndexSearcher oldSearcher = searcher;
  
   File newFile = new File(getIndexPath());
  for (File file : newFile.listFiles())
  {
  if (!file.delete

Newbie: Life span of IndexWriter / IndexSearcher?

2011-01-13 Thread sol myr

Hi,

We're writing a web application, which naturally needs
- IndexSearcher when users use our search screen
- IndexWriter in a background process that periodically updates and optimizes 
our index.
Note our writer is exclusive - no other applications/threads ever write to our 
index files.

What's the common practice in terms of resource creation and sharing?
Specifically:

1) Should I have a single IndexSearcher to serve all (concurrent) users?
I saw such a recommendation in a tutorial, but discovered that an open 
IndexSearcher prevents 'optimize' from merging my files... so should I close it 
just before optimization? Or should I open an individual (short-lived) 
IndexSearcher for each search request?

2) Our tests also imply that IndexWriter.optimize()  takes effect only after 
you close() that writer - which is a shame, because I hoped to keep using the 
same writer (I hear it's expensive to instantiate). I doing something wrong? 

Thanks

RE: Newbie: Life span of IndexWriter / IndexSearcher?

2011-01-13 Thread Uwe Schindler

Hi,


 We're writing a web application, which naturally needs
 - IndexSearcher when users use our search screen
 - IndexWriter in a background process that periodically updates and
 optimizes our index.
 Note our writer is exclusive - no other applications/threads ever write to
our
 index files.
 
 What's the common practice in terms of resource creation and sharing?
 Specifically:
 
 1) Should I have a single IndexSearcher to serve all (concurrent) users?
 I saw such a recommendation in a tutorial, but discovered that an open
 IndexSearcher prevents 'optimize' from merging my files... so should I
close it
 just before optimization? Or should I open an individual (short-lived)
 IndexSearcher for each search request?

You can leave the IndexWriter and IndexSearcher all the time. The only
important thing, changes made by IndexWriter's commit() method are only seen
by IndexSearcher, when the underlying IndexReader is reopened (e.g. by using
IndexReader.reopen()) - please note that this only works with direct access
to the IndexReaders, so I would recommend using the constructors of
IndexSearcher that take IndexReaders (the Directory ones are only for easy
beginner's use). See the Lucene In Action 2 for a good example of a Searcher
manager.

 2) Our tests also imply that IndexWriter.optimize()  takes effect only
after
 you close() that writer - which is a shame, because I hoped to keep using
the
 same writer (I hear it's expensive to instantiate). I doing something
wrong?

This is wrong, see above. As the IndexReader/Searcher keeps the used
segments from the time it was opened, they can't go away until the
snapshot view of IndexReader is closed.

In general, it's not recommeneded to optimize indexes since 2.9 unless you
are doing things like delete all documents.

Uwe
 


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Closing indexsearcher , making sur eit is in use

Use something with reference counting - Lucene in action second
edition has a searcher manager class  which I think might be available
standalone.  Or a couple of low-tech alternatives: instead of closing
the old searcher, move it out of the way and keep a reference to it
and close it after n seconds or searches or whatever.  Or catch the
closed Exception and rerun the query with the up to date searcher.


--
Ian.


On Thu, Jan 13, 2011 at 8:21 PM, Paul Taylor paul_t...@fastmail.fm wrote:
 As recommended, I use just one Index Searcher on my multithreaded GUI app
 using a singleton pattern
 If data is modified in the index I then close the reader and searcher, and
 they will be recreate on next call to getInstance() but Ive hit a problem
 whereby one thread was closing a searcher, another thread already the
 searcher open but when came to use it gave exception 'the IndexReader is
 closed'

 I obviously dont want access to the searcher to be synchronized as it is
 designed to work multithreaded, so how should I close it safetly, i.e close
 if no current references to it.

 Paul

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Closing indexsearcher , making sur eit is in use

2011-01-13 Thread Umesh Prasad

You can use ReadWriteLock
http://download.oracle.com/javase/1.5.0/docs/api/java/util/concurrent/locks/ReentrantReadWriteLock.htmlas
low level technique to manage access.

A ReadWriteLock maintains a pair of associated
lockshttp://download.oracle.com/javase/1.5.0/docs/api/java/util/concurrent/locks/Lock.html,
one for read-only operations and one for writing. The read
lockhttp://download.oracle.com/javase/1.5.0/docs/api/java/util/concurrent/locks/ReadWriteLock.html#readLock%28%29may
be held simultaneously by multiple reader threads, so long as there
are
no writers. The write
lockhttp://download.oracle.com/javase/1.5.0/docs/api/java/util/concurrent/locks/ReadWriteLock.html#writeLock%28%29is
exclusive.

Wrap the lucene's searcher into your SearchManager class, which exposes its
own API for search and forwards the requests to underlying searcher.

The search and reopen will sync up by using ReadWriteLock . search takes
readlock and reopen takes writelock.

PS: 1. Use indexreader.reopen() instead of closing it off and then open
again. It is much faster. (Documented)
Thanks Regards
Umesh Prasad

On Fri, Jan 14, 2011 at 2:25 AM, Ian Lea ian@gmail.com wrote:

Use something with reference counting - Lucene in action second
edition has a searcher manager class which I think might be available
standalone. Or a couple of low-tech alternatives: instead of closing
the old searcher, move it out of the way and keep a reference to it
and close it after n seconds or searches or whatever. Or catch the
closed Exception and rerun the query with the up to date searcher.

--
Ian.

On Thu, Jan 13, 2011 at 8:21 PM, Paul Taylor paul_t...@fastmail.fm
wrote:
As recommended, I use just one Index Searcher on my multithreaded GUI app
using a singleton pattern
If data is modified in the index I then close the reader and searcher,
and
they will be recreate on next call to getInstance() but Ive hit a problem
whereby one thread was closing a searcher, another thread already the
searcher open but when came to use it gave exception 'the IndexReader is
closed'

I obviously dont want access to the searcher to be synchronized as it is
designed to work multithreaded, so how should I close it safetly, i.e
close
if no current references to it.

Paul

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Can not delete index file after close the IndexSearcher

Dear Luceners,

I'm using lucene-3.0.2 in our app. There is some testing code for switching
index, however, when my code run a couple of times, I found the index file
was locked, I can not delete the old index files.


The code looks like:

public class SearchTest
{

private static final int MAX_RESULT = 1;

private String indexPath1 = D:\\index\\index1;
private String indexPath2 = D:\\index\\index2;

private String indexPath = indexPath1;

private Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);

private Directory dir = null;

private IndexSearcher searcher;

public void search()
{
while(true)
{
try
{
String keyword = test;
String fieldName = searchfield;

if(dir == null)
{
dir = FSDirectory.open(new File(indexPath));
}
searcher = new IndexSearcher(dir);

QueryParser queryParse = new QueryParser(Version.LUCENE_30,
fieldName, analyzer);
Query query = queryParse.parse(keyword);

TopDocs hits = searcher.search(query, MAX_RESULT);
int size = 5;
if(hits.scoreDocs.length  size)
{
size = hits.scoreDocs.length;
}
for (int i = 0; i  size; i++)
{
Document doc = searcher.doc(hits.scoreDocs[i].doc);
String text = doc.get(fieldName);
System.out.println(fieldContent is:  + text);
}

IndexSearcher oldSearcher = searcher;
dir = FSDirectory.open(new File(getIndexPath()));
IndexSearcher newSearcher = new IndexSearcher(dir);
searcher = newSearcher;

oldSearcher.close();
System.out.println(Closed Searcher:  +
oldSearcher.getIndexReader().directory().toString());

System.out.println(input 'Q' to quit testing...);
BufferedReader br = new BufferedReader(new
InputStreamReader(System.in));

if(br.readLine().trim().equals(Q))
{
break;
}
}
catch (CorruptIndexException e)
{
e.printStackTrace();
}
catch (IOException e)
{
e.printStackTrace();
}
catch (ParseException e)
{
e.printStackTrace();
}
}
}

private String getIndexPath()
{
if(indexPath.equals(indexPath1))
{
indexPath = indexPath2;
}
else
{
indexPath = indexPath1;
}

return indexPath;
}

public static void main(String[] args)
{
SearchTest searchTest = new SearchTest();
searchTest.search();
}

}

Can anybody take a look at the above code snippet?

I want to search on the different index file every time so I created two
different folders and switch them time to time. The index files in the
index1/index2 maybe replaced before the search request comes.

The problem I found is after I ran the above code 2 or more loops, I can not
modify/delete the cfs/cfx file in the file system(Windows 2003), although I
closed the searcher every time in the code. It seems that the index file is
not released.

Is the problem caused by the shared reference of searcher? or some shared
thread in the lucene?

Thanks in advance!
Garry

Re: Can not delete index file after close the IndexSearcher

2011-01-12 Thread Michael McCandless

When you break out of the loop (user enters 'Q') you don't close the
current searcher.  Could that be it?

Also you are calling FSDir.open each time but should only do it once
(though this should be harmless).

Mike

On Wed, Jan 12, 2011 at 5:39 AM, 张志田 zhitian.zh...@dianping.com wrote:
 Dear Luceners,

 I'm using lucene-3.0.2 in our app. There is some testing code for switching
 index, however, when my code run a couple of times, I found the index file
 was locked, I can not delete the old index files.


 The code looks like:

 public class SearchTest
 {

    private static final int MAX_RESULT = 1;

    private String indexPath1 = D:\\index\\index1;
    private String indexPath2 = D:\\index\\index2;

    private String indexPath = indexPath1;

    private Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);

    private Directory dir = null;

    private IndexSearcher searcher;

    public void search()
    {
        while(true)
        {
            try
            {
                String keyword = test;
                String fieldName = searchfield;

                if(dir == null)
                {
                    dir = FSDirectory.open(new File(indexPath));
                }
                searcher = new IndexSearcher(dir);

                QueryParser queryParse = new QueryParser(Version.LUCENE_30,
 fieldName, analyzer);
                Query query = queryParse.parse(keyword);

                TopDocs hits = searcher.search(query, MAX_RESULT);
                int size = 5;
                if(hits.scoreDocs.length  size)
                {
                    size = hits.scoreDocs.length;
                }
                for (int i = 0; i  size; i++)
                {
                    Document doc = searcher.doc(hits.scoreDocs[i].doc);
                    String text = doc.get(fieldName);
                    System.out.println(fieldContent is:  + text);
                }

                IndexSearcher oldSearcher = searcher;
                dir = FSDirectory.open(new File(getIndexPath()));
                IndexSearcher newSearcher = new IndexSearcher(dir);
                searcher = newSearcher;

                oldSearcher.close();
                System.out.println(Closed Searcher:  +
 oldSearcher.getIndexReader().directory().toString());

                System.out.println(input 'Q' to quit testing...);
                BufferedReader br = new BufferedReader(new
 InputStreamReader(System.in));

                if(br.readLine().trim().equals(Q))
                {
                    break;
                }
            }
            catch (CorruptIndexException e)
            {
                e.printStackTrace();
            }
            catch (IOException e)
            {
                e.printStackTrace();
            }
            catch (ParseException e)
            {
                e.printStackTrace();
            }
        }
    }

    private String getIndexPath()
    {
        if(indexPath.equals(indexPath1))
        {
            indexPath = indexPath2;
        }
        else
        {
            indexPath = indexPath1;
        }

        return indexPath;
    }

    public static void main(String[] args)
    {
        SearchTest searchTest = new SearchTest();
        searchTest.search();
    }

 }

 Can anybody take a look at the above code snippet?

 I want to search on the different index file every time so I created two
 different folders and switch them time to time. The index files in the
 index1/index2 maybe replaced before the search request comes.

 The problem I found is after I ran the above code 2 or more loops, I can not
 modify/delete the cfs/cfx file in the file system(Windows 2003), although I
 closed the searcher every time in the code. It seems that the index file is
 not released.

 Is the problem caused by the shared reference of searcher? or some shared
 thread in the lucene?

 Thanks in advance!
 Garry

Re: Can not delete index file after close the IndexSearcher

Mike, thanks for your feedback.

I verified this in the debug mode, so I just check the folder I closed in
the last loop. Actually, both two folders are locked.

tried with new FSDirectory every loop, no help.

Garry

2011/1/12 Michael McCandless luc...@mikemccandless.com

 When you break out of the loop (user enters 'Q') you don't close the
 current searcher.  Could that be it?

 Also you are calling FSDir.open each time but should only do it once
 (though this should be harmless).

 Mike

 On Wed, Jan 12, 2011 at 5:39 AM, 张志田 zhitian.zh...@dianping.com wrote:
  Dear Luceners,
 
  I'm using lucene-3.0.2 in our app. There is some testing code for
 switching
  index, however, when my code run a couple of times, I found the index
 file
  was locked, I can not delete the old index files.
 
 
  The code looks like:
 
  public class SearchTest
  {
 
 private static final int MAX_RESULT = 1;
 
 private String indexPath1 = D:\\index\\index1;
 private String indexPath2 = D:\\index\\index2;
 
 private String indexPath = indexPath1;
 
 private Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);
 
 private Directory dir = null;
 
 private IndexSearcher searcher;
 
 public void search()
 {
 while(true)
 {
 try
 {
 String keyword = test;
 String fieldName = searchfield;
 
 if(dir == null)
 {
 dir = FSDirectory.open(new File(indexPath));
 }
 searcher = new IndexSearcher(dir);
 
 QueryParser queryParse = new
 QueryParser(Version.LUCENE_30,
  fieldName, analyzer);
 Query query = queryParse.parse(keyword);
 
 TopDocs hits = searcher.search(query, MAX_RESULT);
 int size = 5;
 if(hits.scoreDocs.length  size)
 {
 size = hits.scoreDocs.length;
 }
 for (int i = 0; i  size; i++)
 {
 Document doc = searcher.doc(hits.scoreDocs[i].doc);
 String text = doc.get(fieldName);
 System.out.println(fieldContent is:  + text);
 }
 
 IndexSearcher oldSearcher = searcher;
 dir = FSDirectory.open(new File(getIndexPath()));
 IndexSearcher newSearcher = new IndexSearcher(dir);
 searcher = newSearcher;
 
 oldSearcher.close();
 System.out.println(Closed Searcher:  +
  oldSearcher.getIndexReader().directory().toString());
 
 System.out.println(input 'Q' to quit testing...);
 BufferedReader br = new BufferedReader(new
  InputStreamReader(System.in));
 
 if(br.readLine().trim().equals(Q))
 {
 break;
 }
 }
 catch (CorruptIndexException e)
 {
 e.printStackTrace();
 }
 catch (IOException e)
 {
 e.printStackTrace();
 }
 catch (ParseException e)
 {
 e.printStackTrace();
 }
 }
 }
 
 private String getIndexPath()
 {
 if(indexPath.equals(indexPath1))
 {
 indexPath = indexPath2;
 }
 else
 {
 indexPath = indexPath1;
 }
 
 return indexPath;
 }
 
 public static void main(String[] args)
 {
 SearchTest searchTest = new SearchTest();
 searchTest.search();
 }
 
  }
 
  Can anybody take a look at the above code snippet?
 
  I want to search on the different index file every time so I created two
  different folders and switch them time to time. The index files in the
  index1/index2 maybe replaced before the search request comes.
 
  The problem I found is after I ran the above code 2 or more loops, I can
 not
  modify/delete the cfs/cfx file in the file system(Windows 2003), although
 I
  closed the searcher every time in the code. It seems that the index file
 is
  not released.
 
  Is the problem caused by the shared reference of searcher? or some shared
  thread in the lucene?
 
  Thanks in advance!
  Garry
 




-- 
张志田

大众点评网 - 技术部
电话：52521070 - 1675

Re: Can not delete index file after close the IndexSearcher

2011-01-12 Thread Michael McCandless

Hmmm.

When you say locked what actually does that mean?  Can you post the exception?

Also, can you whittle down your example even more?   EG if calling
this method twice causes the problem, make a method that calls it
twice and hits the exception and then start simplifying from there...

Mike

2011/1/12 张志田 zhitian.zh...@dianping.com:
 Mike, thanks for your feedback.

 I verified this in the debug mode, so I just check the folder I closed in
 the last loop. Actually, both two folders are locked.

 tried with new FSDirectory every loop, no help.

 Garry

 2011/1/12 Michael McCandless luc...@mikemccandless.com

 When you break out of the loop (user enters 'Q') you don't close the
 current searcher.  Could that be it?

 Also you are calling FSDir.open each time but should only do it once
 (though this should be harmless).

 Mike

 On Wed, Jan 12, 2011 at 5:39 AM, 张志田 zhitian.zh...@dianping.com wrote:
  Dear Luceners,
 
  I'm using lucene-3.0.2 in our app. There is some testing code for
 switching
  index, however, when my code run a couple of times, I found the index
 file
  was locked, I can not delete the old index files.
 
 
  The code looks like:
 
  public class SearchTest
  {
 
 private static final int MAX_RESULT = 1;
 
 private String indexPath1 = D:\\index\\index1;
 private String indexPath2 = D:\\index\\index2;
 
 private String indexPath = indexPath1;
 
 private Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);
 
 private Directory dir = null;
 
 private IndexSearcher searcher;
 
 public void search()
 {
 while(true)
 {
 try
 {
 String keyword = test;
 String fieldName = searchfield;
 
 if(dir == null)
 {
 dir = FSDirectory.open(new File(indexPath));
 }
 searcher = new IndexSearcher(dir);
 
 QueryParser queryParse = new
 QueryParser(Version.LUCENE_30,
  fieldName, analyzer);
 Query query = queryParse.parse(keyword);
 
 TopDocs hits = searcher.search(query, MAX_RESULT);
 int size = 5;
 if(hits.scoreDocs.length  size)
 {
 size = hits.scoreDocs.length;
 }
 for (int i = 0; i  size; i++)
 {
 Document doc = searcher.doc(hits.scoreDocs[i].doc);
 String text = doc.get(fieldName);
 System.out.println(fieldContent is:  + text);
 }
 
 IndexSearcher oldSearcher = searcher;
 dir = FSDirectory.open(new File(getIndexPath()));
 IndexSearcher newSearcher = new IndexSearcher(dir);
 searcher = newSearcher;
 
 oldSearcher.close();
 System.out.println(Closed Searcher:  +
  oldSearcher.getIndexReader().directory().toString());
 
 System.out.println(input 'Q' to quit testing...);
 BufferedReader br = new BufferedReader(new
  InputStreamReader(System.in));
 
 if(br.readLine().trim().equals(Q))
 {
 break;
 }
 }
 catch (CorruptIndexException e)
 {
 e.printStackTrace();
 }
 catch (IOException e)
 {
 e.printStackTrace();
 }
 catch (ParseException e)
 {
 e.printStackTrace();
 }
 }
 }
 
 private String getIndexPath()
 {
 if(indexPath.equals(indexPath1))
 {
 indexPath = indexPath2;
 }
 else
 {
 indexPath = indexPath1;
 }
 
 return indexPath;
 }
 
 public static void main(String[] args)
 {
 SearchTest searchTest = new SearchTest();
 searchTest.search();
 }
 
  }
 
  Can anybody take a look at the above code snippet?
 
  I want to search on the different index file every time so I created two
  different folders and switch them time to time. The index files in the
  index1/index2 maybe replaced before the search request comes.
 
  The problem I found is after I ran the above code 2 or more loops, I can
 not
  modify/delete the cfs/cfx file in the file system(Windows 2003), although
 I
  closed the searcher every time in the code. It seems that the index file
 is
  not released.
 
  Is the problem caused by the shared reference of searcher? or some shared
  thread in the lucene?
 
  Thanks in advance!
  Garry
 




 --
 张志田

 大众点评网 - 技术部
 电话：52521070 - 1675


-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Can not delete index file after close the IndexSearcher

Hi Mike,

Sorry to make you confused. lock means the file handle is held by some
other progress, the program can not delete it. There is no exception, I can
see file.delete() method returns false. If I delete the cfs file in the OS
manually, the warning is File was using by another person or program

To simplify my question, I made some more code for testing. you can run it
for reproducing, after two loops, you will see the message e.g. Can not
delete file: D:\index\index2\_0.cfs


Thank you very much


public class SearchTest
{

private static final int MAX_RESULT = 1;

private String indexPath1 = D:\\index\\index1;

private String indexPath2 = D:\\index\\index2;

private String backupIndexpath = D:\\index\\index3;

private String indexPath = indexPath1;

private Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);

private IndexSearcher searcher;

public void search()
{
while (true)
{
try
{
String keyword = test;
String fieldName = searchfield;

Directory dir = FSDirectory.open(new File(indexPath));

searcher = new IndexSearcher(dir);

QueryParser queryParse = new QueryParser(Version.LUCENE_30,
fieldName, analyzer);
Query query = queryParse.parse(keyword);

TopDocs hits = searcher.search(query, MAX_RESULT);
int size = 5;
if (hits.scoreDocs.length  size)
{
size = hits.scoreDocs.length;
}
for (int i = 0; i  size; i++)
{
Document doc = searcher.doc(hits.scoreDocs[i].doc);
String text = doc.get(fieldName);
System.out.println(fieldContent is:  + text);
}

IndexSearcher oldSearcher = searcher;

File newFile = new File(getIndexPath());
for (File file : newFile.listFiles())
{
if (!file.delete())
{
System.out.println(Can not delete file:  +
file.getAbsolutePath());
}
}

// Copy index File from another folder to this folder
copyDir(new File(backupIndexpath), newFile);

Directory newDir = FSDirectory.open(newFile);
IndexSearcher newSearcher = new IndexSearcher(newDir);
searcher = newSearcher;

oldSearcher.close();

System.out.println(Closed Searcher:  +
oldSearcher.getIndexReader().directory().toString());

System.out.println(input 'Q' to quit testing...);
BufferedReader br = new BufferedReader(new
InputStreamReader(System.in));

if (br.readLine().trim().equals(Q))
{
break;
}
}
catch (CorruptIndexException e)
{
e.printStackTrace();
}
catch (IOException e)
{
e.printStackTrace();
}
catch (ParseException e)
{
e.printStackTrace();
}
}
}

private String getIndexPath()
{
if (indexPath.equals(indexPath1))
{
indexPath = indexPath2;
}
else
{
indexPath = indexPath1;
}

return indexPath;
}

public static void copyDir(File sourceLocation, File targetLocation)
throws IOException
{
String[] children = sourceLocation.list();
for (int i = 0; i  children.length; i++)
{
InputStream in = null;
OutputStream out = null;
try
{
in = new FileInputStream(new File(sourceLocation,
children[i]));
out = new FileOutputStream(new File(targetLocation,
children[i]));

byte[] buf = new byte[1024];
int len;
while ((len = in.read(buf))  0)
{
out.write(buf, 0, len);
}
}
catch (FileNotFoundException e)
{
e.printStackTrace();
}
catch (IOException ioe)
{
ioe.printStackTrace();
}
finally
{
try
{
if (in != null)
{
in.close();
}
if (out != null)
{
out.close();
}
}
catch (IOException e)
{
e.printStackTrace();
}
}
}
}

public static void main(String[] args)
{
SearchTest

Re: Can not delete index file after close the IndexSearcher

2011-01-12 Thread Yuhan Zhang

Hi Garry,

I am guessing the directory needs to be closed before opening a new one.

dir.close();
dir = FSDirectory.open(new File(getIndexPath()));

why not to open two IndexSearcher objects in an array of two instead of
swapping them back and forth?
it would be a lot easier.

yuhan

2011/1/12 张志田 zhitian.zh...@dianping.com

 Hi Mike,

 Sorry to make you confused. lock means the file handle is held by some
 other progress, the program can not delete it. There is no exception, I can
 see file.delete() method returns false. If I delete the cfs file in the OS
 manually, the warning is File was using by another person or program

 To simplify my question, I made some more code for testing. you can run it
 for reproducing, after two loops, you will see the message e.g. Can not
 delete file: D:\index\index2\_0.cfs


 Thank you very much


 public class SearchTest
 {

private static final int MAX_RESULT = 1;

private String indexPath1 = D:\\index\\index1;

private String indexPath2 = D:\\index\\index2;

 private String backupIndexpath = D:\\index\\index3;

private String indexPath = indexPath1;

private Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_30);

 private IndexSearcher searcher;

public void search()
{
while (true)
{
try
{
String keyword = test;
String fieldName = searchfield;

 Directory dir = FSDirectory.open(new File(indexPath));

searcher = new IndexSearcher(dir);

QueryParser queryParse = new QueryParser(Version.LUCENE_30,
 fieldName, analyzer);
Query query = queryParse.parse(keyword);

TopDocs hits = searcher.search(query, MAX_RESULT);
int size = 5;
if (hits.scoreDocs.length  size)
{
size = hits.scoreDocs.length;
}
for (int i = 0; i  size; i++)
{
Document doc = searcher.doc(hits.scoreDocs[i].doc);
String text = doc.get(fieldName);
System.out.println(fieldContent is:  + text);
}

IndexSearcher oldSearcher = searcher;

 File newFile = new File(getIndexPath());
for (File file : newFile.listFiles())
{
if (!file.delete())
{
System.out.println(Can not delete file:  +
 file.getAbsolutePath());
}
}

// Copy index File from another folder to this folder
copyDir(new File(backupIndexpath), newFile);

Directory newDir = FSDirectory.open(newFile);
IndexSearcher newSearcher = new IndexSearcher(newDir);
 searcher = newSearcher;

oldSearcher.close();

System.out.println(Closed Searcher:  +
 oldSearcher.getIndexReader().directory().toString());

System.out.println(input 'Q' to quit testing...);
BufferedReader br = new BufferedReader(new
 InputStreamReader(System.in));

if (br.readLine().trim().equals(Q))
{
break;
}
}
catch (CorruptIndexException e)
{
e.printStackTrace();
}
catch (IOException e)
{
e.printStackTrace();
}
catch (ParseException e)
{
e.printStackTrace();
}
}
}

private String getIndexPath()
{
if (indexPath.equals(indexPath1))
{
indexPath = indexPath2;
}
else
{
indexPath = indexPath1;
}

return indexPath;
}

 public static void copyDir(File sourceLocation, File targetLocation)
 throws IOException
{
String[] children = sourceLocation.list();
for (int i = 0; i  children.length; i++)
{
InputStream in = null;
OutputStream out = null;
try
{
in = new FileInputStream(new File(sourceLocation,
 children[i]));
out = new FileOutputStream(new File(targetLocation,
 children[i]));

byte[] buf = new byte[1024];
int len;
while ((len = in.read(buf))  0)
{
out.write(buf, 0, len);
}
}
catch (FileNotFoundException e)
{
e.printStackTrace();
}
catch (IOException ioe)
{
ioe.printStackTrace();
}
finally
{
try
{
if (in != null)
{
in.close

Re: Can not delete index file after close the IndexSearcher