Re: thread safe shared IndexSearcher

Jay Yu Mon, 24 Sep 2007 15:48:05 -0700

Mark,

Great effort getting the original lucene index accessor package in thisshape. I am sure this will benefit a lot of people using Lucene in amultithread env.

I have a quick question to ask you:
Do you have to use the core Lucene 2.3-dev in order to use the accessor?

I will take a look at your codes to see if I could help. I used aslightly modified version of the original package in my project but itbreaks some of my tests. I hope your version works better.


Thanks a lot!

Jay


Mark Miller wrote:

I have sat down and rewrote IndexAccessor from scratch. I copied in thesame reference counting logic, pruned some things, and tried to make thewhole package a bit simpler to use. I have a few things to do, but itspretty solid already. The only major thing I'd still like to do is addan option to warm searchers before putting them in the Searcher cache.Id like to writer some more tests as well. Any help greatly appreciatedif your interested in using the thing.
http://myhardshadow.com/indexaccessor/trunk/src/test/com/mhs/indexaccessor/SimpleSearchServer.java
Here is a an example of a class that can be instantiated in one ofmultiple threads and read /modify a single index without worrying aboutwhat anyof the other threads are doing to the index at any given time. This is avery simple example of how to use the IndexAccessor and not necessarily anexample of best practices. The main idea is that you get your Writer,Searcher, or Reader, and then be sure to release it as soon as your donewith itin a finally block. For loading, you will want to load many docs with aWriter (batch them) before releasing it, but remember that Readers willnot get a new viewof the index until you release all of the Writers. So beware hogging aWriter unless you thats what your intending.
JavaDoc:
http://myhardshadow.com/indexaccessorapi/

Code:
http://myhardshadow.com/indexaccessor/trunk/

Jar:
http://myhardshadow.com/indexaccessorreleases/indexaccessor.jar


Your synchronized block concerns:
The synchronized blocks that control accesss to the IndexAccessor do nothave a huge impact on performance. Keep in mind that all of the work isnot done in a synchonrized block, just the retrieval of the Searcher,Writer, Reader. Even if the synchronization makes the method twice asexpensive, it is still overpowered by the cost of parsing queries andsearching the index. This applies with or without contention. I wrote asimple test and included the output below. You might use the IBM LockAnalyzer for Java to further analyze these costs. Trust me, this thingis speedy. Its many times better than using IndexModifier.
Without Contention
Just retrieve and release Searcher 100000 times
----
avg time:6.3E-4 ms
total time:63 ms

Parse query and search on 1 doc 100000 times
----
avg time:0.03107 ms
total time:3107 ms


With Contention (40 other threads running 80000 searches)
Just retrieve and release Searcher 100000 times
----
avg time:0.04643 ms
total time:4643 ms

Parse query and search on 1 doc 100000 times
----
avg time:0.64337 ms
total time:64337 ms


- Mark

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: thread safe shared IndexSearcher

Reply via email to