[ 
https://issues.apache.org/jira/browse/OAK-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13965128#comment-13965128
 ] 

Chetan Mehrotra commented on OAK-1702:
--------------------------------------

Some more updates. These are with following changes
* Use of OakCodec
* Not storing the text in Index
* Using a lazy cursor
* Shared IndexSearcher
* Optionally using the FSDir by copying the content from OakDirectory to local 
dir

{noformat}
//sahred searcher,fsdir
# FullTextSearchTest               C     min     10%     50%     90%     max    
   N
Oak-Tar                            1       2       2       3       4      38   
21455
Oak-Tar                            5       0       1       1       3     141  
168816
Oak-Tar                           10       0       1       2      13     111  
117977

//sahred searcher
# FullTextSearchTest               C     min     10%     50%     90%     max    
   N
Oak-Tar                            1       2       3       3       4      20   
18148
Oak-Tar                            5       0       1       2       5      57  
113634
Oak-Tar                           10       1       1       5      17      99   
82880

//sahred searcher
# FullTextSearchTest               C     min     10%     50%     90%     max    
   N
Oak-Tar-FDS                        1       5       5       6       7      81   
10028
Oak-Tar-FDS                        5       1       2       6      16      88   
38415
Oak-Tar-FDS                       10       1       3      13      41     210   
32434

//sahred searcher, fsdir
# FullTextSearchTest               C     min     10%     50%     90%     max    
   N
Oak-Tar-FDS                        1       2       2       3       4      37   
21465
Oak-Tar-FDS                        5       0       1       1       3     188  
173179
Oak-Tar-FDS                       10       0       1       2      12     124  
124892
{noformat}

* Oak-Tar - With shared searcher it performs much better than JR2. Using FSDir 
gives a small boost
* Oak-Tar-FDS - With only shared searcher the perf is comparable to JR2. Using 
FSDir brings it at par with default Oak-Tar

Still need to take numbers with Oak-Mongo and Oak-Mongo-FDS where mongo is 
remote. 


> Create a benchmark for Full text search
> ---------------------------------------
>
>                 Key: OAK-1702
>                 URL: https://issues.apache.org/jira/browse/OAK-1702
>             Project: Jackrabbit Oak
>          Issue Type: Task
>          Components: bench
>            Reporter: Chetan Mehrotra
>            Assignee: Chetan Mehrotra
>             Fix For: 1.1
>
>         Attachments: OAK-1702-lazy-cursor.patch, 
> OAK-1702-shared-indexer.patch, OAK-1702.oakcodec.patch, OAK-1702.patch
>
>
> To compare the performance of Full text search between Jackrabbit 2 and Oak a 
> benchmark should be added.
> To start with the benchmark would do following
> * Would be based on WikipediaImport benchmark. So it would import the 
> wikipedia dump and perform full text query on that
> * Should be able to run on both JR2 and Oak. Need to account for maven setup 
> to handle different Lucene version as JR2 uses 3.6.0 and Oak use 4.x
> Later we can add concurrent version



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to