[ https://issues.apache.org/jira/browse/OAK-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13965128#comment-13965128 ]
Chetan Mehrotra commented on OAK-1702: -------------------------------------- Some more updates. These are with following changes * Use of OakCodec * Not storing the text in Index * Using a lazy cursor * Shared IndexSearcher * Optionally using the FSDir by copying the content from OakDirectory to local dir {noformat} //sahred searcher,fsdir # FullTextSearchTest C min 10% 50% 90% max N Oak-Tar 1 2 2 3 4 38 21455 Oak-Tar 5 0 1 1 3 141 168816 Oak-Tar 10 0 1 2 13 111 117977 //sahred searcher # FullTextSearchTest C min 10% 50% 90% max N Oak-Tar 1 2 3 3 4 20 18148 Oak-Tar 5 0 1 2 5 57 113634 Oak-Tar 10 1 1 5 17 99 82880 //sahred searcher # FullTextSearchTest C min 10% 50% 90% max N Oak-Tar-FDS 1 5 5 6 7 81 10028 Oak-Tar-FDS 5 1 2 6 16 88 38415 Oak-Tar-FDS 10 1 3 13 41 210 32434 //sahred searcher, fsdir # FullTextSearchTest C min 10% 50% 90% max N Oak-Tar-FDS 1 2 2 3 4 37 21465 Oak-Tar-FDS 5 0 1 1 3 188 173179 Oak-Tar-FDS 10 0 1 2 12 124 124892 {noformat} * Oak-Tar - With shared searcher it performs much better than JR2. Using FSDir gives a small boost * Oak-Tar-FDS - With only shared searcher the perf is comparable to JR2. Using FSDir brings it at par with default Oak-Tar Still need to take numbers with Oak-Mongo and Oak-Mongo-FDS where mongo is remote. > Create a benchmark for Full text search > --------------------------------------- > > Key: OAK-1702 > URL: https://issues.apache.org/jira/browse/OAK-1702 > Project: Jackrabbit Oak > Issue Type: Task > Components: bench > Reporter: Chetan Mehrotra > Assignee: Chetan Mehrotra > Fix For: 1.1 > > Attachments: OAK-1702-lazy-cursor.patch, > OAK-1702-shared-indexer.patch, OAK-1702.oakcodec.patch, OAK-1702.patch > > > To compare the performance of Full text search between Jackrabbit 2 and Oak a > benchmark should be added. > To start with the benchmark would do following > * Would be based on WikipediaImport benchmark. So it would import the > wikipedia dump and perform full text query on that > * Should be able to run on both JR2 and Oak. Need to account for maven setup > to handle different Lucene version as JR2 uses 3.6.0 and Oak use 4.x > Later we can add concurrent version -- This message was sent by Atlassian JIRA (v6.2#6252)