[ 
https://issues.apache.org/jira/browse/LUCENE-2232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12805916#action_12805916
 ] 

Paul Elschot commented on LUCENE-2232:
--------------------------------------

Meanwhile I've been looking around some more through contrib/benchmark.
I think I can make an index from the dump of the wiki articles, and the 
articles seem to be large enough.
The only question left is how to get/generate interesting (sloppy) phrase 
queries, any ideas there?

Also, if anyone has a test on real data with fields of at least 128 words and 
test queries including phrase
queries (anything that uses the position data) I'd like to hear about how this 
goes (index size, query performance).
I'm expecting a decent speedup for those cases, but I have no idea how big the 
speedup would be.

> Use VShort to encode positions
> ------------------------------
>
>                 Key: LUCENE-2232
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2232
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>            Reporter: Paul Elschot
>         Attachments: LUCENE-2232-nonbackwards.patch, 
> LUCENE-2232-nonbackwards.patch
>
>
> Improve decoding speed for typical case of two bytes for a delta position at 
> the cost of increasing the size of the proximity file.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to