[
https://issues.apache.org/jira/browse/LUCENE-4290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13429916#comment-13429916
]
Robert Muir commented on LUCENE-4290:
-------------------------------------
I get some improvements here in performance (for non-prox queries) by hacking
up luceneutil to
test queries with postingshighlighter+offsets vs fastvectorhighlighter+vectors.
However, I don't think this will be realistically useful until we have the new
block layout from the pfor branch:
prox queries are hurt by the interleaving in the stream (just like if you use
payloads), unrelated to highlighting.
I tried to do more experiments like 'wikibig' in luceneutil but i ran out of
disk space.
Once we have the block layout landed lets revisit this: it gives a much smaller
index, faster indexing,
and I think will work well when thats sorted out.
> basic highlighter that uses postings offsets
> --------------------------------------------
>
> Key: LUCENE-4290
> URL: https://issues.apache.org/jira/browse/LUCENE-4290
> Project: Lucene - Core
> Issue Type: New Feature
> Components: modules/other
> Reporter: Robert Muir
> Attachments: LUCENE-4290.patch
>
>
> We added IndexOptions.DOCS_AND_FREQS_AND_POSITIONS_AND_OFFSETS so you can
> efficiently compress character offsets in the postings list, but nothing yet
> makes use of this.
> Here is a simple highlighter that uses them: it doesn't have many tests or
> fancy features, but I think its ok for the sandbox/ (maybe with a couple more
> tests)
> Additionally I didnt do any benchmarking.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]