[
https://issues.apache.org/jira/browse/LUCENE-5156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14083072#comment-14083072
]
Robert Muir commented on LUCENE-5156:
-------------------------------------
Sorry David, its not about being against speeding something up, its about how
you propose implementing it.
Copying all the data from the entire document into another array on the first
read for the doc, that's a really trashy thing to do here. Instead, we should
just fix it correctly, so that seekCeil() is not linear time.
> CompressingTermVectors termsEnum should probably not support seek-by-ord
> ------------------------------------------------------------------------
>
> Key: LUCENE-5156
> URL: https://issues.apache.org/jira/browse/LUCENE-5156
> Project: Lucene - Core
> Issue Type: Bug
> Reporter: Robert Muir
> Fix For: 4.5, 5.0
>
> Attachments: LUCENE-5156.patch
>
>
> Just like term vectors before it, it has a O(n) seek-by-term.
> But this one also advertises a seek-by-ord, only this is also O(n).
> This could cause e.g. checkindex to be very slow, because if termsenum
> supports ord it does a bunch of seeking tests. (Another solution would be to
> leave it, and add a boolean so checkindex never does seeking tests for term
> vectors, only real fields).
> However, I think its also kinda a trap, in my opinion if seek-by-ord is
> supported anywhere, you kinda expect it to be faster than linear time...?
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]