[ 
https://issues.apache.org/jira/browse/LUCENE-5596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969391#comment-13969391
 ] 

Adrien Grand commented on LUCENE-5596:
--------------------------------------

bq. Longer terms are only prefixed up to a maximum length, the remaining stuff 
is only stored full precision. In fact we only apply prefix terms to the first 
n bytes of the term, everything loger gets stored in full precision only. I 
think, it makes no sense to have longer prefixes than maybe 8 bytes in the 
index.

+1 I think this makes sense to have a maximum prefix length (that would be 
configurable hopefully) and enforce that this maximum prefix length is less 
than 255.

This makes me wonder it would be nice to have something that adapts itself to 
the data, and would only index prefixes that match more than X terms (maybe 
similarly to the way that the block tree terms dict tries to share prefixes). 
But this looks significantly harder to implement!
Maybe LUCENE-5422 could be of interest as well: for example if the set of terms 
that have "XY" and "XYZ" as prefixes are the same, they could point to the same 
postings list. (Just wild ideas, feel free to ignore them :))

> Support for index/search large numeric field
> --------------------------------------------
>
>                 Key: LUCENE-5596
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5596
>             Project: Lucene - Core
>          Issue Type: New Feature
>            Reporter: Kevin Wang
>            Assignee: Uwe Schindler
>         Attachments: LUCENE-5596.patch, LUCENE-5596.patch
>
>
> Currently if an number is larger than Long.MAX_VALUE, we can't index/search 
> that in lucene as a number. For example, IPv6 address is an 128 bit number, 
> so we can't index that as a numeric field and do numeric range query etc.
> It would be good to support BigInteger / BigDecimal
> I've tried use BigInteger for IPv6 in Elasticsearch and that works fine, but 
> there are still lots of things to do
> https://github.com/elasticsearch/elasticsearch/pull/5758



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to