[ 
https://issues.apache.org/jira/browse/PHOENIX-1544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

James Taylor updated PHOENIX-1544:
----------------------------------
    Description: 
Needs to be fleshed out more, but I think we could support indexing array data 
to improve query performance. We could generate an index row per array element, 
tacking on the position of the array element in the row key.

For example, given the array: ARRAY['a','b','c','a'] you could generate the 
following row keys(where the space is a null byte) when an INDEX is created 
over it:
{code}
a 0
b 1
c 2
a 3
{code}

Because the data is immutable, we don't need to worry about keeping it in sync 
with changes to the array (which would be difficult).

  was:
Needs to be fleshed out more, but when data is immutable, I think we have some 
interesting ways we could support indexing array data to improve query 
performance. We could generate an index row per array element, tacking on the 
position of the array element in the row key.

For example, given the array: ARRAY['a','b','c','a'] you could generate the 
following row keys(where the space is a null byte) when an INDEX is created 
over it:
{code}
a 0
b 1
c 2
a 3
{code}

Because the data is immutable, we don't need to worry about keeping it in sync 
with changes to the array (which would be difficult).


> Support indexing of an ARRAY
> ----------------------------
>
>                 Key: PHOENIX-1544
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-1544
>             Project: Phoenix
>          Issue Type: Bug
>            Reporter: James Taylor
>
> Needs to be fleshed out more, but I think we could support indexing array 
> data to improve query performance. We could generate an index row per array 
> element, tacking on the position of the array element in the row key.
> For example, given the array: ARRAY['a','b','c','a'] you could generate the 
> following row keys(where the space is a null byte) when an INDEX is created 
> over it:
> {code}
> a 0
> b 1
> c 2
> a 3
> {code}
> Because the data is immutable, we don't need to worry about keeping it in 
> sync with changes to the array (which would be difficult).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to