Re: Array indexing functions

Petar Zečević Tue, 20 Nov 2018 02:55:08 -0800


Hi,
yes, these are imlemented just like native functions in sql.functions, with 
code generation, so whole-stage codegen should apply.


Regarding plan optimization, I am not sure how these would be taken into 
account in the existing rules, except maybe for filter pushdown.

Petar


Alessandro Solimando <alessandro.solima...@gmail.com> writes:

> Hi Petar,
> I have implemented similar functions a few times through ad-hoc UDFs in the 
> past, so +1 from me.
>
> Can you elaborate a bit more on how you practically implement those 
> functions? Are they UDF or "native" functions like those in sql.functions 
> package?
>
> I am asking because I wonder if/how Catalyst can take those functions into 
> account for producing more optimized plans, maybe you or someone else in the 
> list can clarify this.
>
> Best regards,
> Alessandro
>
> On Tue, 20 Nov 2018 at 11:11, Petar Zečević <petar.zece...@gmail.com> wrote:
>
>  Hi,
>  I implemented two array functions that are useful to us and I wonder if you 
> think it would be useful to add them to the distribution. The functions are 
> used for filtering arrays based on indexes:
>
>  array_allpositions (named after array_position) - takes a column and a value 
> and returns an array of the column's indexes corresponding to elements equal 
> to the provided value
>
>  array_select - takes an array column and an array of indexes and returns a 
> subset of the array based on the provided indexes.
>
>  If you agree with this addition I can create a JIRA ticket and a pull 
> request.
>
>  -- 
>  Petar Zečević
>
>  ---------------------------------------------------------------------
>  To unsubscribe e-mail: dev-unsubscr...@spark.apache.org




---------------------------------------------------------------------
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org

Re: Array indexing functions

Reply via email to