Re: Will higher order functions in spark SQL be pushed upstream?

2018-04-19 Thread Michael Davies
Hi Herman, 

That’s great and thanks for quick reply. The JIRA has an example of transform 
and refers to a presto doc with lots of functions. Do you know which functions 
will be supported. I am interested in using filter for example.

Cheers

Mick


> On 19 Apr 2018, at 10:46, Herman van Hövell tot Westerflier 
>  wrote:
> 
> Hi Mick,
> 
> Higher order functions will be in the next release of Spark (2.4). See 
> SPARK-23908  for more 
> information. The PR should be out in a few weeks.
> 
> - Herman
> 
> On Thu, Apr 19, 2018 at 11:20 AM, Mick Davies  > wrote:
> Hi,
> Regarding higher order functions
> 
> > Yes, we intend to contribute this to open source.
> 
> It doesn't look like this is in 2.3.0, at least I can't find it.
> 
> Do you know when it might reach open source.
> 
> Thanks
> 
> Mick
> 
> 
> 
> --
> Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ 
> 
> 
> -
> To unsubscribe e-mail: dev-unsubscr...@spark.apache.org 
> 
> 
> 



Re: Will higher order functions in spark SQL be pushed upstream?

2018-04-19 Thread Mick Davies
Hi,
Regarding higher order functions

> Yes, we intend to contribute this to open source.

It doesn't look like this is in 2.3.0, at least I can't find it.

Do you know when it might reach open source.

Thanks

Mick



--
Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/

-
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org



Re: Will higher order functions in spark SQL be pushed upstream?

2017-10-10 Thread DB Tsai
Hello,

At Netflix's algorithm team, we work on ranking problems a lot where
we naturally deal with the dataset with nested list of the structs. We
built Scala APIs like map, filter, drop, withColumn that can work on
the nested list of structs efficiently using SQL expression with
codegen.

Here is what we purpose on how APIs will look like, and we would like
to socialize with community to get more feedback!

https://issues.apache.org/jira/browse/SPARK-22231

It will be cool to share some building blocks with Databricks's higher
order function feature.

Thanks.

On Fri, Jun 9, 2017 at 5:04 PM, Antoine HOM  wrote:
> Good news :) Thx Sameer.
>
>
> On Friday, June 9, 2017, Sameer Agarwal  wrote:
>>>
>>> * As a heavy user of complex data types I was wondering if there was
>>> any plan to push those changes upstream?
>>
>>
>> Yes, we intend to contribute this to open source.
>>
>>>
>>> * In addition, I was wondering if as part of this change it also tries
>>> to solve the column pruning / filter pushdown issues with complex
>>> datatypes?
>>
>>
>> For parquet, this effort is primarily tracked via SPARK-4502 (see
>> https://github.com/apache/spark/pull/16578) and is currently targeted for
>> 2.3.

-- 
Sincerely,

DB Tsai
--
PGP Key ID: 0x5CED8B896A6BDFA0

-
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org



Re: Will higher order functions in spark SQL be pushed upstream?

2017-06-09 Thread Antoine HOM
Good news :) Thx Sameer.


On Friday, June 9, 2017, Sameer Agarwal  wrote:

> * As a heavy user of complex data types I was wondering if there was
>> any plan to push those changes upstream?
>>
>
> Yes, we intend to contribute this to open source.
>
>
>> * In addition, I was wondering if as part of this change it also tries
>> to solve the column pruning / filter pushdown issues with complex
>> datatypes?
>
>
> For parquet, this effort is primarily tracked via SPARK-4502 (see
> https://github.com/apache/spark/pull/16578) and is currently targeted for
> 2.3.
>


Re: Will higher order functions in spark SQL be pushed upstream?

2017-06-09 Thread Sameer Agarwal
>
> * As a heavy user of complex data types I was wondering if there was
> any plan to push those changes upstream?
>

Yes, we intend to contribute this to open source.


> * In addition, I was wondering if as part of this change it also tries
> to solve the column pruning / filter pushdown issues with complex
> datatypes?


For parquet, this effort is primarily tracked via SPARK-4502 (see
https://github.com/apache/spark/pull/16578) and is currently targeted for
2.3.


Re: Will higher order functions in spark SQL be pushed upstream?

2017-06-09 Thread Olivier Girardot
+1 for the question

2017-06-07 19:50 GMT+02:00 Antoine HOM :

> Hey guys,
>
> Databricks released higher order functions as part of their runtime
> 3.0 beta (https://databricks.com/blog/2017/05/24/working-with-
> nested-data-using-higher-order-functions-in-sql-on-databricks.html),
> which helps working with array within SQL statements.
>
> * As a heavy user of complex data types I was wondering if there was
> any plan to push those changes upstream?
> * In addition, I was wondering if as part of this change it also tries
> to solve the column pruning / filter pushdown issues with complex
> datatypes?
>
> Thanks!
>
> -
> To unsubscribe e-mail: dev-unsubscr...@spark.apache.org
>
>


-- 
*Olivier Girardot* | Associé
o.girar...@lateral-thoughts.com
+33 6 24 09 17 94


Will higher order functions in spark SQL be pushed upstream?

2017-06-07 Thread Antoine HOM
Hey guys,

Databricks released higher order functions as part of their runtime
3.0 beta 
(https://databricks.com/blog/2017/05/24/working-with-nested-data-using-higher-order-functions-in-sql-on-databricks.html),
which helps working with array within SQL statements.

* As a heavy user of complex data types I was wondering if there was
any plan to push those changes upstream?
* In addition, I was wondering if as part of this change it also tries
to solve the column pruning / filter pushdown issues with complex
datatypes?

Thanks!

-
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org