Re: [SQL] Unsupported features

2018-06-05 Thread Kai Jiang
FYI, Umbrella JIRA ticket: https://issues.apache.org/jira/browse/BEAM-4476
ᐧ
ᐧ

On Mon, Jun 4, 2018 at 3:08 PM Kai Jiang  wrote:

> Ismaël, I was running this naive code snippet
> .
> Yes, IT would be interesting. Next step, I was thinking of is making the
> progress automatically and integrating with Nexmark.
> Do you have any ideas about this? Currently, I ingested data by reading
> plain CSV file. Is that possible to run batch job with non-generated data
> in Nexmark?
>
> Best,
> Kai
> ᐧ
>
> On Mon, Jun 4, 2018 at 4:41 AM Ismaël Mejía  wrote:
>
>> This is super interesting, great work Kai!
>>
>> Just for curiosity, How are you validating this?
>> It would be really interesting to have this also as part of some kind of
>> IT for the future.
>>
>>
>> On Fri, Jun 1, 2018 at 7:43 PM Kai Jiang  wrote:
>>
>>> Sounds a good idea! I will file the major problems later and use a task
>>> issue to track.
>>>
>>> Best,
>>> Kai
>>> ᐧ
>>>
>>> On Fri, Jun 1, 2018 at 10:10 AM Anton Kedin  wrote:
>>>
 This looks very helpful, thank you.

 Can you file Jiras for the major problems? Or maybe a single jira for
 the whole thing with sub-tasks for specific problems.

 Regards,
 Anton

 On Wed, May 30, 2018 at 9:12 AM Kenneth Knowles  wrote:

> This is extremely useful. Thanks for putting so much information
> together!
>
> Kenn
>
> On Wed, May 30, 2018 at 8:19 AM Kai Jiang  wrote:
>
>> Hi all,
>>
>> Based on pull/5481 , I
>> manually did a coverage test with TPC-ds queries (65%) and TPC-h queries
>> (100%) and want to see what features Beam SQL is currently not 
>> supporting.
>> Test was running on DirectRunner.
>>
>> I want to share the result.​
>>  TPC-DS queries on Beam
>> 
>> ​
>> TL;DR:
>>
>>1. aggregation function (stddev) missing or calculation of
>>aggregation functions combination.
>>2. nested beamjoinrel(condition=[true], joinType=[inner]) / cross
>>join error
>>3. date type casting/ calculation and other types casting.
>>4. LIKE operator in String / alias for substring function
>>5. order by w/o limit clause.
>>6. OR operator is supported in join condition
>>7. Syntax: exist/ not exist (errors) .rank() over (partition
>>by) / view (unsupported)
>>
>>
>> Best,
>> Kai
>> ᐧ
>>
>


Re: [SQL] Unsupported features

2018-06-04 Thread Kai Jiang
Ismaël, I was running this naive code snippet
.
Yes, IT would be interesting. Next step, I was thinking of is making the
progress automatically and integrating with Nexmark.
Do you have any ideas about this? Currently, I ingested data by reading
plain CSV file. Is that possible to run batch job with non-generated data
in Nexmark?

Best,
Kai
ᐧ

On Mon, Jun 4, 2018 at 4:41 AM Ismaël Mejía  wrote:

> This is super interesting, great work Kai!
>
> Just for curiosity, How are you validating this?
> It would be really interesting to have this also as part of some kind of
> IT for the future.
>
>
> On Fri, Jun 1, 2018 at 7:43 PM Kai Jiang  wrote:
>
>> Sounds a good idea! I will file the major problems later and use a task
>> issue to track.
>>
>> Best,
>> Kai
>> ᐧ
>>
>> On Fri, Jun 1, 2018 at 10:10 AM Anton Kedin  wrote:
>>
>>> This looks very helpful, thank you.
>>>
>>> Can you file Jiras for the major problems? Or maybe a single jira for
>>> the whole thing with sub-tasks for specific problems.
>>>
>>> Regards,
>>> Anton
>>>
>>> On Wed, May 30, 2018 at 9:12 AM Kenneth Knowles  wrote:
>>>
 This is extremely useful. Thanks for putting so much information
 together!

 Kenn

 On Wed, May 30, 2018 at 8:19 AM Kai Jiang  wrote:

> Hi all,
>
> Based on pull/5481 , I
> manually did a coverage test with TPC-ds queries (65%) and TPC-h queries
> (100%) and want to see what features Beam SQL is currently not supporting.
> Test was running on DirectRunner.
>
> I want to share the result.​
>  TPC-DS queries on Beam
> 
> ​
> TL;DR:
>
>1. aggregation function (stddev) missing or calculation of
>aggregation functions combination.
>2. nested beamjoinrel(condition=[true], joinType=[inner]) / cross
>join error
>3. date type casting/ calculation and other types casting.
>4. LIKE operator in String / alias for substring function
>5. order by w/o limit clause.
>6. OR operator is supported in join condition
>7. Syntax: exist/ not exist (errors) .rank() over (partition
>by) / view (unsupported)
>
>
> Best,
> Kai
> ᐧ
>



Re: [SQL] Unsupported features

2018-06-04 Thread Ismaël Mejía
This is super interesting, great work Kai!

Just for curiosity, How are you validating this?
It would be really interesting to have this also as part of some kind of IT
for the future.


On Fri, Jun 1, 2018 at 7:43 PM Kai Jiang  wrote:

> Sounds a good idea! I will file the major problems later and use a task
> issue to track.
>
> Best,
> Kai
> ᐧ
>
> On Fri, Jun 1, 2018 at 10:10 AM Anton Kedin  wrote:
>
>> This looks very helpful, thank you.
>>
>> Can you file Jiras for the major problems? Or maybe a single jira for the
>> whole thing with sub-tasks for specific problems.
>>
>> Regards,
>> Anton
>>
>> On Wed, May 30, 2018 at 9:12 AM Kenneth Knowles  wrote:
>>
>>> This is extremely useful. Thanks for putting so much information
>>> together!
>>>
>>> Kenn
>>>
>>> On Wed, May 30, 2018 at 8:19 AM Kai Jiang  wrote:
>>>
 Hi all,

 Based on pull/5481 , I
 manually did a coverage test with TPC-ds queries (65%) and TPC-h queries
 (100%) and want to see what features Beam SQL is currently not supporting.
 Test was running on DirectRunner.

 I want to share the result.​
  TPC-DS queries on Beam
 
 ​
 TL;DR:

1. aggregation function (stddev) missing or calculation of
aggregation functions combination.
2. nested beamjoinrel(condition=[true], joinType=[inner]) / cross
join error
3. date type casting/ calculation and other types casting.
4. LIKE operator in String / alias for substring function
5. order by w/o limit clause.
6. OR operator is supported in join condition
7. Syntax: exist/ not exist (errors) .rank() over (partition
by) / view (unsupported)


 Best,
 Kai
 ᐧ

>>>


Re: [SQL] Unsupported features

2018-06-01 Thread Kai Jiang
Sounds a good idea! I will file the major problems later and use a task
issue to track.

Best,
Kai
ᐧ

On Fri, Jun 1, 2018 at 10:10 AM Anton Kedin  wrote:

> This looks very helpful, thank you.
>
> Can you file Jiras for the major problems? Or maybe a single jira for the
> whole thing with sub-tasks for specific problems.
>
> Regards,
> Anton
>
> On Wed, May 30, 2018 at 9:12 AM Kenneth Knowles  wrote:
>
>> This is extremely useful. Thanks for putting so much information together!
>>
>> Kenn
>>
>> On Wed, May 30, 2018 at 8:19 AM Kai Jiang  wrote:
>>
>>> Hi all,
>>>
>>> Based on pull/5481 , I
>>> manually did a coverage test with TPC-ds queries (65%) and TPC-h queries
>>> (100%) and want to see what features Beam SQL is currently not supporting.
>>> Test was running on DirectRunner.
>>>
>>> I want to share the result.​
>>>  TPC-DS queries on Beam
>>> 
>>> ​
>>> TL;DR:
>>>
>>>1. aggregation function (stddev) missing or calculation of
>>>aggregation functions combination.
>>>2. nested beamjoinrel(condition=[true], joinType=[inner]) / cross
>>>join error
>>>3. date type casting/ calculation and other types casting.
>>>4. LIKE operator in String / alias for substring function
>>>5. order by w/o limit clause.
>>>6. OR operator is supported in join condition
>>>7. Syntax: exist/ not exist (errors) .rank() over (partition by)
>>>/ view (unsupported)
>>>
>>>
>>> Best,
>>> Kai
>>> ᐧ
>>>
>>


Re: [SQL] Unsupported features

2018-06-01 Thread Anton Kedin
This looks very helpful, thank you.

Can you file Jiras for the major problems? Or maybe a single jira for the
whole thing with sub-tasks for specific problems.

Regards,
Anton

On Wed, May 30, 2018 at 9:12 AM Kenneth Knowles  wrote:

> This is extremely useful. Thanks for putting so much information together!
>
> Kenn
>
> On Wed, May 30, 2018 at 8:19 AM Kai Jiang  wrote:
>
>> Hi all,
>>
>> Based on pull/5481 , I
>> manually did a coverage test with TPC-ds queries (65%) and TPC-h queries
>> (100%) and want to see what features Beam SQL is currently not supporting.
>> Test was running on DirectRunner.
>>
>> I want to share the result.​
>>  TPC-DS queries on Beam
>> 
>> ​
>> TL;DR:
>>
>>1. aggregation function (stddev) missing or calculation of
>>aggregation functions combination.
>>2. nested beamjoinrel(condition=[true], joinType=[inner]) / cross
>>join error
>>3. date type casting/ calculation and other types casting.
>>4. LIKE operator in String / alias for substring function
>>5. order by w/o limit clause.
>>6. OR operator is supported in join condition
>>7. Syntax: exist/ not exist (errors) .rank() over (partition by)
>>/ view (unsupported)
>>
>>
>> Best,
>> Kai
>> ᐧ
>>
>


Re: [SQL] Unsupported features

2018-05-30 Thread Kenneth Knowles
This is extremely useful. Thanks for putting so much information together!

Kenn

On Wed, May 30, 2018 at 8:19 AM Kai Jiang  wrote:

> Hi all,
>
> Based on pull/5481 , I manually
> did a coverage test with TPC-ds queries (65%) and TPC-h queries (100%) and
> want to see what features Beam SQL is currently not supporting. Test was
> running on DirectRunner.
>
> I want to share the result.​
>  TPC-DS queries on Beam
> 
> ​
> TL;DR:
>
>1. aggregation function (stddev) missing or calculation of aggregation
>functions combination.
>2. nested beamjoinrel(condition=[true], joinType=[inner]) / cross join
>error
>3. date type casting/ calculation and other types casting.
>4. LIKE operator in String / alias for substring function
>5. order by w/o limit clause.
>6. OR operator is supported in join condition
>7. Syntax: exist/ not exist (errors) .rank() over (partition by) /
>view (unsupported)
>
>
> Best,
> Kai
> ᐧ
>


[SQL] Unsupported features

2018-05-30 Thread Kai Jiang
Hi all,

Based on pull/5481 , I manually
did a coverage test with TPC-ds queries (65%) and TPC-h queries (100%) and
want to see what features Beam SQL is currently not supporting. Test was
running on DirectRunner.

I want to share the result.​
 TPC-DS queries on Beam

​
TL;DR:

   1. aggregation function (stddev) missing or calculation of aggregation
   functions combination.
   2. nested beamjoinrel(condition=[true], joinType=[inner]) / cross join
   error
   3. date type casting/ calculation and other types casting.
   4. LIKE operator in String / alias for substring function
   5. order by w/o limit clause.
   6. OR operator is supported in join condition
   7. Syntax: exist/ not exist (errors) .rank() over (partition by) /
   view (unsupported)


Best,
Kai
ᐧ