[jira] [Comment Edited] (BEAM-9239) Dependency conflict with Spark using aws io

2020-08-20 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17181630#comment-17181630
 ] 

Kai Jiang edited comment on BEAM-9239 at 8/21/20, 5:53 AM:
---

I am not too familiar with Flink. However, is that possible to shade or 
relocate `{{com.amazonaws`}} packages in your flink application jar? You could 
take a look at Flink docs about how to do and follow this: 
[https://ci.apache.org/projects/flink/flink-docs-stable/monitoring/debugging_classloading.html#resolving-dependency-conflicts-with-flink-using-the-maven-shade-plugin]


was (Author: vectorijk):
I am not too familiar with Flink. However, is that possible to shade or 
relocate `{{com.amazonaws`}} packages in your flink application jar? You could 
take a look at Flink docs about how to do this: 
[https://ci.apache.org/projects/flink/flink-docs-stable/monitoring/debugging_classloading.html#resolving-dependency-conflicts-with-flink-using-the-maven-shade-plugin]

> Dependency conflict with Spark using aws io
> ---
>
> Key: BEAM-9239
> URL: https://issues.apache.org/jira/browse/BEAM-9239
> Project: Beam
>  Issue Type: Bug
>  Components: io-java-aws, runner-spark
>Affects Versions: 2.17.0
>Reporter: David McIntosh
>Priority: P1
>
> Starting with beam 2.17.0 I get this error in the Spark 2.4.4 driver when aws 
> io is also used:
> {noformat}
> java.lang.NoSuchMethodError: 
> com.fasterxml.jackson.databind.jsontype.TypeSerializer.typeId(Ljava/lang/Object;Lcom/fasterxml/jackson/core/JsonToken;)Lcom/fasterxml/jackson/core/type/WritableTypeId;
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:163)
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:134)
>   at 
> com.fasterxml.jackson.databind.ser.impl.TypeWrappedSerializer.serialize(TypeWrappedSerializer.java:32)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.ensureSerializable(ProxyInvocationHandler.java:721)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:647)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:635)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.serializeToJson(SerializablePipelineOptions.java:67)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.(SerializablePipelineOptions.java:43)
>   at 
> org.apache.beam.runners.spark.translation.EvaluationContext.(EvaluationContext.java:71)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:215)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:90)
> {noformat}
> The cause seems to be that the Spark driver environment uses an older version 
> of Jackson. I tried to update jackson on the Spark cluster but that led to 
> several other errors. 
> The change that started causing this was:
> https://github.com/apache/beam/commit/b68d70a47b68ad84efcd9405c1799002739bd116
> After reverting that change I was able to successfully run my job.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9239) Dependency conflict with Spark using aws io

2020-08-20 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17181630#comment-17181630
 ] 

Kai Jiang commented on BEAM-9239:
-

I am not too familiar with Flink. However, is that possible to shade or 
relocate `{{com.amazonaws`}} packages in your flink application jar? You could 
take a look at Flink docs about how to do this: 
[https://ci.apache.org/projects/flink/flink-docs-stable/monitoring/debugging_classloading.html#resolving-dependency-conflicts-with-flink-using-the-maven-shade-plugin]

> Dependency conflict with Spark using aws io
> ---
>
> Key: BEAM-9239
> URL: https://issues.apache.org/jira/browse/BEAM-9239
> Project: Beam
>  Issue Type: Bug
>  Components: io-java-aws, runner-spark
>Affects Versions: 2.17.0
>Reporter: David McIntosh
>Priority: P1
>
> Starting with beam 2.17.0 I get this error in the Spark 2.4.4 driver when aws 
> io is also used:
> {noformat}
> java.lang.NoSuchMethodError: 
> com.fasterxml.jackson.databind.jsontype.TypeSerializer.typeId(Ljava/lang/Object;Lcom/fasterxml/jackson/core/JsonToken;)Lcom/fasterxml/jackson/core/type/WritableTypeId;
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:163)
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:134)
>   at 
> com.fasterxml.jackson.databind.ser.impl.TypeWrappedSerializer.serialize(TypeWrappedSerializer.java:32)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.ensureSerializable(ProxyInvocationHandler.java:721)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:647)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:635)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.serializeToJson(SerializablePipelineOptions.java:67)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.(SerializablePipelineOptions.java:43)
>   at 
> org.apache.beam.runners.spark.translation.EvaluationContext.(EvaluationContext.java:71)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:215)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:90)
> {noformat}
> The cause seems to be that the Spark driver environment uses an older version 
> of Jackson. I tried to update jackson on the Spark cluster but that led to 
> several other errors. 
> The change that started causing this was:
> https://github.com/apache/beam/commit/b68d70a47b68ad84efcd9405c1799002739bd116
> After reverting that change I was able to successfully run my job.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9239) Dependency conflict with Spark using aws io

2020-05-29 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17120035#comment-17120035
 ] 

Kai Jiang edited comment on BEAM-9239 at 5/30/20, 12:23 AM:


Spark has two experimental configs `spark.driver.userClassPathFirst` and 
`spark.executor.userClassPathFirst`. If set both of these as true, it allows 
user-added jars precedence over Spark's own jars when loading classes in 
cluster mode.


was (Author: vectorijk):
Spark has two experimental configs `spark.driver.userClassPathFirst` and 
`spark.executor.userClassPathFirst`. It allows user-added jars precedence over 
Spark's own jars when loading classes in cluster mode.

> Dependency conflict with Spark using aws io
> ---
>
> Key: BEAM-9239
> URL: https://issues.apache.org/jira/browse/BEAM-9239
> Project: Beam
>  Issue Type: Bug
>  Components: io-java-aws, runner-spark
>Affects Versions: 2.17.0
>Reporter: David McIntosh
>Priority: P1
>
> Starting with beam 2.17.0 I get this error in the Spark 2.4.4 driver when aws 
> io is also used:
> {noformat}
> java.lang.NoSuchMethodError: 
> com.fasterxml.jackson.databind.jsontype.TypeSerializer.typeId(Ljava/lang/Object;Lcom/fasterxml/jackson/core/JsonToken;)Lcom/fasterxml/jackson/core/type/WritableTypeId;
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:163)
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:134)
>   at 
> com.fasterxml.jackson.databind.ser.impl.TypeWrappedSerializer.serialize(TypeWrappedSerializer.java:32)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.ensureSerializable(ProxyInvocationHandler.java:721)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:647)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:635)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.serializeToJson(SerializablePipelineOptions.java:67)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.(SerializablePipelineOptions.java:43)
>   at 
> org.apache.beam.runners.spark.translation.EvaluationContext.(EvaluationContext.java:71)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:215)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:90)
> {noformat}
> The cause seems to be that the Spark driver environment uses an older version 
> of Jackson. I tried to update jackson on the Spark cluster but that led to 
> several other errors. 
> The change that started causing this was:
> https://github.com/apache/beam/commit/b68d70a47b68ad84efcd9405c1799002739bd116
> After reverting that change I was able to successfully run my job.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9239) Dependency conflict with Spark using aws io

2020-05-29 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17120038#comment-17120038
 ] 

Kai Jiang commented on BEAM-9239:
-

Maybe vendor relocated com.fasterxml.jackson library could be long term 
solution for this

> Dependency conflict with Spark using aws io
> ---
>
> Key: BEAM-9239
> URL: https://issues.apache.org/jira/browse/BEAM-9239
> Project: Beam
>  Issue Type: Bug
>  Components: io-java-aws, runner-spark
>Affects Versions: 2.17.0
>Reporter: David McIntosh
>Priority: P1
>
> Starting with beam 2.17.0 I get this error in the Spark 2.4.4 driver when aws 
> io is also used:
> {noformat}
> java.lang.NoSuchMethodError: 
> com.fasterxml.jackson.databind.jsontype.TypeSerializer.typeId(Ljava/lang/Object;Lcom/fasterxml/jackson/core/JsonToken;)Lcom/fasterxml/jackson/core/type/WritableTypeId;
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:163)
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:134)
>   at 
> com.fasterxml.jackson.databind.ser.impl.TypeWrappedSerializer.serialize(TypeWrappedSerializer.java:32)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.ensureSerializable(ProxyInvocationHandler.java:721)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:647)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:635)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.serializeToJson(SerializablePipelineOptions.java:67)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.(SerializablePipelineOptions.java:43)
>   at 
> org.apache.beam.runners.spark.translation.EvaluationContext.(EvaluationContext.java:71)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:215)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:90)
> {noformat}
> The cause seems to be that the Spark driver environment uses an older version 
> of Jackson. I tried to update jackson on the Spark cluster but that led to 
> several other errors. 
> The change that started causing this was:
> https://github.com/apache/beam/commit/b68d70a47b68ad84efcd9405c1799002739bd116
> After reverting that change I was able to successfully run my job.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9239) Dependency conflict with Spark using aws io

2020-05-29 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17120035#comment-17120035
 ] 

Kai Jiang commented on BEAM-9239:
-

Spark has two experimental configs `spark.driver.userClassPathFirst` and 
`spark.executor.userClassPathFirst`. It allows user-added jars precedence over 
Spark's own jars when loading classes in cluster mode.

> Dependency conflict with Spark using aws io
> ---
>
> Key: BEAM-9239
> URL: https://issues.apache.org/jira/browse/BEAM-9239
> Project: Beam
>  Issue Type: Bug
>  Components: io-java-aws, runner-spark
>Affects Versions: 2.17.0
>Reporter: David McIntosh
>Priority: P1
>
> Starting with beam 2.17.0 I get this error in the Spark 2.4.4 driver when aws 
> io is also used:
> {noformat}
> java.lang.NoSuchMethodError: 
> com.fasterxml.jackson.databind.jsontype.TypeSerializer.typeId(Ljava/lang/Object;Lcom/fasterxml/jackson/core/JsonToken;)Lcom/fasterxml/jackson/core/type/WritableTypeId;
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:163)
>   at 
> org.apache.beam.sdk.io.aws.options.AwsModule$AWSCredentialsProviderSerializer.serializeWithType(AwsModule.java:134)
>   at 
> com.fasterxml.jackson.databind.ser.impl.TypeWrappedSerializer.serialize(TypeWrappedSerializer.java:32)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.ensureSerializable(ProxyInvocationHandler.java:721)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:647)
>   at 
> org.apache.beam.sdk.options.ProxyInvocationHandler$Serializer.serialize(ProxyInvocationHandler.java:635)
>   at 
> com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:130)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper._configAndWriteValue(ObjectMapper.java:3559)
>   at 
> com.fasterxml.jackson.databind.ObjectMapper.writeValueAsString(ObjectMapper.java:2927)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.serializeToJson(SerializablePipelineOptions.java:67)
>   at 
> org.apache.beam.runners.core.construction.SerializablePipelineOptions.(SerializablePipelineOptions.java:43)
>   at 
> org.apache.beam.runners.spark.translation.EvaluationContext.(EvaluationContext.java:71)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:215)
>   at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:90)
> {noformat}
> The cause seems to be that the Spark driver environment uses an older version 
> of Jackson. I tried to update jackson on the Spark cluster but that led to 
> several other errors. 
> The change that started causing this was:
> https://github.com/apache/beam/commit/b68d70a47b68ad84efcd9405c1799002739bd116
> After reverting that change I was able to successfully run my job.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-10000) Support BIT_XOR aggregation function in BeamSQL

2020-05-14 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-1?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17107787#comment-17107787
 ] 

Kai Jiang commented on BEAM-1:
--

wow! #1

> Support BIT_XOR aggregation function in BeamSQL
> ---
>
> Key: BEAM-1
> URL: https://issues.apache.org/jira/browse/BEAM-1
> Project: Beam
>  Issue Type: Task
>  Components: dsl-sql
>Reporter: Rui Wang
>Priority: Major
>
> See reference: 
> https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#bit_xor



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (BEAM-9121) Bump vendored calcite to 1.21.0

2020-03-04 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang resolved BEAM-9121.
-
Fix Version/s: 2.20.0
   Resolution: Won't Fix

> Bump vendored calcite to 1.21.0
> ---
>
> Key: BEAM-9121
> URL: https://issues.apache.org/jira/browse/BEAM-9121
> Project: Beam
>  Issue Type: Task
>  Components: dsl-sql
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
> Fix For: 2.20.0
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9121) Bump vendored calcite to 1.21.0

2020-03-04 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang updated BEAM-9121:

Summary: Bump vendored calcite to 1.21.0  (was: Bump vendored calcite to 
1.22.0)

> Bump vendored calcite to 1.21.0
> ---
>
> Key: BEAM-9121
> URL: https://issues.apache.org/jira/browse/BEAM-9121
> Project: Beam
>  Issue Type: Task
>  Components: dsl-sql
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9121) Bump vendored calcite to 1.22.0

2020-03-04 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang reassigned BEAM-9121:
---

Assignee: Kai Jiang

> Bump vendored calcite to 1.22.0
> ---
>
> Key: BEAM-9121
> URL: https://issues.apache.org/jira/browse/BEAM-9121
> Project: Beam
>  Issue Type: Task
>  Components: dsl-sql
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9121) Bump vendored calcite to 1.22.0

2020-03-04 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang updated BEAM-9121:

Summary: Bump vendored calcite to 1.22.0  (was: Bump vendored calcite to 
1.21.0)

> Bump vendored calcite to 1.22.0
> ---
>
> Key: BEAM-9121
> URL: https://issues.apache.org/jira/browse/BEAM-9121
> Project: Beam
>  Issue Type: Task
>  Components: dsl-sql
>Reporter: Kai Jiang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9121) Bump vendored calcite to 1.21.0

2020-03-04 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang reassigned BEAM-9121:
---

Assignee: (was: Kai Jiang)

> Bump vendored calcite to 1.21.0
> ---
>
> Key: BEAM-9121
> URL: https://issues.apache.org/jira/browse/BEAM-9121
> Project: Beam
>  Issue Type: Task
>  Components: dsl-sql
>Reporter: Kai Jiang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9412) Fix linkage errors in vendored calcite

2020-02-28 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang reassigned BEAM-9412:
---

Assignee: Kai Jiang

> Fix linkage errors in vendored calcite
> --
>
> Key: BEAM-9412
> URL: https://issues.apache.org/jira/browse/BEAM-9412
> Project: Beam
>  Issue Type: Improvement
>  Components: dsl-sql
>Reporter: Luke Cwik
>Assignee: Kai Jiang
>Priority: Minor
>
> As of [https://github.com/apache/beam/pull/10559], the linkage errors are:
> {code:java}
> Class org.slf4j.LoggerFactory is not found;
>   referenced by 29 class files
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.sql.pretty.SqlPrettyWriter
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.rel.rules.AbstractMaterializedViewRule
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.util.Benchmark 
> (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.plan.RexImplicationChecker
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.plan.VisitorDataContext
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.sql.SqlDialectFactoryImpl
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.sql.SqlDialect 
> (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.sql.validate.SqlValidatorException
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.runtime.ResultSetEnumerable
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.util.trace.CalciteTrace
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.runtime.CalciteException
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.avatica.remote.AvaticaHttpClientFactoryImpl
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.avatica.remote.RemoteProtobufService
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.avatica.remote.KerberosConnection
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.avatica.remote.AvaticaCommonsHttpClientImpl
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.avatica.remote.ClientKeytabJaasConf
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.avatica.remote.AvaticaCommonsHttpClientSpnegoImpl
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.avatica.remote.Driver
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.avatica.remote.ProtobufTranslationImpl
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.com.jayway.jsonpath.internal.filter.FilterCompiler
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.com.jayway.jsonpath.internal.filter.RelationalExpressionNode
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.com.jayway.jsonpath.internal.filter.ValueNode
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.com.jayway.jsonpath.internal.JsonContext
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.com.jayway.jsonpath.internal.path.ArrayPathToken
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.com.jayway.jsonpath.internal.path.CompiledPath
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.com.jayway.jsonpath.internal.path.PredicateContextImpl
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.com.jayway.jsonpath.spi.json.JsonOrgJsonProvider
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.com.jayway.jsonpath.spi.mapper.GsonMappingProvider
>  (beam-vendor-calcite-1_20_0-0.1-SNAPSHOT.jar)
> 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.avatica.metrics.MetricsSystemLoader
>  (beam-vendor-c

[jira] [Updated] (BEAM-8331) Vendored calcite breaks if another calcite is on the class path

2020-01-14 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang updated BEAM-8331:

Parent: BEAM-9121
Issue Type: Sub-task  (was: Bug)

> Vendored calcite breaks if another calcite is on the class path
> ---
>
> Key: BEAM-8331
> URL: https://issues.apache.org/jira/browse/BEAM-8331
> Project: Beam
>  Issue Type: Sub-task
>  Components: dsl-sql
>Affects Versions: 2.15.0, 2.16.0
>Reporter: Andrew Pilloud
>Assignee: Kai Jiang
>Priority: Major
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> If the beam vendored calcite and a non-vendored calcite are both on the 
> classpath, neither version works. This is because the non-JDBC calcite path 
> uses JDBC as a easy way to perform reflection. (This affects the non-JDBC 
> version of calcite.) We need to rewrite the calcite JDBC urls as part of our 
> vendoring (for example 'jdbc:calcite:' to 'jdbc:beam-vendor-calcite:'). 
> Example of where this happens: 
> [https://github.com/apache/calcite/blob/0cce229903a845a7b8ed36cf86d6078fd82d73d3/core/src/main/java/org/apache/calcite/tools/Frameworks.java#L175]
>  
> {code:java}
> java.lang.RuntimeException: java.lang.RuntimeException: Property 
> 'org.apache.beam.sdk.extensions.sql.impl.planner.BeamRelDataTypeSystem' not 
> valid for plugin type org.apache.calcite.rel.type.RelDataTypeSystem
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPrepare(Frameworks.java:160)
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPlanner(Frameworks.java:115)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLPlannerImpl.(ZetaSQLPlannerImpl.java:86)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLQueryPlanner.(ZetaSQLQueryPlanner.java:55){code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9121) Bump vendored calcite to 1.21.0

2020-01-14 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang updated BEAM-9121:

Status: Open  (was: Triage Needed)

> Bump vendored calcite to 1.21.0
> ---
>
> Key: BEAM-9121
> URL: https://issues.apache.org/jira/browse/BEAM-9121
> Project: Beam
>  Issue Type: Task
>  Components: dsl-sql
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9121) Bump vendored calcite to 1.21.0

2020-01-14 Thread Kai Jiang (Jira)
Kai Jiang created BEAM-9121:
---

 Summary: Bump vendored calcite to 1.21.0
 Key: BEAM-9121
 URL: https://issues.apache.org/jira/browse/BEAM-9121
 Project: Beam
  Issue Type: Task
  Components: dsl-sql
Reporter: Kai Jiang
Assignee: Kai Jiang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-8331) Vendored calcite breaks if another calcite is on the class path

2020-01-10 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17012693#comment-17012693
 ] 

Kai Jiang edited comment on BEAM-8331 at 1/10/20 10:41 AM:
---

I have built a vendored calcite with rewriting jdbc url locally. However,  I 
have tested the issue mentioned BEAM-3386. (ran a Beam SQL job on Spark runner)

Vendored calcite with rewriting JDBC url or without rewriting JDBC url both 
works for Beam SQL job on Spark runner. (ran successfully)

 

[~apilloud] Would you mind sharing how to reproduce this issue in your case? so 
that I can validate vendored calcite with rewriting jdbc url.


was (Author: vectorijk):
I have built a vendored calcite with rewriting jdbc url locally. However,  I 
have tested the issue mentioned BEAM-3386. (ran a Beam SQL job on Spark runner)

Vendored calcite with rewriting JDBC url or without rewriting JDBC url both 
works for Beam SQL job on Spark runner

 

[~apilloud] Would you mind sharing how to reproduce this issue in your case? so 
that I can validate vendored calcite with rewriting jdbc url.

> Vendored calcite breaks if another calcite is on the class path
> ---
>
> Key: BEAM-8331
> URL: https://issues.apache.org/jira/browse/BEAM-8331
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql
>Affects Versions: 2.15.0, 2.16.0
>Reporter: Andrew Pilloud
>Assignee: Kai Jiang
>Priority: Major
>
> If the beam vendored calcite and a non-vendored calcite are both on the 
> classpath, neither version works. This is because the non-JDBC calcite path 
> uses JDBC as a easy way to perform reflection. (This affects the non-JDBC 
> version of calcite.) We need to rewrite the calcite JDBC urls as part of our 
> vendoring (for example 'jdbc:calcite:' to 'jdbc:beam-vendor-calcite:'). 
> Example of where this happens: 
> [https://github.com/apache/calcite/blob/0cce229903a845a7b8ed36cf86d6078fd82d73d3/core/src/main/java/org/apache/calcite/tools/Frameworks.java#L175]
>  
> {code:java}
> java.lang.RuntimeException: java.lang.RuntimeException: Property 
> 'org.apache.beam.sdk.extensions.sql.impl.planner.BeamRelDataTypeSystem' not 
> valid for plugin type org.apache.calcite.rel.type.RelDataTypeSystem
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPrepare(Frameworks.java:160)
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPlanner(Frameworks.java:115)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLPlannerImpl.(ZetaSQLPlannerImpl.java:86)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLQueryPlanner.(ZetaSQLQueryPlanner.java:55){code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8331) Vendored calcite breaks if another calcite is on the class path

2020-01-10 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17012693#comment-17012693
 ] 

Kai Jiang commented on BEAM-8331:
-

I have built a vendored calcite with rewriting jdbc url locally. However,  I 
have tested the issue mentioned BEAM-3386. (ran a Beam SQL job on Spark runner)

Vendored calcite with rewriting JDBC url or without rewriting JDBC url both 
works for Beam SQL job on Spark runner

 

[~apilloud] Would you mind sharing how to reproduce this issue in your case? so 
that I can validate vendored calcite with rewriting jdbc url.

> Vendored calcite breaks if another calcite is on the class path
> ---
>
> Key: BEAM-8331
> URL: https://issues.apache.org/jira/browse/BEAM-8331
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql
>Affects Versions: 2.15.0, 2.16.0
>Reporter: Andrew Pilloud
>Assignee: Kai Jiang
>Priority: Major
>
> If the beam vendored calcite and a non-vendored calcite are both on the 
> classpath, neither version works. This is because the non-JDBC calcite path 
> uses JDBC as a easy way to perform reflection. (This affects the non-JDBC 
> version of calcite.) We need to rewrite the calcite JDBC urls as part of our 
> vendoring (for example 'jdbc:calcite:' to 'jdbc:beam-vendor-calcite:'). 
> Example of where this happens: 
> [https://github.com/apache/calcite/blob/0cce229903a845a7b8ed36cf86d6078fd82d73d3/core/src/main/java/org/apache/calcite/tools/Frameworks.java#L175]
>  
> {code:java}
> java.lang.RuntimeException: java.lang.RuntimeException: Property 
> 'org.apache.beam.sdk.extensions.sql.impl.planner.BeamRelDataTypeSystem' not 
> valid for plugin type org.apache.calcite.rel.type.RelDataTypeSystem
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPrepare(Frameworks.java:160)
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPlanner(Frameworks.java:115)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLPlannerImpl.(ZetaSQLPlannerImpl.java:86)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLQueryPlanner.(ZetaSQLQueryPlanner.java:55){code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8496) remove SDF translators in flink streaming transform translator

2020-01-08 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17011288#comment-17011288
 ] 

Kai Jiang commented on BEAM-8496:
-

For failed nexmark on flink job, it was fixed by 
[https://github.com/apache/beam/commit/4af507f567605723af940c2af3b2912d0948adf5]
 actually.

> remove SDF translators in flink streaming transform translator
> --
>
> Key: BEAM-8496
> URL: https://issues.apache.org/jira/browse/BEAM-8496
> Project: Beam
>  Issue Type: Bug
>  Components: runner-flink
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 7.5h
>  Remaining Estimate: 0h
>
> Since URN of SDF has been moved to runners-core-construction-java, we need to 
> remove it.
> Otherwise, in failed nexmark Jenkins 
> [job|https://builds.apache.org/job/beam_PostCommit_Java_Nexmark_Flink/4128/console],
>  it causes duplicated transformer registered in 
> [PTransformTranslation.KnownTransformPayloadTranslator()|https://github.com/apache/beam/blob/c2f0d282337f3ae0196a7717712396a5a41fdde1/runners/core-construction-java/src/main/java/org/apache/beam/runners/core/construction/PTransformTranslation.java#L290]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (BEAM-4663) Implement Cost calculations for Cost-Based Optimization (CBO)

2019-11-20 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-4663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang resolved BEAM-4663.
-
Fix Version/s: Not applicable
   Resolution: Invalid

> Implement Cost calculations for Cost-Based Optimization (CBO) 
> --
>
> Key: BEAM-4663
> URL: https://issues.apache.org/jira/browse/BEAM-4663
> Project: Beam
>  Issue Type: Sub-task
>  Components: dsl-sql
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
> Fix For: Not applicable
>
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> To support CBO, we should implement methods in each Beam*Rel.java.  
> computeSelfCost(...) as our first step.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-8496) remove SDF translators in flink streaming transform translator

2019-10-28 Thread Kai Jiang (Jira)
Kai Jiang created BEAM-8496:
---

 Summary: remove SDF translators in flink streaming transform 
translator
 Key: BEAM-8496
 URL: https://issues.apache.org/jira/browse/BEAM-8496
 Project: Beam
  Issue Type: Bug
  Components: runner-flink
Reporter: Kai Jiang
Assignee: Kai Jiang


Since URN of SDF has been moved to runners-core-construction-java, we need to 
remove it.
Otherwise, in failed nexmark Jenkins 
[job|https://builds.apache.org/job/beam_PostCommit_Java_Nexmark_Flink/4128/console],
 it causes duplicated transformer registered in 
[PTransformTranslation.KnownTransformPayloadTranslator()|https://github.com/apache/beam/blob/c2f0d282337f3ae0196a7717712396a5a41fdde1/runners/core-construction-java/src/main/java/org/apache/beam/runners/core/construction/PTransformTranslation.java#L290]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-8331) Vendored calcite breaks if another calcite is on the class path

2019-10-02 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang reassigned BEAM-8331:
---

Assignee: Kai Jiang

> Vendored calcite breaks if another calcite is on the class path
> ---
>
> Key: BEAM-8331
> URL: https://issues.apache.org/jira/browse/BEAM-8331
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql
>Affects Versions: 2.15.0, 2.16.0
>Reporter: Andrew Pilloud
>Assignee: Kai Jiang
>Priority: Major
>
> If the beam vendored calcite and a non-vendored calcite are both on the 
> classpath, neither version works. This is because the non-JDBC calcite path 
> uses JDBC as a easy way to perform reflection. (This affects the non-JDBC 
> version of calcite.) We need to rewrite the calcite JDBC urls as part of our 
> vendoring (for example 'jdbc:calcite:' to 'jdbc:beam-vendor-calcite:'). 
> Example of where this happens: 
> [https://github.com/apache/calcite/blob/0cce229903a845a7b8ed36cf86d6078fd82d73d3/core/src/main/java/org/apache/calcite/tools/Frameworks.java#L175]
>  
> {code:java}
> java.lang.RuntimeException: java.lang.RuntimeException: Property 
> 'org.apache.beam.sdk.extensions.sql.impl.planner.BeamRelDataTypeSystem' not 
> valid for plugin type org.apache.calcite.rel.type.RelDataTypeSystem
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPrepare(Frameworks.java:160)
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPlanner(Frameworks.java:115)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLPlannerImpl.(ZetaSQLPlannerImpl.java:86)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLQueryPlanner.(ZetaSQLQueryPlanner.java:55){code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8331) Vendored calcite breaks if another calcite is on the class path

2019-10-02 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16943340#comment-16943340
 ] 

Kai Jiang commented on BEAM-8331:
-

sure. I assigned to myself.

> Vendored calcite breaks if another calcite is on the class path
> ---
>
> Key: BEAM-8331
> URL: https://issues.apache.org/jira/browse/BEAM-8331
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql
>Affects Versions: 2.15.0, 2.16.0
>Reporter: Andrew Pilloud
>Priority: Major
>
> If the beam vendored calcite and a non-vendored calcite are both on the 
> classpath, neither version works. This is because the non-JDBC calcite path 
> uses JDBC as a easy way to perform reflection. (This affects the non-JDBC 
> version of calcite.) We need to rewrite the calcite JDBC urls as part of our 
> vendoring (for example 'jdbc:calcite:' to 'jdbc:beam-vendor-calcite:'). 
> Example of where this happens: 
> [https://github.com/apache/calcite/blob/0cce229903a845a7b8ed36cf86d6078fd82d73d3/core/src/main/java/org/apache/calcite/tools/Frameworks.java#L175]
>  
> {code:java}
> java.lang.RuntimeException: java.lang.RuntimeException: Property 
> 'org.apache.beam.sdk.extensions.sql.impl.planner.BeamRelDataTypeSystem' not 
> valid for plugin type org.apache.calcite.rel.type.RelDataTypeSystem
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPrepare(Frameworks.java:160)
>   at 
> org.apache.beam.vendor.calcite.v1_20_0.org.apache.calcite.tools.Frameworks.withPlanner(Frameworks.java:115)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLPlannerImpl.(ZetaSQLPlannerImpl.java:86)
>   at 
> org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLQueryPlanner.(ZetaSQLQueryPlanner.java:55){code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-2466) Add Kafka Streams runner

2019-08-28 Thread Kai Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16918157#comment-16918157
 ] 

Kai Jiang commented on BEAM-2466:
-

Nice progress. I reassigned the ticket to [~winkelman.kyle]. Definitely, I am 
very interested in collaborating in more features and testing.

> Add Kafka Streams runner
> 
>
> Key: BEAM-2466
> URL: https://issues.apache.org/jira/browse/BEAM-2466
> Project: Beam
>  Issue Type: Wish
>  Components: runner-ideas
>Reporter: Lorand Peter Kasler
>Assignee: Kyle Winkelman
>Priority: Minor
>
> Kafka Streams (https://kafka.apache.org/documentation/streams)  has more and 
> more features that could make it a viable candidate for a streaming runner. 
> It uses DataFlow-like model



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Assigned] (BEAM-2466) Add Kafka Streams runner

2019-08-28 Thread Kai Jiang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang reassigned BEAM-2466:
---

Assignee: Kyle Winkelman  (was: Kai Jiang)

> Add Kafka Streams runner
> 
>
> Key: BEAM-2466
> URL: https://issues.apache.org/jira/browse/BEAM-2466
> Project: Beam
>  Issue Type: Wish
>  Components: runner-ideas
>Reporter: Lorand Peter Kasler
>Assignee: Kyle Winkelman
>Priority: Minor
>
> Kafka Streams (https://kafka.apache.org/documentation/streams)  has more and 
> more features that could make it a viable candidate for a streaming runner. 
> It uses DataFlow-like model



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Assigned] (BEAM-5820) Vendor Calcite

2019-07-30 Thread Kai Jiang (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang reassigned BEAM-5820:
---

Assignee: Kai Jiang

> Vendor Calcite
> --
>
> Key: BEAM-5820
> URL: https://issues.apache.org/jira/browse/BEAM-5820
> Project: Beam
>  Issue Type: Sub-task
>  Components: dsl-sql
>Reporter: Kenneth Knowles
>Assignee: Kai Jiang
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Resolved] (BEAM-7728) Support ParquetTable in SQL

2019-07-30 Thread Kai Jiang (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-7728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang resolved BEAM-7728.
-
   Resolution: Fixed
Fix Version/s: 2.15.0

> Support ParquetTable in SQL
> ---
>
> Key: BEAM-7728
> URL: https://issues.apache.org/jira/browse/BEAM-7728
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
> Fix For: 2.15.0
>
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Comment Edited] (BEAM-7728) Support ParquetTable in SQL

2019-07-30 Thread Kai Jiang (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-7728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16896822#comment-16896822
 ] 

Kai Jiang edited comment on BEAM-7728 at 7/31/19 6:49 AM:
--

to be implemented writer


was (Author: vectorijk):
to be implement writer

> Support ParquetTable in SQL
> ---
>
> Key: BEAM-7728
> URL: https://issues.apache.org/jira/browse/BEAM-7728
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Commented] (BEAM-7728) Support ParquetTable in SQL

2019-07-30 Thread Kai Jiang (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-7728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16896822#comment-16896822
 ] 

Kai Jiang commented on BEAM-7728:
-

to be implement writer

> Support ParquetTable in SQL
> ---
>
> Key: BEAM-7728
> URL: https://issues.apache.org/jira/browse/BEAM-7728
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql
>Reporter: Kai Jiang
>Assignee: Kai Jiang
>Priority: Major
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (BEAM-7728) Support ParquetTable in SQL

2019-07-11 Thread Kai Jiang (JIRA)
Kai Jiang created BEAM-7728:
---

 Summary: Support ParquetTable in SQL
 Key: BEAM-7728
 URL: https://issues.apache.org/jira/browse/BEAM-7728
 Project: Beam
  Issue Type: New Feature
  Components: dsl-sql
Reporter: Kai Jiang
Assignee: Kai Jiang






--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Assigned] (BEAM-5822) Vendor bytebuddy

2019-04-19 Thread Kai Jiang (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang reassigned BEAM-5822:
---

Assignee: Kai Jiang

> Vendor bytebuddy
> 
>
> Key: BEAM-5822
> URL: https://issues.apache.org/jira/browse/BEAM-5822
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-java-core
>Reporter: Kenneth Knowles
>Assignee: Kai Jiang
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-2466) Add Kafka Streams runner

2019-02-13 Thread Kai Jiang (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16767394#comment-16767394
 ] 

Kai Jiang commented on BEAM-2466:
-

[~teabot] IMHO, There should not exist a 'batch processing' concept for 
KStreams. I think we needs limit Beam only in streaming mode for 
KafkaStreamsRunner.
[~aromanenko] I think it requires kafka topic as streaming input. Internally, 
KStream source utilized kafka topics as input.

PoC branch: https://github.com/vectorijk/beam/tree/kafka-stream
Welcome any ideas!



> Add Kafka Streams runner
> 
>
> Key: BEAM-2466
> URL: https://issues.apache.org/jira/browse/BEAM-2466
> Project: Beam
>  Issue Type: Wish
>  Components: runner-ideas
>Reporter: Lorand Peter Kasler
>Assignee: Kai Jiang
>Priority: Minor
>  Labels: triaged
>
> Kafka Streams (https://kafka.apache.org/documentation/streams)  has more and 
> more features that could make it a viable candidate for a streaming runner. 
> It uses DataFlow-like model



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-3386) Dependency conflict when Calcite is included in a project.

2019-01-24 Thread Kai Jiang (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-3386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16751942#comment-16751942
 ] 

Kai Jiang commented on BEAM-3386:
-

[~kenn] this (without relocation) sounds pretty useful. let me checkout the 
thread quickly.

> Dependency conflict when Calcite is included in a project.
> --
>
> Key: BEAM-3386
> URL: https://issues.apache.org/jira/browse/BEAM-3386
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql
>Affects Versions: 2.2.0, 2.3.0, 2.4.0, 2.5.0, 2.6.0
>Reporter: Austin Haas
>Assignee: Kai Jiang
>Priority: Critical
>
> When Calcite (v. 1.13.0) is included in a project that also includes Beam and 
> the Beam SQL extension, then the following error is thrown when trying to run 
> Beam code.
> ClassCastException 
> org.apache.beam.sdk.extensions.sql.impl.planner.BeamRelDataTypeSystem cannot 
> be cast to org.apache.calcite.rel.type.RelDataTypeSystem
> org.apache.calcite.jdbc.CalciteConnectionImpl. 
> (CalciteConnectionImpl.java:120)
> 
> org.apache.calcite.jdbc.CalciteJdbc41Factory$CalciteJdbc41Connection. 
> (CalciteJdbc41Factory.java:114)
> org.apache.calcite.jdbc.CalciteJdbc41Factory.newConnection 
> (CalciteJdbc41Factory.java:59)
> org.apache.calcite.jdbc.CalciteJdbc41Factory.newConnection 
> (CalciteJdbc41Factory.java:44)
> org.apache.calcite.jdbc.CalciteFactory.newConnection 
> (CalciteFactory.java:53)
> org.apache.calcite.avatica.UnregisteredDriver.connect 
> (UnregisteredDriver.java:138)
> java.sql.DriverManager.getConnection (DriverManager.java:664)
> java.sql.DriverManager.getConnection (DriverManager.java:208)
> 
> org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.tools.Frameworks.withPrepare
>  (Frameworks.java:145)
> 
> org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.tools.Frameworks.withPlanner
>  (Frameworks.java:106)
> 
> org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.prepare.PlannerImpl.ready
>  (PlannerImpl.java:140)
> 
> org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.prepare.PlannerImpl.parse
>  (PlannerImpl.java:170)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (BEAM-3386) Dependency conflict when Calcite is included in a project.

2019-01-24 Thread Kai Jiang (JIRA)


[ 
https://issues.apache.org/jira/browse/BEAM-3386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16751928#comment-16751928
 ] 

Kai Jiang commented on BEAM-3386:
-

[~iemejia] Thanks! I will take some time trying to work on vendor calcite.

> Dependency conflict when Calcite is included in a project.
> --
>
> Key: BEAM-3386
> URL: https://issues.apache.org/jira/browse/BEAM-3386
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql
>Affects Versions: 2.2.0, 2.3.0, 2.4.0, 2.5.0, 2.6.0
>Reporter: Austin Haas
>Assignee: Kai Jiang
>Priority: Critical
>
> When Calcite (v. 1.13.0) is included in a project that also includes Beam and 
> the Beam SQL extension, then the following error is thrown when trying to run 
> Beam code.
> ClassCastException 
> org.apache.beam.sdk.extensions.sql.impl.planner.BeamRelDataTypeSystem cannot 
> be cast to org.apache.calcite.rel.type.RelDataTypeSystem
> org.apache.calcite.jdbc.CalciteConnectionImpl. 
> (CalciteConnectionImpl.java:120)
> 
> org.apache.calcite.jdbc.CalciteJdbc41Factory$CalciteJdbc41Connection. 
> (CalciteJdbc41Factory.java:114)
> org.apache.calcite.jdbc.CalciteJdbc41Factory.newConnection 
> (CalciteJdbc41Factory.java:59)
> org.apache.calcite.jdbc.CalciteJdbc41Factory.newConnection 
> (CalciteJdbc41Factory.java:44)
> org.apache.calcite.jdbc.CalciteFactory.newConnection 
> (CalciteFactory.java:53)
> org.apache.calcite.avatica.UnregisteredDriver.connect 
> (UnregisteredDriver.java:138)
> java.sql.DriverManager.getConnection (DriverManager.java:664)
> java.sql.DriverManager.getConnection (DriverManager.java:208)
> 
> org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.tools.Frameworks.withPrepare
>  (Frameworks.java:145)
> 
> org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.tools.Frameworks.withPlanner
>  (Frameworks.java:106)
> 
> org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.prepare.PlannerImpl.ready
>  (PlannerImpl.java:140)
> 
> org.apache.beam.sdks.java.extensions.sql.repackaged.org.apache.calcite.prepare.PlannerImpl.parse
>  (PlannerImpl.java:170)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (BEAM-5130) Support ALL (sub query)

2019-01-11 Thread Kai Jiang (JIRA)


 [ 
https://issues.apache.org/jira/browse/BEAM-5130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Jiang reassigned BEAM-5130:
---

Assignee: Kai Jiang

> Support ALL (sub query)
> ---
>
> Key: BEAM-5130
> URL: https://issues.apache.org/jira/browse/BEAM-5130
> Project: Beam
>  Issue Type: Sub-task
>  Components: dsl-sql
>Reporter: Rui Wang
>Assignee: Kai Jiang
>Priority: Major
>
> Example:
>  
> SELECT order_id FROM order WHERE item_id = ALL (SELECT item_id FROM 
> special_item_table).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)