[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-27 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@tillrohrmann PR has been updated!
I had to do some changes:
- map `User` object to `String` in `TestAvroConsumerConfluent` class
- use `read_messages_from_kafka` instead of `read_messages_from_kafka_avro` 
for vaidation as the events are `String` not `Avro` anymore
- removed `kafka-avro` dependency

When we add `AvroSerializationConfluentSchema` to Flink, I will update the 
test as we discussed.


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-27 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@tillrohrmann make sense. 
I created this ticket https://issues.apache.org/jira/browse/FLINK-9679 for 
implementing `AvroSerializationConfluentSchema`. I will make a PR for it 
shortly.
Mean time I will update this PR also and remove 
`AvroSerializationConfluentSchema` and using `SimpleStringSchema` which later 
we can update the test.


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-27 Thread tillrohrmann
Github user tillrohrmann commented on the issue:

https://github.com/apache/flink/pull/6083
  
This is a good point @medcv. However, I think we should tackle adding a 
`AvroSerializationConfluentSchema` as an orthogonal step. What about removing 
it from this PR which covers the existing integration with Confluent's schema 
registry. Additionally, we should open a JIRA issue to add a 
`AvroSerializationConfluentSchema` to Flink. Once this has been added, we can 
adapt this end-to-end test.

What do you think @medcv?


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-26 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@tillrohrmann 
here is my two cents:
By converting the events to `String` and sending them to Kafka there is a 
high risk to produce a `bad events` and we will lose the benifit of Schema 
Registry to avoid this malformed events sent to the topic. 

what you think about `AvroSerializationConfluentSchema` to Flink dist? If 
we move this serialization code in Flink dist we can drop `kafka-avro` 
dependencies from here.

but still if you think using `String` is ok I can update the PR and use 
`String` instead of `AvroSerializationConfluentSchema` 


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-26 Thread tillrohrmann
Github user tillrohrmann commented on the issue:

https://github.com/apache/flink/pull/6083
  
Why don't we output the events using the `SimpleStringSchema` without 
verification of the schema. That way we would no longer need 
`AvroSerializationConfluentSchema` and could get rid of the `kafka-avro` 
dependency. What do you think @medcv?


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-25 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@tillrohrmann I made the changes and used 
`ConfluentRegistryAvroDeserializationSchema` for the Deserializations.  
I still using `AvroSerializationConfluentSchema` as we need to compare each 
income `Event` with `Schema` before sending the data to Kafka and use the 
schema registry concept to have a full end2end test.
We might need to `AvroSerializationConfluentSchema` also to the Flink dist. 


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-25 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@dawidwys do u have an example using 
`ConfluentRegistryAvroDeserializationSchema`. I have some difficulties to make 
it work 



---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-21 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@tillrohrmann PR is updated please review!
I will also look into `ConfluentRegistryAvroDeserializationSchema`  


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-20 Thread tillrohrmann
Github user tillrohrmann commented on the issue:

https://github.com/apache/flink/pull/6083
  
I agree with @dawidwys. We should use 
`ConfluentRegistryAvroDeserializationSchema` for the test. It is ok that this 
end-to-end test would then only go to Flink `1.6`. The benefit is that we only 
need to address the test related comments.


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-18 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@dawidwys Thanks for the review! I will look into that PR. This test is for 
Release 1.4 and 1.5 and provide a proof that Flink V4.0 and V5.0 can work with 
ConfluentSchema Registry properly. As PR you mentioned will go to next release 
I think this test still would be useful the way it is and for sure we can 
update it later.

@tillrohrmann Any thoughts?


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-18 Thread dawidwys
Github user dawidwys commented on the issue:

https://github.com/apache/flink/pull/6083
  
Just my two cents, we've recently introduced a `DeserializationSchema` that 
uses Confluent Schema Registry to decode data. Could you reuse it rather than 
introduce a new one. The PR in which it was introduced is here: #5995 


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-14 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@tillrohrmann Thanks for the review! I will go through them and will make 
the changes shortly.


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-13 Thread dmpour23
Github user dmpour23 commented on the issue:

https://github.com/apache/flink/pull/6083
  
@medcv Regarding KeyedDeserialization/KeyedSerializationSchema. I assume 
that the blocking part is the casting in deserializeKey, serializeKey,


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-12 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@dmpour23 Thanks! I updated the import as`*.util.serialization` was 
Deprecated!



---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-12 Thread dmpour23
Github user dmpour23 commented on the issue:

https://github.com/apache/flink/pull/6083
  
Hi would this work for KeyedDeserialization/KeyedSerializationSchema.
Should the import be: 

> org.apache.flink.api.common.serialization.DeserializationSchema




---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-06 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@tillrohrmann Thanks a lot for doing the review!


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-06-02 Thread tillrohrmann
Github user tillrohrmann commented on the issue:

https://github.com/apache/flink/pull/6083
  
Thanks for opening the PR @medcv. I will try to take a look next week.


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-05-31 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@tillrohrmann I would appreciate if you review or assign a reviewer to this 
PR.


---


[GitHub] flink issue #6083: [FLINK-8983] End-to-end test: Confluent schema registry

2018-05-26 Thread medcv
Github user medcv commented on the issue:

https://github.com/apache/flink/pull/6083
  
@tillrohrmann please review


---