Hmm, this is strange. Which version of Kafka client do you use while running it 
with Beam?

> On 1 Feb 2022, at 16:56, Utkarsh Parekh <utkarsh.s.par...@gmail.com> wrote:
> 
> Hi Alexey, 
> 
> First of all, thank you for the response! Yes I did have it in Consumer 
> configuration and try to increase "session.timeout".
> 
> From consumer side so far I've following settings:
> props.put("sasl.mechanism", SASL_MECHANISM);
> props.put("security.protocol", SECURITY_PROTOCOL);
> props.put("sasl.jaas.config", saslJaasConfig);
> props.put("request.timeout.ms <http://request.timeout.ms/>", 60000);
> props.put("session.timeout.ms <http://session.timeout.ms/>", 60000);
> props.put(ConsumerConfig.AUTO_OFFSET_RESET_CONFIG, AUTO_OFFSET_RESET_CONFIG);
> props.put(ConsumerConfig.GROUP_ID_CONFIG, consumerGroup);
> 
> It works fine using following code in Databricks Notebook. The problem has 
> been occurring when I run it through Apache beam and KafkaIO (Just providing 
> more context if that may help you to understand problem)
> 
> val df = spark.readStream
>     .format("kafka")
>     .option("subscribe", TOPIC)
>     .option("kafka.bootstrap.servers", BOOTSTRAP_SERVERS)
>     .option("kafka.sasl.mechanism", "PLAIN")
>     .option("kafka.security.protocol", "SASL_SSL")
>     .option("kafka.sasl.jaas.config", EH_SASL)
>     .option("kafka.request.timeout.ms <http://kafka.request.timeout.ms/>", 
> "60000")
>     .option("kafka.session.timeout.ms <http://kafka.session.timeout.ms/>", 
> "60000")
>     .option("failOnDataLoss", "false")
> //.option("kafka.group.id <http://kafka.group.id/>", "testsink")
> .option("startingOffsets", "latest")
>     .load()
> 
> Utkarsh
> 
> On Tue, Feb 1, 2022 at 6:20 AM Alexey Romanenko <aromanenko....@gmail.com 
> <mailto:aromanenko....@gmail.com>> wrote:
> Hi Utkarsh,
> 
> Can it be related to this configuration problem?
> https://docs.microsoft.com/en-us/azure/event-hubs/apache-kafka-troubleshooting-guide#no-records-received
>  
> <https://docs.microsoft.com/en-us/azure/event-hubs/apache-kafka-troubleshooting-guide#no-records-received>
> 
> Did you check timeout settings?
> 
> —
> Alexey        
> 
> 
>> On 1 Feb 2022, at 02:27, Utkarsh Parekh <utkarsh.s.par...@gmail.com 
>> <mailto:utkarsh.s.par...@gmail.com>> wrote:
>> 
>> Hello,
>> 
>> I'm doing POC with KafkaIO and spark runner on Azure Databricks. I'm trying 
>> to create a simple streaming app with Apache Beam, where it reads data from 
>> an Azure event hub and produces messages into another Azure event hub. 
>> 
>> I'm creating and running spark jobs on Azure Databricks.
>> 
>> The problem is the consumer (uses SparkRunner) is not able to read data from 
>> Event hub (queue). There is no activity and no errors on the Spark cluster.
>> 
>> I would appreciate it if anyone could help to fix this issue.
>> 
>> Thank you
>> 
>> Utkarsh
> 

Reply via email to