Pramod Immaneni created APEXMALHAR-2518: -------------------------------------------
Summary: Kafka input operator stops reading tuples when there is a UNKNOWN_MEMBER_ID error during committed offset processing Key: APEXMALHAR-2518 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2518 Project: Apache Apex Malhar Issue Type: Bug Reporter: Pramod Immaneni Assignee: Pramod Immaneni Kafka 0.9 operator stores offsets that are completely processed and no longer needed (committed offsets) back in kafka. It does so by making a kafka API call. If the response from kafka server to this call comes back with an UNKNOWN_MEMBER_ID error, it results in the kafka consumer state changing to needing partition re-assignment and no further messages are returned by the consumer. There are a couple of other errors that result in the same state including when rebalance is in progress. What exactly caused this error is not known but the following is the likely reason due to the conditions surrounding the application. When the operator has temporarily stalled due to back-pressure exerted by the slow downstream, it will eventually stall the operator kafka consumer thread that is reading messages from kafka. This will result in the thread not making any kafka consumer API calls and it will result in no heartbeats being sent to kafka server. This can cause the server to evict the consumer after a timeout period. This could have been the cause for the UNKNOWN_MEMBER_ID error. -- This message was sent by Atlassian JIRA (v6.4.14#64029)