[
https://issues.apache.org/jira/browse/METRON-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ryan Merriman updated METRON-1968:
----------------------------------
Fix Version/s: Next + 1
> Messages are lost when a parser produces multiple messages and batch size is
> greater than 1
> -------------------------------------------------------------------------------------------
>
> Key: METRON-1968
> URL: https://issues.apache.org/jira/browse/METRON-1968
> Project: Metron
> Issue Type: Bug
> Reporter: Ryan Merriman
> Assignee: Ryan Merriman
> Priority: Major
> Fix For: Next + 1
>
> Time Spent: 15h 20m
> Remaining Estimate: 0h
>
> A bug was discovered where messages are lost when a parser produces multiple
> messages. This happens anytime the batch size for that sensor is set to
> greater than 1. For example, consider a parser that produces 30 messages
> from a single input message. Assume the batch size for this sensor/parser is
> set to 10. The batch is currently flushed only after 10 tuples are received
> and only 10 messages are written out. I think the correct behavior would be
> for 3 batches of 10 messages to be flushed for every tuple and a total of 300
> messages written for every 10 tuples.
> This is happening because the various writer classes/interfaces
> (BulkWriterComponent, BulkMessageWriter, KafkaWriter, etc) assume a 1 to 1
> relationship between messages and tuples. The root cause of this specific
> issue is
> [here|https://github.com/apache/metron/blob/master/metron-platform/metron-writer/src/main/java/org/apache/metron/writer/kafka/KafkaWriter.java#L236].
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)