Does anyone have more information about how chukwa removes duplicates during demux? How does it decide what is a duplicate? There are two cases I am thinking of...
1 - we send the same log file to chukwa 2x 2 - we have the exact same line in a log file 2x