Hi, when looking at PreDistinct I notice that calls to emitter.emit(...) are stored in memory until more than 'flushEvery' records are found. How does this batching impact performance, since the calls to emit(...) are not batched in the cleanup method but called in a loop?
Is there an easy way to find the best size for 'flushEvery' other than try and error? Best regards, Leen
