Hi,

when looking at PreDistinct I notice that calls to emitter.emit(...) are
stored in memory until more than 'flushEvery' records are found. How does
this batching impact performance, since the calls to emit(...) are not
batched in the cleanup method but called in a loop?

Is there an easy way to find the best size for 'flushEvery' other than try
and error?

Best regards,
Leen

Reply via email to