Hi,
How can I implement a custom MultipleOutputFormat and specify it as the
output of my Spark job so that I can ensure that there is a unique output
file per key (instead of a a unique output file per reducer)?
Thanks
Arpan
Hi,
Arpan Ghosh wrote:
Hi,
How can I implement a custom MultipleOutputFormat and specify it as
the output of my Spark job so that I can ensure that there is a unique
output file per key (instead of a a unique output file per reducer)?
I use something like this:
class KeyBasedOutput[T :