Hi all, Hive uses JDBM for the replicate table in map join. When multiple map tasks are running on the same node, will there be multiple copis of JDBM file generated, or will all the map task share the same copy? If it is the later, which mapper generates the file, and how to synchronize other mappers?
Thanks, -Gang