Hi, I want to combine the data that are in different HDFS filesystems, for them to be executed in one job. Is it possible to do this with MR, or there is another Apache tool that allows me to do this?
Eg. Hdfs data in Cluster1 ----v Hdfs data in Cluster2 -> this job reads the data from Cluster1, 2 Thanks, -- Best regards,