Hi all, I use hadoop-0.21.0 distribution. I have a large number of small files (KB). Is there any efficient way of handling it in hadoop?
I have heard that solution for that problem is using:
1. HAR (hadoop archives)
2. cat on files
I would like to know if there are any other solutions for processing large
number of small files.
Regards,
Naveen Mahale
