Handling of small files in hadoop

Naveen Mahale Wed, 14 Sep 2011 00:53:05 -0700

Hi all,

I use hadoop-0.21.0 distribution. I have a large number of small files (KB).
Is there any efficient way of handling it in hadoop?


I have heard that solution for that problem is using:
            1. HAR (hadoop archives)
            2. cat on files

I would like to know if there are any other solutions for processing large
number of small files.

Regards,
Naveen Mahale

Handling of small files in hadoop

Reply via email to