Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread Usman Waheed
Hi All, Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3? I tried but interestingly the output was not what i expected versus what i got when my data was in uncompressed format. Thanks, Usman

Re: Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread jason hadoop
I believe the cloudera 18.3 supports bzip2 On Wed, Jun 24, 2009 at 3:45 AM, Usman Waheed usm...@opera.com wrote: Hi All, Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3? I tried but interestingly the output was not what i expected versus what i got when my data was in

RE: Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread Gross, Danny
24, 2009 10:09 AM To: core-user@hadoop.apache.org Subject: Re: Are .bz2 extensions supported in Hadoop 18.3 The version (18.3) i am running in my cluster is the tar ball i got from hadoop.apache.org. So you are suggesting to use the Cloudera 18.3 which supports bzip2 correct? Thanks, Usman I

Re: Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread Usman Waheed
Gross -Original Message- From: Usman Waheed [mailto:usm...@opera.com] Sent: Wednesday, June 24, 2009 10:09 AM To: core-user@hadoop.apache.org Subject: Re: Are .bz2 extensions supported in Hadoop 18.3 The version (18.3) i am running in my cluster is the tar ball i got from

Re: Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread John Heidemann
On Wed, 24 Jun 2009 12:45:59 +0200, Usman Waheed wrote: Hi All, Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3? I tried but interestingly the output was not what i expected versus what i got when my data was in uncompressed format. Thanks, Usman Not AFAIK, but we have added

RE: Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread Gross, Danny
platform!) Hope it helps. Best regards, Danny -Original Message- From: Usman Waheed [mailto:usm...@opera.com] Sent: Wednesday, June 24, 2009 10:32 AM To: core-user@hadoop.apache.org Subject: Re: Are .bz2 extensions supported in Hadoop 18.3 Hi Danny, Hmmm makes me wonder that i might be doing

Re: Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread Usman Waheed
might try Pig (this is such a cool platform!) Hope it helps. Best regards, Danny -Original Message- From: Usman Waheed [mailto:usm...@opera.com] Sent: Wednesday, June 24, 2009 10:32 AM To: core-user@hadoop.apache.org Subject: Re: Are .bz2 extensions supported in Hadoop 18.3 Hi Danny

Re: Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread Christophe Bisciglia
This is correct - thanks for the note Jason. You can see the current patch list for Cloudera's Distribution (based on 18.3) at: http://www.cloudera.com/hadoop-manifest In addition to Bzip2, we have patched in: DBInputFormat, the fair scheduler, job level task limiting, soft fd leak fix, a fix for

Re: Are .bz2 extensions supported in Hadoop 18.3

2009-06-24 Thread Usman Waheed
Very cool, we are using Debian and I checked Cloudera's website. You have packages for the Debian platform. Will check it out and install on a test cluster. Thanks much, Usman This is correct - thanks for the note Jason. You can see the current patch list for Cloudera's Distribution (based