Hi All,
Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3?
I tried but interestingly the output was not what i expected versus what
i got when my data was in uncompressed format.
Thanks,
Usman
I believe the cloudera 18.3 supports bzip2
On Wed, Jun 24, 2009 at 3:45 AM, Usman Waheed usm...@opera.com wrote:
Hi All,
Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3?
I tried but interestingly the output was not what i expected versus what i
got when my data was in
24, 2009 10:09 AM
To: core-user@hadoop.apache.org
Subject: Re: Are .bz2 extensions supported in Hadoop 18.3
The version (18.3) i am running in my cluster is the tar ball i got from
hadoop.apache.org.
So you are suggesting to use the Cloudera 18.3 which supports bzip2
correct?
Thanks,
Usman
I
Gross
-Original Message-
From: Usman Waheed [mailto:usm...@opera.com]
Sent: Wednesday, June 24, 2009 10:09 AM
To: core-user@hadoop.apache.org
Subject: Re: Are .bz2 extensions supported in Hadoop 18.3
The version (18.3) i am running in my cluster is the tar ball i got from
On Wed, 24 Jun 2009 12:45:59 +0200, Usman Waheed wrote:
Hi All,
Can I map/reduce logs that have the .bz2 extension in Hadoop 18.3?
I tried but interestingly the output was not what i expected versus
what i got when my data was in uncompressed format.
Thanks,
Usman
Not AFAIK, but we have added
platform!)
Hope it helps.
Best regards,
Danny
-Original Message-
From: Usman Waheed [mailto:usm...@opera.com]
Sent: Wednesday, June 24, 2009 10:32 AM
To: core-user@hadoop.apache.org
Subject: Re: Are .bz2 extensions supported in Hadoop 18.3
Hi Danny,
Hmmm makes me wonder that i might be doing
might try Pig (this is such a cool
platform!)
Hope it helps.
Best regards,
Danny
-Original Message-
From: Usman Waheed [mailto:usm...@opera.com]
Sent: Wednesday, June 24, 2009 10:32 AM
To: core-user@hadoop.apache.org
Subject: Re: Are .bz2 extensions supported in Hadoop 18.3
Hi Danny
This is correct - thanks for the note Jason. You can see the current
patch list for Cloudera's Distribution (based on 18.3) at:
http://www.cloudera.com/hadoop-manifest
In addition to Bzip2, we have patched in: DBInputFormat, the fair
scheduler, job level task limiting, soft fd leak fix, a fix for
Very cool, we are using Debian and I checked Cloudera's website. You have
packages for the Debian platform.
Will check it out and install on a test cluster.
Thanks much,
Usman
This is correct - thanks for the note Jason. You can see the current
patch list for Cloudera's Distribution (based