Good catch, Pedro!

Russell: Not sure how you can be using Hive 0.9 on EMR since EMR only supports 
upto Hive 0.7.1.

Check this for details: http://aws.amazon.com/elasticmapreduce/faqs/#hive-9

Mark

----- Original Message -----
From: "Russell Jurney" <[email protected]>
To: [email protected]
Sent: Wednesday, May 30, 2012 3:52:27 PM
Subject: Re: HIVE and S3 via EMR?


You = Excellent 


Russell Jurney 
twitter.com/rjurney 
[email protected] 
datasyndrome.com 

On May 29, 2012, at 11:06 PM, Pedro Figueiredo < [email protected] > wrote: 








On 30 May 2012, at 02:17, Russell Jurney wrote: 


I've made the bucket - which is derived from the enron emails - available at 
s3:///rjurney_public_web/from_to_date and a sample is available at 
http://s3.amazonaws.com/rjurney_public_web/from_to_date/part-m-00004 



The problem is that your bucket name contains the '_' character. When Hive (or 
whatever, really) tries to resolve the hostname 
rjurney_public_web.s3.amazonaws.com it fails, because '_' is an illegal 
character in DNS. It's got nothing to do with Hive, or your table definition. 


You can have a look at "Rules for bucket naming" in 
http://docs.amazonwebservices.com/AmazonS3/latest/dev/BucketRestrictions.html 


Cheers, 


Pedro 

Pedro Figueiredo 
Skype: pfig.89clouds 
http://89clouds.com/ - Big Data Consulting 



Reply via email to