Prash,

1.      You probably want to use the s3n filesystem, not the s3 one.  If you 
use s3 you need to manage your file blocks manually.  Swap it over to s3n, way 
easier.

2.      This could be hive failing to read the files.  Hive is probably 
assuming that there are no readable files in 'processed' so its saying you have 
no data.  Is the data compressed?  If so, s3 file names need to end in 
gz/bzip/etc

Pat

From: Prashanth R [mailto:[email protected]]
Sent: Monday, April 11, 2011 2:10 PM
To: [email protected]
Subject: External table creation question

Hi,

The Hive documentation describes keyword "external" as following:

The EXTERNAL keyword lets you create a table and provide a LOCATION so that 
Hive does not use a default location for this table. This comes in handy if you 
already have data generated.

I have my data available in a directory in a bucket in s3. I am trying to 
create a table like

CREATE EXTERNAL TABLE IF NOT EXISTS mslog ( TIME_STAMP STRING, SEQ STRING) 
LOCATION 's3:// <bucket name>/processed/'

But the table isnt' populated with the data available at the s3 location. Am i 
missing something here?


--
- Prash

Reply via email to