Prash, 1. You probably want to use the s3n filesystem, not the s3 one. If you use s3 you need to manage your file blocks manually. Swap it over to s3n, way easier.
2. This could be hive failing to read the files. Hive is probably assuming that there are no readable files in 'processed' so its saying you have no data. Is the data compressed? If so, s3 file names need to end in gz/bzip/etc Pat From: Prashanth R [mailto:[email protected]] Sent: Monday, April 11, 2011 2:10 PM To: [email protected] Subject: External table creation question Hi, The Hive documentation describes keyword "external" as following: The EXTERNAL keyword lets you create a table and provide a LOCATION so that Hive does not use a default location for this table. This comes in handy if you already have data generated. I have my data available in a directory in a bucket in s3. I am trying to create a table like CREATE EXTERNAL TABLE IF NOT EXISTS mslog ( TIME_STAMP STRING, SEQ STRING) LOCATION 's3:// <bucket name>/processed/' But the table isnt' populated with the data available at the s3 location. Am i missing something here? -- - Prash
