Hey list, I've got some CSV data I'm importing from S3. I can create the external table well enough, and I can also do a CREATE TABLE ... AS SELECT ... from it to pull the data internal to Spark.
Here's the HQL for my external table: https://gist.github.com/bradhe/11126024 Now I'd like to add partitioning and clustering to my permanent table. So, I create a new table and try to do an INSERT ... SELECT Here's the HQL for my internal, partitioned table and the insert select: https://gist.github.com/bradhe/11126047 Oddly, the query is scheduled...but it never makes any progress! http://i.imgur.com/vXvgpzD.png Is this a bug? Am I doing something dumb? Thanks, Brad Heller