Thanks. Unfortunately I dont have control over how data is inserted and the
table is not partitioned. 

The reason the sub directories are being created is because when Tez does an
INSERT into a table from a UNION query it creates sub directories so that it
can write in parallel. 

I've also realized that its only with Parquet ... csv works fine



--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Reply via email to