Hi,

Is it possible to do ORC bucked queries in Spark 1.6 ?

Folder structure is like this:
<partition1>/
                     bucket1.orc
                     bucket2.orc
                     bucket3.orc

And the Spark SQL query will be like `select * from <table> where partition = 
partition1 and bucket = bucket1`, this query should only read `bucket1.orc` 
file.

Is this possible with Spark 1.6, if so please let me know how to achieve that ?


Thanks
Manjunath Shetty

Reply via email to