Spark 1.6 and ORC bucketed queries

Manjunath Shetty H Wed, 01 Apr 2020 20:18:30 -0700

Hi,

Is it possible to do ORC bucked queries in Spark 1.6 ?


Folder structure is like this:
<partition1>/
                     bucket1.orc
                     bucket2.orc
                     bucket3.orc

And the Spark SQL query will be like `select * from <table> where partition = 
partition1 and bucket = bucket1`, this query should only read `bucket1.orc` 
file.

Is this possible with Spark 1.6, if so please let me know how to achieve that ?


Thanks
Manjunath Shetty

Spark 1.6 and ORC bucketed queries

Reply via email to