Use Phoenix hints with Spark Integration [main use case: block cache disable]

Roberto Coluccio Wed, 30 Aug 2017 04:12:08 -0700

Hello folks,

I'm facing the issue of disabling adding to the block cache records I'mselecting from my Spark application when reading as DataFrame (e.g.sqlContext.phoenixTableAsDataFrame(myTable, myColumns, myPredicate,myZkUrl, myConf).

I know I can force the no cache on a query basis when issuing SQLqueries leveraging the /*+ NO_CACHE */ hint.I know I can disable the caching at a table-specific or colum-familyspecific basis through an ALTER TABLE HBase shell command.

What I don't know is how to do so when leveraging Phoenix-Spark APIs. Ithink my problem can be stated as a more general purpose question: *howcan Phoenix hints be specified when using Phoenix-Spark APIs?

*For my specific use case, I tried to push within a Configuration objectthe property /hfile.block.cache.size=0/ before creating the DataFramebut I realized records resulting from the underneath scan where stillcached.


Thank you in advance for your help.

Best regards,
Roberto

Use Phoenix hints with Spark Integration [main use case: block cache disable]

Reply via email to