I am using zeppelin to execute queries against my spark cluster. I have
data in files which can be access via scala to DataFrame and then
registered as a table OR through Hive (as the same file is registered as
hive table)

e.g.

bank sample is loaded as spark DF ves bank table registered as hive table

after both I can write

%sql

"select age, balance from bank"

"select age, balance from bank_hive"

In term of execution of the SQL code, is one faster then the other?

Reply via email to