Hi,
I am trying to take a join on two directories which contain parquet
files. My query reads:
*select * from hdfs.root.`parquet1` as t1 join hdfs.root.`parquet2` as t2
on t1.field1= t2.field1;*
(parquet1 and parquet2 directories contain parquet files in them)
It gives an error saying Field References Must be Singular Names.
However, when I select only some of the fields it works fine:
*select t1.field2, t2.filed2 from hdfs.root.`parquet1` as t1 join
hdfs.root.`parquet2` as t2 on t1.field1= t2.field1; (This works fine)*
Surprisingly when I run following query (on parquet files rather then
directories) it again works fine
*select * from hdfs.root.`file1.parquet` as t1 join
hdfs.root.`file2.parquet` as t2 on t1.field1= t2.field1;*
Someone, help me please.
Regards,
Usman Ali