Parquet is default file format for apache drill
so you do not need to give a parquet file for a drill query. Instead give
the folder path which contains the files.

eg: select * from hdfs_storage>.<workspace>.`folder1` will query all the
parquet files in folder1

On Tue, Feb 4, 2020 at 9:55 PM Vishal Jadhav (BLOOMBERG/ 731 LEX) <
vjad...@bloomberg.net> wrote:

> Hello Drillers,
>
> Need some help with the hdfs + parquet files.
>
> I have configured the HDFS storage with parquet & csv format plugins.
>
> I can query the - <hdfs_storage>.<csv_ws_name>.`*.csv` correctly. Also, I
> have a similar directory structure for the parquet files (in a different
> directory), But, not able to query it.
>
> Show files works fine.
> (1) The following query works fine -
> show files from <hdfs_storage>.<workspace>
>
> (2) select * from <hdfs_storage>.<workspace>.`*.parquet` limit 4
> Fails with -
> org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR:
> NoSuchElementException
>
> (3) select * from hdfs_storage>.<workspace>.`xyz.parquet`;
> fails with -
> org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR:
> RemoteException:/path/xyz.parquet (is not a directory)
>
> Please let me know, if I am doing something wrong here.
>
> Thank you!
> - Vishal



-- 
Nitin Pawar

Reply via email to