Re: Too many S3 API calls for simple queries like select and create external table

2022-02-21 Thread Rajesh Balamohan
If you are using parquet format, HIVE-25827 would be causing additional calls to s3 as the footer is read atleast twice. Add to this atleast 9+ list_status calls being made for split gen. ~Rajesh.B On Mon, Feb 21, 2022 at 10:16 AM Sungwoo Park

Re: Too many S3 API calls for simple queries like select and create external table

2022-02-20 Thread Sungwoo Park
My understanding is that additional calls to S3 APi is the price to pay for using the Hadoop library which only emulates FileSystem on top of S3. S3 is not a distributed file system like HDFS, so some of the API calls cannot be optimized in an ideal way. For (i), a more serious problem is the