Dan Hecht has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/8523 )

Change subject: IMPALA-5931: Generates scan ranges in planner for s3/adls
......................................................................


Patch Set 11:

(2 comments)

Had a question about the thrift, I will look at the backend side too.

http://gerrit.cloudera.org:8080/#/c/8523/11/common/thrift/PlanNodes.thrift
File common/thrift/PlanNodes.thrift:

http://gerrit.cloudera.org:8080/#/c/8523/11/common/thrift/PlanNodes.thrift@206
PS11, Line 206: Hdfs
I wonder if we should just drop Hdfs from this name given that this is used for 
other "filesystem" like storage (i.e. S3, ADLS). Ok to ignore if you prefer to 
keep it for some reason (maybe the reason to keep it is consistency with e.g. 
HdfsScanNode, which also handles S3/ADLS since they go through the same 
FileSystem & libhdfs access path).


http://gerrit.cloudera.org:8080/#/c/8523/11/common/thrift/PlanNodes.thrift@228
PS11, Line 228: 4: optional THdfsFileSplitGeneratorSpec 
hdfs_file_split_generator_spec
Isn't it that the frontend generates either a list of (concrete) scan ranges, 
or it just gives a list of FileSplitGeneratorSpec?  i.e. I would have expected 
this to appear as the alternative to Planner.TScanRangeLocationList since the 
that is the "concrete" scan range list that this thing is going to effectively 
explode into.
i.e. it seems clearer to me to leave TScanRange as the "concrete" range that 
the coordinator will generate using this new "spec".
So what's the reasoning for putting this here instead of higher up in the 
thrift plan tree?



--
To view, visit http://gerrit.cloudera.org:8080/8523
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I326065adbb2f7e632814113aae85cb51ca4779a5
Gerrit-Change-Number: 8523
Gerrit-PatchSet: 11
Gerrit-Owner: Vuk Ercegovac <vercego...@cloudera.com>
Gerrit-Reviewer: Dan Hecht <dhe...@cloudera.com>
Gerrit-Reviewer: Dimitris Tsirogiannis <dtsirogian...@cloudera.com>
Gerrit-Reviewer: Lars Volker <l...@cloudera.com>
Gerrit-Reviewer: Tim Armstrong <tarmstr...@cloudera.com>
Gerrit-Reviewer: Vuk Ercegovac <vercego...@cloudera.com>
Gerrit-Comment-Date: Wed, 07 Mar 2018 22:34:36 +0000
Gerrit-HasComments: Yes

Reply via email to