Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12985 )

Change subject: IMPALA-8381: WIP - Optimize ParquetPlainEncoder::DecodeBatch() 
for simple types
......................................................................


Patch Set 2:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/12985/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/12985/2//COMMIT_MSG@37
PS2, Line 37:  in
nit: long line, wrap at 72 when possible


http://gerrit.cloudera.org:8080/#/c/12985/1/be/src/exec/parquet/parquet-common.h
File be/src/exec/parquet/parquet-common.h:

http://gerrit.cloudera.org:8080/#/c/12985/1/be/src/exec/parquet/parquet-common.h@237
PS1, Line 237:       int fixed_len_size, int64_t num_values, int64_t stride, 
InternalType* v);
> Is it a special case? A 96 bit value takes 12 bytes.
sizeof(TimestampValue) == 16 due to padding (to ensure the 8 byte alignment of 
one of its members), but we only overwrite the first 12 byte of it, that is why 
12 is used instead of sizeof(TimestampValue).

The padding doesn't make sense in reality, as the slots are not aligned, but we 
couldn't #pragma pack TimestampValue due to its boost struct members. 
IMPALA-7367 contains some details about this.


http://gerrit.cloudera.org:8080/#/c/12985/1/be/src/exec/parquet/parquet-common.h@477
PS1, Line 477:
> We never bounds check the output buffer, only the input buffer, and FIXED_L
Sorry, another idea for the name: has_variable_length
I think that this would make the reason behind's BYTE_ARRAY's special handling 
even clearer.



--
To view, visit http://gerrit.cloudera.org:8080/12985
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I57b7d2573bb6dfd038e581acb3bd8ea1565aa20d
Gerrit-Change-Number: 12985
Gerrit-PatchSet: 2
Gerrit-Owner: Daniel Becker <daniel.bec...@cloudera.com>
Gerrit-Reviewer: Csaba Ringhofer <csringho...@cloudera.com>
Gerrit-Reviewer: Daniel Becker <daniel.bec...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Comment-Date: Thu, 11 Apr 2019 15:21:45 +0000
Gerrit-HasComments: Yes

Reply via email to