tao meng created HUDI-2674:
------------------------------
Summary: hudi hive reader should not log read values
Key: HUDI-2674
URL: https://issues.apache.org/jira/browse/HUDI-2674
Project: Apache Hudi
Issue Type: Bug
Components: Hive Integration
Affects Versions: 0.9.0
Environment: hudi 0.9.0
hive 3.1.1
hadoop 3.1.1
Reporter: tao meng
Assignee: tao meng
Fix For: 0.10.0
now when we use hive to query hudi table and set
hive.input.format=org.apache.hudi.hadoop.hive.HoodieCombineHiveInputFormat;
all read values will be print. This can lead to performance problems and data
security problems,
as:
xxxxxxx 20:10:45,045 | INFO | main | Reading from record reader |
HoodieCombineRealtimeRecordReader.java:69
xxxxxx 20:10:45,045 | INFO | main | "values_0.158268513314199_10":
\{"value0":"20211102192749","type0":"Text","value1":"null","type1":"unknown","value2":"null","type2":"unknown","value3":"null","type3":"unknown","value4":"null","type4":"unknown","value5":"16","type5":"IntWritable","value6":"16jack","type6":"Text","value7":"null","type7":"unknown","value8":"null","type8":"unknown","value9":"null","type9":"unknown"}
| HoodieCombineRealtimeRecordReader.java:70
xxxxxxx 20:10:45,045 | INFO | main | Reading from record reader |
HoodieCombineRealtimeRecordReader.java:69
xxxxxxx 20:10:45,045 | INFO | main | "values_0.16924293134429924_10":
\{"value0":"20211102192749","type0":"Text","value1":"null","type1":"unknown","value2":"null","type2":"unknown","value3":"null","type3":"unknown","value4":"null","type4":"unknown","value5":"96","type5":"IntWritable","value6":"96jack","type6":"Text","value7":"null","type7":"unknown","value8":"null","type8":"unknown","value9":"null","type9":"unknown"}
| HoodieCombineRealtimeRecordReader.java:70
2021-11-02 20:10:45,045 | INFO | main | Reading from record reader |
HoodieCombineRealtimeRecordReader.java:69
--
This message was sent by Atlassian Jira
(v8.3.4#803005)