[ 
https://issues.apache.org/jira/browse/HIVE-21328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17107950#comment-17107950
 ] 

Hive QA commented on HIVE-21328:
--------------------------------



Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/13002950/HIVE-21328.1.patch

{color:red}ERROR:{color} -1 due to no test(s) being added or modified.

{color:green}SUCCESS:{color} +1 due to 17269 tests passed

Test results: 
https://builds.apache.org/job/PreCommit-HIVE-Build/22347/testReport
Console output: https://builds.apache.org/job/PreCommit-HIVE-Build/22347/console
Test logs: http://104.198.109.242/logs/PreCommit-HIVE-Build-22347/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.TestCheckPhase
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.YetusPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
{noformat}

This message is automatically generated.

ATTACHMENT ID: 13002950 - PreCommit-HIVE-Build

> Call To Hadoop Text getBytes() Without Call to getLength()
> ----------------------------------------------------------
>
>                 Key: HIVE-21328
>                 URL: https://issues.apache.org/jira/browse/HIVE-21328
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Planning
>    Affects Versions: 4.0.0, 3.2.0
>            Reporter: David Mollitor
>            Assignee: David Mollitor
>            Priority: Major
>         Attachments: HIVE-21328.1.patch, HIVE-21328.1.patch
>
>
> I'm not sure if there is actually a bug, but this looks highly suspect:
> {code:java}
>   public Object set(final Object o, final Text text) {
>     return new BytesWritable(text == null ? null : text.getBytes());
>   }
> {code}
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/serde/primitive/ParquetStringInspector.java#L104-L106
> There are two components to a Text object.  There are the internal bytes and 
> the length of the bytes.  The two are independent.  I.e., a quick "reset" on 
> the Text object simply sets the internal length counter to zero.  This code 
> is potentially looking at obsolete data that it shouldn't be seeing because 
> it is not considering the length of the Text.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to