[
https://issues.apache.org/jira/browse/HIVE-14143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15360049#comment-15360049
]
Nemon Lou commented on HIVE-14143:
----------------------------------
Agreed. As describe in TableScanDesc.java
{code}
// Both neededColumnIDs and neededColumns should never be null.
// When neededColumnIDs is an empty list,
// it means no needed column (e.g. we do not need any column to evaluate
// SELECT count(*) FROM t).
private List<Integer> neededColumnIDs;
{code}
I must has been misleading by the following code in HiveInputFormat.java:
{code}
private void pushProjection(final JobConf newjob, final StringBuilder
readColumnsBuffer,
final StringBuilder readColumnNamesBuffer) {
String readColIds = readColumnsBuffer.toString();
String readColNames = readColumnNamesBuffer.toString();
boolean readAllColumns = readColIds.isEmpty() ? true : false;
newjob.setBoolean(ColumnProjectionUtils.READ_ALL_COLUMNS, readAllColumns);
...
}
{code}
The solution is not clear for me . Any suggestions?
> RawDataSize of RCFile is zero after analyze
> --------------------------------------------
>
> Key: HIVE-14143
> URL: https://issues.apache.org/jira/browse/HIVE-14143
> Project: Hive
> Issue Type: Bug
> Components: Statistics
> Affects Versions: 1.2.1, 2.1.0
> Reporter: Nemon Lou
> Assignee: Nemon Lou
> Priority: Minor
> Attachments: HIVE-14143.patch
>
>
> After running the following analyze command ,rawDataSize becomes zero for
> rcfile tables.
> {noformat}
> analyze table RCFILE_TABLE compute statistics ;
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)