[ 
https://issues.apache.org/jira/browse/HAWQ-44?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15003176#comment-15003176
 ] 

ASF GitHub Bot commented on HAWQ-44:
------------------------------------

Github user shivzone commented on a diff in the pull request:

    https://github.com/apache/incubator-hawq/pull/92#discussion_r44730180
  
    --- Diff: 
pxf/pxf-service/src/main/java/org/apache/hawq/pxf/service/BridgeOutputBuilder.java
 ---
    @@ -216,10 +245,53 @@ void fillText(List<OneField> recFields) throws 
BadRecordException {
             int type = fld.type;
             Object val = fld.val;
             if (DataType.get(type) == DataType.BYTEA) {// from 
LineBreakAccessor
    -            output = new BufferWritable((byte[]) val);
    +            if (samplingEnabled) {
    +                convertTextDataToLines((byte[]) val);
    +            } else {
    +                output = new BufferWritable((byte[]) val);
    +                outputList.add(output); // TODO break output into lines
    +            }
             } else { // from QuotedLineBreakAccessor
                 String textRec = (String) val;
                 output = new Text(textRec + "\n");
    +            outputList.add(output);
    +        }
    +    }
    +
    +    void convertTextDataToLines(byte[] val) {
    --- End diff --
    
    Add a comment to this function explaining the purpose of this either here 
or when it is invoked for sampling


> Advanced statistics for PXF tables
> ----------------------------------
>
>                 Key: HAWQ-44
>                 URL: https://issues.apache.org/jira/browse/HAWQ-44
>             Project: Apache HAWQ
>          Issue Type: New Feature
>          Components: PXF
>            Reporter: Noa Horn
>            Assignee: Noa Horn
>              Labels: Performance
>
> PXF will get full statistics on a table using sampling.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to