[ 
https://issues.apache.org/jira/browse/ORC-8?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15102471#comment-15102471
 ] 

ASF GitHub Bot commented on ORC-8:
----------------------------------

Github user omalley commented on the pull request:

    https://github.com/apache/orc/pull/15#issuecomment-172093974
  
    For comparison, here is a before and after for examples/decimal.orc:
    
    ```
    Structure for ../examples/decimal.orc
    
     === Postscript ===
    footerLength: 89
    compression: NONE
    version: 0
    version: 12
    metadataLength: 44
    magic: "ORC"
    
     === Footer ===
    headerLength: 3
    contentLength: 16186
    stripes {
      offset: 3
      indexLength: 54
      dataLength: 16079
      footerLength: 50
      numberOfRows: 6000
    }
    types {
      kind: STRUCT
      subtypes: 1
      fieldNames: "_col0"
    }
    types {
      kind: DECIMAL
      precision: 10
      scale: 5
    }
    numberOfRows: 6000
    statistics {
      numberOfValues: 6000
    }
    statistics {
      numberOfValues: 4000
      decimalStatistics {
        minimum: "-1000.5"
        maximum: "1999.2"
        sum: "1998301.099"
      }
    }
    rowIndexStride: 10000
    
    === Stripe Statistics ===
    Stripe 1:
    offset: 3
    indexLength: 54
    dataLength: 16079
    footerLength: 50
    numberOfRows: 6000
        Stream: column 0 section 6 start: 3 length 7
        Stream: column 1 section 6 start: 10 length 47
        Stream: column 1 section 0 start: 57 length 12
        Stream: column 1 section 1 start: 69 length 13673
        Stream: column 1 section 5 start: 13742 length 2394
        Encoding column 0: 0
        Encoding column 1: 2
    File length: 16337 bytes
    Padding length: 0 bytes
    Padding ratio: 0.00 %
    ```
    
    and 
    
    ```
    { "name": "../examples/decimal.orc",
      "type": "struct<_col0:decimal(10,5)>",
      "rows": 6000,
      "stripe count": 1,
      "format": "0.12", "writer version": "original",
      "compression": "none",
      "file length": 16337,
      "content": 16186, "stripe stats": 44, "footer": 89, "postscript": 17,
      "row index stride": 10000,
      "user metadata": {
      },
      "stripes": [
        { "stripe": 0, "rows": 6000,
          "offset": 3, "length": 16183,
          "index": 54, "data": 16079, "footer": 50
        }
      ]
    }
    ```


> Replace tool implementation to use Reader interface
> ---------------------------------------------------
>
>                 Key: ORC-8
>                 URL: https://issues.apache.org/jira/browse/ORC-8
>             Project: Orc
>          Issue Type: Bug
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>
> Currently FileMetadata is using custom parsing of the ORC footer and 
> postscript. It should use the standard interface from Reader.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to