[
https://issues.apache.org/jira/browse/ORC-8?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15102471#comment-15102471
]
ASF GitHub Bot commented on ORC-8:
----------------------------------
Github user omalley commented on the pull request:
https://github.com/apache/orc/pull/15#issuecomment-172093974
For comparison, here is a before and after for examples/decimal.orc:
```
Structure for ../examples/decimal.orc
=== Postscript ===
footerLength: 89
compression: NONE
version: 0
version: 12
metadataLength: 44
magic: "ORC"
=== Footer ===
headerLength: 3
contentLength: 16186
stripes {
offset: 3
indexLength: 54
dataLength: 16079
footerLength: 50
numberOfRows: 6000
}
types {
kind: STRUCT
subtypes: 1
fieldNames: "_col0"
}
types {
kind: DECIMAL
precision: 10
scale: 5
}
numberOfRows: 6000
statistics {
numberOfValues: 6000
}
statistics {
numberOfValues: 4000
decimalStatistics {
minimum: "-1000.5"
maximum: "1999.2"
sum: "1998301.099"
}
}
rowIndexStride: 10000
=== Stripe Statistics ===
Stripe 1:
offset: 3
indexLength: 54
dataLength: 16079
footerLength: 50
numberOfRows: 6000
Stream: column 0 section 6 start: 3 length 7
Stream: column 1 section 6 start: 10 length 47
Stream: column 1 section 0 start: 57 length 12
Stream: column 1 section 1 start: 69 length 13673
Stream: column 1 section 5 start: 13742 length 2394
Encoding column 0: 0
Encoding column 1: 2
File length: 16337 bytes
Padding length: 0 bytes
Padding ratio: 0.00 %
```
and
```
{ "name": "../examples/decimal.orc",
"type": "struct<_col0:decimal(10,5)>",
"rows": 6000,
"stripe count": 1,
"format": "0.12", "writer version": "original",
"compression": "none",
"file length": 16337,
"content": 16186, "stripe stats": 44, "footer": 89, "postscript": 17,
"row index stride": 10000,
"user metadata": {
},
"stripes": [
{ "stripe": 0, "rows": 6000,
"offset": 3, "length": 16183,
"index": 54, "data": 16079, "footer": 50
}
]
}
```
> Replace tool implementation to use Reader interface
> ---------------------------------------------------
>
> Key: ORC-8
> URL: https://issues.apache.org/jira/browse/ORC-8
> Project: Orc
> Issue Type: Bug
> Reporter: Owen O'Malley
> Assignee: Owen O'Malley
>
> Currently FileMetadata is using custom parsing of the ORC footer and
> postscript. It should use the standard interface from Reader.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)