[jira] [Created] (PARQUET-2213) Add an alternative InputFile.newStream that allow an input range

2022-11-10 Thread Chao Sun (Jira)
Chao Sun created PARQUET-2213: - Summary: Add an alternative InputFile.newStream that allow an input range Key: PARQUET-2213 URL: https://issues.apache.org/jira/browse/PARQUET-2213 Project: Parquet

[jira] [Created] (PARQUET-2203) Make ParquetReadOptions and HadoopReadOptions extendable

2022-10-24 Thread Chao Sun (Jira)
Chao Sun created PARQUET-2203: - Summary: Make ParquetReadOptions and HadoopReadOptions extendable Key: PARQUET-2203 URL: https://issues.apache.org/jira/browse/PARQUET-2203 Project: Parquet

[jira] [Commented] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2022-08-05 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575952#comment-17575952 ] Chao Sun commented on PARQUET-2160: --- {quote} ... only it happens after the decompress call, may I ask

[jira] [Commented] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2022-08-04 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575579#comment-17575579 ] Chao Sun commented on PARQUET-2160: --- Hmm it does need to allocate extra heap memory and then read the

[jira] [Commented] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2022-08-04 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575487#comment-17575487 ] Chao Sun commented on PARQUET-2160: --- {quote} After I made this change to decompress, I found off-heap

[jira] [Updated] (PARQUET-2155) Upgrade protobuf version to 3.17.3

2022-07-20 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Sun updated PARQUET-2155: -- Summary: Upgrade protobuf version to 3.17.3 (was: Upgrade protobuf version to 3.20.1) > Upgrade

[jira] [Created] (PARQUET-2155) Upgrade protobuf version to 3.20.1

2022-06-09 Thread Chao Sun (Jira)
Chao Sun created PARQUET-2155: - Summary: Upgrade protobuf version to 3.20.1 Key: PARQUET-2155 URL: https://issues.apache.org/jira/browse/PARQUET-2155 Project: Parquet Issue Type: Improvement

[jira] [Commented] (PARQUET-2090) [C++] Parquet writes incorrect file_offset

2021-09-13 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17414737#comment-17414737 ] Chao Sun commented on PARQUET-2090: --- Thanks [~emkornfield], you are right - I missed the comment of

[jira] [Created] (PARQUET-2084) Upgrade Thrift to 0.14.2

2021-09-01 Thread Chao Sun (Jira)
Chao Sun created PARQUET-2084: - Summary: Upgrade Thrift to 0.14.2 Key: PARQUET-2084 URL: https://issues.apache.org/jira/browse/PARQUET-2084 Project: Parquet Issue Type: Improvement

[jira] [Created] (PARQUET-2083) Expose getFieldPath from ColumnIO

2021-08-31 Thread Chao Sun (Jira)
Chao Sun created PARQUET-2083: - Summary: Expose getFieldPath from ColumnIO Key: PARQUET-2083 URL: https://issues.apache.org/jira/browse/PARQUET-2083 Project: Parquet Issue Type: Improvement

[jira] [Created] (PARQUET-2061) Add a new API in `PageReadStore` to return row ranges directly

2021-06-28 Thread Chao Sun (Jira)
Chao Sun created PARQUET-2061: - Summary: Add a new API in `PageReadStore` to return row ranges directly Key: PARQUET-2061 URL: https://issues.apache.org/jira/browse/PARQUET-2061 Project: Parquet

[jira] [Created] (PARQUET-2052) Integer overflow when writing huge binary using dictionary encoding

2021-05-20 Thread Chao Sun (Jira)
Chao Sun created PARQUET-2052: - Summary: Integer overflow when writing huge binary using dictionary encoding Key: PARQUET-2052 URL: https://issues.apache.org/jira/browse/PARQUET-2052 Project: Parquet

[jira] [Created] (PARQUET-2050) Expose repetition & definition level from ColumnIO

2021-05-14 Thread Chao Sun (Jira)
Chao Sun created PARQUET-2050: - Summary: Expose repetition & definition level from ColumnIO Key: PARQUET-2050 URL: https://issues.apache.org/jira/browse/PARQUET-2050 Project: Parquet Issue Type:

[jira] [Commented] (PARQUET-1249) Clarify encoding schemes for boolean types

2018-03-23 Thread Chao Sun (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16411943#comment-16411943 ] Chao Sun commented on PARQUET-1249: --- Thanks! > Clarify encoding schemes for boolean types >

[jira] [Assigned] (PARQUET-1249) Clarify encoding schemes for boolean types

2018-03-23 Thread Chao Sun (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Sun reassigned PARQUET-1249: - Assignee: Chao Sun > Clarify encoding schemes for boolean types >

[jira] [Commented] (PARQUET-1249) Clarify encoding schemes for boolean types

2018-03-23 Thread Chao Sun (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16411939#comment-16411939 ] Chao Sun commented on PARQUET-1249: --- Trying to make my first contribution here. Can someone add me as

[jira] [Created] (PARQUET-1249) Clarify encoding schemes for boolean types

2018-03-17 Thread Chao Sun (JIRA)
Chao Sun created PARQUET-1249: - Summary: Clarify encoding schemes for boolean types Key: PARQUET-1249 URL: https://issues.apache.org/jira/browse/PARQUET-1249 Project: Parquet Issue Type: