[jira] [Assigned] (PARQUET-2110) Fix Typos in LogicalTypes.md

2022-01-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-2110: - Assignee: jincongho > Fix Typos in LogicalTypes.md > > >

[jira] [Updated] (PARQUET-2110) Fix Typos in LogicalTypes.md

2022-01-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-2110: -- Fix Version/s: format-2.10.0 > Fix Typos in LogicalTypes.md > >

[jira] [Resolved] (PARQUET-2110) Fix Typos in LogicalTypes.md

2022-01-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-2110. --- Resolution: Fixed Resolved in PR https://github.com/apache/parquet-format/pull/181 > Fix T

[jira] [Commented] (PARQUET-1345) [C++] It is possible to overflow a TMemoryBuffer when serializing the file metadata

2020-10-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17205494#comment-17205494 ] Wes McKinney commented on PARQUET-1345: --- Can you make a repro? Seems like somethi

[jira] [Assigned] (PARQUET-1878) [C++] lz4 codec is not compatible with Hadoop Lz4Codec

2020-09-22 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1878: - Assignee: Patrick Pai > [C++] lz4 codec is not compatible with Hadoop Lz4Codec > -

[jira] [Resolved] (PARQUET-1878) [C++] lz4 codec is not compatible with Hadoop Lz4Codec

2020-09-22 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1878. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7789 [http

[jira] [Commented] (PARQUET-1904) [C++] Export file_offset in RowGroupMetaData

2020-08-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17186124#comment-17186124 ] Wes McKinney commented on PARQUET-1904: --- Done. I also made you an administrator s

[jira] [Updated] (PARQUET-1904) [C++] Export file_offset in RowGroupMetaData

2020-08-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1904: -- Fix Version/s: cpp-1.6.0 > [C++] Export file_offset in RowGroupMetaData > ---

[jira] [Assigned] (PARQUET-1904) [C++] Export file_offset in RowGroupMetaData

2020-08-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1904: - Assignee: Simon Bertron > [C++] Export file_offset in RowGroupMetaData > -

[jira] [Resolved] (PARQUET-1845) [C++] Int96 memory images in test cases assume only little-endian

2020-08-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1845. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6981 [http

[jira] [Assigned] (PARQUET-1845) [C++] Int96 memory images in test cases assume only little-endian

2020-08-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1845: - Assignee: Kazuaki Ishizaki > [C++] Int96 memory images in test cases assume only littl

[jira] [Resolved] (PARQUET-1882) [C++] Writing an all-null column and then reading it with buffered_stream aborts the process

2020-07-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1882. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7718 [http

[jira] [Updated] (PARQUET-1882) [C++] Writing an all-null column and then reading it with buffered_stream aborts the process

2020-07-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1882: -- Summary: [C++] Writing an all-null column and then reading it with buffered_stream aborts the

[jira] [Resolved] (PARQUET-1839) [C++] values_read not updated in ReadBatchSpaced

2020-07-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1839. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7717 [http

[jira] [Updated] (PARQUET-1839) [C++] values_read not updated in ReadBatchSpaced

2020-07-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1839: -- Summary: [C++] values_read not updated in ReadBatchSpaced (was: values_read not updated in

[jira] [Commented] (PARQUET-1882) Writing an all-null column and then reading it with buffered_stream aborts the process

2020-07-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17154724#comment-17154724 ] Wes McKinney commented on PARQUET-1882: --- Can you provide a reproducible code exam

[jira] [Commented] (PARQUET-1878) [C++] lz4 codec is not compatible with Hadoop Lz4Codec

2020-06-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17139954#comment-17139954 ] Wes McKinney commented on PARQUET-1878: --- [~chairmank] can you also send an e-mail

[jira] [Updated] (PARQUET-1241) [C++] Use LZ4 frame format

2020-06-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1241: -- Fix Version/s: cpp-1.6.0 > [C++] Use LZ4 frame format > -- > >

[jira] [Resolved] (PARQUET-1877) [C++] Reconcile container size with string size for memory issues

2020-06-17 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1877. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7465 [http

[jira] [Assigned] (PARQUET-1859) [C++] Require error message when using ParquetException::EofException

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1859: - Assignee: (was: Wes McKinney) > [C++] Require error message when using ParquetExce

[jira] [Assigned] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1385: - Assignee: (was: Wes McKinney) > [C++] bloom_filter-test is very slow under valgrin

[jira] [Assigned] (PARQUET-1352) [CPP] Trying to write an arrow table with structs to a parquet file

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1352: - Assignee: (was: Wes McKinney) > [CPP] Trying to write an arrow table with structs

[jira] [Assigned] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1385: - Assignee: (was: Wes McKinney) > [C++] bloom_filter-test is very slow under valgrin

[jira] [Assigned] (PARQUET-838) [CPP] Unable to read files written by parquet-cpp from parquet-tools

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-838: Assignee: (was: Wes McKinney) > [CPP] Unable to read files written by parquet-cpp fro

[jira] [Assigned] (PARQUET-1385) [C++] bloom_filter-test is very slow under valgrind

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1385: - Assignee: Wes McKinney > [C++] bloom_filter-test is very slow under valgrind > ---

[jira] [Assigned] (PARQUET-443) Schema resolution: map encoding

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-443: Assignee: (was: Wes McKinney) > Schema resolution: map encoding > ---

[jira] [Assigned] (PARQUET-441) Schema resolution: one, two, and three-level array encoding

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-441: Assignee: (was: Wes McKinney) > Schema resolution: one, two, and three-level array en

[jira] [Commented] (PARQUET-1869) [C++] Large decimal values don't roundtrip correctly

2020-06-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17123699#comment-17123699 ] Wes McKinney commented on PARQUET-1869: --- I'm pretty sure this is a problem with c

[jira] [Assigned] (PARQUET-1855) [C++] Improve documentation on MetaData ownership

2020-05-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1855: - Assignee: Francois Saint-Jacques > [C++] Improve documentation on MetaData ownership >

[jira] [Resolved] (PARQUET-1855) [C++] Improve documentation on MetaData ownership

2020-05-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1855. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7244 [http

[jira] [Resolved] (PARQUET-1861) [Documentation][C++] Explain ReaderProperters.buffer_stream*

2020-05-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1861. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7221 [http

[jira] [Resolved] (PARQUET-1865) [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc

2020-05-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1865. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7237 [http

[jira] [Assigned] (PARQUET-1865) [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc

2020-05-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1865: - Assignee: Wes McKinney > [C++] Failure from C++17 feature used in parquet/encoding_ben

[jira] [Created] (PARQUET-1865) [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc

2020-05-20 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1865: - Summary: [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc Key: PARQUET-1865 URL: https://issues.apache.org/jira/browse/PARQUET-1865 Project: P

[jira] [Updated] (PARQUET-1861) [Documentation][C++] Explain ReaderProperters.buffer_stream*

2020-05-08 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1861: -- Summary: [Documentation][C++] Explain ReaderProperters.buffer_stream* (was: [Documentation]

[jira] [Resolved] (PARQUET-1857) [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1857. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 7108 [http

[jira] [Commented] (PARQUET-1858) [Python] [Rust] Parquet read file fails with batch size 1_000_000 and 41 row groups

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17100758#comment-17100758 ] Wes McKinney commented on PARQUET-1858: --- Yes it looks like the file written by Ru

[jira] [Assigned] (PARQUET-1859) [C++] Require error message when using ParquetException::EofException

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1859: - Assignee: Wes McKinney > [C++] Require error message when using ParquetException::EofE

[jira] [Commented] (PARQUET-1858) [Python] [Rust] Parquet read file fails with batch size 1_000_000 and 41 row groups

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17100260#comment-17100260 ] Wes McKinney commented on PARQUET-1858: --- The PLAIN encoding for the boolean type

[jira] [Created] (PARQUET-1859) [C++] Require error message when using ParquetException::EofException

2020-05-05 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1859: - Summary: [C++] Require error message when using ParquetException::EofException Key: PARQUET-1859 URL: https://issues.apache.org/jira/browse/PARQUET-1859 Project: Pa

[jira] [Commented] (PARQUET-1857) [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17100084#comment-17100084 ] Wes McKinney commented on PARQUET-1857: --- I put up a PR for the first problem you

[jira] [Moved] (PARQUET-1857) [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-8677 to PARQUET-1857: -- Component/s: (was: Rust) (was: P

[jira] [Created] (PARQUET-1856) [C++] Test suite assumes that Snappy support is built

2020-05-04 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1856: - Summary: [C++] Test suite assumes that Snappy support is built Key: PARQUET-1856 URL: https://issues.apache.org/jira/browse/PARQUET-1856 Project: Parquet I

[jira] [Resolved] (PARQUET-1820) [C++] Use a column filter hint to inform read prefetching in Arrow reads

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1820. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6744 [http

[jira] [Assigned] (PARQUET-1820) [C++] Use a column filter hint to inform read prefetching in Arrow reads

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1820: - Assignee: David Li > [C++] Use a column filter hint to inform read prefetching in Arro

[jira] [Updated] (PARQUET-1820) [C++] Use a column filter hint to inform read prefetching in Arrow reads

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1820: -- Summary: [C++] Use a column filter hint to inform read prefetching in Arrow reads (was: [C++

[jira] [Commented] (PARQUET-1404) [C++] Add index pages to the format to support efficient page skipping to parquet-cpp

2020-04-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1404?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17090617#comment-17090617 ] Wes McKinney commented on PARQUET-1404: --- Do you want to keep the discussion in on

[jira] [Updated] (PARQUET-1327) [C++] Bloom filter read/write implementation

2020-04-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1327: -- Summary: [C++] Bloom filter read/write implementation (was: [C++]Bloom filter read/write imp

[jira] [Updated] (PARQUET-1828) [C++] Add a SSE2 path for the ByteStreamSplit encoder implementation

2020-04-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1828: -- Summary: [C++] Add a SSE2 path for the ByteStreamSplit encoder implementation (was: Add a SS

[jira] [Created] (PARQUET-1846) [C++] Remove deprecated IO classes and related functions

2020-04-19 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1846: - Summary: [C++] Remove deprecated IO classes and related functions Key: PARQUET-1846 URL: https://issues.apache.org/jira/browse/PARQUET-1846 Project: Parquet

[jira] [Resolved] (PARQUET-1835) [C++] Fix crashes on invalid input (OSS-Fuzz)

2020-04-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1835. --- Resolution: Fixed Issue resolved by pull request 6848 [https://github.com/apache/arrow/pull

[jira] [Updated] (PARQUET-1834) Add Apache 2.0 license to README.md files in parquet-testing

2020-04-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1834: -- Fix Version/s: cpp-1.6.0 > Add Apache 2.0 license to README.md files in parquet-testing > ---

[jira] [Resolved] (PARQUET-1834) Add Apache 2.0 license to README.md files in parquet-testing

2020-04-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1834. --- Resolution: Fixed Resolved by PR https://github.com/apache/parquet-testing/commit/bcd9ebcf

[jira] [Resolved] (PARQUET-1829) [C++] Fix crashes on invalid input (OSS-Fuzz)

2020-03-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1829. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6728 [http

[jira] [Resolved] (PARQUET-458) [C++] Implement support for DataPageV2

2020-03-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-458. -- Resolution: Fixed Issue resolved by pull request 6481 [https://github.com/apache/arrow/pull/64

[jira] [Commented] (PARQUET-1786) [C++] Use simd to improve BYTE_STREAM_SPLIT decoding performance

2020-03-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17066085#comment-17066085 ] Wes McKinney commented on PARQUET-1786: --- Please leave resolved issues in "Resolve

[jira] [Resolved] (PARQUET-1823) [C++] Invalid RowGroup returned when reading with parquet::arrow::FileReader->RowGroup(i)->Column(j)

2020-03-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1823. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6674 [http

[jira] [Resolved] (PARQUET-1819) [C++] Fix crashes on corrupt IPC input (OSS-Fuzz)

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1819. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6659 [http

[jira] [Updated] (PARQUET-1814) [C++] TestInt96ParquetIO failure on Windows

2020-03-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1814: -- Fix Version/s: cpp-1.6.0 > [C++] TestInt96ParquetIO failure on Windows >

[jira] [Updated] (PARQUET-1813) [C++] Remove logging statement in unit test

2020-03-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1813: -- Summary: [C++] Remove logging statement in unit test (was: [C++] Weird error output in tests

[jira] [Assigned] (PARQUET-1813) [C++] Weird error output in tests

2020-03-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1813: - Assignee: Wes McKinney > [C++] Weird error output in tests > -

[jira] [Commented] (PARQUET-1813) [C++] Weird error output in tests

2020-03-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17058286#comment-17058286 ] Wes McKinney commented on PARQUET-1813: --- I missed the debug output in my code rev

[jira] [Resolved] (PARQUET-1663) [C++] Provide API to check the presence of complex data types

2020-03-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1663. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 5490 [http

[jira] [Commented] (PARQUET-1300) [C++] Parquet modular encryption

2020-03-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17053455#comment-17053455 ] Wes McKinney commented on PARQUET-1300: --- Anyone interested in looking at packagin

[jira] [Resolved] (PARQUET-1810) [C++] Fix undefined behaviour on invalid enum values (OSS-Fuzz)

2020-03-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1810. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6537 [http

[jira] [Resolved] (PARQUET-1780) [C++] Set ColumnMetadata.encoding_stats field

2020-03-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1780. --- Resolution: Fixed Issue resolved by pull request 6370 [https://github.com/apache/arrow/pull

[jira] [Assigned] (PARQUET-1780) [C++] Set ColumnMetadata.encoding_stats field

2020-03-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1780: - Assignee: Gamage Omega Ishendra > [C++] Set ColumnMetadata.encoding_stats field >

[jira] [Updated] (PARQUET-1797) [C++] Fix fuzzing errors

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1797: -- Summary: [C++] Fix fuzzing errors (was: Fix fuzzing errors) > [C++] Fix fuzzing errors > ---

[jira] [Created] (PARQUET-1798) [C++] Review logic around automatic assignment of field_id's

2020-02-14 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1798: - Summary: [C++] Review logic around automatic assignment of field_id's Key: PARQUET-1798 URL: https://issues.apache.org/jira/browse/PARQUET-1798 Project: Parquet

[jira] [Resolved] (PARQUET-1788) [C++] ColumnWriter has undefined behavior when writing arrow chunks

2020-02-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1788. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6378 [http

[jira] [Resolved] (PARQUET-1716) [C++] Add support for BYTE_STREAM_SPLIT encoding

2020-02-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1716. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6005 [http

[jira] [Assigned] (PARQUET-1716) [C++] Add support for BYTE_STREAM_SPLIT encoding

2020-02-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1716: - Assignee: Martin Radev > [C++] Add support for BYTE_STREAM_SPLIT encoding > --

[jira] [Commented] (PARQUET-1783) [C++] Parquet statistics wrong for dictionary type

2020-02-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17030199#comment-17030199 ] Wes McKinney commented on PARQUET-1783: --- I suppose it's good at least that the mi

[jira] [Commented] (PARQUET-1783) [C++] Parquet statistics wrong for dictionary type

2020-02-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17030200#comment-17030200 ] Wes McKinney commented on PARQUET-1783: --- Do we need to create a corresponding Arr

[jira] [Created] (PARQUET-1780) [C++] Set ColumnMetadata.encoding_stats field

2020-01-28 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1780: - Summary: [C++] Set ColumnMetadata.encoding_stats field Key: PARQUET-1780 URL: https://issues.apache.org/jira/browse/PARQUET-1780 Project: Parquet Issue Typ

[jira] [Resolved] (PARQUET-1747) [C++] Access to ColumnChunkMetaData fails when encryption is on

2020-01-22 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1747. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 6150 [http

[jira] [Updated] (PARQUET-1772) [C++] ParquetFileWriter: Data overwritten when output stream opened in append mode

2020-01-22 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1772: -- Component/s: parquet-cpp > [C++] ParquetFileWriter: Data overwritten when output stream opene

[jira] [Resolved] (PARQUET-1766) [C++] parquet NaN/null double statistics can result in endless loop

2020-01-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1766. --- Resolution: Fixed Issue resolved by pull request 6167 [https://github.com/apache/arrow/pull

[jira] [Resolved] (PARQUET-1769) [C++] Update to parquet-format 2.8.0

2020-01-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1769. --- Resolution: Fixed Issue resolved by pull request 6200 [https://github.com/apache/arrow/pull

[jira] [Created] (PARQUET-1769) [C++] Update to parquet-format 2.8.0

2020-01-14 Thread Wes McKinney (Jira)
Wes McKinney created PARQUET-1769: - Summary: [C++] Update to parquet-format 2.8.0 Key: PARQUET-1769 URL: https://issues.apache.org/jira/browse/PARQUET-1769 Project: Parquet Issue Type: Improv

[jira] [Resolved] (PARQUET-1701) [C++] Stream API: Add support for optional fields

2020-01-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved PARQUET-1701. --- Fix Version/s: cpp-1.6.0 Resolution: Fixed Issue resolved by pull request 5928 [http

[jira] [Updated] (PARQUET-1701) [C++] Stream API: Add support for optional fields

2020-01-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1701: -- Component/s: parquet-cpp > [C++] Stream API: Add support for optional fields > --

[jira] [Commented] (PARQUET-1698) [C++] Add reader option to pre-buffer entire serialized row group into memory

2020-01-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17014781#comment-17014781 ] Wes McKinney commented on PARQUET-1698: --- Currently in the C++ library, IO calls a

[jira] [Commented] (PARQUET-1698) [C++] Add reader option to pre-buffer entire serialized row group into memory

2020-01-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17014744#comment-17014744 ] Wes McKinney commented on PARQUET-1698: --- I think the pre-buffering should probabl

[jira] [Moved] (PARQUET-1766) [C++] parquet NaN/null double statistics can result in endless loop

2020-01-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-7376 to PARQUET-1766: -- Component/s: (was: C++) parquet-cpp

[jira] [Commented] (PARQUET-1698) [C++] Add reader option to pre-buffer entire serialized row group into memory

2020-01-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17014637#comment-17014637 ] Wes McKinney commented on PARQUET-1698: --- [~lidavidm] I missed the part about "wid

[jira] [Commented] (PARQUET-1698) [C++] Add reader option to pre-buffer entire serialized row group into memory

2020-01-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17013982#comment-17013982 ] Wes McKinney commented on PARQUET-1698: --- [~lidavidm] I'm quite interested to comp

[jira] [Commented] (PARQUET-1720) [C++] Parquet JSONPrint not showing version correctly

2019-12-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17000192#comment-17000192 ] Wes McKinney commented on PARQUET-1720: --- Assuming this is a C++ issue. Can you pr

[jira] [Updated] (PARQUET-1720) [C++] Parquet JSONPrint not showing version correctly

2019-12-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1720: -- Summary: [C++] Parquet JSONPrint not showing version correctly (was: Parquet JSONPrint not s

[jira] [Updated] (PARQUET-1720) Parquet JSONPrint not showing version correctly

2019-12-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1720: -- Component/s: parquet-cpp > Parquet JSONPrint not showing version correctly >

[jira] [Commented] (PARQUET-1715) [C++] Add the Parquet code samples to CI + Refactor Parquet Encryption Samples

2019-12-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16995175#comment-16995175 ] Wes McKinney commented on PARQUET-1715: --- Done > [C++] Add the Parquet code sampl

[jira] [Assigned] (PARQUET-1715) [C++] Add the Parquet code samples to CI + Refactor Parquet Encryption Samples

2019-12-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1715: - Assignee: Gal Lushi > [C++] Add the Parquet code samples to CI + Refactor Parquet Encr

[jira] [Closed] (PARQUET-1718) Store int16 as int16

2019-12-11 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed PARQUET-1718. - > Store int16 as int16 > > > Key: PARQUET-1718 >

[jira] [Updated] (PARQUET-1716) [C++] Add support for BYTE_STREAM_SPLIT encoding

2019-12-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1716: -- Summary: [C++] Add support for BYTE_STREAM_SPLIT encoding (was: [C++][Parquet] Add support f

[jira] [Moved] (PARQUET-1716) [C++][Parquet] Add support for Parquet's BYTE_STREAM_SPLIT encoding

2019-12-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney moved ARROW-5913 to PARQUET-1716: -- Component/s: (was: C++) parquet-cpp

[jira] [Commented] (PARQUET-1712) [C++] Stop using deprecated APIs in examples

2019-12-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16992361#comment-16992361 ] Wes McKinney commented on PARQUET-1712: --- Done > [C++] Stop using deprecated APIs

[jira] [Assigned] (PARQUET-1712) [C++] Stop using deprecated APIs in examples

2019-12-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned PARQUET-1712: - Assignee: Kenta Murata > [C++] Stop using deprecated APIs in examples > --

[jira] [Updated] (PARQUET-924) [C++] Persist original type metadata from Arrow schemas

2019-12-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-924: - Fix Version/s: cpp-1.6.0 > [C++] Persist original type metadata from Arrow schemas > ---

[jira] [Updated] (PARQUET-1693) [C++] Build examples don't account for CMAKE compression feature flags

2019-12-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated PARQUET-1693: -- Fix Version/s: (was: 1.11.0) cpp-1.6.0 > [C++] Build examples don't ac

[jira] [Closed] (PARQUET-1713) [C++] Refactor Parquet Code Samples to use Result APIs

2019-12-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed PARQUET-1713. - > [C++] Refactor Parquet Code Samples to use Result APIs >

  1   2   3   4   5   6   7   8   9   10   >