[jira] [Created] (ARROW-6568) pyarrow.parquet crash writing zero-chunk dictionary-type column

2019-09-15 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-6568: -- Summary: pyarrow.parquet crash writing zero-chunk dictionary-type column Key: ARROW-6568 URL: https://issues.apache.org/jira/browse/ARROW-6568 Project: Apache Arrow

[jira] [Commented] (ARROW-6568) pyarrow.parquet crash writing zero-chunk dictionary-type column

2019-09-15 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930052#comment-16930052 ] Adam Hooper commented on ARROW-6568: My workaround, in my function that wraps

[jira] [Created] (ARROW-7266) dictionary_encode() of a slice gives wrong result

2019-11-26 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-7266: -- Summary: dictionary_encode() of a slice gives wrong result Key: ARROW-7266 URL: https://issues.apache.org/jira/browse/ARROW-7266 Project: Apache Arrow Issue

[jira] [Comment Edited] (ARROW-7266) dictionary_encode() of a slice gives wrong result

2019-11-26 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16982544#comment-16982544 ] Adam Hooper edited comment on ARROW-7266 at 11/26/19 2:37 PM: -- Ah, found a

[jira] [Commented] (ARROW-7266) dictionary_encode() of a slice gives wrong result

2019-11-26 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16982544#comment-16982544 ] Adam Hooper commented on ARROW-7266: Ah, found a workaround that should be good enough for now:

[jira] [Created] (ARROW-7281) AdaptiveIntBuilder::length() does not consider pending_pos_.

2019-11-29 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-7281: -- Summary: AdaptiveIntBuilder::length() does not consider pending_pos_. Key: ARROW-7281 URL: https://issues.apache.org/jira/browse/ARROW-7281 Project: Apache Arrow

[jira] [Updated] (ARROW-7281) AdaptiveIntBuilder::length() does not consider pending_pos_.

2019-11-29 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Hooper updated ARROW-7281: --- Description: {code:c++} arrow::AdaptiveIntBuilder builder(arrow::default_memory_pool());

[jira] [Commented] (ARROW-7281) [C++] AdaptiveIntBuilder::length() does not consider pending_pos_.

2019-11-30 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16985473#comment-16985473 ] Adam Hooper commented on ARROW-7281: Lots of ways. Documented here:

[jira] [Updated] (ARROW-6861) arrow-0.15.0 reading arrow-0.14.1-output Parquet dictionary column: Failure reading column: IOError: Arrow error: Invalid: Resize cannot downsize

2019-10-11 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Hooper updated ARROW-6861: --- Attachment: parquet-written-by-arrow-0-14-1.7z > arrow-0.15.0 reading arrow-0.14.1-output Parquet

[jira] [Commented] (ARROW-6861) arrow-0.15.0 reading arrow-0.14.1-output Parquet dictionary column: Failure reading column: IOError: Arrow error: Invalid: Resize cannot downsize

2019-10-11 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16949738#comment-16949738 ] Adam Hooper commented on ARROW-6861: I've attached a Parquet file, written by Arrow 0.14.1, which

[jira] [Created] (ARROW-6861) With arrow-0.14.1-output Parquet dictionary column: Failure reading column: IOError: Arrow error: Invalid: Resize cannot downsize

2019-10-11 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-6861: -- Summary: With arrow-0.14.1-output Parquet dictionary column: Failure reading column: IOError: Arrow error: Invalid: Resize cannot downsize Key: ARROW-6861 URL:

[jira] [Updated] (ARROW-6861) arrow-0.15.0 reading arrow-0.14.1-output Parquet dictionary column: Failure reading column: IOError: Arrow error: Invalid: Resize cannot downsize

2019-10-11 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Hooper updated ARROW-6861: --- Summary: arrow-0.15.0 reading arrow-0.14.1-output Parquet dictionary column: Failure reading column:

[jira] [Created] (ARROW-7435) Security issue: ValidateOffsets() does not prevent buffer over-read

2019-12-18 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-7435: -- Summary: Security issue: ValidateOffsets() does not prevent buffer over-read Key: ARROW-7435 URL: https://issues.apache.org/jira/browse/ARROW-7435 Project: Apache Arrow

[jira] [Reopened] (ARROW-6895) [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()`

2020-02-18 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Hooper reopened ARROW-6895: The code snippet given in the bug description still fails to read the {{bad.parquet}} file I

[jira] [Updated] (ARROW-6895) [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()`

2020-02-18 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Hooper updated ARROW-6895: --- Attachment: 01-fix-arrow-6895.diff > [C++][Parquet] parquet::arrow::ColumnReader:

[jira] [Commented] (ARROW-6895) [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()`

2020-02-18 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039184#comment-17039184 ] Adam Hooper commented on ARROW-6895: I just uploaded {{01-fix-arrow-6895.diff}}, which I _imagine_

[jira] [Created] (ARROW-10033) ArrowReaderProperties creates thread pool, even when use_threads=False and pre_buffer=False

2020-09-17 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-10033: --- Summary: ArrowReaderProperties creates thread pool, even when use_threads=False and pre_buffer=False Key: ARROW-10033 URL: https://issues.apache.org/jira/browse/ARROW-10033

[jira] [Created] (ARROW-10038) SetCpuThreadPoolCapacity(1) spins up nCPUs threads

2020-09-18 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-10038: --- Summary: SetCpuThreadPoolCapacity(1) spins up nCPUs threads Key: ARROW-10038 URL: https://issues.apache.org/jira/browse/ARROW-10038 Project: Apache Arrow

[jira] [Created] (ARROW-12774) replace_substring_regex() creates invalid arrays => crash

2021-05-13 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-12774: --- Summary: replace_substring_regex() creates invalid arrays => crash Key: ARROW-12774 URL: https://issues.apache.org/jira/browse/ARROW-12774 Project: Apache Arrow

[jira] [Created] (ARROW-12670) extract_regex gives bizarre behavior after nulls or non-matches

2021-05-06 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-12670: --- Summary: extract_regex gives bizarre behavior after nulls or non-matches Key: ARROW-12670 URL: https://issues.apache.org/jira/browse/ARROW-12670 Project: Apache Arrow

[jira] [Created] (ARROW-12672) segfault after `pa.Array.fill_null()`

2021-05-06 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-12672: --- Summary: segfault after `pa.Array.fill_null()` Key: ARROW-12672 URL: https://issues.apache.org/jira/browse/ARROW-12672 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-12911) sum of zero rows gives null; should give 0

2021-05-31 Thread Adam Hooper (Jira)
Adam Hooper created ARROW-12911: --- Summary: sum of zero rows gives null; should give 0 Key: ARROW-12911 URL: https://issues.apache.org/jira/browse/ARROW-12911 Project: Apache Arrow Issue Type: