[jira] [Created] (ARROW-13546) [Python] Breaking API change in FSSpecHandler, requires metadata argument

2021-08-04 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-13546: Summary: [Python] Breaking API change in FSSpecHandler, requires metadata argument Key: ARROW-13546 URL: https://issues.apache.org/jira/browse/ARROW-13546

[jira] [Created] (ARROW-10959) [C++] Add scalar string join kernel

2020-12-18 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10959: Summary: [C++] Add scalar string join kernel Key: ARROW-10959 URL: https://issues.apache.org/jira/browse/ARROW-10959 Project: Apache Arrow Issue

[jira] [Created] (ARROW-10799) [C++] Take on string chunked arrays slow and fails

2020-12-03 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10799: Summary: [C++] Take on string chunked arrays slow and fails Key: ARROW-10799 URL: https://issues.apache.org/jira/browse/ARROW-10799 Project: Apache Arrow

[jira] [Created] (ARROW-10739) [Python] Pickling a sliced array serializes all the buffers

2020-11-25 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10739: Summary: [Python] Pickling a sliced array serializes all the buffers Key: ARROW-10739 URL: https://issues.apache.org/jira/browse/ARROW-10739 Project: Apache

[jira] [Created] (ARROW-10736) [Python] feather/arrow row splitting and counting (Dataset API)

2020-11-25 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10736: Summary: [Python] feather/arrow row splitting and counting (Dataset API) Key: ARROW-10736 URL: https://issues.apache.org/jira/browse/ARROW-10736 Project:

[jira] [Created] (ARROW-10709) [Python] Difficult to make an efficient zero-copy file reader in Python

2020-11-24 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10709: Summary: [Python] Difficult to make an efficient zero-copy file reader in Python Key: ARROW-10709 URL: https://issues.apache.org/jira/browse/ARROW-10709

[jira] [Created] (ARROW-10557) [C++] Add scalar string slicing/substring kernel

2020-11-11 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10557: Summary: [C++] Add scalar string slicing/substring kernel Key: ARROW-10557 URL: https://issues.apache.org/jira/browse/ARROW-10557 Project: Apache Arrow

[jira] [Created] (ARROW-10556) [C++] Caching pre computed data based on FunctionOptions in the kernel state

2020-11-11 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10556: Summary: [C++] Caching pre computed data based on FunctionOptions in the kernel state Key: ARROW-10556 URL: https://issues.apache.org/jira/browse/ARROW-10556

[jira] [Created] (ARROW-10541) [C++] Add re2 library to core arrow / ARROW_WITH_RE2

2020-11-10 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10541: Summary: [C++] Add re2 library to core arrow / ARROW_WITH_RE2 Key: ARROW-10541 URL: https://issues.apache.org/jira/browse/ARROW-10541 Project: Apache Arrow

[jira] [Created] (ARROW-10306) [C++] Add string replacement kernel

2020-10-14 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10306: Summary: [C++] Add string replacement kernel Key: ARROW-10306 URL: https://issues.apache.org/jira/browse/ARROW-10306 Project: Apache Arrow Issue

[jira] [Created] (ARROW-10209) [Python] support positional arguments for options in compute wrapper

2020-10-07 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10209: Summary: [Python] support positional arguments for options in compute wrapper Key: ARROW-10209 URL: https://issues.apache.org/jira/browse/ARROW-10209

[jira] [Created] (ARROW-10208) [C++] comparing list arrays with nulls fails in test framework

2020-10-07 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10208: Summary: [C++] comparing list arrays with nulls fails in test framework Key: ARROW-10208 URL: https://issues.apache.org/jira/browse/ARROW-10208 Project:

[jira] [Created] (ARROW-10207) C++] Unary kernels that results in a list have no preallocated offset buffer

2020-10-07 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10207: Summary: C++] Unary kernels that results in a list have no preallocated offset buffer Key: ARROW-10207 URL: https://issues.apache.org/jira/browse/ARROW-10207

[jira] [Created] (ARROW-10195) [C++] Add string struct extract kernel using re2

2020-10-06 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-10195: Summary: [C++] Add string struct extract kernel using re2 Key: ARROW-10195 URL: https://issues.apache.org/jira/browse/ARROW-10195 Project: Apache Arrow

[jira] [Created] (ARROW-9991) [C++] split kernsl for strings/binary

2020-09-14 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9991: --- Summary: [C++] split kernsl for strings/binary Key: ARROW-9991 URL: https://issues.apache.org/jira/browse/ARROW-9991 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-9471) [C++] Scan Dataset in reverse

2020-07-14 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9471: --- Summary: [C++] Scan Dataset in reverse Key: ARROW-9471 URL: https://issues.apache.org/jira/browse/ARROW-9471 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9458: --- Summary: [Python] Dataset singlethreaded only Key: ARROW-9458 URL: https://issues.apache.org/jira/browse/ARROW-9458 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9456: --- Summary: [Python] Dataset segfault when not importing pyarrow.parquet Key: ARROW-9456 URL: https://issues.apache.org/jira/browse/ARROW-9456 Project: Apache

[jira] [Created] (ARROW-9403) [Python] add .tolist as alias of to_pylist

2020-07-10 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9403: --- Summary: [Python] add .tolist as alias of to_pylist Key: ARROW-9403 URL: https://issues.apache.org/jira/browse/ARROW-9403 Project: Apache Arrow Issue

[jira] [Created] (ARROW-9268) [C++] Add is{alnum,alpha,...} kernels for strings

2020-06-29 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9268: --- Summary: [C++] Add is{alnum,alpha,...} kernels for strings Key: ARROW-9268 URL: https://issues.apache.org/jira/browse/ARROW-9268 Project: Apache Arrow

[jira] [Created] (ARROW-9133) [C++] Add utf8_upper and utf_lower

2020-06-15 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9133: --- Summary: [C++] Add utf8_upper and utf_lower Key: ARROW-9133 URL: https://issues.apache.org/jira/browse/ARROW-9133 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-9131) [C++] Faster ascii_lower and ascii_upper

2020-06-15 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9131: --- Summary: [C++] Faster ascii_lower and ascii_upper Key: ARROW-9131 URL: https://issues.apache.org/jira/browse/ARROW-9131 Project: Apache Arrow Issue

[jira] [Created] (ARROW-9100) Add ascii_lower kernel

2020-06-11 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9100: --- Summary: Add ascii_lower kernel Key: ARROW-9100 URL: https://issues.apache.org/jira/browse/ARROW-9100 Project: Apache Arrow Issue Type: Task

[jira] [Commented] (ARROW-8990) [C++] Benchmark hash table against thirdparty options, possibly vendor a thirdparty hash table library

2020-06-01 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17121205#comment-17121205 ] Maarten Breddels commented on ARROW-8990: - FYI, I've been using that library and 

[jira] [Commented] (ARROW-8961) [C++] Vendor utf8proc library

2020-05-28 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17118713#comment-17118713 ] Maarten Breddels commented on ARROW-8961: - FWIW, in Vaex i've relied on 

[jira] [Commented] (ARROW-555) [C++] String algorithm library for StringArray/BinaryArray

2020-05-22 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17114105#comment-17114105 ] Maarten Breddels commented on ARROW-555: Sounds good. I think it would help me a lot to see

[jira] [Commented] (ARROW-8865) Windows distribution for 0.17.1 seems broken (conda only)

2020-05-20 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17112105#comment-17112105 ] Maarten Breddels commented on ARROW-8865: - Thanks Joris, We got CI working by installing from

[jira] [Updated] (ARROW-8865) Windows distribution for 0.17.1 seems broken (conda only)

2020-05-19 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maarten Breddels updated ARROW-8865: Summary: Windows distribution for 0.17.1 seems broken (conda only) (was: windows

[jira] [Created] (ARROW-8865) windows distribution for 0.17.1 seems broken (conda only?

2020-05-19 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-8865: --- Summary: windows distribution for 0.17.1 seems broken (conda only? Key: ARROW-8865 URL: https://issues.apache.org/jira/browse/ARROW-8865 Project: Apache Arrow

[jira] [Commented] (ARROW-555) [C++] String algorithm library for StringArray/BinaryArray

2020-05-11 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17104666#comment-17104666 ] Maarten Breddels commented on ARROW-555: Something to consider (or should I move this discussion

[jira] [Commented] (ARROW-555) [C++] String algorithm library for StringArray/BinaryArray

2020-05-11 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17104525#comment-17104525 ] Maarten Breddels commented on ARROW-555: I am likely to be able to start working on strings in

[jira] [Commented] (ARROW-555) [C++] String algorithm library for StringArray/BinaryArray

2020-03-04 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17051644#comment-17051644 ] Maarten Breddels commented on ARROW-555: What are the limitation, and is this somewhere

[jira] [Commented] (ARROW-555) [C++] String algorithm library for StringArray/BinaryArray

2020-03-04 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17051264#comment-17051264 ] Maarten Breddels commented on ARROW-555: Related: https://issues.apache.org/jira/browse/ARROW-7083

[jira] [Commented] (ARROW-7396) [Format] Register media types (MIME types) for Apache Arrow formats to IANA

2019-12-17 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16998111#comment-16998111 ] Maarten Breddels commented on ARROW-7396: - According to 

[jira] [Commented] (ARROW-4810) [Format][C++] Add "LargeList" type with 64-bit offsets

2019-03-08 Thread Maarten Breddels (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16788160#comment-16788160 ] Maarten Breddels commented on ARROW-4810: - I see BinaryArray/StringArray classes have similar

[jira] [Commented] (ARROW-4810) [Format][C++] Add "LargeList" type with 64-bit offsets

2019-03-08 Thread Maarten Breddels (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16788150#comment-16788150 ] Maarten Breddels commented on ARROW-4810: - > Having arrays with > 2GB elements or binary arrays

[jira] [Commented] (ARROW-3685) [Python] Use fixed size binary for NumPy fixed-size string dtypes

2018-11-01 Thread Maarten Breddels (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16671614#comment-16671614 ] Maarten Breddels commented on ARROW-3685: - I tried to make a PR, but it's opening a whole can of

[jira] [Created] (ARROW-3686) Support for masked arrays in to/from numpy

2018-11-01 Thread Maarten Breddels (JIRA)
Maarten Breddels created ARROW-3686: --- Summary: Support for masked arrays in to/from numpy Key: ARROW-3686 URL: https://issues.apache.org/jira/browse/ARROW-3686 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-3685) [Python] Use fixed size binary for NumPy fixed-size string dtypes

2018-11-01 Thread Maarten Breddels (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16671515#comment-16671515 ] Maarten Breddels commented on ARROW-3685: - Would you say this needs a change in to_pandas_dtype,

[jira] [Created] (ARROW-3685) Better roundtrip between numpy and arrow binary array

2018-11-01 Thread Maarten Breddels (JIRA)
Maarten Breddels created ARROW-3685: --- Summary: Better roundtrip between numpy and arrow binary array Key: ARROW-3685 URL: https://issues.apache.org/jira/browse/ARROW-3685 Project: Apache Arrow

[jira] [Created] (ARROW-3669) pyarrow swallows big endian arrow without converting or error msg

2018-11-01 Thread Maarten Breddels (JIRA)
Maarten Breddels created ARROW-3669: --- Summary: pyarrow swallows big endian arrow without converting or error msg Key: ARROW-3669 URL: https://issues.apache.org/jira/browse/ARROW-3669 Project: