[jira] [Assigned] (ARROW-18318) [Python] Expose Scalar.validate

2023-01-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-18318: -- Assignee: buaazhwb > [Python] Expose Scalar.validate >

[jira] [Commented] (ARROW-7594) [C++] Implement HTTP and FTP file systems

2023-01-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654992#comment-17654992 ] Antoine Pitrou commented on ARROW-7594: --- [~icook] We'll need someone or something to allocate the

[jira] [Resolved] (ARROW-18195) [R][C++] Final value returned by case_when is NA when input has 64 or more values and 1 or more NAs

2023-01-04 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18195. Resolution: Fixed Issue resolved by pull request 15131

[jira] [Resolved] (ARROW-18436) [Python] `FileSystem.from_uri` doesn't decode %-encoded characters in path

2023-01-03 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18436. Resolution: Fixed Issue resolved by pull request 14974

[jira] [Resolved] (ARROW-18318) [Python] Expose Scalar.validate

2023-01-03 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18318. Resolution: Fixed Issue resolved by pull request 15149

[jira] [Resolved] (ARROW-18202) [R][C++] Different behaviour of R's base::gsub() binding aka libarrow's replace_string_regex kernel since 10.0.0

2023-01-03 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18202. Resolution: Fixed Issue resolved by pull request 15132

[jira] [Commented] (ARROW-12938) [C++] Investigate spawning arbitrary callbacks from StopToken

2022-12-20 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17649652#comment-17649652 ] Antoine Pitrou commented on ARROW-12938: Yes, this is possible even without a separate thread.

[jira] [Updated] (ARROW-18436) [Python] `FileSystem.from_uri` doesn't decode %-encoded characters in path

2022-12-15 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18436: --- Summary: [Python] `FileSystem.from_uri` doesn't decode %-encoded characters in path (was:

[jira] [Assigned] (ARROW-18436) `pyarrow.fs.FileSystem.from_uri` crashes when URI has a space

2022-12-15 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-18436: -- Assignee: Antoine Pitrou > `pyarrow.fs.FileSystem.from_uri` crashes when URI has a

[jira] [Updated] (ARROW-18436) [Python] `FileSystem.from_uri` doesn't decode %-encoded characters

2022-12-15 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18436: --- Summary: [Python] `FileSystem.from_uri` doesn't decode %-encoded characters (was:

[jira] [Updated] (ARROW-18436) `pyarrow.fs.FileSystem.from_uri` crashes when URI has a space

2022-12-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18436: --- Component/s: C++ > `pyarrow.fs.FileSystem.from_uri` crashes when URI has a space >

[jira] [Updated] (ARROW-18436) `pyarrow.fs.FileSystem.from_uri` crashes when URI has a space

2022-12-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18436: --- Fix Version/s: 11.0.0 > `pyarrow.fs.FileSystem.from_uri` crashes when URI has a space >

[jira] [Commented] (ARROW-18436) `pyarrow.fs.FileSystem.from_uri` crashes when URI has a space

2022-12-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17647715#comment-17647715 ] Antoine Pitrou commented on ARROW-18436: That's because the space needs to be encoded. However,

[jira] [Resolved] (ARROW-18106) [C++] JSON reader ignores explicit schema with default unexpected_field_behavior="infer"

2022-12-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18106. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14741

[jira] [Resolved] (ARROW-18435) [C++][Java] Update ORC to 1.8.1

2022-12-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18435. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14942

[jira] [Resolved] (ARROW-17798) [C++][Parquet] Add DELTA_BINARY_PACKED encoder to Parquet writer

2022-12-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-17798. Resolution: Fixed Issue resolved by pull request 14191

[jira] [Assigned] (ARROW-18423) [Python] Expose reading a schema from an IPC message

2022-12-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-18423: -- Assignee: Andre Kohn > [Python] Expose reading a schema from an IPC message >

[jira] [Resolved] (ARROW-18423) [Python] Expose reading a schema from an IPC message

2022-12-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18423. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14831

[jira] [Resolved] (ARROW-18420) [C++][Parquet] Introduce ColumnIndex and OffsetIndex

2022-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18420. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14803

[jira] [Resolved] (ARROW-17932) [C++] Implement streaming RecordBatchReader for JSON

2022-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-17932. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14355

[jira] [Resolved] (ARROW-16430) [Python] Read/Write record batch custom metadata API in pyarrow

2022-12-12 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-16430. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 13041

[jira] [Resolved] (ARROW-18421) [C++][ORC] Add accessor for number of rows by stripe in reader

2022-12-12 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18421. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14806

[jira] [Commented] (ARROW-18277) [R] Unable to install R's arrow on RStudio

2022-12-12 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17646121#comment-17646121 ] Antoine Pitrou commented on ARROW-18277: I think we can close it. > [R] Unable to install R's

[jira] [Commented] (ARROW-12264) [C++][Dataset] Handle NaNs correctly in Parquet predicate push-down

2022-12-12 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17645984#comment-17645984 ] Antoine Pitrou commented on ARROW-12264: That's right. > [C++][Dataset] Handle NaNs correctly

[jira] [Resolved] (ARROW-14999) [C++] List types with different field names are not equal

2022-12-08 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-14999. Resolution: Fixed Issue resolved by pull request 14847

[jira] [Updated] (ARROW-12264) [C++][Dataset] Handle NaNs correctly in Parquet predicate push-down

2022-12-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-12264: --- Issue Type: Bug (was: Task) > [C++][Dataset] Handle NaNs correctly in Parquet predicate

[jira] [Comment Edited] (ARROW-12264) [C++][Dataset] Handle NaNs correctly in Parquet predicate push-down

2022-12-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17644354#comment-17644354 ] Antoine Pitrou edited comment on ARROW-12264 at 12/7/22 2:08 PM: - cc

[jira] [Commented] (ARROW-12264) [C++][Dataset] Handle NaNs correctly in Parquet predicate push-down

2022-12-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17644354#comment-17644354 ] Antoine Pitrou commented on ARROW-12264: cc @westonpace > [C++][Dataset] Handle NaNs correctly

[jira] [Commented] (ARROW-13240) [C++][Parquet] Page statistics not written in v2?

2022-12-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17644350#comment-17644350 ] Antoine Pitrou commented on ARROW-13240: [~jorgecarleitao] Could you try to check if that still

[jira] [Commented] (ARROW-13240) [C++][Parquet] Page statistics not written in v2?

2022-12-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17644349#comment-17644349 ] Antoine Pitrou commented on ARROW-13240: [~emkornfield] When would that have happened? >

[jira] [Updated] (ARROW-13240) [C++][Parquet] Page statistics not written in v2?

2022-12-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-13240: --- Priority: Major (was: Minor) > [C++][Parquet] Page statistics not written in v2? >

[jira] [Resolved] (ARROW-18424) [C++] Fix Doxygen error on `arrow::engine::ConversionStrictness`

2022-12-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18424. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14845

[jira] [Updated] (ARROW-18424) [C++] Fix Doxygen error on `arrow::engine::ConversionStrictness`

2022-12-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18424: --- Priority: Trivial (was: Major) > [C++] Fix Doxygen error on

[jira] [Resolved] (ARROW-14161) [C++][Parquet][Docs] Reading/Writing Parquet Files

2022-12-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-14161. Resolution: Fixed Issue resolved by pull request 14018

[jira] [Resolved] (ARROW-18269) [C++] Slash character in partition value handling

2022-12-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18269. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14646

[jira] [Resolved] (ARROW-18419) [C++] Update vendored fast_float

2022-12-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18419. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14817

[jira] [Resolved] (ARROW-18413) [C++][Parquet] FileMetaData exposes page index metadata

2022-12-01 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18413. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14742

[jira] [Commented] (ARROW-17538) [C++] Importing an ArrowArrayStream can't handle errors from get_schema

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641430#comment-17641430 ] Antoine Pitrou commented on ARROW-17538: [~benpharkins] I don't know if you would like to carve

[jira] [Updated] (ARROW-17538) [C++] Importing an ArrowArrayStream can't handle errors from get_schema

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-17538: --- Fix Version/s: 11.0.0 > [C++] Importing an ArrowArrayStream can't handle errors from

[jira] [Commented] (ARROW-18375) MIGRATION: Enable GitHub issue type labels

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641382#comment-17641382 ] Antoine Pitrou commented on ARROW-18375: I use "Type: enhancement" for user-visible enhancements

[jira] [Commented] (ARROW-18375) MIGRATION: Enable GitHub issue type labels

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641379#comment-17641379 ] Antoine Pitrou commented on ARROW-18375: I don't understand why "Type: test" is for either :-)

[jira] [Updated] (ARROW-18373) MIGRATION: Enable multiple component selection in issue templates

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18373: --- Labels: gh-migration pull-request-available (was: pull-request-available) > MIGRATION:

[jira] [Updated] (ARROW-18378) MIGRATION: Disable issue reporting in ASF Jira

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18378: --- Labels: gh-migration (was: ) > MIGRATION: Disable issue reporting in ASF Jira >

[jira] [Updated] (ARROW-18381) MIGRATION: Create milestones for every needed fix version

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18381: --- Labels: gh-migration (was: ) > MIGRATION: Create milestones for every needed fix version >

[jira] [Updated] (ARROW-18364) MIGRATION: Update GitHub issue templates to support bug reports and feature requests

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18364: --- Labels: gh-migration (was: ) > MIGRATION: Update GitHub issue templates to support bug

[jira] [Updated] (ARROW-18377) MIGRATION: Automate component labels from issue form content

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18377: --- Labels: gh-migration (was: ) > MIGRATION: Automate component labels from issue form

[jira] [Updated] (ARROW-18376) MIGRATION: Add component labels to GitHub

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18376: --- Labels: gh-migration (was: ) > MIGRATION: Add component labels to GitHub >

[jira] [Commented] (ARROW-18371) [C++] Expose *FromJSON helpers

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641250#comment-17641250 ] Antoine Pitrou commented on ARROW-18371: We would have to prefix those macros with {{ARROW_}}.

[jira] [Commented] (ARROW-18375) MIGRATION: Enable GitHub issue type labels

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641234#comment-17641234 ] Antoine Pitrou commented on ARROW-18375: cc [~jorisvandenbossche] [~assignUser] > MIGRATION:

[jira] [Commented] (ARROW-18375) MIGRATION: Enable GitHub issue type labels

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641233#comment-17641233 ] Antoine Pitrou commented on ARROW-18375: I renamed the existing "bug", "enhancement" and "usage"

[jira] [Commented] (ARROW-13221) [C++] arrow_reader_writer_test.cc slow to compile

2022-11-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641195#comment-17641195 ] Antoine Pitrou commented on ARROW-13221: bq. It seems that 1) may not reduce total build time.

[jira] [Commented] (ARROW-13221) [C++] arrow_reader_writer_test.cc slow to compile

2022-11-29 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17640576#comment-17640576 ] Antoine Pitrou commented on ARROW-13221: I think there are two aspects to this: 1) split the

[jira] [Commented] (ARROW-18039) [C++][CI] Reduce MinGW build times

2022-11-29 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17640491#comment-17640491 ] Antoine Pitrou commented on ARROW-18039: Yes, it was already reported at

[jira] [Resolved] (ARROW-17836) [C++] Allow specifying of alignment in MemoryPool's allocations

2022-11-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-17836. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14225

[jira] [Updated] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2022-11-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18400: --- Fix Version/s: 11.0.0 > [Python] Quadratic memory usage of Table.to_pandas with nested data

[jira] [Commented] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2022-11-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17638234#comment-17638234 ] Antoine Pitrou commented on ARROW-18400: [~alenka] [~milesgranger] This seems like something

[jira] [Updated] (ARROW-18400) Quadratic memory usage of Table.to_pandas with nested data

2022-11-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18400: --- Priority: Critical (was: Major) > Quadratic memory usage of Table.to_pandas with nested

[jira] [Updated] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2022-11-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18400: --- Summary: [Python] Quadratic memory usage of Table.to_pandas with nested data (was:

[jira] [Resolved] (ARROW-17859) [C++] Use self-pipe in signal-receiving StopSource

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-17859. Resolution: Fixed Issue resolved by pull request 14250

[jira] [Created] (ARROW-18399) [Python] Reduce warnings during tests

2022-11-23 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18399: -- Summary: [Python] Reduce warnings during tests Key: ARROW-18399 URL: https://issues.apache.org/jira/browse/ARROW-18399 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-18399) [Python] Reduce warnings during tests

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637891#comment-17637891 ] Antoine Pitrou commented on ARROW-18399: cc [~milesgranger] > [Python] Reduce warnings during

[jira] [Resolved] (ARROW-18392) [CI][Python] Some nightly python tests fail due to ACCESS DENIED to S3 bucket

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18392. Resolution: Fixed Issue resolved by pull request 14716

[jira] [Commented] (ARROW-18398) [C++] Sporadic error in StressSourceGroupedSumStop

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637880#comment-17637880 ] Antoine Pitrou commented on ARROW-18398: cc [~westonpace] > [C++] Sporadic error in

[jira] [Created] (ARROW-18398) [C++] Sporadic error in StressSourceGroupedSumStop

2022-11-23 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18398: -- Summary: [C++] Sporadic error in StressSourceGroupedSumStop Key: ARROW-18398 URL: https://issues.apache.org/jira/browse/ARROW-18398 Project: Apache Arrow

[jira] [Updated] (ARROW-18395) [C++] Move select-k implementation into separate module

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18395: --- Labels: good-second-issue (was: ) > [C++] Move select-k implementation into separate

[jira] [Updated] (ARROW-18396) [C++] Move rank implementation into separate module

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18396: --- Labels: good-second-issue (was: ) > [C++] Move rank implementation into separate module >

[jira] [Commented] (ARROW-18396) [C++] Move rank implementation into separate module

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637870#comment-17637870 ] Antoine Pitrou commented on ARROW-18396: cc [~benpharkins] if/when you have time for a

[jira] [Commented] (ARROW-18395) [C++] Move select-k implementation into separate module

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637869#comment-17637869 ] Antoine Pitrou commented on ARROW-18395: cc [~benpharkins] if/when you have time for a

[jira] [Resolved] (ARROW-18383) [C++] Avoid global variables for thread pools and at-fork handlers

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18383. Resolution: Fixed Issue resolved by pull request 14704

[jira] [Created] (ARROW-18397) [C++] Clear S3 region resolver client at S3 shutdown

2022-11-23 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18397: -- Summary: [C++] Clear S3 region resolver client at S3 shutdown Key: ARROW-18397 URL: https://issues.apache.org/jira/browse/ARROW-18397 Project: Apache Arrow

[jira] [Resolved] (ARROW-18350) [C++] Use std::to_chars instead of std::to_string

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18350. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14666

[jira] [Created] (ARROW-18396) [C++] Move rank implementation into separate module

2022-11-23 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18396: -- Summary: [C++] Move rank implementation into separate module Key: ARROW-18396 URL: https://issues.apache.org/jira/browse/ARROW-18396 Project: Apache Arrow

[jira] [Created] (ARROW-18395) [C++] Move select-k implementation into separate module

2022-11-23 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18395: -- Summary: [C++] Move select-k implementation into separate module Key: ARROW-18395 URL: https://issues.apache.org/jira/browse/ARROW-18395 Project: Apache Arrow

[jira] [Commented] (ARROW-18381) MIGRATION: Create milestones for every needed fix version

2022-11-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637624#comment-17637624 ] Antoine Pitrou commented on ARROW-18381: bq. This means that 314 legacy issues will lose

[jira] [Created] (ARROW-18383) [C++] Avoid global variables for thread pools and at-fork handlers

2022-11-22 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18383: -- Summary: [C++] Avoid global variables for thread pools and at-fork handlers Key: ARROW-18383 URL: https://issues.apache.org/jira/browse/ARROW-18383 Project:

[jira] [Created] (ARROW-18382) [C++] "ADDRESS_SANITIZER" not defined in fuzzing builds

2022-11-22 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18382: -- Summary: [C++] "ADDRESS_SANITIZER" not defined in fuzzing builds Key: ARROW-18382 URL: https://issues.apache.org/jira/browse/ARROW-18382 Project: Apache Arrow

[jira] [Resolved] (ARROW-4709) [C++] Optimize for ordered JSON fields

2022-11-22 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-4709. --- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14100

[jira] [Commented] (ARROW-13677) [C++] Improve performance of unpack64

2022-11-22 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637299#comment-17637299 ] Antoine Pitrou commented on ARROW-13677: [~benpharkins] This can be a reasonable undertaking,

[jira] [Resolved] (ARROW-17985) [Python][C++] Opaque error code ([code: 100]), when not setting region

2022-11-22 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-17985. Resolution: Fixed Issue resolved by pull request 14601

[jira] [Resolved] (ARROW-18343) [C++] AllocateBitmap() with out parameter is declared but not defined

2022-11-21 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-18343. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14657

[jira] [Commented] (ARROW-18371) [C++] Expose *FromJSON helpers

2022-11-21 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17636794#comment-17636794 ] Antoine Pitrou commented on ARROW-18371: > I assume the comment is regarding BatchesWithSchema

[jira] [Commented] (ARROW-18371) [C++] Expose *FromJSON helpers

2022-11-21 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17636747#comment-17636747 ] Antoine Pitrou commented on ARROW-18371: Definitely not. These are functions generating ad hoc

[jira] [Updated] (ARROW-18362) [C++] Accelerate Parquet bit-packing decoding with ICX AVX-512

2022-11-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18362: --- Summary: [C++] Accelerate Parquet bit-packing decoding with ICX AVX-512 (was: Accelerate

[jira] [Updated] (ARROW-18362) [C++] Accelerate Parquet bit-packing decoding with AVX-512

2022-11-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18362: --- Summary: [C++] Accelerate Parquet bit-packing decoding with AVX-512 (was: [C++] Accelerate

[jira] [Updated] (ARROW-18362) [C++] Accelerate Parquet bit-packing decoding with AVX-512

2022-11-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18362: --- Description: Accelerate Parquet bit-packing decoding with AVX-512 instructions? (was: h1.

[jira] [Commented] (ARROW-18362) Accelerate Parquet bit-packing decoding with ICX AVX-512

2022-11-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635863#comment-17635863 ] Antoine Pitrou commented on ARROW-18362: Are you willing to contribute this? > Accelerate

[jira] [Created] (ARROW-18353) [C++][Flight] Sporadic hang in UCX tests

2022-11-17 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18353: -- Summary: [C++][Flight] Sporadic hang in UCX tests Key: ARROW-18353 URL: https://issues.apache.org/jira/browse/ARROW-18353 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-18351) [C++][Flight] Crash in UcxErrorHandlingTest.TestDoExchange

2022-11-17 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635483#comment-17635483 ] Antoine Pitrou commented on ARROW-18351: cc [~lidavidm] [~yibocai] > [C++][Flight] Crash in

[jira] [Created] (ARROW-18351) [C++][Flight] Crash in UcxErrorHandlingTest.TestDoExchange

2022-11-17 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18351: -- Summary: [C++][Flight] Crash in UcxErrorHandlingTest.TestDoExchange Key: ARROW-18351 URL: https://issues.apache.org/jira/browse/ARROW-18351 Project: Apache Arrow

[jira] [Created] (ARROW-18350) [C++] Use std::to_chars instead of std::to_string

2022-11-17 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18350: -- Summary: [C++] Use std::to_chars instead of std::to_string Key: ARROW-18350 URL: https://issues.apache.org/jira/browse/ARROW-18350 Project: Apache Arrow

[jira] [Commented] (ARROW-18349) [CI][C++][Flight] Exercise UCX on CI

2022-11-17 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635471#comment-17635471 ] Antoine Pitrou commented on ARROW-18349: cc [~yibocai] [~lidavidm] [~kou] > [CI][C++][Flight]

[jira] [Created] (ARROW-18349) [CI][C++][Flight] Exercise UCX on CI

2022-11-17 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-18349: -- Summary: [CI][C++][Flight] Exercise UCX on CI Key: ARROW-18349 URL: https://issues.apache.org/jira/browse/ARROW-18349 Project: Apache Arrow Issue Type:

[jira] [Assigned] (ARROW-16817) [C++][Python] Segfaults for unsupported datatypes in the ORC writer

2022-11-17 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-16817: -- Assignee: Will Jones (was: Ian Alexander Joiner) > [C++][Python] Segfaults for

[jira] [Assigned] (ARROW-16817) [C++][Python] Segfaults for unsupported datatypes in the ORC writer

2022-11-17 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-16817: -- Assignee: Ian Alexander Joiner (was: Will Jones) > [C++][Python] Segfaults for

[jira] [Resolved] (ARROW-16817) [C++][Python] Segfaults for unsupported datatypes in the ORC writer

2022-11-17 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-16817. Resolution: Fixed Issue resolved by pull request 14638

[jira] [Resolved] (ARROW-15538) [C++] Create mapping from Substrait "standard functions" to Arrow equivalents

2022-11-17 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-15538. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14434

[jira] [Commented] (ARROW-18344) [C++] Use input pre-sortedness to create sorted table with ConcatenateTables

2022-11-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634997#comment-17634997 ] Antoine Pitrou commented on ARROW-18344: That's what we should do indeed. But do we want the

[jira] [Comment Edited] (ARROW-18344) [C++] Use input pre-sortedness to create sorted table with ConcatenateTables

2022-11-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634877#comment-17634877 ] Antoine Pitrou edited comment on ARROW-18344 at 11/16/22 3:11 PM: -- We

[jira] [Commented] (ARROW-18344) [C++] Use input pre-sortedness to create sorted table with ConcatenateTables

2022-11-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634877#comment-17634877 ] Antoine Pitrou commented on ARROW-18344: We don't actually sort data in Arrow, we produce

[jira] [Resolved] (ARROW-17825) [C++] Allow to write several tables successively with ORCFileWriter::Write method

2022-11-15 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-17825. Resolution: Fixed Issue resolved by pull request 14219

  1   2   3   4   5   6   7   8   9   10   >