[jira] [Commented] (ARROW-7939) [Python] crashes when reading parquet file compressed with snappy

2020-06-24 Thread Nicolas Elie (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17143853#comment-17143853 ] Nicolas Elie commented on ARROW-7939: - So I created a new virtual environment with just ipython from

[jira] [Commented] (ARROW-7939) [Python] crashes when reading parquet file compressed with snappy

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17143823#comment-17143823 ] Wes McKinney commented on ARROW-7939: - [~n-elie] I'm pretty sure this is caused by

[jira] [Updated] (ARROW-9217) [C++] Cover 0.01% null for the plain spaced encoding/decoding benchmark

2020-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9217: -- Labels: pull-request-available (was: ) > [C++] Cover 0.01% null for the plain spaced

[jira] [Resolved] (ARROW-9215) pyarrow parquet writer converts uint32 columns to int64

2020-06-24 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Korn resolved ARROW-9215. - Assignee: Uwe Korn Resolution: Not A Problem > pyarrow parquet writer converts uint32 columns to

[jira] [Commented] (ARROW-9215) pyarrow parquet writer converts uint32 columns to int64

2020-06-24 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17143603#comment-17143603 ] Uwe Korn commented on ARROW-9215: - This is expected behaviour as long as you are writing Parquet files

[jira] [Created] (ARROW-9216) [C++] Use BitBlockCounter for plain spaced encoding/decoding

2020-06-24 Thread Frank Du (Jira)
Frank Du created ARROW-9216: --- Summary: [C++] Use BitBlockCounter for plain spaced encoding/decoding Key: ARROW-9216 URL: https://issues.apache.org/jira/browse/ARROW-9216 Project: Apache Arrow

[jira] [Updated] (ARROW-9216) [C++] Use BitBlockCounter for plain spaced encoding/decoding

2020-06-24 Thread Frank Du (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frank Du updated ARROW-9216: Description: Speedup the typical use case which most datas are true values > [C++] Use BitBlockCounter for

[jira] [Updated] (ARROW-9216) [C++] Use BitBlockCounter for plain spaced encoding/decoding

2020-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9216: -- Labels: pull-request-available (was: ) > [C++] Use BitBlockCounter for plain spaced

[jira] [Commented] (ARROW-7939) [Python] crashes when reading parquet file compressed with snappy

2020-06-24 Thread Nicolas Elie (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17143720#comment-17143720 ] Nicolas Elie commented on ARROW-7939: - Also tried with a nightly build and the same crash happens >

[jira] [Updated] (ARROW-7375) [Python] Expose C++ MakeArrayOfNull

2020-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7375: -- Labels: pull-request-available (was: ) > [Python] Expose C++ MakeArrayOfNull >

[jira] [Commented] (ARROW-7939) [Python] crashes when reading parquet file compressed with snappy

2020-06-24 Thread Nicolas Elie (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17143714#comment-17143714 ] Nicolas Elie commented on ARROW-7939: - Hello, I just faced the same issue with pyarrow 0.17.1

[jira] [Created] (ARROW-9217) [C++] Cover 0.01% null for the plain spaced encoding/decoding benchmark

2020-06-24 Thread Frank Du (Jira)
Frank Du created ARROW-9217: --- Summary: [C++] Cover 0.01% null for the plain spaced encoding/decoding benchmark Key: ARROW-9217 URL: https://issues.apache.org/jira/browse/ARROW-9217 Project: Apache Arrow

[jira] [Resolved] (ARROW-9099) [C++][Gandiva] Add TRIM function for string

2020-06-24 Thread Praveen Kumar (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Praveen Kumar resolved ARROW-9099. -- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7402

[jira] [Updated] (ARROW-9099) [C++][Gandiva] Add TRIM function for string

2020-06-24 Thread Praveen Kumar (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Praveen Kumar updated ARROW-9099: - Component/s: C++ - Gandiva > [C++][Gandiva] Add TRIM function for string >

[jira] [Reopened] (ARROW-7939) [Python] crashes when reading parquet file compressed with snappy

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reopened ARROW-7939: - > [Python] crashes when reading parquet file compressed with snappy >

[jira] [Updated] (ARROW-9218) [R] Numeric columns turn to string when imported in R

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9218: Summary: [R] Numeric columns turn to string when imported in R (was: Numeric columns turn to

[jira] [Created] (ARROW-9218) Numeric columns turn to string when imported in R

2020-06-24 Thread David Cortes (Jira)
David Cortes created ARROW-9218: --- Summary: Numeric columns turn to string when imported in R Key: ARROW-9218 URL: https://issues.apache.org/jira/browse/ARROW-9218 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-7939) [Python] crashes when reading parquet file compressed with snappy

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17143965#comment-17143965 ] Wes McKinney commented on ARROW-7939: - What processor does your machine have? If you can produce a

[jira] [Comment Edited] (ARROW-8780) [Python] A fsspec-compatible wrapper for pyarrow.fs filesystems

2020-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17143898#comment-17143898 ] Fabian Höring edited comment on ARROW-8780 at 6/24/20, 4:57 PM: Thanks

[jira] [Commented] (ARROW-8780) [Python] A fsspec-compatible wrapper for pyarrow.fs filesystems

2020-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17143898#comment-17143898 ] Fabian Höring commented on ARROW-8780: -- Thanks for the ticket. I'll close

[jira] [Closed] (ARROW-7584) [Python] Improve ergonomics of new FileSystem API

2020-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-7584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fabian Höring closed ARROW-7584. Resolution: Duplicate > [Python] Improve ergonomics of new FileSystem API >

[jira] [Resolved] (ARROW-9217) [C++][Parquet] Cover 0.01% null for the plain spaced encoding/decoding benchmark

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9217. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7532

[jira] [Resolved] (ARROW-6945) [Rust] Enable integration tests

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-6945. Resolution: Fixed Issue resolved by pull request 7297

[jira] [Assigned] (ARROW-6945) [Rust] Enable integration tests

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6945: -- Assignee: Neal Richardson (was: Andy Grove) > [Rust] Enable integration tests >

[jira] [Commented] (ARROW-7875) [Java] Decimal place getting shifted

2020-06-24 Thread Larry Parker (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144113#comment-17144113 ] Larry Parker commented on ARROW-7875: - Just wanted to let you know that the problem was in the Denodo

[jira] [Updated] (ARROW-9217) [C++][Parquet] Cover 0.01% null for the plain spaced encoding/decoding benchmark

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9217: Summary: [C++][Parquet] Cover 0.01% null for the plain spaced encoding/decoding benchmark (was:

[jira] [Updated] (ARROW-9217) [C++] Cover 0.01% null for the plain spaced encoding/decoding benchmark

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9217: Component/s: C++ > [C++] Cover 0.01% null for the plain spaced encoding/decoding benchmark >

[jira] [Commented] (ARROW-9218) Numeric columns turn to string when imported in R

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17143911#comment-17143911 ] Neal Richardson commented on ARROW-9218: Could you please share code and/or a sample file that

[jira] [Assigned] (ARROW-9146) [C++][Dataset] Scanning a Fragment with a filter + mismatching schema shouldn't abort

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9146: -- Assignee: Ben Kietzman (was: Neal Richardson) > [C++][Dataset] Scanning a Fragment

[jira] [Assigned] (ARROW-9146) [C++][Dataset] Scanning a Fragment with a filter + mismatching schema shouldn't abort

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9146: -- Assignee: Neal Richardson (was: Ben Kietzman) > [C++][Dataset] Scanning a Fragment

[jira] [Assigned] (ARROW-6235) [R] Conversion from arrow::BinaryArray to R character vector not implemented

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6235: -- Assignee: Romain Francois (was: Neal Richardson) > [R] Conversion from

[jira] [Assigned] (ARROW-6235) [R] Conversion from arrow::BinaryArray to R character vector not implemented

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6235: -- Assignee: Neal Richardson (was: Romain Francois) > [R] Conversion from

[jira] [Assigned] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9219: -- Assignee: Neal Richardson > [R] coerce_timestamps in Parquet write options does not

[jira] [Commented] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144369#comment-17144369 ] Neal Richardson commented on ARROW-9219: No worries--the report itself is plenty of help, so

[jira] [Assigned] (ARROW-8563) [Go] Minor change to make newBuilder public

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8563: --- Assignee: Amol Umbarkar > [Go] Minor change to make newBuilder public >

[jira] [Updated] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9219: --- Summary: [R] coerce_timestamps in Parquet write options does not work (was:

[jira] [Updated] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9219: --- Labels: (was: newbie) > [R] coerce_timestamps in Parquet write options does not work >

[jira] [Updated] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9219: --- External issue URL:

[jira] [Updated] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9219: --- Fix Version/s: 1.0.0 > [R] coerce_timestamps in Parquet write options does not work >

[jira] [Updated] (ARROW-6995) [Packaging][Crossbow] The windows conda artifacts are not uploaded to GitHub releases

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6995: Fix Version/s: (was: 1.0.0) 2.0.0 > [Packaging][Crossbow] The windows conda

[jira] [Updated] (ARROW-6940) [C++] Expose Message-level IPC metadata in both read and write interfaces

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6940: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Expose Message-level IPC metadata

[jira] [Resolved] (ARROW-8961) [C++] Add utf8proc library to toolchain

2020-06-24 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-8961. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7452

[jira] [Resolved] (ARROW-9091) [C++] Utilize function's default options when passing no options to CallFunction for a function that requires them

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9091. - Resolution: Fixed Issue resolved by pull request 7498

[jira] [Updated] (ARROW-8729) [C++][Dataset] Only selecting a partition column results in empty table

2020-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8729: -- Labels: dataset dataset-dask-integration pull-request-available (was: dataset

[jira] [Commented] (ARROW-3308) [R] Convert R character vector with data exceeding 2GB to chunked array

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144390#comment-17144390 ] Wes McKinney commented on ARROW-3308: - Always using the chunked builder is actually faster I've found

[jira] [Resolved] (ARROW-7375) [Python] Expose C++ MakeArrayOfNull

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7375. - Resolution: Fixed Issue resolved by pull request 7533

[jira] [Created] (ARROW-9219) coerce_timestamps does not work in write_parquet in Apache arrow R package

2020-06-24 Thread Slim Bentami (Jira)
Slim Bentami created ARROW-9219: --- Summary: coerce_timestamps does not work in write_parquet in Apache arrow R package Key: ARROW-9219 URL: https://issues.apache.org/jira/browse/ARROW-9219 Project:

[jira] [Commented] (ARROW-9064) optimization debian package manager tweaks

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144405#comment-17144405 ] Wes McKinney commented on ARROW-9064: - I closed the PR as there were too many test failures >

[jira] [Commented] (ARROW-3308) [R] Convert R character vector with data exceeding 2GB to chunked array

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144412#comment-17144412 ] Wes McKinney commented on ARROW-3308: - Yeah, either that or error. Does it error now? > [R] Convert

[jira] [Resolved] (ARROW-9183) [C++] Failed to build arrow-cpp with gcc 4.9.2

2020-06-24 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-9183. - Resolution: Fixed Issue resolved by pull request 7493

[jira] [Commented] (ARROW-9219) coerce_timestamps does not work in write_parquet in Apache arrow R package

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144256#comment-17144256 ] Neal Richardson commented on ARROW-9219: It's a bug. Only one argument is passed to the function,

[jira] [Comment Edited] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-24 Thread Slim Bentami (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144364#comment-17144364 ] Slim Bentami edited comment on ARROW-9219 at 6/24/20, 8:40 PM: --- i would

[jira] [Commented] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-24 Thread Slim Bentami (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144364#comment-17144364 ] Slim Bentami commented on ARROW-9219: - i would love to do what i can to help but i'm afraid i am too

[jira] [Updated] (ARROW-8991) [C++][Compute] Add scalar_hash function

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8991: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Compute] Add scalar_hash function

[jira] [Resolved] (ARROW-7925) [C++][Documentation] Instructions about running IWYU and other tasks in cpp/development.rst have gone stale

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7925. - Resolution: Fixed Issue resolved by pull request 7538

[jira] [Resolved] (ARROW-4309) [Documentation] Add a docker-compose entry which builds the documentation with CUDA enabled

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-4309. - Resolution: Fixed Issue resolved by pull request 7439

[jira] [Updated] (ARROW-842) [Python] Handle more kinds of null sentinel objects from pandas 0.x

2020-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-842: - Labels: pull-request-available (was: ) > [Python] Handle more kinds of null sentinel objects

[jira] [Commented] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-24 Thread Slim Bentami (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144496#comment-17144496 ] Slim Bentami commented on ARROW-9219: - glad it helps. when do you think the fix would be available

[jira] [Updated] (ARROW-8647) [C++][Dataset] Optionally encode partition field values as dictionary type

2020-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8647: -- Labels: dataset dataset-dask-integration pull-request-available (was: dataset

[jira] [Commented] (ARROW-9211) [Python] ArrowInvalid error raised when deserialising pandas with pd.NaT values in object column

2020-06-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144465#comment-17144465 ] Wes McKinney commented on ARROW-9211: - Well, raising an exception is strictly better than returning

[jira] [Created] (ARROW-9221) ArrowBuf#setBytes(int, ByteBuffer) doesn't check the byte buffer's endianness

2020-06-24 Thread David Li (Jira)
David Li created ARROW-9221: --- Summary: ArrowBuf#setBytes(int, ByteBuffer) doesn't check the byte buffer's endianness Key: ARROW-9221 URL: https://issues.apache.org/jira/browse/ARROW-9221 Project: Apache

[jira] [Created] (ARROW-9222) [Format][Proposal] Remove validity bitmap from Union types

2020-06-24 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-9222: --- Summary: [Format][Proposal] Remove validity bitmap from Union types Key: ARROW-9222 URL: https://issues.apache.org/jira/browse/ARROW-9222 Project: Apache Arrow

[jira] [Created] (ARROW-9223) Fix to_pandas() export for timestamps within structs

2020-06-24 Thread Micah Kornfield (Jira)
Micah Kornfield created ARROW-9223: -- Summary: Fix to_pandas() export for timestamps within structs Key: ARROW-9223 URL: https://issues.apache.org/jira/browse/ARROW-9223 Project: Apache Arrow

[jira] [Commented] (ARROW-3308) [R] Convert R character vector with data exceeding 2GB to chunked array

2020-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144395#comment-17144395 ] Neal Richardson commented on ARROW-3308: IIUC that works if you're converting to a Table, but

[jira] [Commented] (ARROW-9215) pyarrow parquet writer converts uint32 columns to int64

2020-06-24 Thread Devavret Makkar (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17144637#comment-17144637 ] Devavret Makkar commented on ARROW-9215: bq. as uint32 columns can be larger than the range of