[jira] [Created] (ARROW-17193) [C++] Building GCS and tests on M1 MacOS 12.05 is failing.

2022-07-23 Thread Rok Mihevc (Jira)
Rok Mihevc created ARROW-17193:
--

 Summary: [C++] Building GCS and tests on M1 MacOS 12.05 is failing.
 Key: ARROW-17193
 URL: https://issues.apache.org/jira/browse/ARROW-17193
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++
Affects Versions: 8.0.0
Reporter: Rok Mihevc


Building GCS and tests on M1 MacOS 12.05 with dependencies installed with 
homebrew is failing.

{code:bash}
cmake \
-GNinja \
-DCMAKE_INSTALL_PREFIX=$ARROW_HOME \
-DCMAKE_INSTALL_LIBDIR=lib \
-DARROW_PYTHON=ON \
-DARROW_COMPUTE=ON \
-DARROW_FILESYSTEM=ON \
-DARROW_CSV=ON \
-DARROW_GCS=ON \
-DARROW_INSTALL_NAME_RPATH=OFF \
-DARROW_BUILD_TESTS=ON \
-DCMAKE_CXX_STANDARD=17 \
..
{code}

Building errors with:

{noformat}
Undefined symbols for architecture arm64:
  "absl::lts_20220623::FormatTime(std::__1::basic_string_view >, absl::lts_20220623::Time, 
absl::lts_20220623::TimeZone)", referenced from:
  arrow::fs::(anonymous 
namespace)::GcsIntegrationTest_OpenInputStreamReadMetadata_Test::TestBody() in 
gcsfs_test.cc.o
  
"absl::lts_20220623::FromChrono(std::__1::chrono::time_point > > 
const&)", referenced from:
  arrow::fs::(anonymous 
namespace)::GcsIntegrationTest_OpenInputStreamReadMetadata_Test::TestBody() in 
gcsfs_test.cc.o
  "absl::lts_20220623::RFC3339_full", referenced from:
  arrow::fs::(anonymous 
namespace)::GcsFileSystem_ObjectMetadataRoundtrip_Test::TestBody() in 
gcsfs_test.cc.o
  arrow::fs::(anonymous 
namespace)::GcsIntegrationTest_OpenInputStreamReadMetadata_Test::TestBody() in 
gcsfs_test.cc.o
  "absl::lts_20220623::time_internal::cctz::utc_time_zone()", referenced from:
  arrow::fs::(anonymous 
namespace)::GcsIntegrationTest_OpenInputStreamReadMetadata_Test::TestBody() in 
gcsfs_test.cc.o
  "absl::lts_20220623::ToDoubleSeconds(absl::lts_20220623::Duration)", 
referenced from:
  arrow::fs::(anonymous 
namespace)::GcsFileSystem_ObjectMetadataRoundtrip_Test::TestBody() in 
gcsfs_test.cc.o
  "absl::lts_20220623::Duration::operator-=(absl::lts_20220623::Duration)", 
referenced from:
  arrow::fs::(anonymous 
namespace)::GcsFileSystem_ObjectMetadataRoundtrip_Test::TestBody() in 
gcsfs_test.cc.o
  "absl::lts_20220623::ParseTime(std::__1::basic_string_view >, std::__1::basic_string_view >, absl::lts_20220623::Time*, 
std::__1::basic_string, 
std::__1::allocator >*)", referenced from:
  arrow::fs::(anonymous 
namespace)::GcsFileSystem_ObjectMetadataRoundtrip_Test::TestBody() in 
gcsfs_test.cc.o
{noformat}


Dependencies  installed with:
{noformat}
brew update && brew bundle --file=cpp/Brewfile
{noformat}

See https://github.com/apache/arrow/pull/13681#issuecomment-1193241547 and  
https://github.com/apache/arrow/pull/13407



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17192) .to_pandas can't read_feather if a date column contains dates before 1677 and after 2262

2022-07-23 Thread Adrien Pacifico (Jira)
Adrien Pacifico created ARROW-17192:
---

 Summary: .to_pandas  can't read_feather if a date column contains 
dates before 1677 and after 2262
 Key: ARROW-17192
 URL: https://issues.apache.org/jira/browse/ARROW-17192
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
 Environment: Any environment
Reporter: Adrien Pacifico


A feather file with a column containing dates lower than 1677 or greater than 
2262 cannot be read with pandas, du to  `.to_pandas` method.

To reproduce the issue:
# create feather file
df = pd.DataFrame(\{"date": [
datetime.fromisoformat("1654-01-01"),
datetime.fromisoformat("1920-01-01"),
],})
df.to_feather("to_trash.feather")

### read feather file 

from pyarrow.feather import read_feather

read_feather("to_trash.feather")



I think that the expected behavior would be to have an object column contining 
datetime objects.

I think that the problem comes from _array_like_to_pandas method : 
[https://github.com/apache/arrow/blob/76f45a6892b13391fdede4c72934f75f6d56143c/python/pyarrow/array.pxi#L1584]

or  from `_to_pandas()`
https://github.com/apache/arrow/blob/76f45a6892b13391fdede4c72934f75f6d56143c/python/pyarrow/array.pxi#L2742

or from `to_pandas`:
https://github.com/apache/arrow/blob/76f45a6892b13391fdede4c72934f75f6d56143c/python/pyarrow/array.pxi#L673



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17191) [C++] MinGW Flight tests failing

2022-07-23 Thread David Li (Jira)
David Li created ARROW-17191:


 Summary: [C++] MinGW Flight tests failing 
 Key: ARROW-17191
 URL: https://issues.apache.org/jira/browse/ARROW-17191
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: David Li
Assignee: David Li


Noticed across several PRs
{noformat}
[ RUN      ] GrpcDataTest.TestDoExchangeError
D:/a/arrow/arrow/cpp/src/arrow/flight/test_definitions.cc:490: Failure
Value of: _st.IsNotImplemented()
  Actual: false
Expected: true
Expected 'writer->Close()' to fail with NotImplemented, but got IOError: Stream 
finished before first message sent. gRPC client debug context: UNKNOWN:Error 
received from peer ipv4:127.0.0.1:52323 
{created_time:"2022-07-23T01:21:23.785644223+00:00", grpc_status:2, 
grpc_message:"Stream finished before first message sent"}. Client context: OK. 
Detail: Failed
D:/a/arrow/arrow/cpp/src/arrow/flight/test_definitions.cc:490: Failure
Value of: _st.ToString()
Expected: has substring "Expected error"
  Actual: "IOError: Stream finished before first message sent. gRPC client 
debug context: UNKNOWN:Error received from peer ipv4:127.0.0.1:52323 
{created_time:\"2022-07-23T01:21:23.785644223+00:00\", grpc_status:2, 
grpc_message:\"Stream finished before first message sent\"}. Client context: 
OK. Detail: Failed"
[  FAILED  ] GrpcDataTest.TestDoExchangeError (5 ms)
[ RUN      ] GrpcDataTest.TestDoExchangeConcurrency
[       OK ] GrpcDataTest.TestDoExchangeConcurrency (5 ms)
[ RUN      ] GrpcDataTest.TestDoExchangeUndrained
[       OK ] GrpcDataTest.TestDoExchangeUndrained (4 ms)
[ RUN      ] GrpcDataTest.TestIssue5095
[       OK ] GrpcDataTest.TestIssue5095 (9 ms)
[--] 17 tests from GrpcDataTest (891 ms total)
[--] 7 tests from GrpcDoPutTest
[ RUN      ] GrpcDoPutTest.TestInts
D:/a/arrow/arrow/cpp/src/arrow/flight/test_definitions.cc:690: Failure
Failed
'writer->Close()' failed with Invalid: Expected app_metadata to be foo bar but 
got \0L��. gRPC client debug context: UNKNOWN:Error received from peer 
ipv4:127.0.0.1:52331 {grpc_message:"Expected app_metadata to be foo bar but got 
\x00L\xf4\x86\x02\xe0\xa1", grpc_status:3, 
created_time:"2022-07-23T01:21:23.810734286+00:00"}. Client context: OK
[  FAILED  ] GrpcDoPutTest.TestInts (4 ms)
[ RUN      ] GrpcDoPutTest.TestFloats
D:/a/arrow/arrow/cpp/src/arrow/flight/test_definitions.cc:690: Failure
Failed
'writer->Close()' failed with Invalid: Expected app_metadata to be foo bar but 
got \0<���. gRPC client debug context: UNKNOWN:Error received from peer 
ipv4:127.0.0.1:52333 {grpc_message:"Expected app_metadata to be foo bar but got 
\x00<\xee\xc6\x02\xe0\xa1", grpc_status:3, 
created_time:"2022-07-23T01:21:23.815439591+00:00"}. Client context: OK
[  FAILED  ] GrpcDoPutTest.TestFloats (4 ms)
[ RUN      ] GrpcDoPutTest.TestEmptyBatch
D:/a/arrow/arrow/cpp/src/arrow/flight/test_definitions.cc:690: Failure
Failed
'writer->Close()' failed with Invalid: Expected app_metadata to be foo bar but 
got \0���. gRPC client debug context: UNKNOWN:Error received from peer 
ipv4:127.0.0.1:52335 {grpc_message:"Expected app_metadata to be foo bar but got 
\x00\x9c\xef\xa6\x02\xe0\xa1", grpc_status:3, 
created_time:"2022-07-23T01:21:23.819872813+00:00"}. Client context: OK
[  FAILED  ] GrpcDoPutTest.TestEmptyBatch (4 ms)
[ RUN      ] GrpcDoPutTest.TestDicts
D:/a/arrow/arrow/cpp/src/arrow/flight/test_definitions.cc:690: Failure
Failed
'writer->Close()' failed with Invalid: Expected app_metadata to be foo bar but 
got \0\���. gRPC client debug context: UNKNOWN:Error received from peer 
ipv4:127.0.0.1:52337 {grpc_message:"Expected app_metadata to be foo bar but got 
\x00\\\xf0\xc6\x02\xe0\xa1", grpc_status:3, 
created_time:"2022-07-23T01:21:23.824172893+00:00"}. Client context: OK
[  FAILED  ] GrpcDoPutTest.TestDicts (4 ms)
[ RUN      ] GrpcDoPutTest.TestLargeBatch
D:/a/arrow/arrow/cpp/src/arrow/flight/test_definitions.cc:690: Failure
Failed
'writer->Close()' failed with Invalid: Expected app_metadata to be foo bar but 
got \0|��. gRPC client debug context: UNKNOWN:Error received from peer 
ipv4:127.0.0.1:52339 {created_time:"2022-07-23T01:21:24.001437714+00:00", 
grpc_status:3, grpc_message:"Expected app_metadata to be foo bar but got 
\x00|\xf2\xa6\x02\xe0\xa1"}. Client context: OK
[  FAILED  ] GrpcDoPutTest.TestLargeBatch (185 ms)
[ RUN      ] GrpcDoPutTest.TestSizeLimit
D:/a/arrow/arrow/cpp/src/arrow/flight/test_definitions.cc:802: Failure
Failed
'writer->Close()' failed with Invalid: Expected app_metadata to be foo bar but 
got \0\�,�. gRPC client debug context: UNKNOWN:Error received from peer 
ipv4:127.0.0.1:52341 {grpc_message:"Expected app_metadata to be foo bar but got 
\x00\\\xef,\x07\xe0\xa1", grpc_status:3, 
created_time:"2022-07-23T01:21:24.016917836+00:00"}. Client context: OK
[  FAILED  ] GrpcDoPutTest.TestSizeLimit (8 ms)
[ RUN      ]