[jira] [Created] (ARROW-8343) [GLib] Add GArrowRecordBatchIterator
Kenta Murata created ARROW-8343: --- Summary: [GLib] Add GArrowRecordBatchIterator Key: ARROW-8343 URL: https://issues.apache.org/jira/browse/ARROW-8343 Project: Apache Arrow Issue Type: New Feature Components: GLib Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-8073) [GLib] Add binding of arrow::fs::PathForest
Kenta Murata created ARROW-8073: --- Summary: [GLib] Add binding of arrow::fs::PathForest Key: ARROW-8073 URL: https://issues.apache.org/jira/browse/ARROW-8073 Project: Apache Arrow Issue Type: New Feature Components: GLib Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7739) [GLib] Use placement new to initialize shared_ptr object in private structs
Kenta Murata created ARROW-7739: --- Summary: [GLib] Use placement new to initialize shared_ptr object in private structs Key: ARROW-7739 URL: https://issues.apache.org/jira/browse/ARROW-7739 Project: Apache Arrow Issue Type: Task Components: GLib Reporter: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7730) [GLib] Add Duration type support
Kenta Murata created ARROW-7730: --- Summary: [GLib] Add Duration type support Key: ARROW-7730 URL: https://issues.apache.org/jira/browse/ARROW-7730 Project: Apache Arrow Issue Type: New Feature Components: GLib Reporter: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7698) [Format][C++] Add tensor and sparse tensor supports in File metadata
Kenta Murata created ARROW-7698: --- Summary: [Format][C++] Add tensor and sparse tensor supports in File metadata Key: ARROW-7698 URL: https://issues.apache.org/jira/browse/ARROW-7698 Project: Apache Arrow Issue Type: New Feature Components: C++, Format Reporter: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7515) [C++] Rename nonexistent and non_existent to not_found
Kenta Murata created ARROW-7515: --- Summary: [C++] Rename nonexistent and non_existent to not_found Key: ARROW-7515 URL: https://issues.apache.org/jira/browse/ARROW-7515 Project: Apache Arrow Issue Type: Task Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7504) [GLib] Introduce value-returning garrow::check
Kenta Murata created ARROW-7504: --- Summary: [GLib] Introduce value-returning garrow::check Key: ARROW-7504 URL: https://issues.apache.org/jira/browse/ARROW-7504 Project: Apache Arrow Issue Type: Improvement Components: GLib Reporter: Kenta Murata Assignee: Kenta Murata Follow this discussion https://github.com/apache/arrow/pull/6066/files#r363367450 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7445) [GLib] Add HadoopFileSystem support
Kenta Murata created ARROW-7445: --- Summary: [GLib] Add HadoopFileSystem support Key: ARROW-7445 URL: https://issues.apache.org/jira/browse/ARROW-7445 Project: Apache Arrow Issue Type: Sub-task Components: GLib Reporter: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7444) [GLib] Add LocalFileSystem support
Kenta Murata created ARROW-7444: --- Summary: [GLib] Add LocalFileSystem support Key: ARROW-7444 URL: https://issues.apache.org/jira/browse/ARROW-7444 Project: Apache Arrow Issue Type: Sub-task Components: GLib Reporter: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7443) [GLib] Add binding of arrow::fs
Kenta Murata created ARROW-7443: --- Summary: [GLib] Add binding of arrow::fs Key: ARROW-7443 URL: https://issues.apache.org/jira/browse/ARROW-7443 Project: Apache Arrow Issue Type: New Feature Components: GLib Reporter: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7421) [C++] Support creating SparseCSRMatrix and SparseCSCMatrix from 0d and 1d Tensors
Kenta Murata created ARROW-7421: --- Summary: [C++] Support creating SparseCSRMatrix and SparseCSCMatrix from 0d and 1d Tensors Key: ARROW-7421 URL: https://issues.apache.org/jira/browse/ARROW-7421 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7420) [C++] Migrate internal functions of SparseTensor to Result-returning version
Kenta Murata created ARROW-7420: --- Summary: [C++] Migrate internal functions of SparseTensor to Result-returning version Key: ARROW-7420 URL: https://issues.apache.org/jira/browse/ARROW-7420 Project: Apache Arrow Issue Type: Task Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7419) [Python] Support SparseCSCMatrix
Kenta Murata created ARROW-7419: --- Summary: [Python] Support SparseCSCMatrix Key: ARROW-7419 URL: https://issues.apache.org/jira/browse/ARROW-7419 Project: Apache Arrow Issue Type: New Feature Components: Python Reporter: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7371) [GLib] Add Datasets binding
Kenta Murata created ARROW-7371: --- Summary: [GLib] Add Datasets binding Key: ARROW-7371 URL: https://issues.apache.org/jira/browse/ARROW-7371 Project: Apache Arrow Issue Type: New Feature Components: GLib Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7369) [GLib] Add garrow_table_combine_chunks
Kenta Murata created ARROW-7369: --- Summary: [GLib] Add garrow_table_combine_chunks Key: ARROW-7369 URL: https://issues.apache.org/jira/browse/ARROW-7369 Project: Apache Arrow Issue Type: New Feature Components: GLib Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7306) [C++] Add Result-returning version of FileSystemFromUri
Kenta Murata created ARROW-7306: --- Summary: [C++] Add Result-returning version of FileSystemFromUri Key: ARROW-7306 URL: https://issues.apache.org/jira/browse/ARROW-7306 Project: Apache Arrow Issue Type: Task Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7297) [C++] Add value accessor in sparse tensor class
Kenta Murata created ARROW-7297: --- Summary: [C++] Add value accessor in sparse tensor class Key: ARROW-7297 URL: https://issues.apache.org/jira/browse/ARROW-7297 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata {{SparseTensor}} can have value accessor like {{Tensor::Value}}. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7291) [Dev]Fix FORMAT_DIR in update-flatbuffers.sh
Kenta Murata created ARROW-7291: --- Summary: [Dev]Fix FORMAT_DIR in update-flatbuffers.sh Key: ARROW-7291 URL: https://issues.apache.org/jira/browse/ARROW-7291 Project: Apache Arrow Issue Type: Bug Components: Developer Tools Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7037) [C++ ] Compile error on the combination of protobuf >= 3.9 and clang
Kenta Murata created ARROW-7037: --- Summary: [C++ ] Compile error on the combination of protobuf >= 3.9 and clang Key: ARROW-7037 URL: https://issues.apache.org/jira/browse/ARROW-7037 Project: Apache Arrow Issue Type: Task Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata I encountered the following compile error on the combination of protobuf 3.10.0 and clang (Xcode 11). {noformat} [13/26] Building CXX object c++/src/CMakeFiles/orc.dir/wrap/orc-proto-wrapper.cc.o FAILED: c++/src/CMakeFiles/orc.dir/wrap/orc-proto-wrapper.cc.o /Applications/Xcode_11.1.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/c++ -Ic++/include -I/Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/include -I/Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/src -Ic++/src -isystem c++/libs/thirdparty/zlib_ep-install/include -isystem c++/libs/thirdparty/lz4_ep-install/include -Qunused-arguments -fcolor-diagnostics -ggdb -O0 -g -fPIC -Wno-zero-as-null-pointer-constant -Wno-inconsistent-missing-destructor-override -Wno-error=undef -std=c++11 -Weverything -Wno-c++98-compat -Wno-missing-prototypes -Wno-c++98-compat-pedantic -Wno-padded -Wno-covered-switch-default -Wno-missing-noreturn -Wno-unknown-pragmas -Wno-gnu-zero-variadic-macro-arguments -Wconversion -Wno-c++2a-compat -Werror -std=c++11 -Weverything -Wno-c++98-compat -Wno-missing-prototypes -Wno-c++98-compat-pedantic -Wno-padded -Wno-covered-switch-default -Wno-missing-noreturn -Wno-unknown-pragmas -Wno-gnu-zero-variadic-macro-arguments -Wconversion -Wno-c++2a-compat -Werror -O0 -g -MD -MT c++/src/CMakeFiles/orc.dir/wrap/orc-proto-wrapper.cc.o -MF c++/src/CMakeFiles/orc.dir/wrap/orc-proto-wrapper.cc.o.d -o c++/src/CMakeFiles/orc.dir/wrap/orc-proto-wrapper.cc.o -c /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/src/wrap/orc-proto-wrapper.cc In file included from /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/src/wrap/orc-proto-wrapper.cc:44: c++/src/orc_proto.pb.cc:959:145: error: possible misuse of comma operator here [-Werror,-Wcomma] static bool dynamic_init_dummy_orc_5fproto_2eproto = ( ::PROTOBUF_NAMESPACE_ID::internal::AddDescriptors(_table_orc_5fproto_2eproto), true); ^ c++/src/orc_proto.pb.cc:959:57: note: cast expression to void to silence warning static bool dynamic_init_dummy_orc_5fproto_2eproto = ( ::PROTOBUF_NAMESPACE_ID::internal::AddDescriptors(_table_orc_5fproto_2eproto), true); ^~~~ static_cast( ) 1 error generated. {noformat} This may be due to a bug of protobuf filed as https://github.com/protocolbuffers/protobuf/issues/6619. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-7036) [C++] Version up ORC to avoid compile errors
Kenta Murata created ARROW-7036: --- Summary: [C++] Version up ORC to avoid compile errors Key: ARROW-7036 URL: https://issues.apache.org/jira/browse/ARROW-7036 Project: Apache Arrow Issue Type: Task Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata I encountered the compile errors due to {{-Wshadow-field}} like below: {noformat} [1/4] Building CXX object c++/src/CMakeFiles/orc.dir/Vector.cc.o FAILED: c++/src/CMakeFiles/orc.dir/Vector.cc.o /Applications/Xcode_11.1.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/c++ -Ic++/include -I/Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/include -I/Users/mrkn/src/github.com/apa che/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/src -Ic++/src -isystem c++/libs/thirdparty/zlib_ep-install/include -isystem c++/libs/thirdparty/lz4_ep-install/include -Qunused-arguments -fcolor-diagnostics -ggdb -O0 -g -fPIC -Wno-z ero-as-null-pointer-constant -Wno-inconsistent-missing-destructor-override -Wno-error=undef -std=c++11 -Weverything -Wno-c++98-compat -Wno-missing-prototypes -Wno-c++98-compat-pedantic -Wno-padded -Wno-covered-switch-default -Wno-missing-n oreturn -Wno-unknown-pragmas -Wno-gnu-zero-variadic-macro-arguments -Wconversion -Werror -std=c++11 -Weverything -Wno-c++98-compat -Wno-missing-prototypes -Wno-c++98-compat-pedantic -Wno-padded -Wno-covered-switch-default -Wno-missing-nore turn -Wno-unknown-pragmas -Wno-gnu-zero-variadic-macro-arguments -Wconversion -Werror -O0 -g -MD -MT c++/src/CMakeFiles/orc.dir/Vector.cc.o -MF c++/src/CMakeFiles/orc.dir/Vector.cc.o.d -o c++/src/CMakeFiles/orc.dir/Vector.cc.o -c /Users/mr kn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/src/Vector.cc /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/src/Vector.cc:59:45: error: parameter 'capacity' shadows member inherited from type 'ColumnVectorBatch' [-Werror,-Wshadow-field] LongVectorBatch::LongVectorBatch(uint64_t capacity, MemoryPool& pool ^ /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/include/orc/Vector.hh:46:14: note: declared here uint64_t capacity; ^ /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/src/Vector.cc:87:49: error: parameter 'capacity' shadows member inherited from type 'ColumnVectorBatch' [-Werror,-Wshadow-field] DoubleVectorBatch::DoubleVectorBatch(uint64_t capacity, MemoryPool& pool ^ /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/include/orc/Vector.hh:46:14: note: declared here uint64_t capacity; ^ /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/src/Vector.cc:115:49: error: parameter 'capacity' shadows member inherited from type 'ColumnVectorBatch' [-Werror,-Wshadow-field] StringVectorBatch::StringVectorBatch(uint64_t capacity, MemoryPool& pool ^ /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/include/orc/Vector.hh:46:14: note: declared here uint64_t capacity; ^ /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/src/Vector.cc:407:55: error: parameter 'capacity' shadows member inherited from type 'ColumnVectorBatch' [-Werror,-Wshadow-field] TimestampVectorBatch::TimestampVectorBatch(uint64_t capacity, ^ /Users/mrkn/src/github.com/apache/arrow/cpp/build.debug/orc_ep-prefix/src/orc_ep/c++/include/orc/Vector.hh:46:14: note: declared here uint64_t capacity; ^ 4 errors generated. {noformat} Upgrading ORC to 1.5.7 will fix this errors. I used Xcode 11.1 on macOS Mojave. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-6814) [C++] Resolve compiler warnings occurred on release build
Kenta Murata created ARROW-6814: --- Summary: [C++] Resolve compiler warnings occurred on release build Key: ARROW-6814 URL: https://issues.apache.org/jira/browse/ARROW-6814 Project: Apache Arrow Issue Type: Task Components: C++, C++ - Gandiva Reporter: Kenta Murata Assignee: Kenta Murata I encountered some compiler warnings on release build when I used gcc version 7.4.0 (Ubuntu 7.4.0-1ubuntu1~18.04.1). [https://gist.github.com/mrkn/f7739edb301988a24e9d6066410b0625] -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-6508) [C++] Add Tensor and SparseTensor factory function with validations
Kenta Murata created ARROW-6508: --- Summary: [C++] Add Tensor and SparseTensor factory function with validations Key: ARROW-6508 URL: https://issues.apache.org/jira/browse/ARROW-6508 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata Now Tensor and SparseTensor only have their constructors, but not factory functions that validate the parameters. We need such factory functions for creating Tensor and SparseTensor from parameters given from the external source. -- This message was sent by Atlassian Jira (v8.3.2#803003)
[jira] [Created] (ARROW-6505) [Website] Add new committers
Kenta Murata created ARROW-6505: --- Summary: [Website] Add new committers Key: ARROW-6505 URL: https://issues.apache.org/jira/browse/ARROW-6505 Project: Apache Arrow Issue Type: Improvement Components: Website Reporter: Kenta Murata Assignee: Kenta Murata I'd like to add new committers on the committer list. -- This message was sent by Atlassian Jira (v8.3.2#803003)
[jira] [Created] (ARROW-6503) [C++] Add an argument of memory pool object to SparseTensorConverter
Kenta Murata created ARROW-6503: --- Summary: [C++] Add an argument of memory pool object to SparseTensorConverter Key: ARROW-6503 URL: https://issues.apache.org/jira/browse/ARROW-6503 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata According to the comment https://github.com/apache/arrow/pull/5290#discussion_r322244745, we need to have variants of some functions for supplying a memory pool object to SparseTensorConverter function. -- This message was sent by Atlassian Jira (v8.3.2#803003)
[jira] [Created] (ARROW-6501) [Format][C++] Remove non_zero_length field from SparseIndex
Kenta Murata created ARROW-6501: --- Summary: [Format][C++] Remove non_zero_length field from SparseIndex Key: ARROW-6501 URL: https://issues.apache.org/jira/browse/ARROW-6501 Project: Apache Arrow Issue Type: Improvement Components: C++, Format Reporter: Kenta Murata Assignee: Kenta Murata We can remove non_zero_length field from SparseIndex because it can be supplied from the shape of the indices tensor. -- This message was sent by Atlassian Jira (v8.3.2#803003)
[jira] [Created] (ARROW-6489) [Developer][Documentation]Fix merge script and readme
Kenta Murata created ARROW-6489: --- Summary: [Developer][Documentation]Fix merge script and readme Key: ARROW-6489 URL: https://issues.apache.org/jira/browse/ARROW-6489 Project: Apache Arrow Issue Type: Improvement Components: Developer Tools Reporter: Kenta Murata Assignee: Kenta Murata The following things should be fixed. - merge_arrow_pr.py shouldn't be affected by git's merge.ff value. - README should describe the information of APACHE_JIRA_USERNAME and APACHE_JIRA_PASSWORD - README should describe the users needs to install requests and jira libraries before running merge_arrow_pr.py -- This message was sent by Atlassian Jira (v8.3.2#803003)
[jira] [Created] (ARROW-6393) [C++]Add EqualOptions support in SparseTensor::Equals
Kenta Murata created ARROW-6393: --- Summary: [C++]Add EqualOptions support in SparseTensor::Equals Key: ARROW-6393 URL: https://issues.apache.org/jira/browse/ARROW-6393 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata SparseTensor::Equals should take EqualOptions argument as Tensor::Equals does. -- This message was sent by Atlassian Jira (v8.3.2#803003)
[jira] [Created] (ARROW-6319) [C++] Extract the core of NumericTensor::Value as Tensor::Value
Kenta Murata created ARROW-6319: --- Summary: [C++] Extract the core of NumericTensor::Value as Tensor::Value Key: ARROW-6319 URL: https://issues.apache.org/jira/browse/ARROW-6319 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata I'd like to enable element-wise access in Tensor class. -- This message was sent by Atlassian Jira (v8.3.2#803003)
[jira] [Created] (ARROW-5830) [C++] Stop using memcmp in TensorEquals
Kenta Murata created ARROW-5830: --- Summary: [C++] Stop using memcmp in TensorEquals Key: ARROW-5830 URL: https://issues.apache.org/jira/browse/ARROW-5830 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata Because memcmp problematic for comparing floating-point values, such as NaNs. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5813) [C++] Support checking the equality of the different contiguous tensors
Kenta Murata created ARROW-5813: --- Summary: [C++] Support checking the equality of the different contiguous tensors Key: ARROW-5813 URL: https://issues.apache.org/jira/browse/ARROW-5813 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata The current TensorEquals function couldn't check the equality of the different contiguous tensors. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5754) [C++]Missing override for ~GrpcStreamWriter?
Kenta Murata created ARROW-5754: --- Summary: [C++]Missing override for ~GrpcStreamWriter? Key: ARROW-5754 URL: https://issues.apache.org/jira/browse/ARROW-5754 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata I encountered the following compile error: {{../src/arrow/flight/client.cc:244:3: error: '~GrpcStreamWriter' overrides a destructor but is not marked 'override' [-Werror,-Winconsistent-missing-destructor-override] ~GrpcStreamWriter() = default; ^ ../src/arrow/flight/client.h:86:27: note: overridden virtual function is here class ARROW_FLIGHT_EXPORT FlightStreamWriter : public ipc::RecordBatchWriter { ^}} Putting override modifier can resolve this problem. I'll make a pull-request for the change. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5736) [Format] Support small bit-width indices of sparse tensor
Kenta Murata created ARROW-5736: --- Summary: [Format] Support small bit-width indices of sparse tensor Key: ARROW-5736 URL: https://issues.apache.org/jira/browse/ARROW-5736 Project: Apache Arrow Issue Type: Improvement Components: Format Reporter: Kenta Murata Assignee: Kenta Murata Adding 32bit sparse index support is necessary to support non-copy data sharing with the existing systems such as SciPy. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5704) [C++] Stop using ARROW_TEMPLATE_EXPORT for SparseTensorImpl class
Kenta Murata created ARROW-5704: --- Summary: [C++] Stop using ARROW_TEMPLATE_EXPORT for SparseTensorImpl class Key: ARROW-5704 URL: https://issues.apache.org/jira/browse/ARROW-5704 Project: Apache Arrow Issue Type: Task Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata I'd like to stop using ARROW_TEMPLATE_EXPORT for SparseTensorImpl class so that it can be wrapped in Arrow GLib library on the mingw platform. This relates to ARROW-4399. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5486) [GLib] Add binding of gandiva::FunctionRegistry and related things
Kenta Murata created ARROW-5486: --- Summary: [GLib] Add binding of gandiva::FunctionRegistry and related things Key: ARROW-5486 URL: https://issues.apache.org/jira/browse/ARROW-5486 Project: Apache Arrow Issue Type: New Feature Components: GLib Reporter: Kenta Murata Assignee: Kenta Murata I'd like to add a support of gandiva::FunctionRegistry and the related things in gandiva-glib. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5320) [C++] Undefined symbol errors are occurred when linking parquet executables
Kenta Murata created ARROW-5320: --- Summary: [C++] Undefined symbol errors are occurred when linking parquet executables Key: ARROW-5320 URL: https://issues.apache.org/jira/browse/ARROW-5320 Project: Apache Arrow Issue Type: Bug Components: C++ Environment: Xcode 10.2 on macOS Mojave 10.14.4 Reporter: Kenta Murata Undefined symbol errors occurred when linking debug/parquet-reader, debug/parquet-file-deserialize-test, and debug/parquet-scan. The unresolvable symbol is of boost regex referred in libparquet.a. I tried to build the commit 608e846a9f825a30a0faa651bc0a3eebba20e7db with Xcode 10.2 on macOS Mojave. I specified -DARROW_BOOST_VENDORED=ON to avoid the problem related to the latest boost in Homebrew (See [https://github.com/boostorg/process/issues/55]). The complete build log is available here: [https://gist.github.com/mrkn/e5489140c9a782ca13a1b4bb8dd33111] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5155) [GLib][Ruby] MakeDense and MakeSparse in UnionArray should accept a vector of Field
Kenta Murata created ARROW-5155: --- Summary: [GLib][Ruby] MakeDense and MakeSparse in UnionArray should accept a vector of Field Key: ARROW-5155 URL: https://issues.apache.org/jira/browse/ARROW-5155 Project: Apache Arrow Issue Type: New Feature Components: GLib, Ruby Reporter: Kenta Murata Assignee: Kenta Murata This is a derivative issue of https://issues.apache.org/jira/browse/ARROW-4622 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5150) [Ruby] Add Arrow::Table#raw_records
Kenta Murata created ARROW-5150: --- Summary: [Ruby] Add Arrow::Table#raw_records Key: ARROW-5150 URL: https://issues.apache.org/jira/browse/ARROW-5150 Project: Apache Arrow Issue Type: New Feature Components: Ruby Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5050) [C++] cares_ep should build before grpc_ep
Kenta Murata created ARROW-5050: --- Summary: [C++] cares_ep should build before grpc_ep Key: ARROW-5050 URL: https://issues.apache.org/jira/browse/ARROW-5050 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Kenta Murata Assignee: Kenta Murata I found that grpc_ep can fail to find cares_ep because grpc_ep may be built before cares_ep. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5032) [C++] Headers in vendored/datetime directory aren't installed
Kenta Murata created ARROW-5032: --- Summary: [C++] Headers in vendored/datetime directory aren't installed Key: ARROW-5032 URL: https://issues.apache.org/jira/browse/ARROW-5032 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Kenta Murata I found that header files in vendored/datetime directory are not installed even though vendored/datetime.h is installed. vendored/datetime.h is depends on the files in vendored/datetime directory, so they should be installed. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4942) [Ruby] Remove needless omits
Kenta Murata created ARROW-4942: --- Summary: [Ruby] Remove needless omits Key: ARROW-4942 URL: https://issues.apache.org/jira/browse/ARROW-4942 Project: Apache Arrow Issue Type: Test Components: Ruby Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4938) [Glib] Undefined symbols error occurred when GIR file is being generated.
Kenta Murata created ARROW-4938: --- Summary: [Glib] Undefined symbols error occurred when GIR file is being generated. Key: ARROW-4938 URL: https://issues.apache.org/jira/browse/ARROW-4938 Project: Apache Arrow Issue Type: Bug Components: GLib Reporter: Kenta Murata When there are the old arrow-glib.*dylib files in the installation directory, and these libraries doesn't have enough symbols, the "undefined symbols" error is occurred during GIR file is generated. When I encountered this error, removing the old libraries resolves the problem. I extracted the build log related to this problem in this gist: https://gist.github.com/mrkn/6c14d5cae2bebca4609ed9c3ef8e5bbf -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4932) [GLib] Use G_DECLARE_DERIVABLE_TYPE macro
Kenta Murata created ARROW-4932: --- Summary: [GLib] Use G_DECLARE_DERIVABLE_TYPE macro Key: ARROW-4932 URL: https://issues.apache.org/jira/browse/ARROW-4932 Project: Apache Arrow Issue Type: Task Components: GLib Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4906) [Format] Fix document to describe that SparseMatrixIndexCSR assumes indptr is sorted for each row
Kenta Murata created ARROW-4906: --- Summary: [Format] Fix document to describe that SparseMatrixIndexCSR assumes indptr is sorted for each row Key: ARROW-4906 URL: https://issues.apache.org/jira/browse/ARROW-4906 Project: Apache Arrow Issue Type: Bug Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4775) [Website] Site navbar cannot be expanded
Kenta Murata created ARROW-4775: --- Summary: [Website] Site navbar cannot be expanded Key: ARROW-4775 URL: https://issues.apache.org/jira/browse/ARROW-4775 Project: Apache Arrow Issue Type: Bug Components: Website Reporter: Kenta Murata Assignee: Kenta Murata I found that the navbar at the top of the page cannot be expanded when the page is narrow. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4671) [C++] MakBuilder must care Type::DICTIONARY
Kenta Murata created ARROW-4671: --- Summary: [C++] MakBuilder must care Type::DICTIONARY Key: ARROW-4671 URL: https://issues.apache.org/jira/browse/ARROW-4671 Project: Apache Arrow Issue Type: New Feature Components: C++ Reporter: Kenta Murata Now, we cannot create a builder for DictionaryArray by using MakeBuilder. When we pass DictionaryType to MakeBuilder, it says like below: {quote}MakeBuilder: cannot construct builder for type dictionary {quote} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4662) [Python] Add type_codes property in UnionType
Kenta Murata created ARROW-4662: --- Summary: [Python] Add type_codes property in UnionType Key: ARROW-4662 URL: https://issues.apache.org/jira/browse/ARROW-4662 Project: Apache Arrow Issue Type: New Feature Components: Python Reporter: Kenta Murata Assignee: Kenta Murata -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4632) [Ruby] Add BigDecimal#to_arrow
Kenta Murata created ARROW-4632: --- Summary: [Ruby] Add BigDecimal#to_arrow Key: ARROW-4632 URL: https://issues.apache.org/jira/browse/ARROW-4632 Project: Apache Arrow Issue Type: New Feature Components: Ruby Reporter: Kenta Murata Assignee: Kenta Murata It may be better that BigDecimal has to_arrow instance method to convert itself to Arrow::Decimal128. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4622) MakeDense and MakeSparse in UnionArray should accept a vector of Field
Kenta Murata created ARROW-4622: --- Summary: MakeDense and MakeSparse in UnionArray should accept a vector of Field Key: ARROW-4622 URL: https://issues.apache.org/jira/browse/ARROW-4622 Project: Apache Arrow Issue Type: Bug Components: C++, GLib, Python Reporter: Kenta Murata Assignee: Kenta Murata Currently MakeDense and MakeUnion of UnionArray couldn't create a UnionArray with user-specified field names. This is bugs of these functions. To fix them, optional arguments of std::vector should be added. GLib and Python bindings should be fixed, together. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4600) [Ruby] Arrow::DictionaryArray#[] should returns the item in the indices array
Kenta Murata created ARROW-4600: --- Summary: [Ruby] Arrow::DictionaryArray#[] should returns the item in the indices array Key: ARROW-4600 URL: https://issues.apache.org/jira/browse/ARROW-4600 Project: Apache Arrow Issue Type: Bug Components: Ruby Reporter: Kenta Murata Arrow::DictionaryArray#[] should returns the item in the indices array. However, the current behavior is error like below: {{Traceback (most recent call last):}} {{ 5: from test.rb:4:in `'}} {{ 4: from test.rb:4:in `new'}} {{ 3: from /Users/mrkn/src/github.com/apache/arrow/ruby/red-arrow/lib/arrow/dictionary-data-type.rb:103:in `initialize'}} {{ 2: from /Users/mrkn/.rbenv/versions/2.6.0/lib/ruby/gems/2.6.0/gems/gobject-introspection-3.3.1/lib/gobject-introspection/loader.rb:328:in `block in load_constructor_infos'}} {{ 1: from /Users/mrkn/.rbenv/versions/2.6.0/lib/ruby/gems/2.6.0/gems/gobject-introspection-3.3.1/lib/gobject-introspection/loader.rb:317:in `block (2 levels) in load_constructor_infos'}} {{/Users/mrkn/.rbenv/versions/2.6.0/lib/ruby/gems/2.6.0/gems/gobject-introspection-3.3.1/lib/gobject-introspection/loader.rb:317:in `invoke': *invalid argument Array (expect #) (+ArgumentError+)*}} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4537) [CI] Suppress shell warning on travis-ci
Kenta Murata created ARROW-4537: --- Summary: [CI] Suppress shell warning on travis-ci Key: ARROW-4537 URL: https://issues.apache.org/jira/browse/ARROW-4537 Project: Apache Arrow Issue Type: Task Components: Continuous Integration Reporter: Kenta Murata Suppress shell warnings like: {{+'[' == 1 ']'}} {{/home/travis/build/apache/arrow/ci/travis_before_script_cpp.sh: line 81: [: ==: unary operator expected}} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4536) Add data_type argument in garrow_list_array_new
Kenta Murata created ARROW-4536: --- Summary: Add data_type argument in garrow_list_array_new Key: ARROW-4536 URL: https://issues.apache.org/jira/browse/ARROW-4536 Project: Apache Arrow Issue Type: Bug Components: GLib Reporter: Kenta Murata This issue is corresponding to GitHub's https://github.com/apache/arrow/pull/3621 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4535) [C++] Fix MakeBuilder to preserve ListType's field name
Kenta Murata created ARROW-4535: --- Summary: [C++] Fix MakeBuilder to preserve ListType's field name Key: ARROW-4535 URL: https://issues.apache.org/jira/browse/ARROW-4535 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Kenta Murata MakeBuilder doesn't preserve the field name in the given ListType. I think this is a bug. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4506) [Ruby] Add Arrow::RecordBatch#raw_records
Kenta Murata created ARROW-4506: --- Summary: [Ruby] Add Arrow::RecordBatch#raw_records Key: ARROW-4506 URL: https://issues.apache.org/jira/browse/ARROW-4506 Project: Apache Arrow Issue Type: New Feature Components: Ruby Reporter: Kenta Murata Assignee: Kenta Murata I want to add Arrow::RecordBatch#raw_records method to convert a record batch object to a nested array. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4397) [C++] dim_names in Tensor and SparseTensor
Kenta Murata created ARROW-4397: --- Summary: [C++] dim_names in Tensor and SparseTensor Key: ARROW-4397 URL: https://issues.apache.org/jira/browse/ARROW-4397 Project: Apache Arrow Issue Type: New Feature Components: C++ Reporter: Kenta Murata Along with ARROW-4388, it would be useful to introduce dim_names in Tensor and SparseTensor of C++ library. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4320) [C++] Add tests for non-contiguous tensors
Kenta Murata created ARROW-4320: --- Summary: [C++] Add tests for non-contiguous tensors Key: ARROW-4320 URL: https://issues.apache.org/jira/browse/ARROW-4320 Project: Apache Arrow Issue Type: Test Reporter: Kenta Murata Assignee: Kenta Murata I would like to add some test cases for tensors with non-contiguous strides. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4318) [C++] Add Tensor::CountNonZero
Kenta Murata created ARROW-4318: --- Summary: [C++] Add Tensor::CountNonZero Key: ARROW-4318 URL: https://issues.apache.org/jira/browse/ARROW-4318 Project: Apache Arrow Issue Type: New Feature Reporter: Kenta Murata Assignee: Kenta Murata I would like to move CountNonZero defined in SparseTensorConverter into Tensor class, and add tests for this function. The pull-request is [https://github.com/apache/arrow/pull/3452]. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4226) [C++] Add CSF sparse tensor support
Kenta Murata created ARROW-4226: --- Summary: [C++] Add CSF sparse tensor support Key: ARROW-4226 URL: https://issues.apache.org/jira/browse/ARROW-4226 Project: Apache Arrow Issue Type: New Feature Reporter: Kenta Murata [https://github.com/apache/arrow/pull/2546#pullrequestreview-156064172] {quote}Perhaps in the future, if zero-copy and future-proof-ness is really what we want, we might want to add the CSF (compressed sparse fiber) format, a generalisation of CSR/CSC. I'm currently working on adding it to PyData/Sparse, and I plan to make it the preferred format (COO will still be around though). {quote} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4225) [C++] Add CSC sparse matrix support
Kenta Murata created ARROW-4225: --- Summary: [C++] Add CSC sparse matrix support Key: ARROW-4225 URL: https://issues.apache.org/jira/browse/ARROW-4225 Project: Apache Arrow Issue Type: New Feature Reporter: Kenta Murata CSC sparse matrix is necessary for integration with existing sparse matrix libraries (umfpack, superlu). https://github.com/apache/arrow/pull/2546#issuecomment-422135645 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4224) [Python] Support integration with pydata/sparse library
Kenta Murata created ARROW-4224: --- Summary: [Python] Support integration with pydata/sparse library Key: ARROW-4224 URL: https://issues.apache.org/jira/browse/ARROW-4224 Project: Apache Arrow Issue Type: New Feature Reporter: Kenta Murata It would be great to support integration with pydata/sparse library. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4223) [Python] Support scipy.sparse integration
Kenta Murata created ARROW-4223: --- Summary: [Python] Support scipy.sparse integration Key: ARROW-4223 URL: https://issues.apache.org/jira/browse/ARROW-4223 Project: Apache Arrow Issue Type: Improvement Reporter: Kenta Murata It would be great to support integration with scipy.sparse. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4221) [Format] Add canonical flag in COO sparse index
Kenta Murata created ARROW-4221: --- Summary: [Format] Add canonical flag in COO sparse index Key: ARROW-4221 URL: https://issues.apache.org/jira/browse/ARROW-4221 Project: Apache Arrow Issue Type: Improvement Reporter: Kenta Murata To support the integration with scipy.sparse.coo_matrix, it is necessary to add a flag in SparseCOOIndex. This flag denotes whether elements in COO sparse tensor is sorted lexicographically or not. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4222) [C++] Support equality comparison between COO and CSR sparse tensors in SparseTensorEquals
Kenta Murata created ARROW-4222: --- Summary: [C++] Support equality comparison between COO and CSR sparse tensors in SparseTensorEquals Key: ARROW-4222 URL: https://issues.apache.org/jira/browse/ARROW-4222 Project: Apache Arrow Issue Type: Improvement Reporter: Kenta Murata Currently SparseTensorEquals always returns false when it gets COO and CSR sparse tensors. It should support comparing the items in the sparse tensors. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3518) Detect HOMEBREW_PREFIX automatically
Kenta Murata created ARROW-3518: --- Summary: Detect HOMEBREW_PREFIX automatically Key: ARROW-3518 URL: https://issues.apache.org/jira/browse/ARROW-3518 Project: Apache Arrow Issue Type: Improvement Reporter: Kenta Murata It can be detected by executing {{brew --prefix}} if available. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3515) Introduce NumericTensor class
Kenta Murata created ARROW-3515: --- Summary: Introduce NumericTensor class Key: ARROW-3515 URL: https://issues.apache.org/jira/browse/ARROW-3515 Project: Apache Arrow Issue Type: New Feature Reporter: Kenta Murata [https://github.com/apache/arrow/pull/2759] This commit defines the new NumericTensor class as a subclass of Tensor class. NumericTensor extends Tensor class by adding a member function to access element values in a tensor. I want to use this new feature for writing tests of SparseTensor in [#2546|https://github.com/apache/arrow/pull/2546]. -- This message was sent by Atlassian JIRA (v7.6.3#76005)