[jira] [Created] (ARROW-10882) [Python][Dataset] Writing dataset from python iterator of record batches

2020-12-11 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10882: - Summary: [Python][Dataset] Writing dataset from python iterator of record batches Key: ARROW-10882 URL: https://issues.apache.org/jira/browse/ARROW-10882

[jira] [Created] (ARROW-10883) [C++][Dataset] Preserve order when writing dataset

2020-12-11 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10883: - Summary: [C++][Dataset] Preserve order when writing dataset Key: ARROW-10883 URL: https://issues.apache.org/jira/browse/ARROW-10883 Project: Apache A

[jira] [Created] (ARROW-10951) [Python][CI] Nightly pandas builds failing because of pytest monkeypatch issue

2020-12-17 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10951: - Summary: [Python][CI] Nightly pandas builds failing because of pytest monkeypatch issue Key: ARROW-10951 URL: https://issues.apache.org/jira/browse/ARROW-10951

[jira] [Created] (ARROW-10998) [C++] Filesystems: detect if URI is passed where a file path is required and raise informative error

2020-12-21 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10998: - Summary: [C++] Filesystems: detect if URI is passed where a file path is required and raise informative error Key: ARROW-10998 URL: https://issues.apache.org/jir

[jira] [Created] (ARROW-11000) [Python] Enable random access reading for Python file objects (if supported)

2020-12-21 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11000: - Summary: [Python] Enable random access reading for Python file objects (if supported) Key: ARROW-11000 URL: https://issues.apache.org/jira/browse/ARROW-11000

[jira] [Created] (ARROW-11001) [C++][Dataset] Enable column renaming (in physical schema -> dataset schema) in Dataset scanning

2020-12-21 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11001: - Summary: [C++][Dataset] Enable column renaming (in physical schema -> dataset schema) in Dataset scanning Key: ARROW-11001 URL: https://issues.apache.org/jira/br

[jira] [Created] (ARROW-11003) [C++][Dataset] Schema evolution in Dataset scanning

2020-12-21 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11003: - Summary: [C++][Dataset] Schema evolution in Dataset scanning Key: ARROW-11003 URL: https://issues.apache.org/jira/browse/ARROW-11003 Project: Apache

[jira] [Created] (ARROW-11142) [C++][Parquet] Inconsistent batch_size usage in parquet GetRecordBatchReader

2021-01-06 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11142: - Summary: [C++][Parquet] Inconsistent batch_size usage in parquet GetRecordBatchReader Key: ARROW-11142 URL: https://issues.apache.org/jira/browse/ARROW-11142

[jira] [Created] (ARROW-11163) [C++][Python] Compressed Feather file written with pyarrow 0.17 not readable in pyarrow 2.0.0+

2021-01-07 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11163: - Summary: [C++][Python] Compressed Feather file written with pyarrow 0.17 not readable in pyarrow 2.0.0+ Key: ARROW-11163 URL: https://issues.apache.org/jira/brow

[jira] [Created] (ARROW-11166) [Python][Compute] Add bindings for ProjectOptions

2021-01-07 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11166: - Summary: [Python][Compute] Add bindings for ProjectOptions Key: ARROW-11166 URL: https://issues.apache.org/jira/browse/ARROW-11166 Project: Apache Ar

[jira] [Created] (ARROW-11167) [Python][Compute] Improve usability for defining sort options

2021-01-07 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11167: - Summary: [Python][Compute] Improve usability for defining sort options Key: ARROW-11167 URL: https://issues.apache.org/jira/browse/ARROW-11167 Proje

[jira] [Created] (ARROW-11226) [Python][CI] Windows tests failing with s3fs 0.5.2

2021-01-12 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11226: - Summary: [Python][CI] Windows tests failing with s3fs 0.5.2 Key: ARROW-11226 URL: https://issues.apache.org/jira/browse/ARROW-11226 Project: Apache A

[jira] [Created] (ARROW-11227) [Python][CI] AMD64 Conda Python 3.7 Pandas 0.24 cron job failing in to_pandas extension dtype test

2021-01-12 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11227: - Summary: [Python][CI] AMD64 Conda Python 3.7 Pandas 0.24 cron job failing in to_pandas extension dtype test Key: ARROW-11227 URL: https://issues.apache.org/jir

[jira] [Created] (ARROW-11259) [Python] Allow to create field reference to nested field

2021-01-15 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11259: - Summary: [Python] Allow to create field reference to nested field Key: ARROW-11259 URL: https://issues.apache.org/jira/browse/ARROW-11259 Project: Ap

[jira] [Created] (ARROW-11260) [C++][Dataset] Don't require dictionaries for reading dataset with schema-based Partitioning

2021-01-15 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11260: - Summary: [C++][Dataset] Don't require dictionaries for reading dataset with schema-based Partitioning Key: ARROW-11260 URL: https://issues.apache.org/jira/browse

[jira] [Created] (ARROW-11334) [Python][CI] Nightly pandas builds failing because of internal pandas change

2021-01-21 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11334: - Summary: [Python][CI] Nightly pandas builds failing because of internal pandas change Key: ARROW-11334 URL: https://issues.apache.org/jira/browse/ARROW-11334

[jira] [Created] (ARROW-11370) [C++] Ability to "re-chunk" Tables or ChunkedArrays

2021-01-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11370: - Summary: [C++] Ability to "re-chunk" Tables or ChunkedArrays Key: ARROW-11370 URL: https://issues.apache.org/jira/browse/ARROW-11370 Project: Apache

[jira] [Created] (ARROW-11373) [Python][Docs] Add example of specifying type for a column when reading csv file

2021-01-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11373: - Summary: [Python][Docs] Add example of specifying type for a column when reading csv file Key: ARROW-11373 URL: https://issues.apache.org/jira/browse/ARROW-11373

[jira] [Created] (ARROW-11374) [Python] Make legacy pyarrow.filesystem / pyarrow.serialize warnings more visisble

2021-01-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11374: - Summary: [Python] Make legacy pyarrow.filesystem / pyarrow.serialize warnings more visisble Key: ARROW-11374 URL: https://issues.apache.org/jira/browse/ARROW-113

[jira] [Created] (ARROW-11378) [C++][Dataset] Writing partitions with timestamp type give mis-formatted (integers) directory names

2021-01-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11378: - Summary: [C++][Dataset] Writing partitions with timestamp type give mis-formatted (integers) directory names Key: ARROW-11378 URL: https://issues.apache.org/jira

[jira] [Created] (ARROW-11379) [C++][Dataset] Reading dataset with filtering on timestamp partition field crashes

2021-01-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11379: - Summary: [C++][Dataset] Reading dataset with filtering on timestamp partition field crashes Key: ARROW-11379 URL: https://issues.apache.org/jira/browse/ARROW-113

[jira] [Created] (ARROW-11399) [C++][Parquet] Timestamp ColumnDescriptor (from logical type) incorrectly showing ConvertedType as NONE

2021-01-27 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11399: - Summary: [C++][Parquet] Timestamp ColumnDescriptor (from logical type) incorrectly showing ConvertedType as NONE Key: ARROW-11399 URL: https://issues.apache.org/

[jira] [Created] (ARROW-11400) [Python] Pickled ParquetFileFragment has invalid partition_expresion with dictionary type in pyarrow 2.0

2021-01-27 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11400: - Summary: [Python] Pickled ParquetFileFragment has invalid partition_expresion with dictionary type in pyarrow 2.0 Key: ARROW-11400 URL: https://issues.apache.org

[jira] [Created] (ARROW-11472) [Python][CI] Kartothek integrations build is failing with numpy 1.20

2021-02-02 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11472: - Summary: [Python][CI] Kartothek integrations build is failing with numpy 1.20 Key: ARROW-11472 URL: https://issues.apache.org/jira/browse/ARROW-11472

[jira] [Created] (ARROW-11553) [Python] Make Table.cast(schema) more flexible regarding order of fields / missing fields?

2021-02-08 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11553: - Summary: [Python] Make Table.cast(schema) more flexible regarding order of fields / missing fields? Key: ARROW-11553 URL: https://issues.apache.org/jira/browse/A

[jira] [Created] (ARROW-11608) [CI] turbodbc integration tests are failing (build isue)

2021-02-12 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11608: - Summary: [CI] turbodbc integration tests are failing (build isue) Key: ARROW-11608 URL: https://issues.apache.org/jira/browse/ARROW-11608 Project: Ap

[jira] [Created] (ARROW-11673) [C++] Casting dictionary type to use different index type

2021-02-17 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11673: - Summary: [C++] Casting dictionary type to use different index type Key: ARROW-11673 URL: https://issues.apache.org/jira/browse/ARROW-11673 Project: A

[jira] [Created] (ARROW-11759) [C++] Kernel to extract datetime components (year, month, day, etc) from timestamp type

2021-02-24 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11759: - Summary: [C++] Kernel to extract datetime components (year, month, day, etc) from timestamp type Key: ARROW-11759 URL: https://issues.apache.org/jira/browse/ARRO

[jira] [Created] (ARROW-11923) [CI] Update branch name for dask dev integration tests

2021-03-10 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11923: - Summary: [CI] Update branch name for dask dev integration tests Key: ARROW-11923 URL: https://issues.apache.org/jira/browse/ARROW-11923 Project: Apac

[jira] [Created] (ARROW-11980) [Python] Remove "experimental" status from Table.replace_schema_metadata

2021-03-16 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11980: - Summary: [Python] Remove "experimental" status from Table.replace_schema_metadata Key: ARROW-11980 URL: https://issues.apache.org/jira/browse/ARROW-11980

[jira] [Created] (ARROW-11983) [Python] ImportError calling pyarrow from_pandas within ThreadPool

2021-03-16 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11983: - Summary: [Python] ImportError calling pyarrow from_pandas within ThreadPool Key: ARROW-11983 URL: https://issues.apache.org/jira/browse/ARROW-11983

[jira] [Created] (ARROW-12057) [Python] Remove direct usage of pandas' Block subclasses

2021-03-23 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12057: - Summary: [Python] Remove direct usage of pandas' Block subclasses Key: ARROW-12057 URL: https://issues.apache.org/jira/browse/ARROW-12057 Project: Ap

[jira] [Created] (ARROW-12058) [Python] Enable arithmetic operations on Expressions

2021-03-23 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12058: - Summary: [Python] Enable arithmetic operations on Expressions Key: ARROW-12058 URL: https://issues.apache.org/jira/browse/ARROW-12058 Project: Apache

[jira] [Created] (ARROW-12060) [Python] Enable calling compute functions on Expressions

2021-03-23 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12060: - Summary: [Python] Enable calling compute functions on Expressions Key: ARROW-12060 URL: https://issues.apache.org/jira/browse/ARROW-12060 Project: Ap

[jira] [Created] (ARROW-12188) [Docs] Switch to pydata-sphinx-theme for the main sphinx docs

2021-04-02 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12188: - Summary: [Docs] Switch to pydata-sphinx-theme for the main sphinx docs Key: ARROW-12188 URL: https://issues.apache.org/jira/browse/ARROW-12188 Proje

[jira] [Created] (ARROW-12246) [CI] Synch conda recipes with upstream feedstock

2021-04-07 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12246: - Summary: [CI] Synch conda recipes with upstream feedstock Key: ARROW-12246 URL: https://issues.apache.org/jira/browse/ARROW-12246 Project: Apache Arr

[jira] [Created] (ARROW-12314) [Python] pq.read_pandas with use_legacy_dataset=False does not accept columns as a set (kartothek integration failure)

2021-04-09 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12314: - Summary: [Python] pq.read_pandas with use_legacy_dataset=False does not accept columns as a set (kartothek integration failure) Key: ARROW-12314 URL: https://iss

[jira] [Created] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2021-04-13 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12358: - Summary: [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset Key: ARROW-12358 URL: https://issues.apache.org/jira/browse/

[jira] [Created] (ARROW-12396) [Python][Docs] Clarify serialization docstrings about deprecated status

2021-04-15 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12396: - Summary: [Python][Docs] Clarify serialization docstrings about deprecated status Key: ARROW-12396 URL: https://issues.apache.org/jira/browse/ARROW-12396

[jira] [Created] (ARROW-12518) [Python] Expose Parquet statistics has_null_count / has_distinct_count

2021-04-23 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12518: - Summary: [Python] Expose Parquet statistics has_null_count / has_distinct_count Key: ARROW-12518 URL: https://issues.apache.org/jira/browse/ARROW-12518

[jira] [Created] (ARROW-12541) [Docs] Improve styling/readability of tables in the new doc theme

2021-04-26 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12541: - Summary: [Docs] Improve styling/readability of tables in the new doc theme Key: ARROW-12541 URL: https://issues.apache.org/jira/browse/ARROW-12541 P

[jira] [Created] (ARROW-12545) [Python][Docs] Fill in section about Custom Schema and Field Metadata

2021-04-26 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12545: - Summary: [Python][Docs] Fill in section about Custom Schema and Field Metadata Key: ARROW-12545 URL: https://issues.apache.org/jira/browse/ARROW-12545

[jira] [Created] (ARROW-12564) [C++] Add compute kernel for extract keys / items from Map type data

2021-04-27 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12564: - Summary: [C++] Add compute kernel for extract keys / items from Map type data Key: ARROW-12564 URL: https://issues.apache.org/jira/browse/ARROW-12564

[jira] [Created] (ARROW-12611) [CI][Python] Nightly test-conda-python-pandas-0.24 is failing due to numpy compat issue

2021-04-30 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12611: - Summary: [CI][Python] Nightly test-conda-python-pandas-0.24 is failing due to numpy compat issue Key: ARROW-12611 URL: https://issues.apache.org/jira/browse/ARRO

[jira] [Created] (ARROW-12631) [Python] Should dataset.write_table accept a Scanner?

2021-05-03 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12631: - Summary: [Python] Should dataset.write_table accept a Scanner? Key: ARROW-12631 URL: https://issues.apache.org/jira/browse/ARROW-12631 Project: Apach

[jira] [Created] (ARROW-12805) [Python] Use consistent memory_pool / pool keyword argument name

2021-05-17 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12805: - Summary: [Python] Use consistent memory_pool / pool keyword argument name Key: ARROW-12805 URL: https://issues.apache.org/jira/browse/ARROW-12805 Pr

[jira] [Created] (ARROW-12806) [Python] test_write_to_dataset_filesystem missing a dataset mark

2021-05-17 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12806: - Summary: [Python] test_write_to_dataset_filesystem missing a dataset mark Key: ARROW-12806 URL: https://issues.apache.org/jira/browse/ARROW-12806 Pr

[jira] [Created] (ARROW-12966) [Python] Expose Python binding for ElementWiseAggregateOptions

2021-06-04 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12966: - Summary: [Python] Expose Python binding for ElementWiseAggregateOptions Key: ARROW-12966 URL: https://issues.apache.org/jira/browse/ARROW-12966 Proj

[jira] [Created] (ARROW-12987) [CI] test-ubuntu-18.04 nightly builds are failing due to Gandiva test failure

2021-06-07 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12987: - Summary: [CI] test-ubuntu-18.04 nightly builds are failing due to Gandiva test failure Key: ARROW-12987 URL: https://issues.apache.org/jira/browse/ARROW-12987

[jira] [Created] (ARROW-12988) [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty)

2021-06-07 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-12988: - Summary: [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty) Key: ARROW-12988 URL: https://issues.apache.org/jira/browse

[jira] [Created] (ARROW-13011) [Python] Using fs.HadoopFileSystem in the dask tests crashes

2021-06-08 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13011: - Summary: [Python] Using fs.HadoopFileSystem in the dask tests crashes Key: ARROW-13011 URL: https://issues.apache.org/jira/browse/ARROW-13011 Projec

[jira] [Created] (ARROW-13018) [C++][Docs] Use consistent terminology for nulls (min_count) in scalar aggregate kernels

2021-06-09 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13018: - Summary: [C++][Docs] Use consistent terminology for nulls (min_count) in scalar aggregate kernels Key: ARROW-13018 URL: https://issues.apache.org/jira/browse/ARR

[jira] [Created] (ARROW-13033) [C++] Kernel to localize naive timestamps to a timezone (preserving clock-time)

2021-06-10 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13033: - Summary: [C++] Kernel to localize naive timestamps to a timezone (preserving clock-time) Key: ARROW-13033 URL: https://issues.apache.org/jira/browse/ARROW-13033

[jira] [Created] (ARROW-13034) [Python][Docs] Update outdated examples for hdfs/azure on the Parquet doc page

2021-06-10 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13034: - Summary: [Python][Docs] Update outdated examples for hdfs/azure on the Parquet doc page Key: ARROW-13034 URL: https://issues.apache.org/jira/browse/ARROW-13034

[jira] [Created] (ARROW-13074) [Python] Start with deprecating ParquetDataset custom attributes

2021-06-14 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13074: - Summary: [Python] Start with deprecating ParquetDataset custom attributes Key: ARROW-13074 URL: https://issues.apache.org/jira/browse/ARROW-13074 Pr

[jira] [Created] (ARROW-13081) [C++] Comparison kernels should not allow to compare tz-naive and tz-aware timestamps

2021-06-15 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13081: - Summary: [C++] Comparison kernels should not allow to compare tz-naive and tz-aware timestamps Key: ARROW-13081 URL: https://issues.apache.org/jira/browse/ARROW-

[jira] [Created] (ARROW-13141) [C++][Python] HadoopFileSystem: automatically set CLASSPATH based on HADOOP_HOME env variable?

2021-06-22 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13141: - Summary: [C++][Python] HadoopFileSystem: automatically set CLASSPATH based on HADOOP_HOME env variable? Key: ARROW-13141 URL: https://issues.apache.org/jira/brow

[jira] [Created] (ARROW-13158) [Python] Fix repr and contains of StructScalar with duplicate field names

2021-06-24 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13158: - Summary: [Python] Fix repr and contains of StructScalar with duplicate field names Key: ARROW-13158 URL: https://issues.apache.org/jira/browse/ARROW-13158

[jira] [Created] (ARROW-13159) [Doc][Python] The use of IPython directive or doctest code blocks in the python user guide

2021-06-24 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13159: - Summary: [Doc][Python] The use of IPython directive or doctest code blocks in the python user guide Key: ARROW-13159 URL: https://issues.apache.org/jira/browse/A

[jira] [Created] (ARROW-13236) [Python] Improve repr of pyarrow.compute.FunctionOptions

2021-07-01 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13236: - Summary: [Python] Improve repr of pyarrow.compute.FunctionOptions Key: ARROW-13236 URL: https://issues.apache.org/jira/browse/ARROW-13236 Project: Ap

[jira] [Created] (ARROW-13247) [C++] Kernel to convert timestamp with timezone to another timezone (metadata-only change)

2021-07-02 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13247: - Summary: [C++] Kernel to convert timestamp with timezone to another timezone (metadata-only change) Key: ARROW-13247 URL: https://issues.apache.org/jira/browse/A

[jira] [Created] (ARROW-13258) [Python] Improve the repr of ParquetFileFragment

2021-07-05 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13258: - Summary: [Python] Improve the repr of ParquetFileFragment Key: ARROW-13258 URL: https://issues.apache.org/jira/browse/ARROW-13258 Project: Apache Arr

[jira] [Created] (ARROW-13260) [Doc] Host different released versions of the documentation + version switcher

2021-07-05 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13260: - Summary: [Doc] Host different released versions of the documentation + version switcher Key: ARROW-13260 URL: https://issues.apache.org/jira/browse/ARROW-13260

[jira] [Created] (ARROW-13350) [Python][CI] conda-python-3.7-pandas-0.24 nightly build failing in test_extract_datetime_components

2021-07-16 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13350: - Summary: [Python][CI] conda-python-3.7-pandas-0.24 nightly build failing in test_extract_datetime_components Key: ARROW-13350 URL: https://issues.apache.org/jira

[jira] [Created] (ARROW-13351) [Python] Bump minimum support pandas version to pandas 1.0

2021-07-16 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13351: - Summary: [Python] Bump minimum support pandas version to pandas 1.0 Key: ARROW-13351 URL: https://issues.apache.org/jira/browse/ARROW-13351 Project:

[jira] [Created] (ARROW-13525) [Python] Mention alternatives in deprecation message of ParquetDataset attributes

2021-08-02 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13525: - Summary: [Python] Mention alternatives in deprecation message of ParquetDataset attributes Key: ARROW-13525 URL: https://issues.apache.org/jira/browse/ARROW-1352

[jira] [Created] (ARROW-13594) [CI] Turbodbc integration builds are failing due to use of deprecated/removed APIs

2021-08-10 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13594: - Summary: [CI] Turbodbc integration builds are failing due to use of deprecated/removed APIs Key: ARROW-13594 URL: https://issues.apache.org/jira/browse/ARROW-135

[jira] [Created] (ARROW-13612) [Python] Allow specifying a custom type for converting ExtensionScalar to python object

2021-08-12 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13612: - Summary: [Python] Allow specifying a custom type for converting ExtensionScalar to python object Key: ARROW-13612 URL: https://issues.apache.org/jira/browse/ARRO

[jira] [Created] (ARROW-13652) [Python] Expose the CopyFiles utility in Python

2021-08-17 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13652: - Summary: [Python] Expose the CopyFiles utility in Python Key: ARROW-13652 URL: https://issues.apache.org/jira/browse/ARROW-13652 Project: Apache Arro

[jira] [Created] (ARROW-13654) [C++][Parquet] Appending a FileMetaData object to itselfs explodes memory

2021-08-18 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13654: - Summary: [C++][Parquet] Appending a FileMetaData object to itselfs explodes memory Key: ARROW-13654 URL: https://issues.apache.org/jira/browse/ARROW-13654

[jira] [Created] (ARROW-13655) [C++][Parquet] Reading large Parquet file can give "MaxMessageSize reached" error with Thrift 0.14

2021-08-18 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13655: - Summary: [C++][Parquet] Reading large Parquet file can give "MaxMessageSize reached" error with Thrift 0.14 Key: ARROW-13655 URL: https://issues.apache.org/jira/

[jira] [Created] (ARROW-13662) [CI] Failing test test_extract_datetime_components with pandas 0.24

2021-08-18 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13662: - Summary: [CI] Failing test test_extract_datetime_components with pandas 0.24 Key: ARROW-13662 URL: https://issues.apache.org/jira/browse/ARROW-13662

[jira] [Created] (ARROW-13735) [Python] Creating a Map array with non-default field names segfaults

2021-08-24 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13735: - Summary: [Python] Creating a Map array with non-default field names segfaults Key: ARROW-13735 URL: https://issues.apache.org/jira/browse/ARROW-13735

[jira] [Created] (ARROW-13791) [Python][Docs] Update datasets user guide with more details on Partitioning(Factory)

2021-08-30 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13791: - Summary: [Python][Docs] Update datasets user guide with more details on Partitioning(Factory) Key: ARROW-13791 URL: https://issues.apache.org/jira/browse/ARROW-1

[jira] [Created] (ARROW-13795) [C++] Add async version of the ORC Dataset scanner

2021-08-30 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13795: - Summary: [C++] Add async version of the ORC Dataset scanner Key: ARROW-13795 URL: https://issues.apache.org/jira/browse/ARROW-13795 Project: Apache A

[jira] [Created] (ARROW-13796) [C++] Add write support for ORC in the Datasets API

2021-08-30 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13796: - Summary: [C++] Add write support for ORC in the Datasets API Key: ARROW-13796 URL: https://issues.apache.org/jira/browse/ARROW-13796 Project: Apache

[jira] [Created] (ARROW-13797) [C++] Implement column projection pushdown to ORC reader in Datasets API

2021-08-30 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13797: - Summary: [C++] Implement column projection pushdown to ORC reader in Datasets API Key: ARROW-13797 URL: https://issues.apache.org/jira/browse/ARROW-13797

[jira] [Created] (ARROW-13813) [C++][Dataset] Support URL encoding of partition field values for the file path

2021-08-31 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13813: - Summary: [C++][Dataset] Support URL encoding of partition field values for the file path Key: ARROW-13813 URL: https://issues.apache.org/jira/browse/ARROW-13813

[jira] [Created] (ARROW-13814) [CI] Nightly integration build with spark master failing to compile spark

2021-08-31 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13814: - Summary: [CI] Nightly integration build with spark master failing to compile spark Key: ARROW-13814 URL: https://issues.apache.org/jira/browse/ARROW-13814

[jira] [Created] (ARROW-13958) [Python] Migrate Python ORC bindings to use new Result-based APIs

2021-09-09 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13958: - Summary: [Python] Migrate Python ORC bindings to use new Result-based APIs Key: ARROW-13958 URL: https://issues.apache.org/jira/browse/ARROW-13958 P

[jira] [Created] (ARROW-14003) [Python] Not providing a sort_key in the "select_k_unstable" kernel crashes

2021-09-15 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14003: - Summary: [Python] Not providing a sort_key in the "select_k_unstable" kernel crashes Key: ARROW-14003 URL: https://issues.apache.org/jira/browse/ARROW-14003

[jira] [Created] (ARROW-14055) [Docs] Add canonical url to the docs

2021-09-21 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14055: - Summary: [Docs] Add canonical url to the docs Key: ARROW-14055 URL: https://issues.apache.org/jira/browse/ARROW-14055 Project: Apache Arrow

[jira] [Created] (ARROW-14115) [Python] Remove deprecated pyarrow.serialization functionality

2021-09-24 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14115: - Summary: [Python] Remove deprecated pyarrow.serialization functionality Key: ARROW-14115 URL: https://issues.apache.org/jira/browse/ARROW-14115 Proj

[jira] [Created] (ARROW-14153) [C++] Add support for batch_size in the ORC Scanner (Dataset)

2021-09-28 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14153: - Summary: [C++] Add support for batch_size in the ORC Scanner (Dataset) Key: ARROW-14153 URL: https://issues.apache.org/jira/browse/ARROW-14153 Proje

[jira] [Created] (ARROW-14189) [Docs] Add version dropdown to the sphinx docs

2021-10-01 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14189: - Summary: [Docs] Add version dropdown to the sphinx docs Key: ARROW-14189 URL: https://issues.apache.org/jira/browse/ARROW-14189 Project: Apache Arrow

[jira] [Created] (ARROW-14194) [Docs] Improve vertical spacing in the sphinx API docs

2021-10-01 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14194: - Summary: [Docs] Improve vertical spacing in the sphinx API docs Key: ARROW-14194 URL: https://issues.apache.org/jira/browse/ARROW-14194 Project: Apac

[jira] [Created] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14196: - Summary: [C++][Parquet] Default to compliant nested types in Parquet writer Key: ARROW-14196 URL: https://issues.apache.org/jira/browse/ARROW-14196

[jira] [Created] (ARROW-14241) [C++] Dataset ORC build failing in java-jars nightly build

2021-10-06 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14241: - Summary: [C++] Dataset ORC build failing in java-jars nightly build Key: ARROW-14241 URL: https://issues.apache.org/jira/browse/ARROW-14241 Project:

[jira] [Created] (ARROW-14284) [C++][Python] Improve error message when trying use SyncScanner when requiring async

2021-10-11 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14284: - Summary: [C++][Python] Improve error message when trying use SyncScanner when requiring async Key: ARROW-14284 URL: https://issues.apache.org/jira/browse/ARROW-1

[jira] [Created] (ARROW-14286) [Python][Parquet] Allow to select columns of a list field without requiring the list component names

2021-10-11 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14286: - Summary: [Python][Parquet] Allow to select columns of a list field without requiring the list component names Key: ARROW-14286 URL: https://issues.apache.org/jir

[jira] [Created] (ARROW-14287) [R] Selecting colums while reading Parquet file with nested types can give wrong column

2021-10-11 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14287: - Summary: [R] Selecting colums while reading Parquet file with nested types can give wrong column Key: ARROW-14287 URL: https://issues.apache.org/jira/browse/ARRO

[jira] [Created] (ARROW-14406) [Python][CI] Nightly dask integration jobs fail

2021-10-21 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14406: - Summary: [Python][CI] Nightly dask integration jobs fail Key: ARROW-14406 URL: https://issues.apache.org/jira/browse/ARROW-14406 Project: Apache Arro

[jira] [Created] (ARROW-14447) [Python] Use oldest-supported-numpy for declaring numpy version build dependency

2021-10-22 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14447: - Summary: [Python] Use oldest-supported-numpy for declaring numpy version build dependency Key: ARROW-14447 URL: https://issues.apache.org/jira/browse/ARROW-14447

[jira] [Created] (ARROW-14448) [Python] Update pyarrow.array() docstring note on timestamp (timezone) conversion

2021-10-22 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14448: - Summary: [Python] Update pyarrow.array() docstring note on timestamp (timezone) conversion Key: ARROW-14448 URL: https://issues.apache.org/jira/browse/ARROW-1444

[jira] [Created] (ARROW-14459) [Doc] Update the pinned sphinx version

2021-10-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14459: - Summary: [Doc] Update the pinned sphinx version Key: ARROW-14459 URL: https://issues.apache.org/jira/browse/ARROW-14459 Project: Apache Arrow

[jira] [Created] (ARROW-14460) [Doc] Use sphinx-remove-toctrees to generated docstring pages from navigation (and reduce build time)

2021-10-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14460: - Summary: [Doc] Use sphinx-remove-toctrees to generated docstring pages from navigation (and reduce build time) Key: ARROW-14460 URL: https://issues.apache.org/ji

[jira] [Created] (ARROW-14470) [Python] Expose the use_threads option in Feather read functions

2021-10-26 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14470: - Summary: [Python] Expose the use_threads option in Feather read functions Key: ARROW-14470 URL: https://issues.apache.org/jira/browse/ARROW-14470 Pr

[jira] [Created] (ARROW-14495) [Python] DictionaryArray.from_buffers should not crash

2021-10-28 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14495: - Summary: [Python] DictionaryArray.from_buffers should not crash Key: ARROW-14495 URL: https://issues.apache.org/jira/browse/ARROW-14495 Project: Apac

[jira] [Created] (ARROW-14496) [Docs] Ensure links to non-sphinx parts of the docs are relative instead of absolute

2021-10-28 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14496: - Summary: [Docs] Ensure links to non-sphinx parts of the docs are relative instead of absolute Key: ARROW-14496 URL: https://issues.apache.org/jira/browse/ARROW-1

[jira] [Created] (ARROW-14500) [C++] Support casting from storage type to extension type

2021-10-28 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14500: - Summary: [C++] Support casting from storage type to extension type Key: ARROW-14500 URL: https://issues.apache.org/jira/browse/ARROW-14500 Project: A

  1   2   3   4   5   6   7   8   9   10   >