Joris Van den Bossche created ARROW-10882:
-
Summary: [Python][Dataset] Writing dataset from python iterator of
record batches
Key: ARROW-10882
URL: https://issues.apache.org/jira/browse/ARROW-10882
Joris Van den Bossche created ARROW-10883:
-
Summary: [C++][Dataset] Preserve order when writing dataset
Key: ARROW-10883
URL: https://issues.apache.org/jira/browse/ARROW-10883
Project: Apache A
Joris Van den Bossche created ARROW-10951:
-
Summary: [Python][CI] Nightly pandas builds failing because of
pytest monkeypatch issue
Key: ARROW-10951
URL: https://issues.apache.org/jira/browse/ARROW-10951
Joris Van den Bossche created ARROW-10998:
-
Summary: [C++] Filesystems: detect if URI is passed where a file
path is required and raise informative error
Key: ARROW-10998
URL: https://issues.apache.org/jir
Joris Van den Bossche created ARROW-11000:
-
Summary: [Python] Enable random access reading for Python file
objects (if supported)
Key: ARROW-11000
URL: https://issues.apache.org/jira/browse/ARROW-11000
Joris Van den Bossche created ARROW-11001:
-
Summary: [C++][Dataset] Enable column renaming (in physical schema
-> dataset schema) in Dataset scanning
Key: ARROW-11001
URL: https://issues.apache.org/jira/br
Joris Van den Bossche created ARROW-11003:
-
Summary: [C++][Dataset] Schema evolution in Dataset scanning
Key: ARROW-11003
URL: https://issues.apache.org/jira/browse/ARROW-11003
Project: Apache
Joris Van den Bossche created ARROW-11142:
-
Summary: [C++][Parquet] Inconsistent batch_size usage in parquet
GetRecordBatchReader
Key: ARROW-11142
URL: https://issues.apache.org/jira/browse/ARROW-11142
Joris Van den Bossche created ARROW-11163:
-
Summary: [C++][Python] Compressed Feather file written with
pyarrow 0.17 not readable in pyarrow 2.0.0+
Key: ARROW-11163
URL: https://issues.apache.org/jira/brow
Joris Van den Bossche created ARROW-11166:
-
Summary: [Python][Compute] Add bindings for ProjectOptions
Key: ARROW-11166
URL: https://issues.apache.org/jira/browse/ARROW-11166
Project: Apache Ar
Joris Van den Bossche created ARROW-11167:
-
Summary: [Python][Compute] Improve usability for defining sort
options
Key: ARROW-11167
URL: https://issues.apache.org/jira/browse/ARROW-11167
Proje
Joris Van den Bossche created ARROW-11226:
-
Summary: [Python][CI] Windows tests failing with s3fs 0.5.2
Key: ARROW-11226
URL: https://issues.apache.org/jira/browse/ARROW-11226
Project: Apache A
Joris Van den Bossche created ARROW-11227:
-
Summary: [Python][CI] AMD64 Conda Python 3.7 Pandas 0.24 cron
job failing in to_pandas extension dtype test
Key: ARROW-11227
URL: https://issues.apache.org/jir
Joris Van den Bossche created ARROW-11259:
-
Summary: [Python] Allow to create field reference to nested field
Key: ARROW-11259
URL: https://issues.apache.org/jira/browse/ARROW-11259
Project: Ap
Joris Van den Bossche created ARROW-11260:
-
Summary: [C++][Dataset] Don't require dictionaries for reading
dataset with schema-based Partitioning
Key: ARROW-11260
URL: https://issues.apache.org/jira/browse
Joris Van den Bossche created ARROW-11334:
-
Summary: [Python][CI] Nightly pandas builds failing because of
internal pandas change
Key: ARROW-11334
URL: https://issues.apache.org/jira/browse/ARROW-11334
Joris Van den Bossche created ARROW-11370:
-
Summary: [C++] Ability to "re-chunk" Tables or ChunkedArrays
Key: ARROW-11370
URL: https://issues.apache.org/jira/browse/ARROW-11370
Project: Apache
Joris Van den Bossche created ARROW-11373:
-
Summary: [Python][Docs] Add example of specifying type for a
column when reading csv file
Key: ARROW-11373
URL: https://issues.apache.org/jira/browse/ARROW-11373
Joris Van den Bossche created ARROW-11374:
-
Summary: [Python] Make legacy pyarrow.filesystem /
pyarrow.serialize warnings more visisble
Key: ARROW-11374
URL: https://issues.apache.org/jira/browse/ARROW-113
Joris Van den Bossche created ARROW-11378:
-
Summary: [C++][Dataset] Writing partitions with timestamp type
give mis-formatted (integers) directory names
Key: ARROW-11378
URL: https://issues.apache.org/jira
Joris Van den Bossche created ARROW-11379:
-
Summary: [C++][Dataset] Reading dataset with filtering on
timestamp partition field crashes
Key: ARROW-11379
URL: https://issues.apache.org/jira/browse/ARROW-113
Joris Van den Bossche created ARROW-11399:
-
Summary: [C++][Parquet] Timestamp ColumnDescriptor (from logical
type) incorrectly showing ConvertedType as NONE
Key: ARROW-11399
URL: https://issues.apache.org/
Joris Van den Bossche created ARROW-11400:
-
Summary: [Python] Pickled ParquetFileFragment has invalid
partition_expresion with dictionary type in pyarrow 2.0
Key: ARROW-11400
URL: https://issues.apache.org
Joris Van den Bossche created ARROW-11472:
-
Summary: [Python][CI] Kartothek integrations build is failing with
numpy 1.20
Key: ARROW-11472
URL: https://issues.apache.org/jira/browse/ARROW-11472
Joris Van den Bossche created ARROW-11553:
-
Summary: [Python] Make Table.cast(schema) more flexible regarding
order of fields / missing fields?
Key: ARROW-11553
URL: https://issues.apache.org/jira/browse/A
Joris Van den Bossche created ARROW-11608:
-
Summary: [CI] turbodbc integration tests are failing (build isue)
Key: ARROW-11608
URL: https://issues.apache.org/jira/browse/ARROW-11608
Project: Ap
Joris Van den Bossche created ARROW-11673:
-
Summary: [C++] Casting dictionary type to use different index type
Key: ARROW-11673
URL: https://issues.apache.org/jira/browse/ARROW-11673
Project: A
Joris Van den Bossche created ARROW-11759:
-
Summary: [C++] Kernel to extract datetime components (year, month,
day, etc) from timestamp type
Key: ARROW-11759
URL: https://issues.apache.org/jira/browse/ARRO
Joris Van den Bossche created ARROW-11923:
-
Summary: [CI] Update branch name for dask dev integration tests
Key: ARROW-11923
URL: https://issues.apache.org/jira/browse/ARROW-11923
Project: Apac
Joris Van den Bossche created ARROW-11980:
-
Summary: [Python] Remove "experimental" status from
Table.replace_schema_metadata
Key: ARROW-11980
URL: https://issues.apache.org/jira/browse/ARROW-11980
Joris Van den Bossche created ARROW-11983:
-
Summary: [Python] ImportError calling pyarrow from_pandas within
ThreadPool
Key: ARROW-11983
URL: https://issues.apache.org/jira/browse/ARROW-11983
Joris Van den Bossche created ARROW-12057:
-
Summary: [Python] Remove direct usage of pandas' Block subclasses
Key: ARROW-12057
URL: https://issues.apache.org/jira/browse/ARROW-12057
Project: Ap
Joris Van den Bossche created ARROW-12058:
-
Summary: [Python] Enable arithmetic operations on Expressions
Key: ARROW-12058
URL: https://issues.apache.org/jira/browse/ARROW-12058
Project: Apache
Joris Van den Bossche created ARROW-12060:
-
Summary: [Python] Enable calling compute functions on Expressions
Key: ARROW-12060
URL: https://issues.apache.org/jira/browse/ARROW-12060
Project: Ap
Joris Van den Bossche created ARROW-12188:
-
Summary: [Docs] Switch to pydata-sphinx-theme for the main sphinx
docs
Key: ARROW-12188
URL: https://issues.apache.org/jira/browse/ARROW-12188
Proje
Joris Van den Bossche created ARROW-12246:
-
Summary: [CI] Synch conda recipes with upstream feedstock
Key: ARROW-12246
URL: https://issues.apache.org/jira/browse/ARROW-12246
Project: Apache Arr
Joris Van den Bossche created ARROW-12314:
-
Summary: [Python] pq.read_pandas with use_legacy_dataset=False
does not accept columns as a set (kartothek integration failure)
Key: ARROW-12314
URL: https://iss
Joris Van den Bossche created ARROW-12358:
-
Summary: [C++][Python][R][Dataset] Control overwriting vs
appending when writing to existing dataset
Key: ARROW-12358
URL: https://issues.apache.org/jira/browse/
Joris Van den Bossche created ARROW-12396:
-
Summary: [Python][Docs] Clarify serialization docstrings about
deprecated status
Key: ARROW-12396
URL: https://issues.apache.org/jira/browse/ARROW-12396
Joris Van den Bossche created ARROW-12518:
-
Summary: [Python] Expose Parquet statistics has_null_count /
has_distinct_count
Key: ARROW-12518
URL: https://issues.apache.org/jira/browse/ARROW-12518
Joris Van den Bossche created ARROW-12541:
-
Summary: [Docs] Improve styling/readability of tables in the new
doc theme
Key: ARROW-12541
URL: https://issues.apache.org/jira/browse/ARROW-12541
P
Joris Van den Bossche created ARROW-12545:
-
Summary: [Python][Docs] Fill in section about Custom Schema and
Field Metadata
Key: ARROW-12545
URL: https://issues.apache.org/jira/browse/ARROW-12545
Joris Van den Bossche created ARROW-12564:
-
Summary: [C++] Add compute kernel for extract keys / items from
Map type data
Key: ARROW-12564
URL: https://issues.apache.org/jira/browse/ARROW-12564
Joris Van den Bossche created ARROW-12611:
-
Summary: [CI][Python] Nightly test-conda-python-pandas-0.24 is
failing due to numpy compat issue
Key: ARROW-12611
URL: https://issues.apache.org/jira/browse/ARRO
Joris Van den Bossche created ARROW-12631:
-
Summary: [Python] Should dataset.write_table accept a Scanner?
Key: ARROW-12631
URL: https://issues.apache.org/jira/browse/ARROW-12631
Project: Apach
Joris Van den Bossche created ARROW-12805:
-
Summary: [Python] Use consistent memory_pool / pool keyword
argument name
Key: ARROW-12805
URL: https://issues.apache.org/jira/browse/ARROW-12805
Pr
Joris Van den Bossche created ARROW-12806:
-
Summary: [Python] test_write_to_dataset_filesystem missing a
dataset mark
Key: ARROW-12806
URL: https://issues.apache.org/jira/browse/ARROW-12806
Pr
Joris Van den Bossche created ARROW-12966:
-
Summary: [Python] Expose Python binding for
ElementWiseAggregateOptions
Key: ARROW-12966
URL: https://issues.apache.org/jira/browse/ARROW-12966
Proj
Joris Van den Bossche created ARROW-12987:
-
Summary: [CI] test-ubuntu-18.04 nightly builds are failing due to
Gandiva test failure
Key: ARROW-12987
URL: https://issues.apache.org/jira/browse/ARROW-12987
Joris Van den Bossche created ARROW-12988:
-
Summary: [CI] The kartothek nightly integration build is failing
(test_update_dataset_from_ddf_empty)
Key: ARROW-12988
URL: https://issues.apache.org/jira/browse
Joris Van den Bossche created ARROW-13011:
-
Summary: [Python] Using fs.HadoopFileSystem in the dask tests
crashes
Key: ARROW-13011
URL: https://issues.apache.org/jira/browse/ARROW-13011
Projec
Joris Van den Bossche created ARROW-13018:
-
Summary: [C++][Docs] Use consistent terminology for nulls
(min_count) in scalar aggregate kernels
Key: ARROW-13018
URL: https://issues.apache.org/jira/browse/ARR
Joris Van den Bossche created ARROW-13033:
-
Summary: [C++] Kernel to localize naive timestamps to a timezone
(preserving clock-time)
Key: ARROW-13033
URL: https://issues.apache.org/jira/browse/ARROW-13033
Joris Van den Bossche created ARROW-13034:
-
Summary: [Python][Docs] Update outdated examples for hdfs/azure on
the Parquet doc page
Key: ARROW-13034
URL: https://issues.apache.org/jira/browse/ARROW-13034
Joris Van den Bossche created ARROW-13074:
-
Summary: [Python] Start with deprecating ParquetDataset custom
attributes
Key: ARROW-13074
URL: https://issues.apache.org/jira/browse/ARROW-13074
Pr
Joris Van den Bossche created ARROW-13081:
-
Summary: [C++] Comparison kernels should not allow to compare
tz-naive and tz-aware timestamps
Key: ARROW-13081
URL: https://issues.apache.org/jira/browse/ARROW-
Joris Van den Bossche created ARROW-13141:
-
Summary: [C++][Python] HadoopFileSystem: automatically set
CLASSPATH based on HADOOP_HOME env variable?
Key: ARROW-13141
URL: https://issues.apache.org/jira/brow
Joris Van den Bossche created ARROW-13158:
-
Summary: [Python] Fix repr and contains of StructScalar with
duplicate field names
Key: ARROW-13158
URL: https://issues.apache.org/jira/browse/ARROW-13158
Joris Van den Bossche created ARROW-13159:
-
Summary: [Doc][Python] The use of IPython directive or doctest
code blocks in the python user guide
Key: ARROW-13159
URL: https://issues.apache.org/jira/browse/A
Joris Van den Bossche created ARROW-13236:
-
Summary: [Python] Improve repr of pyarrow.compute.FunctionOptions
Key: ARROW-13236
URL: https://issues.apache.org/jira/browse/ARROW-13236
Project: Ap
Joris Van den Bossche created ARROW-13247:
-
Summary: [C++] Kernel to convert timestamp with timezone to
another timezone (metadata-only change)
Key: ARROW-13247
URL: https://issues.apache.org/jira/browse/A
Joris Van den Bossche created ARROW-13258:
-
Summary: [Python] Improve the repr of ParquetFileFragment
Key: ARROW-13258
URL: https://issues.apache.org/jira/browse/ARROW-13258
Project: Apache Arr
Joris Van den Bossche created ARROW-13260:
-
Summary: [Doc] Host different released versions of the
documentation + version switcher
Key: ARROW-13260
URL: https://issues.apache.org/jira/browse/ARROW-13260
Joris Van den Bossche created ARROW-13350:
-
Summary: [Python][CI] conda-python-3.7-pandas-0.24 nightly build
failing in test_extract_datetime_components
Key: ARROW-13350
URL: https://issues.apache.org/jira
Joris Van den Bossche created ARROW-13351:
-
Summary: [Python] Bump minimum support pandas version to pandas 1.0
Key: ARROW-13351
URL: https://issues.apache.org/jira/browse/ARROW-13351
Project:
Joris Van den Bossche created ARROW-13525:
-
Summary: [Python] Mention alternatives in deprecation message of
ParquetDataset attributes
Key: ARROW-13525
URL: https://issues.apache.org/jira/browse/ARROW-1352
Joris Van den Bossche created ARROW-13594:
-
Summary: [CI] Turbodbc integration builds are failing due to use
of deprecated/removed APIs
Key: ARROW-13594
URL: https://issues.apache.org/jira/browse/ARROW-135
Joris Van den Bossche created ARROW-13612:
-
Summary: [Python] Allow specifying a custom type for converting
ExtensionScalar to python object
Key: ARROW-13612
URL: https://issues.apache.org/jira/browse/ARRO
Joris Van den Bossche created ARROW-13652:
-
Summary: [Python] Expose the CopyFiles utility in Python
Key: ARROW-13652
URL: https://issues.apache.org/jira/browse/ARROW-13652
Project: Apache Arro
Joris Van den Bossche created ARROW-13654:
-
Summary: [C++][Parquet] Appending a FileMetaData object to itselfs
explodes memory
Key: ARROW-13654
URL: https://issues.apache.org/jira/browse/ARROW-13654
Joris Van den Bossche created ARROW-13655:
-
Summary: [C++][Parquet] Reading large Parquet file can give
"MaxMessageSize reached" error with Thrift 0.14
Key: ARROW-13655
URL: https://issues.apache.org/jira/
Joris Van den Bossche created ARROW-13662:
-
Summary: [CI] Failing test test_extract_datetime_components with
pandas 0.24
Key: ARROW-13662
URL: https://issues.apache.org/jira/browse/ARROW-13662
Joris Van den Bossche created ARROW-13735:
-
Summary: [Python] Creating a Map array with non-default field
names segfaults
Key: ARROW-13735
URL: https://issues.apache.org/jira/browse/ARROW-13735
Joris Van den Bossche created ARROW-13791:
-
Summary: [Python][Docs] Update datasets user guide with more
details on Partitioning(Factory)
Key: ARROW-13791
URL: https://issues.apache.org/jira/browse/ARROW-1
Joris Van den Bossche created ARROW-13795:
-
Summary: [C++] Add async version of the ORC Dataset scanner
Key: ARROW-13795
URL: https://issues.apache.org/jira/browse/ARROW-13795
Project: Apache A
Joris Van den Bossche created ARROW-13796:
-
Summary: [C++] Add write support for ORC in the Datasets API
Key: ARROW-13796
URL: https://issues.apache.org/jira/browse/ARROW-13796
Project: Apache
Joris Van den Bossche created ARROW-13797:
-
Summary: [C++] Implement column projection pushdown to ORC reader
in Datasets API
Key: ARROW-13797
URL: https://issues.apache.org/jira/browse/ARROW-13797
Joris Van den Bossche created ARROW-13813:
-
Summary: [C++][Dataset] Support URL encoding of partition field
values for the file path
Key: ARROW-13813
URL: https://issues.apache.org/jira/browse/ARROW-13813
Joris Van den Bossche created ARROW-13814:
-
Summary: [CI] Nightly integration build with spark master failing
to compile spark
Key: ARROW-13814
URL: https://issues.apache.org/jira/browse/ARROW-13814
Joris Van den Bossche created ARROW-13958:
-
Summary: [Python] Migrate Python ORC bindings to use new
Result-based APIs
Key: ARROW-13958
URL: https://issues.apache.org/jira/browse/ARROW-13958
P
Joris Van den Bossche created ARROW-14003:
-
Summary: [Python] Not providing a sort_key in the
"select_k_unstable" kernel crashes
Key: ARROW-14003
URL: https://issues.apache.org/jira/browse/ARROW-14003
Joris Van den Bossche created ARROW-14055:
-
Summary: [Docs] Add canonical url to the docs
Key: ARROW-14055
URL: https://issues.apache.org/jira/browse/ARROW-14055
Project: Apache Arrow
Joris Van den Bossche created ARROW-14115:
-
Summary: [Python] Remove deprecated pyarrow.serialization
functionality
Key: ARROW-14115
URL: https://issues.apache.org/jira/browse/ARROW-14115
Proj
Joris Van den Bossche created ARROW-14153:
-
Summary: [C++] Add support for batch_size in the ORC Scanner
(Dataset)
Key: ARROW-14153
URL: https://issues.apache.org/jira/browse/ARROW-14153
Proje
Joris Van den Bossche created ARROW-14189:
-
Summary: [Docs] Add version dropdown to the sphinx docs
Key: ARROW-14189
URL: https://issues.apache.org/jira/browse/ARROW-14189
Project: Apache Arrow
Joris Van den Bossche created ARROW-14194:
-
Summary: [Docs] Improve vertical spacing in the sphinx API docs
Key: ARROW-14194
URL: https://issues.apache.org/jira/browse/ARROW-14194
Project: Apac
Joris Van den Bossche created ARROW-14196:
-
Summary: [C++][Parquet] Default to compliant nested types in
Parquet writer
Key: ARROW-14196
URL: https://issues.apache.org/jira/browse/ARROW-14196
Joris Van den Bossche created ARROW-14241:
-
Summary: [C++] Dataset ORC build failing in java-jars nightly build
Key: ARROW-14241
URL: https://issues.apache.org/jira/browse/ARROW-14241
Project:
Joris Van den Bossche created ARROW-14284:
-
Summary: [C++][Python] Improve error message when trying use
SyncScanner when requiring async
Key: ARROW-14284
URL: https://issues.apache.org/jira/browse/ARROW-1
Joris Van den Bossche created ARROW-14286:
-
Summary: [Python][Parquet] Allow to select columns of a list field
without requiring the list component names
Key: ARROW-14286
URL: https://issues.apache.org/jir
Joris Van den Bossche created ARROW-14287:
-
Summary: [R] Selecting colums while reading Parquet file with
nested types can give wrong column
Key: ARROW-14287
URL: https://issues.apache.org/jira/browse/ARRO
Joris Van den Bossche created ARROW-14406:
-
Summary: [Python][CI] Nightly dask integration jobs fail
Key: ARROW-14406
URL: https://issues.apache.org/jira/browse/ARROW-14406
Project: Apache Arro
Joris Van den Bossche created ARROW-14447:
-
Summary: [Python] Use oldest-supported-numpy for declaring numpy
version build dependency
Key: ARROW-14447
URL: https://issues.apache.org/jira/browse/ARROW-14447
Joris Van den Bossche created ARROW-14448:
-
Summary: [Python] Update pyarrow.array() docstring note on
timestamp (timezone) conversion
Key: ARROW-14448
URL: https://issues.apache.org/jira/browse/ARROW-1444
Joris Van den Bossche created ARROW-14459:
-
Summary: [Doc] Update the pinned sphinx version
Key: ARROW-14459
URL: https://issues.apache.org/jira/browse/ARROW-14459
Project: Apache Arrow
Joris Van den Bossche created ARROW-14460:
-
Summary: [Doc] Use sphinx-remove-toctrees to generated docstring
pages from navigation (and reduce build time)
Key: ARROW-14460
URL: https://issues.apache.org/ji
Joris Van den Bossche created ARROW-14470:
-
Summary: [Python] Expose the use_threads option in Feather read
functions
Key: ARROW-14470
URL: https://issues.apache.org/jira/browse/ARROW-14470
Pr
Joris Van den Bossche created ARROW-14495:
-
Summary: [Python] DictionaryArray.from_buffers should not crash
Key: ARROW-14495
URL: https://issues.apache.org/jira/browse/ARROW-14495
Project: Apac
Joris Van den Bossche created ARROW-14496:
-
Summary: [Docs] Ensure links to non-sphinx parts of the docs are
relative instead of absolute
Key: ARROW-14496
URL: https://issues.apache.org/jira/browse/ARROW-1
Joris Van den Bossche created ARROW-14500:
-
Summary: [C++] Support casting from storage type to extension type
Key: ARROW-14500
URL: https://issues.apache.org/jira/browse/ARROW-14500
Project: A
1 - 100 of 1560 matches
Mail list logo