This is an automated email from the ASF dual-hosted git repository.
bkietz pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/arrow.git.
from f03c844 ARROW-7608: [C++][Dataset] Add the ability to list files in
FileSystemSource
add efbc047 ARROW-7547: [C++][Dataset][Python] Add ParquetFileFormat
options
No new revisions were added by this update.
Summary of changes:
cpp/examples/arrow/dataset-parquet-scan-example.cc | 3 +-
cpp/src/arrow/dataset/dataset.cc | 60 ++++-
cpp/src/arrow/dataset/dataset.h | 28 ++-
cpp/src/arrow/dataset/dataset_test.cc | 29 +--
cpp/src/arrow/dataset/discovery_test.cc | 2 +-
cpp/src/arrow/dataset/file_base.cc | 2 +-
cpp/src/arrow/dataset/file_base.h | 2 +-
cpp/src/arrow/dataset/file_ipc.cc | 4 +-
cpp/src/arrow/dataset/file_ipc.h | 7 +-
cpp/src/arrow/dataset/file_parquet.cc | 242 +++++++++++++--------
cpp/src/arrow/dataset/file_parquet.h | 55 ++++-
cpp/src/arrow/dataset/file_parquet_test.cc | 91 +++++---
cpp/src/arrow/dataset/scanner.cc | 8 +
cpp/src/arrow/dataset/scanner.h | 11 +
cpp/src/arrow/dataset/scanner_test.cc | 25 ++-
cpp/src/arrow/dataset/test_util.h | 8 +-
cpp/src/arrow/status.cc | 5 +-
cpp/src/arrow/status.h | 1 +
cpp/src/arrow/testing/generator.cc | 9 +-
cpp/src/arrow/testing/generator.h | 11 +
cpp/src/arrow/type.h | 5 +-
cpp/src/parquet/arrow/arrow_reader_writer_test.cc | 6 +-
cpp/src/parquet/arrow/reader.h | 5 +-
cpp/src/parquet/file_reader.cc | 4 +-
cpp/src/parquet/file_reader.h | 5 +-
cpp/src/parquet/metadata.h | 5 +-
cpp/src/parquet/properties.h | 9 +-
python/pyarrow/_dataset.pyx | 78 ++++++-
python/pyarrow/dataset.py | 2 +-
python/pyarrow/includes/libarrow_dataset.pxd | 108 +++++----
python/pyarrow/tests/test_dataset.py | 23 +-
r/R/arrowExports.R | 8 +-
r/R/dataset.R | 29 ++-
r/src/arrowExports.cpp | 31 ++-
r/src/dataset.cpp | 24 +-
r/tests/testthat/test-dataset.R | 6 +
36 files changed, 655 insertions(+), 296 deletions(-)