Hello, After upgrading Arrow 5.0.0.2 to 6.0.0.2 in a Bioconductor 3.13 Docker container, I started to see some new errors when reading Parquet files that use snappy compression. I'm using the prebuilt Linux binary by setting LIBARROW_BINARY=true during installation. Building arrow using the latest nightly source fixes the issue. Is it possible the 6.0.0.2 prebuilt Linux binary does not have snappy compression support enabled? The error is copied below.
Error: NotImplemented: Support for codec 'snappy' not built In order to read this file, you will need to reinstall arrow with additional features enabled. Set one of these environment variables before installing: * LIBARROW_MINIMAL=false (for all optional features, including 'snappy') * ARROW_WITH_SNAPPY=ON (for just 'snappy') See https://arrow.apache.org/docs/r/articles/install.html for details Backtrace: 1. popcycle::get.vct.by.file(db, vct_dir, "2018_176/2018-06-25T20-03-48+00-00") test_files.R:210:2 4. arrow::read_parquet(...) 5. base::tryCatch(reader$ReadTable(), error = read_compressed_error) 6. base:::tryCatchList(expr, classes, parentenv, handlers) 7. base:::tryCatchOne(expr, names, parentenv, handlers[[1L]]) 8. value[[3L]](cond) Thanks, Chris Berthiaume
