Neal Richardson created ARROW-9609: -------------------------------------- Summary: [C++] CSV datasets don't materialize virtual columns Key: ARROW-9609 URL: https://issues.apache.org/jira/browse/ARROW-9609 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Neal Richardson Assignee: Ben Kietzman Fix For: 2.0.0, 1.0.1
They're included but the values are all null. Minimal reproducer in R: {code:r} library(arrow) dir.create("testds/c=5", recursive = TRUE) cat("a,b\n1,2\n", file = "testds/c=5/test.csv") ds <- open_dataset("testds", format = "csv") ds ## FileSystemDataset with 1 csv file ## a: int64 ## b: int64 ## c: int32 dplyr::collect(ds) ## # A tibble: 1 x 3 ## a b c ## <int> <int> <int> ## 1 1 2 NA {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)