mrverhoeven commented on issue #29896: URL: https://github.com/apache/arrow/issues/29896#issuecomment-2721999332
Still reproducible on 17.0.0.1. A useful workaround for anyone who finds themselves here is to either mutate that column type or specify it in the filter call: ``` open_dataset(tf, schema = tf_reg) %>% + mutate(stations = as.character(stations)) %>% + filter(stations == "41") %>% + collect() # A tibble: 12 × 5 lat long depth mag stations <dbl> <dbl> <int> <dbl> <chr> 1 -20.4 182. 562 4.8 41 2 -23.4 180 553 4.7 41 3 -25.7 180. 434 4.6 41 4 -32.2 180. 422 4.6 41 5 -18.1 182. 618 4.6 41 6 -25.0 180. 470 4.8 41 7 -34.4 180. 201 4.4 41 8 -34.1 182. 75 4.7 41 9 -23.5 180. 539 5 41 10 -16.0 186. 143 4.6 41 11 -19.0 169. 242 5 41 12 -13.8 167. 68 4.7 41 open_dataset(tf, schema = tf_reg) %>% + filter(as.character(stations) == "41") %>% + collect() # A tibble: 12 × 5 lat long depth mag stations <dbl> <dbl> <int> <dbl> <chr> 1 -20.4 182. 562 4.8 41 2 -23.4 180 553 4.7 41 3 -25.7 180. 434 4.6 41 4 -32.2 180. 422 4.6 41 5 -18.1 182. 618 4.6 41 6 -25.0 180. 470 4.8 41 7 -34.4 180. 201 4.4 41 8 -34.1 182. 75 4.7 41 9 -23.5 180. 539 5 41 10 -16.0 186. 143 4.6 41 11 -19.0 169. 242 5 41 12 -13.8 167. 68 4.7 41 > > packageVersion("arrow") [1] ‘17.0.0.1’ ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org