boshek commented on issue #15056: URL: https://github.com/apache/arrow/issues/15056#issuecomment-1376532373
I can reproduce this issue in RStudio Connect. This is the error that I see (which is different that what you are reporting): <details> <summary>log</summary> ``` 2023/01/09 23:47:44.920714517 [1] 3785 2023/01/09 23:47:45.194163992 [1] 3785 2023/01/09 23:47:45.197808737 [1] "pinged" 2023/01/09 23:48:15.466341119 [1] 3785 2023/01/09 23:48:45.750669396 [1] 3785 2023/01/09 23:49:16.046789291 [1] 3785 2023/01/09 23:49:46.366873116 [1] 3785 2023/01/09 23:50:16.639291193 [1] 3785 2023/01/09 23:50:46.902676880 [1] 3785 2023/01/09 23:51:17.211774091 [1] 3785 2023/01/09 23:51:20.593104281 Warning: Error in collect: ignoring SIGPIPE signal 2023/01/09 23:51:20.606683408 212: <Anonymous> 2023/01/09 23:51:20.606694399 211: signalCondition 2023/01/09 23:51:20.606724639 210: signal_abort 2023/01/09 23:51:20.606743769 209: abort 2023/01/09 23:51:20.606782270 208: augment_io_error_msg 2023/01/09 23:51:20.606783790 207: value[[3L]] 2023/01/09 23:51:20.606797460 206: tryCatchOne 2023/01/09 23:51:20.606798670 205: tryCatchList 2023/01/09 23:51:20.606891601 204: tryCatch 2023/01/09 23:51:20.606893811 203: collect.arrow_dplyr_query 2023/01/09 23:51:20.606907001 202: collect 2023/01/09 23:51:20.606907791 201: %>% 2023/01/09 23:51:20.606914991 200: <reactive:plot_data> [/opt/rstudio-connect/mnt/app/app.R#50] 2023/01/09 23:51:20.606927181 198: .func 2023/01/09 23:51:20.606945612 195: contextFunc 2023/01/09 23:51:20.606947502 194: env$runWith 2023/01/09 23:51:20.606972372 187: ctx$run 2023/01/09 23:51:20.606973622 186: self$.updateValue 2023/01/09 23:51:20.606983322 184: plot_data 2023/01/09 23:51:20.606984412 182: renderPlot [/opt/rstudio-connect/mnt/app/app.R#62] 2023/01/09 23:51:20.606993812 180: func 2023/01/09 23:51:20.606995122 140: drawPlot 2023/01/09 23:51:20.607005132 126: <reactive:plotObj> 2023/01/09 23:51:20.607006152 110: drawReactive 2023/01/09 23:51:20.607019292 97: renderFunc 2023/01/09 23:51:20.607022683 96: output$TS 2023/01/09 23:51:20.607032143 15: <Anonymous> 2023/01/09 23:51:20.607033293 13: fn 2023/01/09 23:51:20.607046283 8: retry 2023/01/09 23:51:20.607047413 7: connect$retryingStartServer 2023/01/09 23:51:20.607056303 6: eval 2023/01/09 23:51:20.607058203 5: eval 2023/01/09 23:51:20.607072323 4: eval 2023/01/09 23:51:20.607073053 3: eval 2023/01/09 23:51:20.607080343 2: eval.parent 2023/01/09 23:51:20.607081113 1: local ``` </details> I tried a few troubleshooting steps including: - moving the `collect` call around to see where exactly the issue was. the only thing that "worked" was to pull all the data in right away which obviously isn't helpful - tried using parquet and csv. - Using csv seems to resolve this issue though that does not solve the actual thing you wanted. Two additional questions: - does write the partitioned data as csv and using them as csvs work? - have you tried this using a linux machine? I don't have access to the linux machine where I can deploy this. it is very strange that this works locally but not in RStudio Connect. Trying this on linux would be a useful point on information. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
