sonomatechDS commented on issue #15056: URL: https://github.com/apache/arrow/issues/15056#issuecomment-1378040428
@boshek I re-wrote my full dataset to .csv and am still getting the same `stack imbalance` then `segfault` error as before. **It seems that there's a size limitation when combined with idle time.** The full version of this app displays data from 2016-2022 (300-400k rows per site), compared to only 2022 in the reprex (5-10k rows). I am able to recreate the error by altering the reprex app.R code to simulate a larger dataset (see below). As before, when updating the app constantly, there is no issue. When left idle for 1+ minutes, the app crashes as before. **Note: This is after adjusting the dataset to .csv format from .arrow format** Reprex updates: - Remove the filter on aqs_sitecode in the `plot_data` reactive so that each time a new site is chosen, the whole dataset is loaded (300k rows) - Add back the data subset in the call to `ggplot()` ``` plot_data <- reactive({ req(selected_site$sitecode) s <- as.integer(selected_site$sitecode) ds %>% filter(#aqs_sitecode == s, #### Remove this filter ... load all 300k rows on each update (simulates larger data load) parameter == 'Ozone', sample_duration == '1 HOUR', poc == 1) %>% select(aqs_sitecode, date_time2, sample_measurement) %>% collect() }) output$TS <- renderPlot({ req(selected_site$sitecode, is.data.frame(plot_data())) #### Add call to subset() in ggplot() ggplot(subset(plot_data(), aqs_sitecode == selected_site$sitecode)) + geom_line(aes(date_time2, sample_measurement)) + scale_x_datetime() + labs(x = 'DateTime', y = 'Ozone in ppb', main = selected_site$sitecode) }) ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org