sonomatechDS commented on issue #15056:
URL: https://github.com/apache/arrow/issues/15056#issuecomment-1378040428

   @boshek I re-wrote my full dataset to .csv and am still getting the same 
`stack imbalance`  then `segfault` error as before. 
   
   **It seems that there's a size limitation when combined with idle time.** 
The full version of this app displays data from 2016-2022 (300-400k rows per 
site), compared to only 2022 in the reprex (5-10k rows).
   
   I am able to recreate the error by altering the reprex app.R code to 
simulate a larger dataset (see below). As before, when updating the app 
constantly, there is no issue. When left idle for 1+ minutes, the app crashes 
as before.
   
   **Note: This is after adjusting the dataset to .csv format from .arrow 
format**
   
   Reprex updates:
   
   - Remove the filter on aqs_sitecode in the `plot_data` reactive so that each 
time a new site is chosen, the whole dataset is loaded (300k rows)
   - Add back the data subset in the call to `ggplot()`
   
   ```
     plot_data <- reactive({
       req(selected_site$sitecode)
       
       s <- as.integer(selected_site$sitecode)
       
       ds %>%
          filter(#aqs_sitecode == s, #### Remove this filter ... load all 300k 
rows on each update (simulates larger data load)
                parameter == 'Ozone',
                sample_duration == '1 HOUR',
                poc == 1) %>%
         select(aqs_sitecode, date_time2, sample_measurement) %>%
         collect()
     })
     
     output$TS <- renderPlot({
       req(selected_site$sitecode, 
           is.data.frame(plot_data()))
       
      #### Add call to subset() in ggplot()
       ggplot(subset(plot_data(), aqs_sitecode == selected_site$sitecode)) +
         geom_line(aes(date_time2, sample_measurement)) +
         scale_x_datetime() +
         labs(x = 'DateTime', y = 'Ozone in ppb', main = selected_site$sitecode)
     })
     ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to