Can you please open a JIRA issue?
On Wed, Jan 8, 2020 at 12:37 PM Christian Klar <[email protected]>
wrote:
> Hi,
>
>
>
> At the bottom please find the session_info.
>
>
>
> When creating north of 200-300 batches, the writing to the arrow file
> crashes R – it doesn’t even show an error message. Rstudio just aborts.
>
>
>
> I have the feeling that maybe each batch becomes a stream and R has issues
> with the connections, but that’s a total guess.
>
>
>
> Any help would be appreciated.
>
>
>
> ##
>
>
>
> Here is the function. When running it with 3000 it crashes immediately.
>
>
>
> Before that I ran it with 100, and then increased it slowly, and then it
> randomly crashed again.
>
>
>
>
> write_arrow_custom(data.frame(A=c(1:100000),B=c(1:100000)),'C:/Temp/test.arrow',3000)
>
>
>
> write_arrow_custom <- function(df,targetarrow,nrbatches) {
>
> ct <- nrbatches
>
> idxs <- c(0:ct)/ct*nrow(df)
>
> idxs <- round(idxs,0) %>% as.integer()
>
> idxs[length(idxs)] <- nrow(df)
>
> df_nav <- idxs %>% as.data.frame() %>% rename(colfrom=1) %>%
> mutate(colto=lead(colfrom)) %>% mutate(colfrom=colfrom+1) %>% filter(!
> is.na(colto)) %>% mutate(R=row_number())
>
> stopifnot(df_nav %>% mutate(chk=colto-colfrom+1) %>% '$'('chk') %>%
> sum()==nrow(df))
>
> table_df <- Table$create(name=rownames(df[1,]),df[1,])
>
> writer <- RecordBatchFileWriter$create(targetarrow,table_df$schema)
>
> df_nav %>% dlply(c('R'),function(df_nav){
>
> catl(glue('{df_nav$colfrom[1]}:{df_nav$colto[1]} / {df_nav$R[1]}...'))
>
> tmp <- df[df_nav$colfrom[1]:df_nav$colto[1],]
>
> writer$write_batch(record_batch(name = rownames(tmp), tmp))
>
> NULL
>
> }) -> batch_lst
>
> writer$close()
>
> rm(batch_lst)
>
> gc()
>
> }
>
>
>
>
>
> ##
>
>
>
>
>
>
>
> - Session info
> -----------------------------------------------------------------------------------------------------------------------------------------------------------
>
> setting value
>
> version R version 3.6.1 (2019-07-05)
>
> os Windows 10 x64
>
> system x86_64, mingw32
>
> ui RStudio
>
> language (EN)
>
> collate English_United States.1252
>
> ctype English_United States.1252
>
> tz America/New_York
>
> date 2020-01-08
>
>
>
> - Packages
> ---------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> ! package * version date lib source
>
>
> acepack 1.4.1 2016-10-29 [1] CRAN (R
> 3.6.1)
>
> arrow * 0.15.1.1 2019-11-05 [1] CRAN (R 3.6.2)
>
>
> askpass 1.1 2019-01-13 [1] CRAN (R 3.6.1)
>
>
> assertthat 0.2.1 2019-03-21 [1] CRAN (R
> 3.6.1)
>
> backports 1.1.5 2019-10-02 [1] CRAN (R
> 3.6.1)
>
> base64enc 0.1-3 2015-07-28 [1] CRAN (R 3.6.0)
>
>
> bit 1.1-14 2018-05-29 [1] CRAN (R
> 3.6.0)
>
> bit64 0.9-7 2017-05-08 [1] CRAN (R
> 3.6.0)
>
> blob 1.2.0 2019-07-09 [1] CRAN (R 3.6.1)
>
>
> callr 3.3.1 2019-07-18 [1] CRAN (R
> 3.6.1)
>
> cellranger 1.1.0 2016-07-27 [1] CRAN (R
> 3.6.1)
>
> checkmate 1.9.4 2019-07-04 [1] CRAN (R 3.6.1)
>
>
> cli 1.1.0 2019-03-19 [1] CRAN (R
> 3.6.1)
>
> cluster 2.1.0 2019-06-19 [2] CRAN (R 3.6.1)
>
>
> codetools 0.2-16 2018-12-24 [2] CRAN (R
> 3.6.1)
>
> colorspace 1.4-1 2019-03-18 [1] CRAN (R
> 3.6.1)
>
> commonmark 1.7 2018-12-01 [1] CRAN (R 3.6.1)
>
>
> crayon 1.3.4 2017-09-16 [1] CRAN (R
> 3.6.1)
>
> credentials 1.1 2019-03-12 [1] CRAN (R
> 3.6.2)
>
> curl * 4.2 2019-09-24 [1] CRAN (R 3.6.1)
>
>
> data.table 1.12.2 2019-04-07 [1] CRAN (R
> 3.6.1)
>
> DBI * 1.0.0 2018-05-02 [1] CRAN (R
> 3.6.1)
>
> desc 1.2.0 2018-05-01 [1] CRAN (R 3.6.1)
>
>
> devtools * 2.2.0 2019-09-07 [1] CRAN (R
> 3.6.1)
>
> digest 0.6.23 2019-11-23 [1] CRAN (R
> 3.6.1)
>
> dplyr * 0.8.3 2019-07-04 [1] CRAN (R 3.6.1)
>
>
> DT 0.9 2019-09-17 [1] CRAN (R
> 3.6.1)
>
> ellipsis 0.3.0 2019-09-20 [1] CRAN (R
> 3.6.1)
>
> evaluate 0.14 2019-05-28 [1] CRAN (R 3.6.1)
>
>
> foreign 0.8-71 2018-07-20 [2] CRAN (R
> 3.6.1)
>
> Formula * 1.2-3 2018-05-03 [1] CRAN (R
> 3.6.0)
>
> fs 1.3.1 2019-05-06 [1] CRAN (R 3.6.1)
>
>
> fst * 0.9.0 2019-04-09 [1] CRAN (R
> 3.6.1)
>
> future * 1.15.0-9000 2019-11-19 [1] Github
> (HenrikBengtsson/future@bc241c7)
>
> ggplot2 * 3.2.1 2019-08-10 [1] CRAN (R 3.6.1)
>
>
> globals 0.12.4 2018-10-11 [1] CRAN (R
> 3.6.0)
>
> glue * 1.3.1 2019-03-12 [1] CRAN (R
> 3.6.1)
>
> gridExtra 2.3 2017-09-09 [1] CRAN (R 3.6.1)
>
>
> gt * 0.1.0 2019-11-27 [1] Github (rstudio/gt@284bbe5)
>
>
> gtable 0.3.0 2019-03-25 [1] CRAN (R
> 3.6.1)
>
> Hmisc * 4.3-0 2019-11-07 [1] CRAN (R 3.6.1)
>
>
> htmlTable 1.13.2 2019-09-22 [1] CRAN (R
> 3.6.1)
>
> D htmltools 0.3.6.9004 2019-09-20 [1] Github
> (rstudio/htmltools@c49b29c)
>
> htmlwidgets 1.3 2018-09-30 [1] CRAN (R
> 3.6.1)
>
> jsonlite * 1.6 2018-12-07 [1] CRAN (R
> 3.6.1)
>
> knitr 1.25 2019-09-18 [1] CRAN (R
> 3.6.1)
>
> lattice * 0.20-38 2018-11-04 [2] CRAN (R
> 3.6.1)
>
> latticeExtra 0.6-28 2016-02-09 [1] CRAN (R
> 3.6.1)
>
> lazyeval 0.2.2 2019-03-15 [1] CRAN (R
> 3.6.1)
>
> lifecycle 0.1.0 2019-08-01 [1] CRAN (R
> 3.6.1)
>
> listenv 0.7.0 2018-01-21 [1] CRAN (R
> 3.6.1)
>
> lubridate * 1.7.4 2018-04-11 [1] CRAN (R
> 3.6.1)
>
> magrittr * 1.5 2014-11-22 [1] CRAN (R
> 3.6.1)
>
> Matrix 1.2-17 2019-03-22 [2] CRAN (R
> 3.6.1)
>
> memoise 1.1.0 2017-04-21 [1] CRAN (R
> 3.6.1)
>
> munsell 0.5.0 2018-06-12 [1] CRAN (R
> 3.6.1)
>
> nnet 7.3-12 2016-02-02 [2] CRAN (R
> 3.6.1)
>
> openssl 1.4.1 2019-07-18 [1] CRAN (R
> 3.6.1)
>
> outliers * 0.14 2011-01-24 [1] CRAN (R
> 3.6.0)
>
> pillar 1.4.2 2019-06-29 [1] CRAN (R
> 3.6.1)
>
> pkgbuild 1.0.5 2019-08-26 [1] CRAN (R
> 3.6.1)
>
> pkgconfig 2.0.2 2018-08-16 [1] CRAN (R
> 3.6.1)
>
> pkgload 1.0.2 2018-10-29 [1] CRAN (R
> 3.6.1)
>
> plyr * 1.8.4 2016-06-08 [1] CRAN (R
> 3.6.1)
>
> prettyunits 1.0.2 2015-07-13 [1] CRAN (R
> 3.6.1)
>
> processx 3.4.1 2019-07-18 [1] CRAN (R
> 3.6.1)
>
> pryr * 0.1.4 2018-02-18 [1] CRAN (R
> 3.6.1)
>
> ps 1.3.0 2018-12-21 [1] CRAN (R
> 3.6.1)
>
> purrr * 0.3.2 2019-03-15 [1] CRAN (R
> 3.6.1)
>
> R6 * 2.4.1 2019-11-12 [1] CRAN (R
> 3.6.1)
>
> RColorBrewer 1.1-2 2014-12-07 [1] CRAN (R
> 3.6.0)
>
> Rcpp 1.0.3 2019-11-08 [1] CRAN (R
> 3.6.1)
>
> readxl * 1.3.1 2019-03-13 [1] CRAN (R
> 3.6.1)
>
> remotes 2.1.0 2019-06-24 [1] CRAN (R
> 3.6.1)
>
> rlang * 0.4.2 2019-11-23 [1] CRAN (R
> 3.6.1)
>
> rmarkdown * 2.0.3 2019-12-19 [1] Github
> (rstudio/rmarkdown@26cc3b1)
>
> RODBC * 1.3-16 2019-09-03 [1] CRAN (R
> 3.6.1)
>
> roxygen2 * 6.1.1 2018-11-07 [1] CRAN (R
> 3.6.1)
>
> rpart 4.1-15 2019-04-12 [2] CRAN (R
> 3.6.1)
>
> rprojroot 1.3-2 2018-01-03 [1] CRAN (R
> 3.6.1)
>
> RSQLite * 2.1.2 2019-07-24 [1] CRAN (R
> 3.6.1)
>
> rstudioapi 0.10 2019-03-19 [1] CRAN (R
> 3.6.1)
>
> scales 1.0.0 2018-08-09 [1] CRAN (R
> 3.6.1)
>
> sessioninfo 1.1.1 2018-11-05 [1] CRAN (R
> 3.6.1)
>
> slide * 0.0.0.9002 2019-11-27 [1] Github
> (DavisVaughan/slide@92e8e02)
>
> ssh 0.6 2019-04-09 [1] CRAN (R
> 3.6.2)
>
> stringi 1.4.3 2019-03-12 [1] CRAN (R
> 3.6.0)
>
> stringr * 1.4.0 2019-02-10 [1] CRAN (R
> 3.6.1)
>
> survival * 2.44-1.1 2019-04-01 [2] CRAN (R
> 3.6.1)
>
> testthat 2.2.1 2019-07-25 [1] CRAN (R
> 3.6.1)
>
> tibble 2.1.3 2019-06-06 [1] CRAN (R
> 3.6.1)
>
> tidyr * 1.0.0 2019-09-11 [1] CRAN (R
> 3.6.1)
>
> tidyselect 0.2.5 2018-10-11 [1] CRAN (R
> 3.6.1)
>
> usethis * 1.5.1 2019-07-04 [1] CRAN (R
> 3.6.1)
>
> varhandle * 2.0.3 2018-07-04 [1] CRAN (R
> 3.6.0)
>
> vctrs 0.2.0.9007 2019-11-27 [1] Github (r-lib/vctrs@945809e)
>
>
> withr 2.1.2 2018-03-15 [1] CRAN (R
> 3.6.1)
>
> xfun 0.9 2019-08-21 [1] CRAN (R
> 3.6.1)
>
> xml2 * 1.2.2 2019-08-09 [1] CRAN (R
> 3.6.1)
>
> xts * 0.11-2 2018-11-05 [1] CRAN (R
> 3.6.1)
>
> zoo * 1.8-6 2019-05-28 [1] CRAN (R
> 3.6.1)
>
>
>
> [1] C:/Users/cklar/Desktop/R packages
>
> [2] C:/Program Files/R/R-3.6.1/library
>
>
>
> P -- Loaded and on-disk path mismatch.
>
> D -- DLL MD5 mismatch, broken installation.
>
>
>
> Christian Klar
> TFG Asset Management
> Tetragon Financial Management
> 399 Park Avenue, 22nd Floor | New York, NY 10022 | United States
> Direct: +1 212 359 7369 | Main: +1 212 359 7300 | Mobile: +1 607 216 5045
> [email protected]
> www.tetragoninv.com
>
> This communication and all or some of the information contained therein
> may be confidential. If you have received this communication in error,
> please destroy all electronic and paper copies and notify the sender
> immediately. Unless specifically indicated, this communication is not a
> confirmation, an offer to sell or solicitation of any offer to buy any
> financial product, or an official statement of Tetragon Financial Group or
> its affiliates. TFG Asset Management L.P. and Tetragon Financial Management
> LP are registered as investment advisers under the U.S. Investment Advisers
> Act of 1940.
>
>
>
>