Re: [datatable-help] "ungrouping" a data.table

Bacou, Melanie Mon, 11 Jan 2016 19:22:43 -0800

Assuming all list columns are of the same lengths, then this might be alittle faster?


dtUngroup <- function(dt) {
  l <- names(dt)[dt[, lapply(.SD, class)]=="list"]
  if (length(l)>0) {
    nl <- setdiff(names(dt), l)
    t <- sapply(dt[[l[1]]], length)
    tmp <- dt[, .SD, .SDcols=nl]
    tmp <- tmp[, lapply(.SD, rep, times=t)]
    tmp <- cbind(tmp, dt[, lapply(.SD, unlist), .SDcols=l])
  } else tmp <- dt
  return(tmp)
}


# If you need to cast columns back to POSIXct, might be easier as follows:
res <- dtUngroup(DT)
res[, c := as.POSIXct(c, origin="1970-01-01")]

On 1/8/2016 9:23 AM, statquant3 wrote:

dtUngroup <- function(DT){
        colClasses <- lapply(DT,FUN=class)
        listCols <- which(colClasses=='list')
        if(length(listCols)>0){
                nonListCols <- setdiff(colnames(DT),listCols)
                nbListElem <- 
unlist(DT[,lapply(.SD,FUN=lengths),.SDcols=(listCols[1L])])
                DT1 <- 
DT[,lapply(.SD,FUN=rep,times=(nbListElem)),.SDcols=(nonListCols)]
                DT1[,(listCols):=DT[,lapply(.SD,FUN=function(x)
do.call('c',x)),.SDcols=(listCols)]]
                return(DT1)
        }
        return(DT)
}

This works... still 20x slower than the equivalent in kdb



--
View this message in context: 
http://r.789695.n4.nabble.com/ungrouping-a-data-table-tp4716265p4716269.html
Sent from the datatable-help mailing list archive at Nabble.com.
_______________________________________________
datatable-help mailing list
[email protected]
https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help


_______________________________________________
datatable-help mailing list
[email protected]
https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help

Re: [datatable-help] "ungrouping" a data.table

Reply via email to