I want to call a function inside data.table that calculates a set of summary statistics as follows:
summ.stats <- function(vec) { list( Min = min(vec), Mean = mean(vec), SD = sd(vec), Median = median(vec), Max = max(vec)) }
and I want to name it in j data.table :
DT <- data.table(a=c(1,2,3,1,2,3),b=c(1,4,3,2,1,4),c=c(2,3,4,5,2,1)) DT[, summ.stats(b), by=a]
This is normal and I get:
a Min Mean SD Median Max 1: 1 1 1.5 0.7071068 1.5 2 2: 2 1 2.5 2.1213203 2.5 4 3: 3 3 3.5 0.7071068 3.5 4
But I'm interested in passing a few variables to summ.stats. For instance:
DT[, summ.stats(b, c), by=a]
I want to get something like:
a Var Min Mean SD Median Max 1: 1 b 1 1.5 0.7071068 1.5 2 2: 2 b 1 2.5 2.1213203 2.5 4 3: 3 b 3 3.5 0.7071068 3.5 4 4: 1 c 2 3.5 2.1213203 3.5 5 5: 2 c 2 2.5 0.7071068 2.5 3 6: 3 c 1 2.5 2.1213203 2.5 4
What is the best way to do this?