Let the big data race with microbenchmark , just for fun!
create large data frames
dat1 <- data.frame(x1 = rep(c("A","B","C","A"), 1000), stringsAsFactors = FALSE) dat2 <- data.frame(x1 = rep(c("A","B","C", "D"), 1000), x2 = runif(1,0), stringsAsFactors = FALSE)
on your stamps, set set, GO!
library(microbenchmark) mbm <- microbenchmark( left_join = left_join(dat1, dat2, by="x1"), merge = merge(dat1, dat2, by = "x1"), times = 20 )
Many, many seconds later .... left_join is faster than MUCH for large data frames.

source share