Overlap joins in R: a speed comparison with packages sqldf
and data.table
In a perfect data world the tables you need to join have common IDs. In this setting, in R, you might use the merge function from the base package or the speedy and useful join functions from the dplyr package. But sometimes your join is a little more complicated. Say, for example, that you have […]
Continue reading →