I am using RStudio 0.97.320 (R 2.15.3) on Amazon EC2. My data frame has 200k rows and 12 columns.
I am trying to reconcile logistic regression with approximately 1,500 parameters.
R uses 7% of the CPU and has 60 + GB of memory and still takes a lot of time.
Here is the code:
glm.1.2 <- glm(formula = Y ~ factor(X1) * log(X2) * (X3 + X4 * (X5 + I(X5^2)) * (X8 + I(X8^2)) + ((X6 + I(X6^2)) * factor(X7))), family = binomial(logit), data = df[1:150000,])
Any suggestions to expedite this by a significant amount?
source share