Initially, I had a data frame consisting of 12 columns in N rows. The last column is my class (0 or 1). I had to convert the entire entire data frame to a numeric number using
training <- sapply(training.temp,as.numeric)
But then I thought it was necessary for the class column to be a factor column in order to use the randomforest () tool as a classifier, so I did
training[,"Class"] <- factor(training[,ncol(training)])
I move on to creating a tree with
training_rf <- randomForest(Class ~., data = trainData, importance = TRUE, do.trace = 100)
But I get two errors:
1: In Ops.factor(training[, "Status"], factor(training[, ncol(training)])) : <= this is not relevant for factors (roughly translated) 2: In randomForest.default(m, y, ...) : The response has five or fewer unique values. Are you sure you want to do regression?
I would appreciate if someone could point out the formatting error I am making.
Thanks!
source share