I am trying to use a random forest package for classification in R.
Variable values of variables are listed below:
- means the value of the raw value of the variable x for class 0
- means the value of the important value of the variable x for class 1
MeanDecreaseAccuracyMeanDecreaseGini
Now I know what they mean, since I know their definitions. I want to know how to use them.
What I really want to know is what these values mean only in the context of how accurate they are, what is a good value, what is a bad value, what are the maximum and minimum values, etc.
If a variable has a high MeanDecreaseAccuracy or MeanDecreaseGini , then is this important or unimportant? Any raw grade information may also be helpful. I want to know everything there is to know about these numbers that are relevant to their application.
An explanation that uses the words error, summation, or permutation would be less useful than a simpler explanation, in which there would be no discussion of how random forests work.
As if I wanted someone to explain to me how to use the radio, I would not expect the explanation to be related to how the radio converts radio waves into sound.
r statistics data-mining random-forest
thirsty93 Apr 10 '09 at 2:18 2009-04-10 02:18
source share