People visit my site and I have an algorithm that gives a rating from 1 to 0. The higher the rating, the more likely it is that this person will buy something, but the rating is not a probability, and this may not be a linear dependence on probability of purchase.
I have a lot of data on what ratings I gave people in the past and whether these people really make a purchase.
Using this data about what happened to the estimates in the past, I want to be able to take the estimate and translate it into an appropriate probability based on this past data.
Any ideas?
edit: Several people offer bucketing, and I should have mentioned that I was considering this approach, but I'm sure there should be a way to do this "seamlessly." Some time ago I asked a question about another, but perhaps related issue here , I have a feeling that something like this might be applicable, but I'm not sure.
edit2: Let's say I told you that out of 100 customers with an account above 0.5, 12 of them were bought, and out of 25 customers with an account below 0.5, 2 of them were bought. What can I conclude, if at all, about the estimated probability of buying from someone with a score of 0.5?
source share