Suppose a user votes for some films on a scale of 1 to 5. These films have genre information, and there can be more than one genre in a film. Like this:
Movie A Rating 4 Action/Sci-Fi Movie B Rating 5 Comedy/Action Movie C Rating 4 Comedy/Drama
We want to know what genre our user likes. Here we have our result set:
Genre Movie_Count Average_Rating ---------- Action 2 5 Comedy 2 4.5 SciFi 1 4 Drama 1 4
Obviously, we cannot predict anything with such a small set of results, but suppose we have a larger data set.
Using this data, how can we sort the most popular genres of this user? Just calculate the weighted average or something more complicated?
source share