I have users and resources. Each resource is described by a set of functions, and each user is associated with a different set of resources. In my particular case, the resources are web pages, as well as information about the functions of the location of the visit, time of visit, number of visits, etc., which each time are associated with a specific user.
I want to get an assessment of the similarity between my users regarding these features, but I cannot find a way to combine the components of the resource. I did this with text functions, since you can add documents together and then extract functions (say TF-IDF), but I don't know how to continue this configuration.
To be as clear as possible, here is what I have:
>>> len(user_features) 13
I can get a document similarity matrix using, for example, Euclidean distances:
>>> euclidean_distance(user_features[0], user_features[0])
But I do not know how to compare users with each other. I have to somehow combine the functions together to get the N_Users X N_Features , but I don't know how to do this.
Any clues on how to proceed?
Additional information about the functions that I use:
The functions that I have are not fully fixed. What I have so far is 13 different functions that are already combined from the “views”. I have standard deviation, mean, etc. For each species, in order to have something “flat” in order to be able to compare them. One of the features that I have is: has the location changed since the last time it was viewed? But what about an hour ago? Two hours ago?