Collaborative filtering: non-personalized item-to-element affinity

Question

Collaborative filtering: non-personalized item-to-element affinity

I am trying to calculate the similarity between the items in the Amazon line "Customers who viewed / purchased X also viewed / bought Y and Z". All the examples and links that I saw relate to the similarity in the computational elements for the ranked elements, to search for similarity between a user or user, or to search for recommended elements based on the history of current users. I would like to start with an inappropriate approach before considering the preferences of current users.

Looking at Amazon.com’s recommendation recommendation , they use the following logic to autonomously resemble product items:

For each item in product catalog, I1 
  For each customer C who purchased I1
    For each item I2 purchased by customer C
       Record that a customer purchased I1 and I2
  For each item I2 
    Compute the similarity between I1 and I2

If I understand correctly, by the time we are in "Calculate similiarty between I1 and I2", I have a list of items (I2) purchased in combination with one value of I1 (outer loop).

How is this calculation done?

Another idea is that I overdo it and make it more difficult than I need. Is it enough to make a top-n request to account I2, bought in combination with I1?

I also appreciate the suggestions as to whether this approach is correct. My product database contains about 150 thousand products at any time. Since the bulk of the reading material I've seen shows similarities with a user element or even similarities with a user, should I look for this route instead.

, . , , , 0/1 /. , .

edit: python , db, Oracle PL/SQL.

+3

python algorithm recommendation-engine collaborative-filtering similarity

Neil Kodner 05 . '10 22:14

3

. ,

        Item1  Item2 ... ItemN
 User1  0        1   ...  0
 User2  1        1   ...  0 
  .
  .
  .
 UserM  1        0   ...  0

, -, , . ,

        Item1  Item2 ... ItemN
 Item1  1       1/M  ...  0
 Item2  1/M     1    ...  0 
  .
  .
  .
 ItemN  0       0    ...  1

: ", / X, / Y, Z,..." (Collaborative Filtering). .

Amazon , , - .

. , , , .

+4

leef 12 '12 6:11

@Neil , :

, . . , , Jaccard cos(I1,I2).

– vs – user-user – vs – . , , , ( ).

, , , , .

+2

isomorphismes 14 . '11 5:41

Tom · Accepted Answer · 2010-03-05T22:32:00+0000

O'Reilly . , , , . - , , , : " A X, , Z?" . .

Collaborative filtering: non-personalized item-to-element affinity

More articles: