I have a list of ordered lists, most of which contain the same elements. I want to find the most likely order of items from lists (samples).
Example:
l1={ a, b, f, h, z } l2={ c, e, h, x, z } l3={ a, e, y, z } l4={ b, e, f, z }
The result should be:
R={a, b, c, e, f, h, x, y, z}; or R={ a,b,c,e,f,h,y,x,z }
Elements have no information regarding their natural order. The order must be learned from the lists, and in some cases the order in the list may contradict other lists, so I need the most likely order. I have about 175,000 lists, about 1.8 million items (total, 260 thousand unique), the number of items in the list varies.
I already tried to build a directed graph, where the edges have the number of lists that connect the vertices in that order, and then went through all the paths to find the most likely sequence. This approach is well suited for small problems, but it is too complex to solve this problem.
Any pointers please be greatly appreciated.
Thanks.
Juan