Maximize the sum of "non-overlapping" numbers from the matrix

Question

Maximize the sum of "non-overlapping" numbers from the matrix

Just looking for a little direction, I understand that the above example can be solved using brute force iteration, but I'm looking for a more elegant (i.e., mathematical?) Solution that could potentially solve much larger examples (say, 20x20 or 30x30). It is possible that this cannot be done, and I have had very little success in approaching an approach that does not rely on brute force ...

I have a matrix (let's call it A), which is equal to nxn. I want to select a subset (call it B) of points from matrix A. The subset will consist of n elements, where one and only one element is taken from each row and from each column A. The output should provide a solution (B), so the sum of the elements of constituents B is the maximum possible value given these limitations (e.g. 25 in the example below). If several instances of B are found (that is, different solutions that give the same maximum amount), you should choose the solution for B that has the largest minimum element.

B can also be a selection matrix, which is equal to nxn, but where only n of the required elements are nonzero.

For example: if A =

|5 4 3 2 1| |4 3 2 1 5| |3 2 1 5 4| |2 1 5 4 3| |1 5 4 3 2|

=> B will be

 |5 5 5 5 5|

However, if A =

 |5 4 3| |4 3 2| |3 2 1|

B =

 |3 3 3|

Since the minimum element of B is 3, which is greater than for

 |5 3 1|

or

 |4 4 1|

which also add up to 9

+6

algorithm matrix dynamic-programming mathematical-optimization

oeoeaio Apr 30 '12 at 5:05

source share

4 answers

Uch. This algorithm is incorrect; there is no evidence because it is wrong, and therefore it is impossible to prove that it is right. ;) I leave it here because I'm too attached to completely remove it, and this is a good demonstration of why you should formally prove the algorithms instead of saying "it looks right! It is impossible that it does not work!"

I give this decision without evidence, yet. I have a sketch of evidence, but I am having problems proving the optimal basis for this problem. Anyway...

Problem

Given a square array of numbers, select as many "non-overlapping" numbers as possible so that the sum of the selected numbers is maximized. "Non-overlapping" means that no two numbers can be from the same row or the same column.

Algorithm

Let A be a square array of n by n numbers.
Let Aij denote the element A in the row i th and j th.
Let S( i1:i2, j1:j2 ) denote the optimal sum of nonoverlapping numbers for a square subarray A containing the intersection of rows i1 - i2 and columns j1 - j2 .

Then the optimal sum of disjoint numbers is denoted by S( 1:n , 1:n ) and is defined as follows:

 S( 1:n , 1:n ) = max { [ S( 2:n , 2:n ) + A11 ] [ S( 2:n , 1:n-1 ) + A1n ] [ S( 1:n-1 , 2:n ) + An1 ] [ S( 1:n-1 , 1:n-1 ) + Ann ] } (recursively) Note that S( i:i, j:j ) is simply Aij.

That is, the optimal sum for a square array of size n can be determined by calculating the optimal sum for each of the four submatrices of size n-1 , and then maximizing the sum of the sub-array and the element that was “left”.

 S for |# # # #| |# # # #| |# # # #| |# # # #| Is the best of the sums S for: |# | | #| |# # # | | # # #| | # # #| |# # # | |# # # | | # # #| | # # #| |# # # | |# # # | | # # #| | # # #| |# # # | | #| |# |

Implementation

The recursive algorithm above offers a recursive solution:

 def S(A,i1,i2,j1,j2): if (i1 == i2) and (j1==j2): return A[i1][j1] else: return max ( S( A, i1+1, i2, j1+1, j2) + A[i1][j1] ], S( A, i1+1, i2, j1, j2-1) + A[i1][j2] ], S( A, i1, i2-1, j1+1, j2) + A[i2][j1] ], S( A, i1, i2-1, j1, j2-1) + A[i2][j2] ], )

Note that this will call O(4^n) on S() !! This is much better than the factor O(n!) Time complexity of the brute force solution, but still terrible performance.

It is important to note that many calls are repeated with the same parameters. For example, when solving a 3 * 3 array, each 2 * 2 matrix is solved many times.

This suggests two possible solutions to speed up:

Make the resulting function S() cache results, so that for each i1,i2,j1,j2 only S(A,i1,i2,j1,j2) is needed once. This means that S() only needs to calculate the results O(n^3) - all other requests will be filled from the cache. (This is called memoising .)
Instead of starting at the top with a large array of n*n and working with smaller subtasks, start from the bottom with the smallest possible subtasks and create up in n*n business. This is called dynamic programming. It is also O(n^3) , but it is much faster than O(n^3) because you do not need to hit the cache all the time.

The dynamic programming solution has the following form:

Find optimal solutions for all 1x1 submatrices. (Trivial.)
Find optimal solutions for all 2x2 subarrays.
Find optimal solutions for all 3x3 submatrices.
...
Find optimal solutions for all n-1 * n-1 sub-arrays.
Find optimal solutions for a complete n * n array.

Notes to this decision:

There is no evidence yet. I'm working on it.
You will notice that the above algorithm gives you S() , the optimal amount . He does not tell you what figures really make up this amount. You can add your own method of returning your path to the solution.
The algorithm above does not guarantee that a property related as 2,2 vs. 1,3 will be violated in favor of all individual numbers being as large as possible (so 2,2 wins). you can define max() to break ties in favor of the largest possible numbers, and this will do what you want, but I cannot prove it.

General notes:

Dynamic programming is a powerful method for developing fast algorithms for any problem that exhibits two properties:

Optimal substructure: the problem can be divided into several smaller parts, each of which can be used as part of the solution to the original problem.
Overlapping subtasks mean that there are several actual subtasks that need to be solved, and solutions for subtasks are reused repeatedly.

If the problem has an optimal substructure, and the problem is divided into slightly smaller tasks - say, a problem of size n divided into sub-tasks of size n-1 - then the problem can be solved by dynamic programming.

If you can divide the problem into much smaller chunks - say, chop a problem of size n in half, each of sizes n/2 - which divide and win, not dynamic programming. Splitting and resolving solutions is usually very fast - for example, a binary search will find an element in a sorted array in O(log n) time.

0

Li-aung yip Apr 30 '12 at 16:53

source share

As Matthias pointed out, you should use backtracking.

Find a reasonable solution. Select the maximum values from each row and see if they overlap. If not, then break part of the decision so that the result does not overlap.
Determine the suitability of a partial solution. Suppose you are collecting a value for each row iteratively, and you have already selected the values from the first k rows. The suitability of this solution is equal to the sum of the already selected values + max values from the remaining rows and unselected columns.
Now, recursively begin the search for a solution. Select the values from the first line, calculate their suitability and insert them into the priority queue. Remove all solutions whose suitability is lower than the current optimal solution (initialized in step 1). Select a solution at the beginning of the queue, calculate the next level of solutions and return them to the priority queue. After you have selected values from all columns and rows, calculate the sum and, if it is higher than the current, replace it.

0

Elkamina Apr 30 '12 at 18:17

source share

This is due to the n Queens problem, except that you do not need a diagonal and you have informed decisions. As a Queens problem, you can solve it by (multiple) backtracking.

Ie, as soon as you find a solution that you remember, its weight, note that the soul is not valid, and start over. The solution (a) with the highest weight has been reached.

-1

Matthias Apr 30 '12 at 5:15

source share

Reinhard · Accepted Answer · 2012-05-02T08:52:26+0000

Your problem is almost identical to the problem. A problem that can, for example, be solved by the Hungarian algorithm during polynomial time.

Note that the assignment problem is usually a minimization problem, but multiplying your matrix by -1 and adding some constant should make the method applicable. In addition, there is no formal braking condition when using several optimal solutions. However, the method gives you a solution that has the optimal amount. Let m be the minimal term. Change your matrix by setting all entries less than or equal to m to zero and solve again. Or you will receive a decision with the same amount, which is better than the last. If not, the previous solution was already optimal.

Maximize the sum of "non-overlapping" numbers from the matrix

Problem

Algorithm

Implementation

Notes to this decision:

General notes:

More articles: