Why does Python / Numpy require a row vector for the matrix / vector dot product?

Question

Why does Python / Numpy require a row vector for the matrix / vector dot product?

Suppose we want to calculate the product of the points of a matrix and a column vector:

So in Numpy / Python we go:

a=numpy.asarray([[1,2,3], [4,5,6], [7,8,9]]) b=numpy.asarray([[2],[1],[3]]) a.dot(b)

Results in:

array ([[13], [31], [49]])

So far, so good, but why does this also work?

 b=numpy.asarray([2,1,3]) a.dot(b)

Results in:

array ([13, 31, 49])

I would expect that [2,1,3] is a row vector (which requires transposition to apply the point product) , but does Numpy seem to see the default arrays as column vectors (in the case of matrix multiplication)?

How it works?

EDIT:

And why:

 b=numpy.asarray([2,1,3]) b.transpose()==b

Thus, the matrix matrix vector matrix works (therefore, it sees it as a column vector), however, other operations (transposition) do not work. This is not a consistent design, is it?

+5

python numpy numpy-broadcasting

robert Jan 05 '16 at 8:41

source share

1 answer

shx2 · Answer 1 · 2016-01-05T08:59:33+0000

Let me first understand how the dot operation is defined in numpy.

(for simplicity, leave the rules for discussion out of context) you can perform dot(A,B) if the last dimension A (ie A.shape[-1] ) matches the next last size B (i.e. B . shape [-2]) if B.ndim> = 2 and just dimension B if B.ndim == 1.

In other words, if A.shape=(N1,...,Nk,X) and B.shape=(M1,...,M(j-1),X,Mj) (note the general X ). The resulting array will have the form (N1,...,Nk,M1,...,Mj) (note that X been deleted).

Or, if A.shape=(N1,...,Nk,X) and B.shape=(X,) . The resulting array will have the form (N1,...,Nk) (note that X been deleted).

Your examples work because they satisfy the rules (the first example satisfies the first, the second satisfies the second):

 a=numpy.asarray([[1,2,3], [4,5,6], [7,8,9]]) b=numpy.asarray([[2],[1],[3]]) a.shape, b.shape, '->', a.dot(b).shape # X=3 => ((3, 3), (3, 1), '->', (3, 1)) b=numpy.asarray([2,1,3]) a.shape, b.shape, '->', a.dot(b).shape # X=3 => ((3, 3), (3,), '->', (3,))

My recommendation is that when using numpy, don’t think in terms of “row / column vectors” and, if possible, don’t think in terms of “vectors” in general, but in terms of an “array with shape S”. This means that both row vectors and column vectors are simply “1dim arrays”. As for numpy, they are one and the same.

This should also make it clear why in your case b.transponse() is the same as b . b is a 1dim array, when transposed , remains a 1dim array. Transpose does not affect 1dim arrays.

Why does Python / Numpy require a row vector for the matrix / vector dot product?

More articles: