LAPACK does not perform matrix multiplication. He is BLAS, which provides matrix multiplication.
If you have a 64-bit operating system, I recommend that you try the 64-bit version of BLAS first. This will give you an instant doubling of performance.
Secondly, look at a high-performance BLAS implementation such as OpenBLAS . OpenBLAS uses both vectography and parallelization (i.e., multi-core). This is a free (no cost) open source project.
Matlab internally uses the Intel MKL library, which you can also use with the Armadillo library. Intel MKL is a closed source, but is free for non-commercial use. Please note that OpenBLAS can get multiplication performance by a matrix that is on the same level or better than Intel MKL.
Note that high-performance linear algebra is easier to perform on Linux and Mac OS X than on Windows.
mtall source share