* fix all numerical instabilies in the unit tests, now all tests can be run 2000 times with almost zero failures.
* faster matrix-matrix and matrix-vector products (especially for not aligned cases) * faster tridiagonalization (make it using our matrix-vector impl.) Others: * fix Flags of Map * split the test_product to two smaller ones