eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2024-12-27 07:29:52 +08:00

Author	SHA1	Message	Date
Antonio Sanchez	1e6c6c1576	Replace memset with fill to work for non-trivial scalars. For custom scalars, zero is not necessarily represented by a zeroed-out memory block (e.g. gnu MPFR). We therefore cannot rely on `memset` if we want to fill a matrix or tensor with zeroes. Instead, we should rely on `fill`, which for trivial types does end up getting converted to a `memset` under-the-hood (at least with gcc/clang). Requires adding a `fill(begin, end, v)` to `TensorDevice`. Replaced all potentially bad instances of memset with fill. Fixes #2245.	2021-07-08 18:34:41 +00:00
Jakub Lichman	d72c794ccd	Compilation of basicbenchmark fixed	2021-04-21 06:53:32 +00:00
Antonio Sanchez	69adf26aa3	Modify googlehash use to account for namespace issues. The namespace declaration for googlehash is a configurable macro that can be disabled. In particular, it is disabled within google, causing compile errors since `dense_hash_map`/`sparse_hash_map` are then in the global namespace instead of in `::google`. Here we play a bit of gynastics to allow for both `google::_hash_map` and `_hash_map`, while limiting namespace polution. Symbols within the `::google` namespace are imported into `Eigen::google`. We also remove checks based on `_SPARSE_HASH_MAP_H_`, as this is fragile, and instead require `EIGEN_GOOGLEHASH_SUPPORT` to be defined.	2021-04-12 19:00:39 -07:00
Antonio Sanchez	8dfe1029a5	Augment NumTraits with min/max_exponent() again. Replace usage of `std::numeric_limits<...>::min/max_exponent` in codebase where possible. Also replaced some other `numeric_limits` usages in affected tests with the `NumTraits` equivalent. The previous MR !443 failed for c++03 due to lack of `constexpr`. Because of this, we need to keep around the `std::numeric_limits` version in enum expressions until the switch to c++11. Fixes #2148	2021-03-16 20:12:46 -07:00
David Tellenbach	df4bc2731c	Revert "Augment NumTraits with min/max_exponent()." This reverts commit `75ce9cd2a7`.	2021-03-17 03:06:08 +01:00
Antonio Sanchez	75ce9cd2a7	Augment NumTraits with min/max_exponent(). Replace usage of `std::numeric_limits<...>::min/max_exponent` in codebase. Also replaced some other `numeric_limits` usages in affected tests with the `NumTraits` equivalent. Fixes #2148	2021-03-17 01:00:41 +00:00
Antonio Sanchez	119763cf38	Eliminate CMake FindPackageHandleStandardArgs warnings. CMake complains that the package name does not match when the case differs, e.g.: ``` CMake Warning (dev) at /usr/share/cmake-3.18/Modules/FindPackageHandleStandardArgs.cmake:273 (message): The package name passed to `find_package_handle_standard_args` (UMFPACK) does not match the name of the calling package (Umfpack). This can lead to problems in calling code that expects `find_package` result variables (e.g., `_FOUND`) to follow a certain pattern. Call Stack (most recent call first): cmake/FindUmfpack.cmake:50 (find_package_handle_standard_args) bench/spbench/CMakeLists.txt:24 (find_package) This warning is for project developers. Use -Wno-dev to suppress it. ``` Here we rename the libraries to match their true cases.	2021-02-24 09:52:05 +00:00
Sebastien Boisvert	39cbd6578f	Fix #1911 : add benchmark for move semantics with fixed-size matrix $ clang++ -O3 bench/bench_move_semantics.cpp -I. -std=c++11 \ -o bench_move_semantics $ ./bench_move_semantics float copy semantics: 1755.97 ms float move semantics: 55.063 ms double copy semantics: 2457.65 ms double move semantics: 55.034 ms	2020-06-11 23:43:25 +00:00
n0mend	6d2a9a524b	Update run instructions for benchCholesky	2020-06-01 18:31:46 +00:00
Mark Eberlein	ba9d18b938	Add KLU support to spbenchsolver	2020-05-11 21:50:27 +00:00
mehdi-goli	d3e81db6c5	Eigen moved the `scanLauncehr` function inside the internal namespace. This commit applies the following changes: - Moving the `scamLauncher` specialization inside internal namespace to fix compiler crash on TensorScan for SYCL backend. - Replacing `SYCL/sycl.hpp` to `CL/sycl.hpp` in order to follow SYCL 1.2.1 standard. - minor fixes: commenting out an unused variable to avoid compiler warnings.	2020-05-11 16:10:33 +01:00
Clément Grégoire	82f54ad144	Fix perf monitoring merge function	2020-04-28 17:02:59 +00:00
Aaron Franke	5c22c7a7de	Make file formatting comply with POSIX and Unix standards UTF-8, LF, no BOM, and newlines at the end of files	2020-03-23 18:09:02 +00:00
Gael Guennebaud	08eeb648ea	update hg to git hashes	2019-12-05 16:33:24 +01:00
Gael Guennebaud	c488b8b32f	Replace calls to "hg" by calls to "git"	2019-12-04 11:24:06 +01:00
Mehdi Goli	00f32752f7	[SYCL] Rebasing the SYCL support branch on top of the Einge upstream master branch. * Unifying all loadLocalTile from lhs and rhs to an extract_block function. * Adding get_tensor operation which was missing in TensorContractionMapper. * Adding the -D method missing from cmake for Disable_Skinny Contraction operation. * Wrapping all the indices in TensorScanSycl into Scan parameter struct. * Fixing typo in Device SYCL * Unifying load to private register for tall/skinny no shared * Unifying load to vector tile for tensor-vector/vector-tensor operation * Removing all the LHS/RHS class for extracting data from global * Removing Outputfunction from TensorContractionSkinnyNoshared. * Combining the local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining the no-local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining General Tensor-Vector and VectorTensor contraction into one kernel. * Making double buffering optional for Tensor contraction when local memory is version is used. * Modifying benchmark to accept custom Reduction Sizes * Disabling AVX optimization for SYCL backend on the host to allow SSE optimization to the host * Adding Test for SYCL * Modifying SYCL CMake	2019-11-28 10:08:54 +00:00
Hans Johnson	8c8cab1afd	STYLE: Convert CMake-language commands to lower case Ancient CMake versions required upper-case commands. Later command names became case-insensitive. Now the preferred style is lower-case.	2019-10-31 11:36:37 -05:00
Hans Johnson	6fb3e5f176	STYLE: Remove CMake-language block-end command arguments Ancient versions of CMake required else(), endif(), and similar block termination commands to have arguments matching the command starting the block. This is no longer the preferred style.	2019-10-31 11:36:27 -05:00
Gael Guennebaud	0cb4ba98e7	update wrt recent changes	2019-02-21 17:19:36 +01:00
Gael Guennebaud	902a7793f7	Add possibility to bench row-major lhs and rhs	2019-02-15 16:52:34 +01:00
Gael Guennebaud	b3c4344a68	bug #1676 : workaround GCC's bug in c++17 mode.	2019-02-07 15:21:35 +01:00
Gael Guennebaud	efe02292a6	Add recent gemm related changesets and various cleanups in perf-monitoring	2019-01-29 11:53:47 +01:00
Gael Guennebaud	c64d5d3827	Bypass inline asm for non compatible compilers.	2019-01-23 23:43:13 +01:00
Gael Guennebaud	f20c991679	add changesets related to matrix product perf.	2018-12-13 10:33:29 +01:00
luz.paz"	f67b19a884	[PATCH 1/2] Misc. typos From 68d431b4c14ad60a778ee93c1f59ecc4b931950e Mon Sep 17 00:00:00 2001 Found via `codespell -q 3 -I ../eigen-word-whitelist.txt` where the whitelists consists of: ``` als ans cas dum lastr lowd nd overfl pres preverse substraction te uint whch ``` --- CMakeLists.txt \| 26 +++++++++---------- Eigen/src/Core/GenericPacketMath.h \| 2 +- Eigen/src/SparseLU/SparseLU.h \| 2 +- bench/bench_norm.cpp \| 2 +- doc/HiPerformance.dox \| 2 +- doc/QuickStartGuide.dox \| 2 +- .../Eigen/CXX11/src/Tensor/TensorChipping.h \| 6 ++--- .../Eigen/CXX11/src/Tensor/TensorDeviceGpu.h \| 2 +- .../src/Tensor/TensorForwardDeclarations.h \| 4 +-- .../src/Tensor/TensorGpuHipCudaDefines.h \| 2 +- .../Eigen/CXX11/src/Tensor/TensorReduction.h \| 2 +- .../CXX11/src/Tensor/TensorReductionGpu.h \| 2 +- .../test/cxx11_tensor_concatenation.cpp \| 2 +- unsupported/test/cxx11_tensor_executor.cpp \| 2 +- 14 files changed, 29 insertions(+), 29 deletions(-)	2018-09-18 04:15:01 -04:00
Gael Guennebaud	995730fc6c	Add option to disable plot generation	2018-11-07 00:41:16 +01:00
Gael Guennebaud	8a5955a052	Optimize the product of a householder-sequence with the identity, and optimize the evaluation of a HouseholderSequence to a dense matrix using faster blocked product.	2018-07-11 17:16:50 +02:00
luz.paz	e3912f5e63	MIsc. source and comment typos Found using `codespell` and `grep` from downstream FreeCAD	2018-03-11 10:01:44 -04:00
Gael Guennebaud	31e0bda2e3	Fix cmake warning	2017-12-14 15:48:27 +01:00
Gael Guennebaud	0f83aeb6b2	Improve cmake scripts for Pastix and BLAS detection.	2017-04-14 10:22:12 +02:00
Mehdi Goli	f499fe9496	Adding synchronisation to convolution kernel for sycl backend.	2017-03-13 09:18:37 +00:00
Mehdi Goli	aadb7405a7	Fixing typo in sycl Benchmark.	2017-03-08 18:20:06 +00:00
Mehdi Goli	5e9a1e7a7a	Adding sycl Benchmarks.	2017-03-08 14:17:48 +00:00
Benoit Steiner	fbc39fd02c	Merge latest changes from upstream	2017-01-30 15:25:57 -08:00
Gael Guennebaud	45b289505c	Add debug output	2017-01-03 11:31:02 +01:00
Gael Guennebaud	5838f078a7	Fix inclusion	2017-01-03 11:30:27 +01:00
Benoit Steiner	3eda02d78d	Fixed the sycl benchmarking code	2016-12-22 10:37:05 -08:00
Gael Guennebaud	747202d338	typo	2016-12-08 12:48:15 +01:00
Gael Guennebaud	bb297abb9e	make sure we use the right eigen version	2016-12-08 12:00:11 +01:00
Gael Guennebaud	8b4b00d277	fix usage of custom compiler	2016-12-08 11:59:39 +01:00
Gael Guennebaud	7105596899	Add missing include and use -O3	2016-12-07 16:56:08 +01:00
Gael Guennebaud	780f3c1adf	Fix call to convert on linux	2016-12-07 16:30:11 +01:00
Gael Guennebaud	3855ab472f	Cleanup file structure	2016-12-07 14:23:49 +01:00
Gael Guennebaud	59a59fa8e7	Update perf monitoring scripts to generate html/svg outputs	2016-12-07 13:36:56 +01:00
Gael Guennebaud	1b4e085a7f	generate png file for web upload	2016-12-06 16:46:22 +01:00
Gael Guennebaud	f90c4aebc5	Update monitored changeset lists	2016-12-06 15:07:46 +01:00
Gael Guennebaud	c68c8631e7	fix compilation of BTL's blaze interface	2016-12-05 23:02:16 +01:00
Gael Guennebaud	1ff1d4a124	Add performance monitoring for LLT	2016-12-05 23:01:52 +01:00
Gael Guennebaud	445c015751	extend monitoring benchmarks with transpose matrix-vector and triangular matrix-vectors.	2016-12-05 13:36:26 +01:00
Gael Guennebaud	4c0d5f3c01	Add perf monitoring for gemv	2016-12-02 11:34:12 +01:00

1 2 3 4 5 ...

390 Commits