eigen/unsupported/Eigen/CXX11
Rasmus Munk Larsen e2999d4c38 Fix performance regressions due to https://bitbucket.org/eigen/eigen/pull-requests/662.
The change caused the device struct to be copied for each expression evaluation, and caused, e.g., a 10% regression in the TensorFlow multinomial op on GPU:


Benchmark                       Time(ns)        CPU(ns)     Iterations
----------------------------------------------------------------------
BM_Multinomial_gpu_1_100000_4     128173         231326           2922  1.610G items/s

VS

Benchmark                       Time(ns)        CPU(ns)     Iterations
----------------------------------------------------------------------
BM_Multinomial_gpu_1_100000_4     146683         246914           2719  1.509G items/s
2019-08-02 11:18:13 -07:00
..
src Fix performance regressions due to https://bitbucket.org/eigen/eigen/pull-requests/662. 2019-08-02 11:18:13 -07:00
CMakeLists.txt
Tensor Fix GPU build due to gpu_assert not always being defined. 2018-10-18 16:29:29 -07:00
TensorSymmetry
ThreadPool