Benoit Steiner
|
dd2b45feed
|
Removed extraneous 'explicit' keywords
|
2016-05-04 16:57:52 -07:00 |
|
Benoit Steiner
|
968ec1c2ae
|
Use numext::isfinite instead of std::isfinite
|
2016-05-03 19:56:40 -07:00 |
|
Benoit Steiner
|
aad9a04da4
|
Deleted superfluous explicit keyword.
|
2016-05-03 09:37:19 -07:00 |
|
Benoit Steiner
|
8a9228ed9b
|
Fixed compilation error
|
2016-05-01 14:48:01 -07:00 |
|
Benoit Steiner
|
d6c9596fd8
|
Added missing accessors to fixed sized tensors
|
2016-04-29 18:51:33 -07:00 |
|
Benoit Steiner
|
17fe7f354e
|
Deleted trailing commas
|
2016-04-29 18:39:01 -07:00 |
|
Benoit Steiner
|
e5f71aa6b2
|
Deleted useless trailing commas
|
2016-04-29 18:36:10 -07:00 |
|
Benoit Steiner
|
44f592dceb
|
Deleted unnecessary trailing commas.
|
2016-04-29 18:33:46 -07:00 |
|
Benoit Steiner
|
f100d1494c
|
Return the proper size (ie 1) for tensors of rank 0
|
2016-04-29 18:14:33 -07:00 |
|
Benoit Steiner
|
a8c0405cf5
|
Deleted unused default values for template parameters
|
2016-04-29 16:34:43 -07:00 |
|
Benoit Steiner
|
c07404f6a1
|
Restore Tensor support for non c++11 compilers
|
2016-04-29 15:19:19 -07:00 |
|
Benoit Steiner
|
ba32ded021
|
Fixed include path
|
2016-04-29 15:11:09 -07:00 |
|
Gael Guennebaud
|
318e65e0ae
|
Fix missing inclusion of Eigen/Core
|
2016-04-27 23:05:40 +02:00 |
|
Rasmus Munk Larsen
|
463738ccbe
|
Use computeProductBlockingSizes to compute blocking for both ShardByCol and ShardByRow cases.
|
2016-04-27 12:26:18 -07:00 |
|
Gael Guennebaud
|
3dddd34133
|
Refactor the unsupported CXX11/Core module to internal headers only.
|
2016-04-26 11:20:25 +02:00 |
|
Benoit Steiner
|
4a164d2c46
|
Fixed the partial evaluation of non vectorizable tensor subexpressions
|
2016-04-25 10:43:03 -07:00 |
|
Benoit Steiner
|
fd9401f260
|
Refined the cost of the striding operation.
|
2016-04-25 09:16:08 -07:00 |
|
Benoit Steiner
|
4bbc97be5e
|
Provide access to the base threadpool classes
|
2016-04-21 17:59:33 -07:00 |
|
Benoit Steiner
|
33adce5c3a
|
Added the ability to switch to the new thread pool with a #define
|
2016-04-21 11:59:58 -07:00 |
|
Benoit Steiner
|
f670613e4b
|
Fixed several compilation warnings
|
2016-04-21 11:03:02 -07:00 |
|
Benoit Steiner
|
2dde1b1028
|
Don't crash when attempting to reduce empty tensors.
|
2016-04-20 18:08:20 -07:00 |
|
Benoit Steiner
|
c7c2054bb5
|
Started to implement a portable way to yield.
|
2016-04-19 17:59:58 -07:00 |
|
Benoit Steiner
|
2b72163028
|
Implemented a more portable version of thread local variables
|
2016-04-19 15:56:02 -07:00 |
|
Benoit Steiner
|
5b1106c56b
|
Fixed a compilation error with nvcc 7.
|
2016-04-19 14:57:57 -07:00 |
|
Benoit Steiner
|
7129d998db
|
Simplified the code that launches cuda kernels.
|
2016-04-19 14:55:21 -07:00 |
|
Benoit Steiner
|
b9ea40c30d
|
Don't take the address of a kernel on CUDA devices that don't support this feature.
|
2016-04-19 14:35:11 -07:00 |
|
Benoit Steiner
|
884c075058
|
Use numext::ceil instead of std::ceil
|
2016-04-19 14:33:30 -07:00 |
|
Benoit Steiner
|
a278414d1b
|
Avoid an unnecessary copy of the evaluator.
|
2016-04-19 13:54:28 -07:00 |
|
Benoit Steiner
|
50968a0a3e
|
Use DenseIndex in the MeanReducer to avoid overflows when processing very large tensors.
|
2016-04-19 11:53:58 -07:00 |
|
Benoit Steiner
|
c8e8f93d6c
|
Move the evalGemm method into the TensorContractionEvaluatorBase class to make it accessible from both the single and multithreaded contraction evaluators.
|
2016-04-15 16:48:10 -07:00 |
|
Benoit Steiner
|
7cff898e0a
|
Deleted unnecessary variable
|
2016-04-15 15:46:14 -07:00 |
|
Benoit Steiner
|
6c43c49e4a
|
Fixed a few compilation warnings
|
2016-04-15 15:34:34 -07:00 |
|
Benoit Steiner
|
eb669f989f
|
Merged in rmlarsen/eigen (pull request PR-178)
Eigen Tensor cost model part 2: Thread scheduling for standard evaluators and reductions.
|
2016-04-15 14:53:15 -07:00 |
|
Rasmus Munk Larsen
|
3718bf654b
|
Get rid of void* casting when calling EvalRange::run.
|
2016-04-15 12:51:33 -07:00 |
|
Benoit Steiner
|
a62e924656
|
Added ability to access the cache sizes from the tensor devices
|
2016-04-14 21:25:06 -07:00 |
|
Benoit Steiner
|
18e6f67426
|
Added support for exclusive or
|
2016-04-14 20:37:46 -07:00 |
|
Rasmus Munk Larsen
|
07ac4f7e02
|
Eigen Tensor cost model part 2: Thread scheduling for standard evaluators and reductions. The cost model is turned off by default.
|
2016-04-14 18:28:23 -07:00 |
|
Benoit Steiner
|
9624a1ea3d
|
Added missing definition of PacketSize in the gpu evaluator of convolution
|
2016-04-14 17:16:58 -07:00 |
|
Benoit Steiner
|
6fbedf5a4e
|
Merged in rmlarsen/eigen (pull request PR-177)
Eigen Tensor cost model part 1.
|
2016-04-14 17:13:19 -07:00 |
|
Benoit Steiner
|
9c064b5a97
|
Cleanup
|
2016-04-14 16:41:31 -07:00 |
|
Benoit Steiner
|
1372156c41
|
Prepared the migration to the new non blocking thread pool
|
2016-04-14 16:16:42 -07:00 |
|
Rasmus Munk Larsen
|
aeb5494a0b
|
Improvements to cost model.
|
2016-04-14 15:52:58 -07:00 |
|
Benoit Steiner
|
78a51abc12
|
Added a more scalable non blocking thread pool
|
2016-04-14 15:23:10 -07:00 |
|
Rasmus Munk Larsen
|
d2e95492e7
|
Merge upstream updates.
|
2016-04-14 13:59:50 -07:00 |
|
Rasmus Munk Larsen
|
235e83aba6
|
Eigen cost model part 1. This implements a basic recursive framework to estimate the cost of evaluating tensor expressions.
|
2016-04-14 13:57:35 -07:00 |
|
Benoit Steiner
|
5912ad877c
|
Silenced a compilation warning
|
2016-04-14 11:40:14 -07:00 |
|
Benoit Steiner
|
c7167fee0e
|
Added support for fp16 to the sigmoid function
|
2016-04-14 10:08:33 -07:00 |
|
Benoit Steiner
|
3b76df64fc
|
Defer the decision to vectorize tensor CUDA code to the meta kernel. This makes it possible to decide to vectorize or not depending on the capability of the target cuda architecture. In particular, this enables us to vectorize the processing of fp16 when running on device of capability >= 5.3
|
2016-04-12 10:58:51 -07:00 |
|
Benoit Steiner
|
7d5b17087f
|
Added missing EIGEN_DEVICE_FUNC to the tensor conversion code.
|
2016-04-07 20:01:19 -07:00 |
|
Benoit Steiner
|
48308ed801
|
Added support for isinf, isnan, and isfinite checks to the tensor api
|
2016-04-07 09:48:36 -07:00 |
|
Benoit Steiner
|
cfb34d808b
|
Fixed a possible integer overflow.
|
2016-04-07 08:46:52 -07:00 |
|
Benoit Steiner
|
7be1eaad1e
|
Fixed typos in the implementation of the zeta and polygamma ops.
|
2016-04-06 14:15:37 -07:00 |
|
tillahoffmann
|
726bd5f077
|
Merged eigen/eigen into default
|
2016-04-05 18:21:05 +01:00 |
|
Gael Guennebaud
|
4d7e230d2f
|
bug #1189: fix pow/atan2 compilation for AutoDiffScalar
|
2016-04-05 14:49:41 +02:00 |
|
Till Hoffmann
|
80eba21ad0
|
Merge upstream.
|
2016-04-01 18:18:49 +01:00 |
|
Till Hoffmann
|
ffd770ce94
|
Fixed CUDA signature.
|
2016-04-01 17:58:24 +01:00 |
|
tillahoffmann
|
49960adbdd
|
Merged eigen/eigen into default
|
2016-04-01 14:36:15 +01:00 |
|
Till Hoffmann
|
57239f4a81
|
Added polygamma function.
|
2016-04-01 14:35:21 +01:00 |
|
Till Hoffmann
|
dd5d390daf
|
Added zeta function.
|
2016-04-01 13:32:29 +01:00 |
|
Benoit Steiner
|
3da495e6b9
|
Relaxed the condition used to gate the fft code.
|
2016-03-31 18:11:51 -07:00 |
|
Benoit Steiner
|
0f5cc504fe
|
Properly gate the fft code
|
2016-03-31 12:59:39 -07:00 |
|
Benoit Steiner
|
af4ef540bf
|
Fixed a off-by-one bug in a debug assertion
|
2016-03-30 18:37:19 -07:00 |
|
Benoit Steiner
|
791e5cfb69
|
Added NumTraits for type2index.
|
2016-03-30 18:36:36 -07:00 |
|
Benoit Steiner
|
483aaad10a
|
Fixed compilation warning
|
2016-03-30 17:08:13 -07:00 |
|
Benoit Steiner
|
1b40abbf99
|
Added missing assignment operator to the TensorUInt128 class, and made misc small improvements
|
2016-03-30 13:17:03 -07:00 |
|
Benoit Steiner
|
aa45ad2aac
|
Fixed the formatting of the README.
|
2016-03-29 15:06:13 -07:00 |
|
Benoit Steiner
|
56df5ef1d7
|
Attempt to fix the formatting of the README
|
2016-03-29 15:03:38 -07:00 |
|
Benoit Steiner
|
c38295f0a0
|
Added support for fmod
|
2016-03-28 15:53:02 -07:00 |
|
Benoit Steiner
|
6772f653c3
|
Made it possible to customize the threadpool
|
2016-03-28 10:01:04 -07:00 |
|
Benoit Steiner
|
1bc81f7889
|
Fixed compilation warnings on arm
|
2016-03-28 09:21:04 -07:00 |
|
Benoit Steiner
|
78f83d6f6a
|
Prevent potential overflow.
|
2016-03-28 09:18:04 -07:00 |
|
Benoit Steiner
|
74f91ed06c
|
Improved support for integer modulo
|
2016-03-25 17:21:56 -07:00 |
|
Benoit Steiner
|
41434a8a85
|
Avoid unnecessary conversions
|
2016-03-23 16:52:38 -07:00 |
|
Benoit Steiner
|
92693b50eb
|
Fixed compilation warning
|
2016-03-23 16:40:36 -07:00 |
|
Benoit Steiner
|
393bc3b16b
|
Added comment
|
2016-03-23 16:22:15 -07:00 |
|
Christoph Hertzberg
|
9642fd7a93
|
Replace all M_PI by EIGEN_PI and add a check to the testsuite.
|
2016-03-23 15:37:45 +01:00 |
|
Benoit Steiner
|
3d1e857327
|
Fixed compilation error
|
2016-03-22 15:48:28 -07:00 |
|
Benoit Steiner
|
de7d92c259
|
Pulled latest updates from trunk
|
2016-03-22 15:24:49 -07:00 |
|
Benoit Steiner
|
002cf0d1c9
|
Use a single Barrier instead of a collection of Notifications to reduce the thread synchronization overhead
|
2016-03-22 15:24:23 -07:00 |
|
Benoit Steiner
|
bc2b802751
|
Fixed a couple of typos
|
2016-03-22 14:27:34 -07:00 |
|
Benoit Steiner
|
6a31b7be3e
|
Avoid using std::vector whenever possible
|
2016-03-22 14:02:50 -07:00 |
|
Benoit Steiner
|
65a7113a36
|
Use an enum instead of a static const int to prevent possible link error
|
2016-03-22 09:33:54 -07:00 |
|
Benoit Steiner
|
f9ad25e4d8
|
Fixed contractions of 16 bit floats
|
2016-03-22 09:30:23 -07:00 |
|
Benoit Steiner
|
8ef3181f15
|
Worked around a constness related issue
|
2016-03-21 11:24:05 -07:00 |
|
Benoit Steiner
|
7a07d6aa2b
|
Small cleanup
|
2016-03-21 11:12:17 -07:00 |
|
Benoit Steiner
|
e91f255301
|
Marked variables that's only used in debug mode as such
|
2016-03-21 10:02:00 -07:00 |
|
Benoit Steiner
|
db5c14de42
|
Explicitly cast the default value into the proper scalar type.
|
2016-03-21 09:52:58 -07:00 |
|
Benoit Steiner
|
8e03333f06
|
Renamed some class members to make the code more readable.
|
2016-03-18 15:21:04 -07:00 |
|
Benoit Steiner
|
6c08943d9f
|
Fixed a bug in the padding of extracted image patches.
|
2016-03-18 15:19:10 -07:00 |
|
Benoit Steiner
|
9a7ece9caf
|
Worked around constness issue
|
2016-03-18 10:38:29 -07:00 |
|
Benoit Steiner
|
edc679f6c6
|
Fixed compilation warning
|
2016-03-18 07:12:34 -07:00 |
|
Benoit Steiner
|
70eb70f5f8
|
Avoid mutable class members when possible
|
2016-03-17 21:47:18 -07:00 |
|
Benoit Steiner
|
95b8961a9b
|
Allocate the mersenne twister used by the random number generators on the heap instead of on the stack since they tend to keep a lot of state (i.e. about 5k) around.
|
2016-03-17 15:23:51 -07:00 |
|
Benoit Steiner
|
f7329619da
|
Fix bug in tensor contraction. The code assumes that contraction axis indices for the LHS (after possibly swapping to ColMajor!) is increasing. Explicitly sort the contraction axis pairs to make it so.
|
2016-03-17 15:08:02 -07:00 |
|
Christoph Hertzberg
|
46aa9772fc
|
Merged in ebrevdo/eigen (pull request PR-169)
Bugfixes to cuda tests, igamma & igammac implemented, & tests for digamma, igamma, igammac on CPU & GPU.
|
2016-03-16 21:59:08 +01:00 |
|
Benoit Steiner
|
b72ffcb05e
|
Made the comparison of Eigen::array GPU friendly
|
2016-03-11 16:37:59 -08:00 |
|
Benoit Steiner
|
25f69cb932
|
Added a comparison operator for Eigen::array
Alias Eigen::array to std::array when compiling with Visual Studio 2015
|
2016-03-11 15:20:37 -08:00 |
|
Benoit Steiner
|
86d45a3c83
|
Worked around visual studio compilation warnings.
|
2016-03-09 21:29:39 -08:00 |
|
Benoit Steiner
|
8fd4241377
|
Fixed a typo.
|
2016-03-10 02:28:46 +00:00 |
|
Benoit Steiner
|
a685a6beed
|
Made the list reductions less ambiguous.
|
2016-03-09 17:41:52 -08:00 |
|