Benoit Steiner
|
9624a1ea3d
|
Added missing definition of PacketSize in the gpu evaluator of convolution
|
2016-04-14 17:16:58 -07:00 |
|
Benoit Steiner
|
6fbedf5a4e
|
Merged in rmlarsen/eigen (pull request PR-177)
Eigen Tensor cost model part 1.
|
2016-04-14 17:13:19 -07:00 |
|
Benoit Steiner
|
9c064b5a97
|
Cleanup
|
2016-04-14 16:41:31 -07:00 |
|
Benoit Steiner
|
1372156c41
|
Prepared the migration to the new non blocking thread pool
|
2016-04-14 16:16:42 -07:00 |
|
Rasmus Munk Larsen
|
aeb5494a0b
|
Improvements to cost model.
|
2016-04-14 15:52:58 -07:00 |
|
Benoit Steiner
|
78a51abc12
|
Added a more scalable non blocking thread pool
|
2016-04-14 15:23:10 -07:00 |
|
Rasmus Munk Larsen
|
d2e95492e7
|
Merge upstream updates.
|
2016-04-14 13:59:50 -07:00 |
|
Rasmus Munk Larsen
|
235e83aba6
|
Eigen cost model part 1. This implements a basic recursive framework to estimate the cost of evaluating tensor expressions.
|
2016-04-14 13:57:35 -07:00 |
|
Benoit Steiner
|
5912ad877c
|
Silenced a compilation warning
|
2016-04-14 11:40:14 -07:00 |
|
Benoit Steiner
|
c7167fee0e
|
Added support for fp16 to the sigmoid function
|
2016-04-14 10:08:33 -07:00 |
|
Benoit Steiner
|
3b76df64fc
|
Defer the decision to vectorize tensor CUDA code to the meta kernel. This makes it possible to decide to vectorize or not depending on the capability of the target cuda architecture. In particular, this enables us to vectorize the processing of fp16 when running on device of capability >= 5.3
|
2016-04-12 10:58:51 -07:00 |
|
Benoit Steiner
|
7d5b17087f
|
Added missing EIGEN_DEVICE_FUNC to the tensor conversion code.
|
2016-04-07 20:01:19 -07:00 |
|
Benoit Steiner
|
48308ed801
|
Added support for isinf, isnan, and isfinite checks to the tensor api
|
2016-04-07 09:48:36 -07:00 |
|
Benoit Steiner
|
cfb34d808b
|
Fixed a possible integer overflow.
|
2016-04-07 08:46:52 -07:00 |
|
Benoit Steiner
|
7be1eaad1e
|
Fixed typos in the implementation of the zeta and polygamma ops.
|
2016-04-06 14:15:37 -07:00 |
|
tillahoffmann
|
726bd5f077
|
Merged eigen/eigen into default
|
2016-04-05 18:21:05 +01:00 |
|
Gael Guennebaud
|
4d7e230d2f
|
bug #1189: fix pow/atan2 compilation for AutoDiffScalar
|
2016-04-05 14:49:41 +02:00 |
|
Till Hoffmann
|
80eba21ad0
|
Merge upstream.
|
2016-04-01 18:18:49 +01:00 |
|
Till Hoffmann
|
ffd770ce94
|
Fixed CUDA signature.
|
2016-04-01 17:58:24 +01:00 |
|
tillahoffmann
|
49960adbdd
|
Merged eigen/eigen into default
|
2016-04-01 14:36:15 +01:00 |
|
Till Hoffmann
|
57239f4a81
|
Added polygamma function.
|
2016-04-01 14:35:21 +01:00 |
|
Till Hoffmann
|
dd5d390daf
|
Added zeta function.
|
2016-04-01 13:32:29 +01:00 |
|
Benoit Steiner
|
3da495e6b9
|
Relaxed the condition used to gate the fft code.
|
2016-03-31 18:11:51 -07:00 |
|
Benoit Steiner
|
0f5cc504fe
|
Properly gate the fft code
|
2016-03-31 12:59:39 -07:00 |
|
Benoit Steiner
|
af4ef540bf
|
Fixed a off-by-one bug in a debug assertion
|
2016-03-30 18:37:19 -07:00 |
|
Benoit Steiner
|
791e5cfb69
|
Added NumTraits for type2index.
|
2016-03-30 18:36:36 -07:00 |
|
Benoit Steiner
|
483aaad10a
|
Fixed compilation warning
|
2016-03-30 17:08:13 -07:00 |
|
Benoit Steiner
|
1b40abbf99
|
Added missing assignment operator to the TensorUInt128 class, and made misc small improvements
|
2016-03-30 13:17:03 -07:00 |
|
Benoit Steiner
|
aa45ad2aac
|
Fixed the formatting of the README.
|
2016-03-29 15:06:13 -07:00 |
|
Benoit Steiner
|
56df5ef1d7
|
Attempt to fix the formatting of the README
|
2016-03-29 15:03:38 -07:00 |
|
Benoit Steiner
|
c38295f0a0
|
Added support for fmod
|
2016-03-28 15:53:02 -07:00 |
|
Benoit Steiner
|
6772f653c3
|
Made it possible to customize the threadpool
|
2016-03-28 10:01:04 -07:00 |
|
Benoit Steiner
|
1bc81f7889
|
Fixed compilation warnings on arm
|
2016-03-28 09:21:04 -07:00 |
|
Benoit Steiner
|
78f83d6f6a
|
Prevent potential overflow.
|
2016-03-28 09:18:04 -07:00 |
|
Benoit Steiner
|
74f91ed06c
|
Improved support for integer modulo
|
2016-03-25 17:21:56 -07:00 |
|
Benoit Steiner
|
41434a8a85
|
Avoid unnecessary conversions
|
2016-03-23 16:52:38 -07:00 |
|
Benoit Steiner
|
92693b50eb
|
Fixed compilation warning
|
2016-03-23 16:40:36 -07:00 |
|
Benoit Steiner
|
393bc3b16b
|
Added comment
|
2016-03-23 16:22:15 -07:00 |
|
Christoph Hertzberg
|
9642fd7a93
|
Replace all M_PI by EIGEN_PI and add a check to the testsuite.
|
2016-03-23 15:37:45 +01:00 |
|
Benoit Steiner
|
3d1e857327
|
Fixed compilation error
|
2016-03-22 15:48:28 -07:00 |
|
Benoit Steiner
|
de7d92c259
|
Pulled latest updates from trunk
|
2016-03-22 15:24:49 -07:00 |
|
Benoit Steiner
|
002cf0d1c9
|
Use a single Barrier instead of a collection of Notifications to reduce the thread synchronization overhead
|
2016-03-22 15:24:23 -07:00 |
|
Benoit Steiner
|
bc2b802751
|
Fixed a couple of typos
|
2016-03-22 14:27:34 -07:00 |
|
Benoit Steiner
|
6a31b7be3e
|
Avoid using std::vector whenever possible
|
2016-03-22 14:02:50 -07:00 |
|
Benoit Steiner
|
65a7113a36
|
Use an enum instead of a static const int to prevent possible link error
|
2016-03-22 09:33:54 -07:00 |
|
Benoit Steiner
|
f9ad25e4d8
|
Fixed contractions of 16 bit floats
|
2016-03-22 09:30:23 -07:00 |
|
Benoit Steiner
|
8ef3181f15
|
Worked around a constness related issue
|
2016-03-21 11:24:05 -07:00 |
|
Benoit Steiner
|
7a07d6aa2b
|
Small cleanup
|
2016-03-21 11:12:17 -07:00 |
|
Benoit Steiner
|
e91f255301
|
Marked variables that's only used in debug mode as such
|
2016-03-21 10:02:00 -07:00 |
|
Benoit Steiner
|
db5c14de42
|
Explicitly cast the default value into the proper scalar type.
|
2016-03-21 09:52:58 -07:00 |
|