Mehdi Goli
|
7318daf887
|
Fixing LLVM error on TensorMorphingSycl.h on GPU; fixing int64_t crash for tensor_broadcast_sycl on GPU; adding get_sycl_supported_devices() on syclDevice.h.
|
2016-11-25 16:19:07 +00:00 |
|
Mehdi Goli
|
b8cc5635d5
|
Removing unsupported device from test case; cleaning the tensor device sycl.
|
2016-11-23 16:30:41 +00:00 |
|
Benoit Steiner
|
f11da1d83b
|
Made the QueueInterface thread safe
|
2016-11-20 13:17:08 -08:00 |
|
Benoit Steiner
|
ed839c5851
|
Enable the use of constant expressions with clang >= 3.6
|
2016-11-20 10:34:49 -08:00 |
|
Benoit Steiner
|
6d781e3e52
|
Merged eigen/eigen into default
|
2016-11-20 10:12:54 -08:00 |
|
Benoit Steiner
|
79a07b891b
|
Fixed a typo
|
2016-11-20 07:07:41 -08:00 |
|
Gael Guennebaud
|
465ede0f20
|
Fix compilation issue in mat = permutation (regression introduced in 8193ffb3d3
)
|
2016-11-20 09:41:37 +01:00 |
|
Benoit Steiner
|
81151bd474
|
Fixed merge conflicts
|
2016-11-19 19:12:59 -08:00 |
|
Benoit Steiner
|
9265ca707e
|
Made it possible to check the state of a sycl device without synchronization
|
2016-11-19 10:56:24 -08:00 |
|
Benoit Steiner
|
2d1aec15a7
|
Added missing include
|
2016-11-19 08:09:54 -08:00 |
|
Luke Iwanski
|
af67335e0e
|
Added test for cwiseMin, cwiseMax and operator%.
|
2016-11-19 13:37:27 +00:00 |
|
Benoit Steiner
|
1bdf1b9ce0
|
Merged in benoitsteiner/opencl (pull request PR-253)
OpenCL improvements
|
2016-11-19 04:44:43 +00:00 |
|
Benoit Steiner
|
a357fe1fb9
|
Code cleanup
|
2016-11-18 16:58:09 -08:00 |
|
Benoit Steiner
|
1c6eafb46b
|
Updated cxx11_tensor_device_sycl to run only on the OpenCL devices available on the host
|
2016-11-18 16:43:27 -08:00 |
|
Benoit Steiner
|
ca754caa23
|
Only runs the cxx11_tensor_reduction_sycl on devices that are available.
|
2016-11-18 16:31:14 -08:00 |
|
Benoit Steiner
|
dc601d79d1
|
Added the ability to run test exclusively OpenCL devices that are listed by sycl::device::get_devices().
|
2016-11-18 16:26:50 -08:00 |
|
Benoit Steiner
|
8649e16c2a
|
Enable EIGEN_HAS_C99_MATH when building with the latest version of Visual Studio
|
2016-11-18 14:18:34 -08:00 |
|
Benoit Steiner
|
110b7f8d9f
|
Deleted unnecessary semicolons
|
2016-11-18 14:06:17 -08:00 |
|
Benoit Steiner
|
b5e3285e16
|
Test broadcasting on OpenCL devices with 64 bit indexing
|
2016-11-18 13:44:20 -08:00 |
|
Gael Guennebaud
|
164414c563
|
Merged in ChunW/eigen (pull request PR-252)
Workaround for error in VS2012 with /clr
|
2016-11-18 21:07:29 +00:00 |
|
Benoit Steiner
|
37c2c516a6
|
Cleaned up the sycl device code
|
2016-11-18 12:38:06 -08:00 |
|
Benoit Steiner
|
7335c49204
|
Fixed the cxx11_tensor_device_sycl test
|
2016-11-18 12:37:13 -08:00 |
|
Mehdi Goli
|
15e226d7d3
|
adding Benoit changes on the TensorDeviceSycl.h
|
2016-11-18 16:34:54 +00:00 |
|
Mehdi Goli
|
622805a0c5
|
Modifying TensorDeviceSycl.h to always create buffer of type uint8_t and convert them to the actual type at the execution on the device; adding the queue interface class to separate the lifespan of sycl queue and buffers,created for that queue, from Eigen::SyclDevice; modifying sycl tests to support the evaluation of the results for both row major and column major data layout on all different devices that are supported by Sycl{CPU; GPU; and Host}.
|
2016-11-18 16:20:42 +00:00 |
|
Luke Iwanski
|
5159675c33
|
Added isnan, isfinite and isinf for SYCL device. Plus test for that.
|
2016-11-18 16:01:48 +00:00 |
|
Luke Iwanski
|
927bd62d2a
|
Now testing out (+=, =) in.FUNC() and out (+=, =) out.FUNC()
|
2016-11-18 11:16:42 +00:00 |
|
Gael Guennebaud
|
8193ffb3d3
|
bug #1343: fix compilation regression in mat+=selfadjoint_view.
Generic EigenBase2EigenBase assignment was incomplete.
|
2016-11-18 10:17:34 +01:00 |
|
Gael Guennebaud
|
cebff7e3a2
|
bug #1343: fix compilation regression in array = matrix_product
|
2016-11-18 10:09:33 +01:00 |
|
Benoit Steiner
|
7c30078b9f
|
Merged eigen/eigen into default
|
2016-11-17 22:53:37 -08:00 |
|
Benoit Steiner
|
553f50b246
|
Added a way to detect errors generated by the opencl device from the host
|
2016-11-17 21:51:48 -08:00 |
|
Benoit Steiner
|
72a45d32e9
|
Cleanup
|
2016-11-17 21:29:15 -08:00 |
|
Benoit Steiner
|
4349fc640e
|
Created a test to check that the sycl runtime can successfully report errors (like ivision by 0).
Small cleanup
|
2016-11-17 20:27:54 -08:00 |
|
Benoit Steiner
|
a6a3fd0703
|
Made TensorDeviceCuda.h compile on windows
|
2016-11-17 16:15:27 -08:00 |
|
Chun Wang
|
0d0948c3b9
|
Workaround for error in VS2012 with /clr
|
2016-11-17 17:54:27 -05:00 |
|
Benoit Steiner
|
004344cf54
|
Avoid calling log(0) or 1/0
|
2016-11-17 11:56:44 -08:00 |
|
Konstantinos Margaritis
|
a1d5c503fa
|
replace sizeof(Packet) with PacketSize else it breaks for ZVector.Packet4f
|
2016-11-17 13:27:45 -05:00 |
|
Konstantinos Margaritis
|
672aa97d4d
|
implement float/std::complex<float> for ZVector as well, minor fixes to ZVector
|
2016-11-17 13:27:33 -05:00 |
|
Luke Iwanski
|
7878756dea
|
Fixed existing test.
|
2016-11-17 17:46:55 +00:00 |
|
Luke Iwanski
|
c5130dedbe
|
Specialised basic math functions for SYCL device.
|
2016-11-17 11:47:13 +00:00 |
|
Benoit Steiner
|
f2e8b73256
|
Enable the use of AVX512 instruction by default
|
2016-11-16 21:28:04 -08:00 |
|
Gael Guennebaud
|
7b09e4dd8c
|
bump default branch to 3.3.90
|
2016-11-16 22:20:58 +01:00 |
|
Benoit Steiner
|
dff9a049c4
|
Optimized the computation of exp, sqrt, ceil anf floor for fp16 on Pascal GPUs
|
2016-11-16 09:01:51 -08:00 |
|
Benoit Steiner
|
b5c75351e3
|
Merged eigen/eigen into default
|
2016-11-14 15:54:44 -08:00 |
|
Rasmus Munk Larsen
|
32df1b1046
|
Reduce dispatch overhead in parallelFor by only calling thread_pool.Schedule() for one of the two recursive calls in handleRange. This avoids going through the scedule path to push both recursive calls onto another thread-queue in the binary tree, but instead executes one of them on the main thread. At the leaf level this will still activate a full complement of threads, but will save up to 50% of the overhead in Schedule (random number generation, insertion in queue which includes signaling via atomics).
|
2016-11-14 14:18:16 -08:00 |
|
Mehdi Goli
|
05e8c2a1d9
|
Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl.
|
2016-11-14 18:13:53 +00:00 |
|
Mehdi Goli
|
f8ca893976
|
Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing.
|
2016-11-14 17:51:57 +00:00 |
|
Gael Guennebaud
|
0ee92aa38e
|
Optimize sparse<bool> && sparse<bool> to use the same path as for coeff-wise products.
|
2016-11-14 18:47:41 +01:00 |
|
Gael Guennebaud
|
2e334f5da0
|
bug #426: move operator && and || to MatrixBase and SparseMatrixBase.
|
2016-11-14 18:47:02 +01:00 |
|
Gael Guennebaud
|
a048aba14c
|
Merged in olesalscheider/eigen (pull request PR-248)
Make sure not to call numext::maxi on expression templates
|
2016-11-14 13:25:53 +00:00 |
|
Gael Guennebaud
|
eedb87f4ba
|
Fix regression in SparseMatrix::ReverseInnerIterator
|
2016-11-14 14:05:53 +01:00 |
|