Mehdi Goli
|
622805a0c5
|
Modifying TensorDeviceSycl.h to always create buffer of type uint8_t and convert them to the actual type at the execution on the device; adding the queue interface class to separate the lifespan of sycl queue and buffers,created for that queue, from Eigen::SyclDevice; modifying sycl tests to support the evaluation of the results for both row major and column major data layout on all different devices that are supported by Sycl{CPU; GPU; and Host}.
|
2016-11-18 16:20:42 +00:00 |
|
Luke Iwanski
|
5159675c33
|
Added isnan, isfinite and isinf for SYCL device. Plus test for that.
|
2016-11-18 16:01:48 +00:00 |
|
Luke Iwanski
|
927bd62d2a
|
Now testing out (+=, =) in.FUNC() and out (+=, =) out.FUNC()
|
2016-11-18 11:16:42 +00:00 |
|
Benoit Steiner
|
7c30078b9f
|
Merged eigen/eigen into default
|
2016-11-17 22:53:37 -08:00 |
|
Benoit Steiner
|
553f50b246
|
Added a way to detect errors generated by the opencl device from the host
|
2016-11-17 21:51:48 -08:00 |
|
Benoit Steiner
|
72a45d32e9
|
Cleanup
|
2016-11-17 21:29:15 -08:00 |
|
Benoit Steiner
|
4349fc640e
|
Created a test to check that the sycl runtime can successfully report errors (like ivision by 0).
Small cleanup
|
2016-11-17 20:27:54 -08:00 |
|
Benoit Steiner
|
a6a3fd0703
|
Made TensorDeviceCuda.h compile on windows
|
2016-11-17 16:15:27 -08:00 |
|
Benoit Steiner
|
004344cf54
|
Avoid calling log(0) or 1/0
|
2016-11-17 11:56:44 -08:00 |
|
Konstantinos Margaritis
|
a1d5c503fa
|
replace sizeof(Packet) with PacketSize else it breaks for ZVector.Packet4f
|
2016-11-17 13:27:45 -05:00 |
|
Konstantinos Margaritis
|
672aa97d4d
|
implement float/std::complex<float> for ZVector as well, minor fixes to ZVector
|
2016-11-17 13:27:33 -05:00 |
|
Luke Iwanski
|
7878756dea
|
Fixed existing test.
|
2016-11-17 17:46:55 +00:00 |
|
Luke Iwanski
|
c5130dedbe
|
Specialised basic math functions for SYCL device.
|
2016-11-17 11:47:13 +00:00 |
|
Benoit Steiner
|
f2e8b73256
|
Enable the use of AVX512 instruction by default
|
2016-11-16 21:28:04 -08:00 |
|
Gael Guennebaud
|
7b09e4dd8c
|
bump default branch to 3.3.90
|
2016-11-16 22:20:58 +01:00 |
|
Benoit Steiner
|
dff9a049c4
|
Optimized the computation of exp, sqrt, ceil anf floor for fp16 on Pascal GPUs
|
2016-11-16 09:01:51 -08:00 |
|
Benoit Steiner
|
b5c75351e3
|
Merged eigen/eigen into default
|
2016-11-14 15:54:44 -08:00 |
|
Rasmus Munk Larsen
|
32df1b1046
|
Reduce dispatch overhead in parallelFor by only calling thread_pool.Schedule() for one of the two recursive calls in handleRange. This avoids going through the scedule path to push both recursive calls onto another thread-queue in the binary tree, but instead executes one of them on the main thread. At the leaf level this will still activate a full complement of threads, but will save up to 50% of the overhead in Schedule (random number generation, insertion in queue which includes signaling via atomics).
|
2016-11-14 14:18:16 -08:00 |
|
Mehdi Goli
|
05e8c2a1d9
|
Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl.
|
2016-11-14 18:13:53 +00:00 |
|
Mehdi Goli
|
f8ca893976
|
Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing.
|
2016-11-14 17:51:57 +00:00 |
|
Gael Guennebaud
|
0ee92aa38e
|
Optimize sparse<bool> && sparse<bool> to use the same path as for coeff-wise products.
|
2016-11-14 18:47:41 +01:00 |
|
Gael Guennebaud
|
2e334f5da0
|
bug #426: move operator && and || to MatrixBase and SparseMatrixBase.
|
2016-11-14 18:47:02 +01:00 |
|
Gael Guennebaud
|
a048aba14c
|
Merged in olesalscheider/eigen (pull request PR-248)
Make sure not to call numext::maxi on expression templates
|
2016-11-14 13:25:53 +00:00 |
|
Gael Guennebaud
|
eedb87f4ba
|
Fix regression in SparseMatrix::ReverseInnerIterator
|
2016-11-14 14:05:53 +01:00 |
|
Niels Ole Salscheider
|
51fef87408
|
Make sure not to call numext::maxi on expression templates
|
2016-11-12 12:20:57 +01:00 |
|
Mehdi Goli
|
a5c3f15682
|
Adding comment to TensorDeviceSycl.h and cleaning the code.
|
2016-11-11 19:06:34 +00:00 |
|
Benoit Steiner
|
f4722aa479
|
Merged in benoitsteiner/opencl (pull request PR-247)
|
2016-11-11 00:01:28 +00:00 |
|
Mehdi Goli
|
3be3963021
|
Adding EIGEN_STRONG_INLINE back; using size() instead of dimensions.TotalSize() on Tensor.
|
2016-11-10 19:16:31 +00:00 |
|
Mehdi Goli
|
12387abad5
|
adding the missing in eigen_assert!
|
2016-11-10 18:58:08 +00:00 |
|
Mehdi Goli
|
2e704d4257
|
Adding Memset; optimising MecopyDeviceToHost by removing double copying;
|
2016-11-10 18:45:12 +00:00 |
|
Gael Guennebaud
|
eeac81b8c0
|
bump to 3.3.0
|
2016-11-10 13:55:14 +01:00 |
|
Gael Guennebaud
|
e80bc2ddb0
|
Fix printing of sparse expressions
|
2016-11-10 10:35:32 +01:00 |
|
Benoit Steiner
|
75c080b176
|
Added a test to validate memory transfers between host and sycl device
|
2016-11-09 06:23:42 -08:00 |
|
Benoit Steiner
|
db3903498d
|
Merged in benoitsteiner/opencl (pull request PR-246)
Improved support for OpenCL
|
2016-11-08 22:28:44 +00:00 |
|
Benoit Steiner
|
dcc14bee64
|
Fixed the formatting of the code
|
2016-11-08 14:24:46 -08:00 |
|
Benoit Steiner
|
b88c1117d4
|
Fixed the indentation of the cmake file
|
2016-11-08 14:22:36 -08:00 |
|
Luke Iwanski
|
912cb3d660
|
#if EIGEN_EXCEPTION -> #ifdef EIGEN_EXCEPTIONS.
|
2016-11-08 22:01:14 +00:00 |
|
Luke Iwanski
|
1b345b0895
|
Fix for SYCL queue initialisation.
|
2016-11-08 21:56:31 +00:00 |
|
Luke Iwanski
|
1b95717358
|
Use try/catch only when exceptions are enabled.
|
2016-11-08 21:08:53 +00:00 |
|
Mehdi Goli
|
d57430dd73
|
Converting all sycl buffers to uninitialised device only buffers; adding memcpyHostToDevice and memcpyDeviceToHost on syclDevice; modifying all examples to obey the new rules; moving sycl queue creating to the device based on Benoit suggestion; removing the sycl specefic condition for returning m_result in TensorReduction.h according to Benoit suggestion.
|
2016-11-08 17:08:02 +00:00 |
|
Gael Guennebaud
|
73985ead27
|
Extend unit test to check sparse solvers with a SparseVector as the rhs and result.
|
2016-11-06 20:29:57 +01:00 |
|
Gael Guennebaud
|
436a111792
|
Generalize Cholmod support to hanlde any sparse type as the rhs and result of the solve method
|
2016-11-06 20:29:23 +01:00 |
|
Gael Guennebaud
|
afc55b1885
|
Generalize IterativeSolverBase::solve to hanlde any sparse type as the results (instead of SparseMatrix only)
|
2016-11-06 20:28:18 +01:00 |
|
Gael Guennebaud
|
a5c2d8a3cc
|
Generalize solve_sparse_through_dense_panels to handle SparseVector.
|
2016-11-06 15:20:58 +01:00 |
|
Gael Guennebaud
|
f8bfe10613
|
Add missing friend declaration
|
2016-11-06 15:20:30 +01:00 |
|
Gael Guennebaud
|
fc7180cda8
|
Add a default ctor to evaluator<SparseVector>.
Needed for evaluator<Solve>.
|
2016-11-06 15:20:00 +01:00 |
|
Gael Guennebaud
|
4d226ab5b5
|
Enable swapping between SparseMatrix and SparseVector
|
2016-11-06 15:15:03 +01:00 |
|
Benoit Steiner
|
ad086b03e4
|
Removed unnecessary statement
|
2016-11-05 12:43:27 -07:00 |
|
Benoit Steiner
|
dad177be01
|
Added missing includes
|
2016-11-05 10:04:42 -07:00 |
|
Gael Guennebaud
|
55b4fd1d40
|
Extend mpreal unit test to check LLT with complexes.
|
2016-11-05 11:28:53 +01:00 |
|