Mehdi Goli
|
a5c3f15682
|
Adding comment to TensorDeviceSycl.h and cleaning the code.
|
2016-11-11 19:06:34 +00:00 |
|
Benoit Steiner
|
f4722aa479
|
Merged in benoitsteiner/opencl (pull request PR-247)
|
2016-11-11 00:01:28 +00:00 |
|
Mehdi Goli
|
3be3963021
|
Adding EIGEN_STRONG_INLINE back; using size() instead of dimensions.TotalSize() on Tensor.
|
2016-11-10 19:16:31 +00:00 |
|
Mehdi Goli
|
12387abad5
|
adding the missing in eigen_assert!
|
2016-11-10 18:58:08 +00:00 |
|
Mehdi Goli
|
2e704d4257
|
Adding Memset; optimising MecopyDeviceToHost by removing double copying;
|
2016-11-10 18:45:12 +00:00 |
|
Gael Guennebaud
|
eeac81b8c0
|
bump to 3.3.0
|
2016-11-10 13:55:14 +01:00 |
|
Gael Guennebaud
|
e80bc2ddb0
|
Fix printing of sparse expressions
|
2016-11-10 10:35:32 +01:00 |
|
Benoit Steiner
|
75c080b176
|
Added a test to validate memory transfers between host and sycl device
|
2016-11-09 06:23:42 -08:00 |
|
Benoit Steiner
|
db3903498d
|
Merged in benoitsteiner/opencl (pull request PR-246)
Improved support for OpenCL
|
2016-11-08 22:28:44 +00:00 |
|
Benoit Steiner
|
dcc14bee64
|
Fixed the formatting of the code
|
2016-11-08 14:24:46 -08:00 |
|
Benoit Steiner
|
b88c1117d4
|
Fixed the indentation of the cmake file
|
2016-11-08 14:22:36 -08:00 |
|
Luke Iwanski
|
912cb3d660
|
#if EIGEN_EXCEPTION -> #ifdef EIGEN_EXCEPTIONS.
|
2016-11-08 22:01:14 +00:00 |
|
Luke Iwanski
|
1b345b0895
|
Fix for SYCL queue initialisation.
|
2016-11-08 21:56:31 +00:00 |
|
Luke Iwanski
|
1b95717358
|
Use try/catch only when exceptions are enabled.
|
2016-11-08 21:08:53 +00:00 |
|
Mehdi Goli
|
d57430dd73
|
Converting all sycl buffers to uninitialised device only buffers; adding memcpyHostToDevice and memcpyDeviceToHost on syclDevice; modifying all examples to obey the new rules; moving sycl queue creating to the device based on Benoit suggestion; removing the sycl specefic condition for returning m_result in TensorReduction.h according to Benoit suggestion.
|
2016-11-08 17:08:02 +00:00 |
|
Gael Guennebaud
|
73985ead27
|
Extend unit test to check sparse solvers with a SparseVector as the rhs and result.
|
2016-11-06 20:29:57 +01:00 |
|
Gael Guennebaud
|
436a111792
|
Generalize Cholmod support to hanlde any sparse type as the rhs and result of the solve method
|
2016-11-06 20:29:23 +01:00 |
|
Gael Guennebaud
|
afc55b1885
|
Generalize IterativeSolverBase::solve to hanlde any sparse type as the results (instead of SparseMatrix only)
|
2016-11-06 20:28:18 +01:00 |
|
Gael Guennebaud
|
a5c2d8a3cc
|
Generalize solve_sparse_through_dense_panels to handle SparseVector.
|
2016-11-06 15:20:58 +01:00 |
|
Gael Guennebaud
|
f8bfe10613
|
Add missing friend declaration
|
2016-11-06 15:20:30 +01:00 |
|
Gael Guennebaud
|
fc7180cda8
|
Add a default ctor to evaluator<SparseVector>.
Needed for evaluator<Solve>.
|
2016-11-06 15:20:00 +01:00 |
|
Gael Guennebaud
|
4d226ab5b5
|
Enable swapping between SparseMatrix and SparseVector
|
2016-11-06 15:15:03 +01:00 |
|
Benoit Steiner
|
ad086b03e4
|
Removed unnecessary statement
|
2016-11-05 12:43:27 -07:00 |
|
Benoit Steiner
|
dad177be01
|
Added missing includes
|
2016-11-05 10:04:42 -07:00 |
|
Gael Guennebaud
|
55b4fd1d40
|
Extend mpreal unit test to check LLT with complexes.
|
2016-11-05 11:28:53 +01:00 |
|
Gael Guennebaud
|
a354c3ca59
|
Fix compilation of LLT with complex<mpreal>.
|
2016-11-05 11:28:29 +01:00 |
|
Benoit Steiner
|
d46a36cc84
|
Merged eigen/eigen into default
|
2016-11-04 18:22:55 -07:00 |
|
Mehdi Goli
|
0ebe3808ca
|
Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size;
|
2016-11-04 18:18:19 +00:00 |
|
Gael Guennebaud
|
47d1b4a609
|
Added tag 3.3-rc2 for changeset ba05572dcb
|
2016-11-04 09:09:18 +01:00 |
|
Gael Guennebaud
|
ba05572dcb
|
bump to 3.3-rc2
|
2016-11-04 09:09:06 +01:00 |
|
Benoit Steiner
|
5c3995769c
|
Improved AVX512 configuration
|
2016-11-03 04:50:28 -07:00 |
|
Benoit Steiner
|
fbe672d599
|
Reenable the generation of dynamic blas libraries.
|
2016-11-03 04:08:43 -07:00 |
|
Benoit Steiner
|
ca0ba0d9a4
|
Improved AVX512 support
|
2016-11-03 04:00:49 -07:00 |
|
Benoit Steiner
|
c80587c92b
|
Merged eigen/eigen into default
|
2016-11-03 03:55:11 -07:00 |
|
Gael Guennebaud
|
3f1d0cdc22
|
bug #1337: improve doc of homogeneous() and hnormalized()
|
2016-11-03 11:03:08 +01:00 |
|
Gael Guennebaud
|
78e93ac1ad
|
bug #1330: Cholmod supports double precision only, so let's trigger a static assertion if the scalar type does not match this requirement.
|
2016-11-03 10:21:59 +01:00 |
|
Benoit Steiner
|
3e37166d0b
|
Merged in benoitsteiner/opencl (pull request PR-244)
Disable vectorization on device only when compiling for sycl
|
2016-11-02 22:01:03 +00:00 |
|
Benoit Steiner
|
0585b2965d
|
Disable vectorization on device only when compiling for sycl
|
2016-11-02 11:44:27 -07:00 |
|
Benoit Steiner
|
e6e77ed08b
|
Don't call lgamma_r when compiling for an Apple device, since the function isn't available on MacOS
|
2016-11-02 09:55:39 -07:00 |
|
Benoit Steiner
|
b238f387b4
|
Pulled latest updates from trunk
|
2016-11-02 08:53:13 -07:00 |
|
Benoit Steiner
|
c8db17301e
|
Special functions require math.h: make sure it is included.
|
2016-11-02 08:51:52 -07:00 |
|
Gael Guennebaud
|
a07bb428df
|
bug #1004: improve accuracy of LinSpaced for abs(low) >> abs(high).
|
2016-11-02 11:34:38 +01:00 |
|
Gael Guennebaud
|
598de8b193
|
Add pinsertfirst function and implement pinsertlast for complex on SSE/AVX.
|
2016-11-02 10:38:13 +01:00 |
|
Benoit Steiner
|
e44519744e
|
Merged in benoitsteiner/opencl (pull request PR-243)
Fixed the ambiguity in callig make_tuple for sycl backend.
|
2016-11-02 02:56:58 +00:00 |
|
Rasmus Munk Larsen
|
0a6ae41555
|
Merged eigen/eigen into default
|
2016-11-01 15:37:00 -07:00 |
|
Rasmus Munk Larsen
|
b730952414
|
Don't attempts to use lgamma_r for CUDA devices.
Fix type in lgamma_impl<double>.
|
2016-11-01 15:34:19 -07:00 |
|
Benoit Steiner
|
7a0e96b80d
|
Gate the code that refers to cuda fp16 primitives more thoroughly
|
2016-11-01 12:08:09 -07:00 |
|
Mehdi Goli
|
51af6ae971
|
Fixed the ambiguity in callig make_tuple for sycl backend.
|
2016-10-31 16:35:51 +00:00 |
|
Benoit Steiner
|
0a9ad6fc72
|
Worked around Visual Studio compilation errors
|
2016-10-28 07:54:27 -07:00 |
|
Benoit Steiner
|
d5f88e2357
|
Sharded the tensor_image_patch test to help it run on low power devices
|
2016-10-27 21:48:21 -07:00 |
|