mirror of
https://gitlab.com/libeigen/eigen.git
synced 2024-12-21 07:19:46 +08:00
00f32752f7
* Unifying all loadLocalTile from lhs and rhs to an extract_block function. * Adding get_tensor operation which was missing in TensorContractionMapper. * Adding the -D method missing from cmake for Disable_Skinny Contraction operation. * Wrapping all the indices in TensorScanSycl into Scan parameter struct. * Fixing typo in Device SYCL * Unifying load to private register for tall/skinny no shared * Unifying load to vector tile for tensor-vector/vector-tensor operation * Removing all the LHS/RHS class for extracting data from global * Removing Outputfunction from TensorContractionSkinnyNoshared. * Combining the local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining the no-local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining General Tensor-Vector and VectorTensor contraction into one kernel. * Making double buffering optional for Tensor contraction when local memory is version is used. * Modifying benchmark to accept custom Reduction Sizes * Disabling AVX optimization for SYCL backend on the host to allow SSE optimization to the host * Adding Test for SYCL * Modifying SYCL CMake
31 lines
729 B
Bash
Executable File
31 lines
729 B
Bash
Executable File
rm -f tensor_benchmark_sycl
|
|
: "${COMPUTECPP_PACKAGE_ROOT_DIR:?Need to set COMPUTECPP_PACKAGE_ROOT_DIR}"
|
|
echo "COMPUTECPP_PACKAGE_ROOT_DIR is set to: "$COMPUTECPP_PACKAGE_ROOT_DIR
|
|
${COMPUTECPP_PACKAGE_ROOT_DIR}/bin/compute++ \
|
|
tensor_benchmarks_sycl.cc \
|
|
benchmark_main.cc \
|
|
-I ../../ \
|
|
-I ${COMPUTECPP_PACKAGE_ROOT_DIR}/include/ \
|
|
-std=c++11 \
|
|
-march=native \
|
|
-O3 \
|
|
-DNDEBUG \
|
|
-DEIGEN_MPL2_ONLY \
|
|
-DEIGEN_USE_SYCL=1 \
|
|
-DEIGEN_SYCL_LOCAL_MEM=1 \
|
|
-no-serial-memop \
|
|
-mllvm \
|
|
-inline-threshold=10000 \
|
|
-fsycl-ih-last \
|
|
-sycl-driver \
|
|
-Xclang -cl-mad-enable \
|
|
-lOpenCL \
|
|
-lComputeCpp \
|
|
-lpthread \
|
|
-o \
|
|
tensor_benchmark_sycl\
|
|
${@:1}
|
|
|
|
export LD_LIBRARY_PATH=${COMPUTECPP_PACKAGE_ROOT_DIR}/lib:$LD_LIBRARY_PATH
|
|
./tensor_benchmark_sycl
|