You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
cuTransB, cuTransA, N, M, K, &alpha, B, ldb, A, lda, &beta, C, N));
}
We have to cast the DeviceContext to CUDADeviceContext even though we have already know we are implementing a CUDA version of the functor.
Instead, we'd better to take DeviceContext as the template parameter.
At the same time, the template parameter in OpKernel should also be DeviceContext instead of Place
The text was updated successfully, but these errors were encountered:
There are two problems of current DeviceContext design:
DeviceContext
has a interface calledGetEigenDevice
.Paddle/paddle/platform/device_context.h
Lines 38 to 51 in 00b64f6
However, Eigen is not supported in all kinds of Device, e.g. AMD Graphics card. It should be moved to derived DeviceContext class.
Place
template parameter.Paddle/paddle/operators/math/math_function.cu
Lines 24 to 45 in 00b64f6
We have to cast the DeviceContext to CUDADeviceContext even though we have already know we are implementing a CUDA version of the functor.
Instead, we'd better to take DeviceContext as the template parameter.
At the same time, the template parameter in OpKernel should also be
DeviceContext
instead ofPlace
The text was updated successfully, but these errors were encountered: