[Pten] add cuda implement of cast kernel #37610

MingMingShangTian · 2021-11-26T10:24:13Z

PR types

New features

PR changes

Others

Describe

Add cuda implement of cast kernel

paddle-bot-old · 2021-11-26T10:24:17Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

chenwhql · 2021-11-29T07:04:15Z

paddle/pten/kernels/functions/cuda/cast_kernel_impl.h

+using CUDAContext = paddle::platform::CUDADeviceContext;
+
+template <typename InT, typename OutT, int VecSize>
+__global__ void VecCastCUDAKernel(const InT* in, const int64_t N, OutT* out) {


不建议copy代码过来，麻烦确保只有一份代码维护，否则原实现优化后，这里没更新，又会有问题

好的，后续我再重提PR修复一下

* add cuda implement of cast kernel * remove bfloat16 when defined paddle_with_hip

add cuda implement of cast kernel

0c3a892

remove bfloat16 when defined paddle_with_hip

77a8e8b

MingMingShangTian changed the title ~~add cuda implement of cast kernel~~ [Pten] add cuda implement of cast kernel Nov 29, 2021

chenwhql approved these changes Nov 29, 2021

View reviewed changes

chenwhql reviewed Nov 29, 2021

View reviewed changes

MingMingShangTian merged commit 9956763 into PaddlePaddle:develop Nov 29, 2021

MingMingShangTian deleted the cuda_cast_kernel branch November 29, 2021 07:06

Zjq9409 pushed a commit to Zjq9409/Paddle that referenced this pull request Dec 10, 2021

[Pten] add cuda implement of cast kernel (PaddlePaddle#37610)

5174c96

* add cuda implement of cast kernel * remove bfloat16 when defined paddle_with_hip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pten] add cuda implement of cast kernel #37610

[Pten] add cuda implement of cast kernel #37610

MingMingShangTian commented Nov 26, 2021

paddle-bot-old bot commented Nov 26, 2021

chenwhql Nov 29, 2021

MingMingShangTian Nov 29, 2021

[Pten] add cuda implement of cast kernel #37610

[Pten] add cuda implement of cast kernel #37610

Conversation

MingMingShangTian commented Nov 26, 2021

PR types

PR changes

Describe

paddle-bot-old bot commented Nov 26, 2021

chenwhql Nov 29, 2021

Choose a reason for hiding this comment

MingMingShangTian Nov 29, 2021

Choose a reason for hiding this comment