Fix paddle.mode and paddle.bincount API #62995

xingmingyyj · 2024-03-25T08:57:40Z

PR Category

Others

PR Types

Others

Description

paddle.mode和paddle.bincount两个API在静态图模式下组网执行时，出现精度问题。经过分析原因和 #62801 所遇到的问题一致，根据kernel中的数据类型进行修复。

paddle-bot · 2024-03-25T08:57:46Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

kangguangli · 2024-03-26T07:21:37Z

paddle/phi/infermeta/binary.cc

-    out->set_dtype(weights.dtype());
+    if (weights.dtype() == DataType::FLOAT32) {
+      out->set_dtype(DataType::FLOAT32);
+    } else {


这和 out->set_dtype(weights.dtype()); 有什么区别吗？感觉原本的写法反倒更简洁些

这和 out->set_dtype(weights.dtype()); 有什么区别吗？感觉原本的写法反倒更简洁些

这里是参照kernel的这段逻辑修改的

if (!has_weights) { int64_t* output_data = dev_ctx.template Alloc<int64_t>(output); phi::funcs::SetConstant<Context, int64_t>()( dev_ctx, output, static_cast<int64_t>(0)); KernelBincount<T, InputT, int64_t> <<<GET_BLOCKS(input_numel), PADDLE_CUDA_NUM_THREADS, 0, stream>>>( input_data, input_numel, has_weights, weights_data, output_data); } else { if (weights->dtype() == DataType::FLOAT32) { float* output_data = dev_ctx.template Alloc<float>(output); phi::funcs::SetConstant<Context, float>()( dev_ctx, output, static_cast<float>(0)); KernelBincount<T, InputT, float> <<<GET_BLOCKS(input_numel), PADDLE_CUDA_NUM_THREADS, 0, stream>>>( input_data, input_numel, has_weights, weights_data, output_data); } else { double* output_data = dev_ctx.template Alloc<double>(output); phi::funcs::SetConstant<Context, double>()( dev_ctx, output, static_cast<double>(0)); KernelBincount<T, InputT, double> <<<GET_BLOCKS(input_numel), PADDLE_CUDA_NUM_THREADS, 0, stream>>>( input_data, input_numel, has_weights, weights_data, output_data); } } }

这里的逻辑和out->set_dtype(weights.dtype());有出入

补充下现在weights的dtype

paddle/phi/infermeta/binary.cc

paddle-ci-bot · 2024-04-02T03:12:31Z

Sorry to inform you that 2564443's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

xingmingyyj · 2024-04-07T06:42:37Z

补充说明bincount报错信息：
下面动转静代码执行时：

......
paddle.seed(33)
obj = naive_func
dy_out = obj(in_tensor, in_params, func)

paddle.seed(33)
jit_obj = paddle.jit.to_static(obj)
st_out = jit_obj(in_tensor, in_params, func)
print("dy_out is: ", dy_out)
print("st_out is: ", st_out)

paddle.jit.save(jit_obj, path="bincount")
print("jit.save is successfully !!!")

paddle.seed(33)
jit = paddle.jit.load("bincount")
print("jit.load is successfully !!!")

paddle.seed(33)
inputs_key = sorted(in_tensor.keys())
inputs_value = []
for k in inputs_key:
    inputs_value.append(in_tensor[k])
# print('inputs_value is: ', inputs_value)
res = jit(*inputs_value)
print('jit.load res: ', res)

compare(dy_out, res, delta=1e-5, rtol=1e-6)

报错如下：

Traceback (most recent call last):
  File "/home/aistudio/fix_op/Paddle/tools/fix_bitcount.py", line 106, in <module>
    res = jit(*inputs_value)
  File "/home/aistudio/fix_op/Paddle/build/python/paddle/nn/layer/layers.py", line 1429, in __call__
    return self.forward(*inputs, **kwargs)
  File "/home/aistudio/fix_op/Paddle/build/python/paddle/jit/translated_layer.py", line 1475, in __i_m_p_l__
    return _run_dygraph(self, input, program_holder)
  File "/home/aistudio/fix_op/Paddle/build/python/paddle/jit/translated_layer.py", line 1002, in _run_dygraph
    _legacy_C_ops.run_program(
ValueError: In user code:


    InvalidArgumentError: The type of data we are trying to retrieve (int32) does not match the type of data (int64) currently contained in the container.
      [Hint: Expected dtype() == phi::CppTypeToDataType<T>::Type(), but received dtype():9 != phi::CppTypeToDataType<T>::Type():7.] (at /home/aistudio/fix_op/Paddle/paddle/phi/core/dense_tensor.cc:161)
      [operator < pd_kernel.phi_kernel > error]  [operator < run_program > error]

这里可以发现在scale这算子中，张量的实际数据类型和目前期望的数据类型不一致。
执行器执行的计算图如下：

{
    (%0) = "data(phi_kernel)" () {dtype:(pd_op.DataType)bool,is_persistable:[false],kernel_key:<backend:GPU|layout:Undefined(AnyLayout)|dtype:int32>,kernel_name:"data",name:"_jst.0.a.0",op_name:"pd_op.data",place:(pd_op.Place)Place(gpu:0),shape:(pd_op.IntArray)[],stop_gradient:[false]} : () -> gpu_tensor<10xi32>
    (%1) = "full(phi_kernel)" () {dtype:(pd_op.DataType)int32,kernel_key:<backend:CPU|layout:Undefined(AnyLayout)|dtype:int32>,kernel_name:"full",op_name:"pd_op.full",place:(pd_op.Place)Place(cpu),shape:(pd_op.IntArray)[1],stop_gradient:[true],value:(Float)0} : () -> cpu_tensor<1xi32>
    (%2) = "bincount(phi_kernel)" (%0, <<NULL VALUE>>, %1) {is_persistable:[false],kernel_key:<backend:GPU|layout:NCHW|dtype:int32>,kernel_name:"bincount",op_name:"pd_op.bincount",stop_gradient:[false]} : (gpu_tensor<10xi32>, <<NULL TYPE>>, cpu_tensor<1xi32>) -> gpu_tensor<-1xi32>
    (%3) = "full(phi_kernel)" () {dtype:(pd_op.DataType)float32,kernel_key:<backend:CPU|layout:Undefined(AnyLayout)|dtype:float32>,kernel_name:"full",op_name:"pd_op.full",place:(pd_op.Place)Place(cpu),shape:(pd_op.IntArray)[1],stop_gradient:[true],value:(Float)1} : () -> cpu_tensor<1xf32>
    (%4) = "scale(phi_kernel)" (%2, %3) {bias:(Float)0,bias_after_scale:true,is_persistable:[false],kernel_key:<backend:GPU|layout:NCHW|dtype:int32>,kernel_name:"scale",op_name:"pd_op.scale",stop_gradient:[false]} : (gpu_tensor<-1xi32>, cpu_tensor<1xf32>) -> gpu_tensor<-1xi32>
    () = "builtin.shadow_output" (%4) {output_name:"translated_layer/scale_0.tmp_0"} : (gpu_tensor<-1xi32>) -> 
}

猜测时infermeta中的dtype设置问题导致的。这里weight为空，x.dtype为int32,所以被设置为了int32类型，和kernel中的下述逻辑不符。

  if (!has_weights) {
    int64_t* output_data = dev_ctx.template Alloc<int64_t>(output);
    phi::funcs::SetConstant<Context, int64_t>()(
        dev_ctx, output, static_cast<int64_t>(0));

    KernelBincount<T, InputT, int64_t>
        <<<GET_BLOCKS(input_numel), PADDLE_CUDA_NUM_THREADS, 0, stream>>>(
            input_data, input_numel, has_weights, weights_data, output_data);
  }

paddle-ci-bot · 2024-04-12T03:08:12Z

Sorry to inform you that e9d0862's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

xingmingyyj added 3 commits March 25, 2024 08:42

fix_mode_and_bincount_api

5ea2c3d

fix_mode_and_bincount_api

4ef550f

fix_mode_and_bincount_api

2564443

paddle-bot bot added the contributor External developers label Mar 25, 2024

kangguangli reviewed Mar 26, 2024

View reviewed changes

xingmingyyj requested a review from kangguangli March 27, 2024 09:08

fix

e9d0862

xingmingyyj mentioned this pull request Apr 7, 2024

[WeeklyReports] 2024.03.23~2024.04.05 周报汇总 PFCCLab/Camp#193

Closed

28 tasks

xingmingyyj added 2 commits April 29, 2024 12:11

Update binary.cc

9c9aff5

Update binary.cc

8bc6ff1

xingmingyyj closed this Apr 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix paddle.mode and paddle.bincount API #62995

Fix paddle.mode and paddle.bincount API #62995

xingmingyyj commented Mar 25, 2024 •

edited

Loading

paddle-bot bot commented Mar 25, 2024

kangguangli Mar 26, 2024

xingmingyyj Mar 26, 2024

kangguangli Mar 27, 2024

paddle-ci-bot bot commented Apr 2, 2024

xingmingyyj commented Apr 7, 2024

paddle-ci-bot bot commented Apr 12, 2024

Fix paddle.mode and paddle.bincount API #62995

Fix paddle.mode and paddle.bincount API #62995

Conversation

xingmingyyj commented Mar 25, 2024 • edited Loading

PR Category

PR Types

Description

paddle-bot bot commented Mar 25, 2024

kangguangli Mar 26, 2024

Choose a reason for hiding this comment

xingmingyyj Mar 26, 2024

Choose a reason for hiding this comment

kangguangli Mar 27, 2024

Choose a reason for hiding this comment

paddle-ci-bot bot commented Apr 2, 2024

xingmingyyj commented Apr 7, 2024

paddle-ci-bot bot commented Apr 12, 2024

xingmingyyj commented Mar 25, 2024 •

edited

Loading