Skip to content

Conversation

@enkilee
Copy link
Contributor

@enkilee enkilee commented Aug 22, 2025

PR Category

Execute Infrastructure

PR Types

Bug fixes

Description

 paddle.index_add(Tensor([10],"float32"), Tensor([0],"int32"), 0, Tensor([4],"float32"), ) 
 (External) CUDA error(700), an illegal memory access was encountered. 
  [Hint: 'cudaErrorIllegalAddress'. The device encountered a load or store instruction on an invalid memory address. This leaves the process in an inconsistentstate and any further CUDA work will return the same error. To continue using CUDA, the process must be terminated and relaunched. ] (at /home/aistudio/Paddle/paddle/fluid/pybind/eager_functions.cc:1435)

terminate called after throwing an instance of 'common::enforce::EnforceNotMet'
  what():  (External) CUDA error(700), an illegal memory access was encountered. 
  [Hint: 'cudaErrorIllegalAddress'. The device encountered a load or store instruction on an invalid memory address. This leaves the process in an inconsistentstate and any further CUDA work will return the same error. To continue using CUDA, the process must be terminated and relaunched. ] (at /home/aistudio/Paddle/paddle/phi/core/platform/device/gpu/gpu_info.cc:348)



--------------------------------------
C++ Traceback (most recent call last):
--------------------------------------
0   std::_Sp_counted_base<(__gnu_cxx::_Lock_policy)2>::_M_release()
1   phi::DenseTensor::~DenseTensor()
2   std::_Sp_counted_deleter<phi::Allocation*, std::function<void (phi::Allocation*)>, std::allocator<void>, (__gnu_cxx::_Lock_policy)2>::_M_dispose()
3   paddle::memory::allocation::CUDAAllocator::FreeImpl(phi::Allocation*)

----------------------
Error Message Summary:
----------------------
FatalError: `Process abort signal` is detected by the operating system.
  [TimeInfo: *** Aborted at 1755826917 (unix time) try "date -d @1755826917" if you are using GNU date ***]
  [SignalInfo: *** SIGABRT (@0x3e800000e48) received by PID 3656 (TID 0x7f57ee946740) from PID 3656 ***]

Aborted (core dumped)
  1. check src_dim_idx in index_add_cuda_kernel
  2. add PADDLE_ENFORCE_NOT_NULL

@paddle-bot
Copy link

paddle-bot bot commented Aug 22, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@paddle-bot paddle-bot bot added the contributor External developers label Aug 22, 2025
@enkilee
Copy link
Contributor Author

enkilee commented Aug 25, 2025

/re-run all-failed

@DanielSun11
Copy link
Contributor

DanielSun11 commented Aug 26, 2025

image

请补充下paddleapitest回测的结果。

#72637
paddleapitest地址 https://github.com/PFCCLab/PaddleAPITest
出错配置存放的位置:

@codecov-commenter
Copy link

codecov-commenter commented Sep 4, 2025

Codecov Report

❌ Patch coverage is 88.88889% with 1 line in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (develop@955e32c). Learn more about missing BASE report.

Files with missing lines Patch % Lines
paddle/phi/infermeta/binary.cc 88.88% 1 Missing ⚠️

❌ Your patch status has failed because the patch coverage (88.88%) is below the target coverage (90.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files
@@            Coverage Diff             @@
##             develop   #74831   +/-   ##
==========================================
  Coverage           ?   88.88%           
==========================================
  Files              ?        1           
  Lines              ?        9           
  Branches           ?        0           
==========================================
  Hits               ?        8           
  Misses             ?        1           
  Partials           ?        0           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@enkilee
Copy link
Contributor Author

enkilee commented Sep 10, 2025

@DanielSun11 please review if u are free. thx.

Copy link
Contributor

@DanielSun11 DanielSun11 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@luotao1 luotao1 merged commit 2008f33 into PaddlePaddle:develop Sep 10, 2025
58 of 60 checks passed
zrr1999 pushed a commit to zrr1999/Paddle that referenced this pull request Sep 11, 2025
* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants