Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GEMM/IGEMM tests not working as expected for RVV? #8096

Open
ken-unger opened this issue Mar 21, 2025 · 2 comments
Open

GEMM/IGEMM tests not working as expected for RVV? #8096

ken-unger opened this issue Mar 21, 2025 · 2 comments

Comments

@ken-unger
Copy link
Contributor

@dsharlet in PR#7932 in response to my noting that the unit test didn't catch my dumb rvv igemm bug, you said;

Yes it does seem weird the unit tests would not catch this. I am pretty sure the unit tests catch far more subtle bugs than this. I suspect there's a problem specific to RVV, could you please take a look? There's a lot that is unique to the RVV tests due to the non-constant SIMD length.

I started looking into this a bit, and for sure there seems to be some problem. I've noted that qd8-f32-qc8w-igemm-minmax-test and qu8-igemm-minmax-fp32-test (an upcoming RVV PR) do not actually execute any of the tests claimed to pass! In other gemm & igemm tests I see very few RVV tests actually executed, and in some cases, notably qu8-igemm-minmax-fp32-test, even the scalar tests do not appear to be actually executed.

So far just simplistic debugging, with a couple of printfs to see if it ever makes it here https://github.com/google/XNNPACK/blob/master/test/gemm-microkernel-tester.cc#L80

I'll look at this further in the next few days but thought I'd log this issue to track,

@ken-unger
Copy link
Contributor Author

ken-unger commented Mar 22, 2025

I'm probably not understanding something correctly, since this has been here for a while but shouldn't https://github.com/google/XNNPACK/blob/master/test/gemm-microkernel-tester.cc#L70

for (size_t bl = params.loop_bl_.from; bl <= tester.k() / 2; bl = params.loop_bl_.next(bl)) {

be this instead? Or at least this solves the issue.

for (size_t bl = params.loop_bl_.from; bl <= (tester.k() + 1) / 2; bl = params.loop_bl_.next(bl)) {

@CNOCycle
Copy link

This issue has been reported in #7949. Some propoposed solutions were proposed and await further discussion.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants