-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PHI] feat: support gqa for blha #64640
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
FYI:更新了单测,在GQA模式下的blha RoPE 还有些bug,正在debuging。其他的测试本地都已通过~ 现在单测是把GQA和MHA的分开写了,尝试过把GQA和MHA的单测放到一个test里。但是合并写时,势必要执行两次以下的代码。会导致结果错误,如果要再设置两个seq_lens_decoder。在一个test里的变量太多。 self.seq_lens_decoder[:] = self.seq_lens_encoder
self.seq_lens_encoder[:] = 0
self.seq_lens_this_time[:] = 1 因此现在选择了把GQA的单测和MHA的单测分开写 |
FYI: 把gqa的单测和mha的单测拆开了。并在本地跑过了test |
Sorry to inform you that 5dd9b13's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually. |
PR Category
Others
PR Types
Improvements
Description
增强blha,使其支持GQA