Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tweak SVE dot kernel #4382

Merged
merged 2 commits into from
Dec 19, 2023
Merged

Tweak SVE dot kernel #4382

merged 2 commits into from
Dec 19, 2023

Conversation

Mousius
Copy link
Contributor

@Mousius Mousius commented Dec 19, 2023

This changes the SVE dot kernel to only predicate when necessary as well as streamlining the assembly a bit. The benchmarks seem to indicate this can improve performance by ~33%.

This changes the SVE dot kernel to only predicate when necessary as well
as streamlining the assembly a bit. The benchmarks seem to indicate this
can improve performance by ~33%.
@martin-frbg martin-frbg added this to the 0.3.26 milestone Dec 19, 2023
@martin-frbg martin-frbg merged commit fa220b2 into OpenMathLib:develop Dec 19, 2023
61 of 63 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants