Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize attn v2 2.0: use customized kernel for attention score #1199

Closed
wants to merge 45 commits into from

Conversation

xinhaoc
Copy link
Collaborator

@xinhaoc xinhaoc commented Oct 19, 2023

Description of changes:

Related Issues:

Linked Issues:

  • Issue #

Issues closed by this PR:

  • Closes #

This change is Reviewable

@jiazhihao jiazhihao added the inference Features and fixes related to the inference project. label Oct 19, 2023
@jiazhihao jiazhihao mentioned this pull request Oct 19, 2023
2 tasks
@xinhaoc
Copy link
Collaborator Author

xinhaoc commented Oct 23, 2023

#1206

@xinhaoc xinhaoc marked this pull request as ready for review November 8, 2023 19:06
@xinhaoc xinhaoc requested a review from jiazhihao November 8, 2023 19:06
@xinhaoc xinhaoc closed this Nov 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
inference Features and fixes related to the inference project.
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants