Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize QS8 GIO packing using AVXVNNI instruction #7497

Open
wants to merge 6 commits into
base: master
Choose a base branch
from

Conversation

xujuntwt95329
Copy link
Contributor

No description provided.

copybara-service bot pushed a commit that referenced this pull request Nov 22, 2024
--
378c835 by Xu Jun <[email protected]>:

optimize qs8-x8c8-gio packw using avxvnni

--
12915c9 by Xu Jun <[email protected]>:

fix _mm_unpacklo_epi32

--
67c98df by Xu Jun <[email protected]>:

fix w movement in kc remainder

--
a60cf60 by Xu Jun <[email protected]>:

vectorize NC remainder

--
3b3dac2 by Xu Jun <[email protected]>:

fix typo

--
ed80587 by Xu Jun <[email protected]>:

renmae safe_load_64bits to safe_load_u64

FUTURE_COPYBARA_INTEGRATE_REVIEW=#7497 from xujuntwt95329:qs8-gio-avxvnni ed80587
PiperOrigin-RevId: 699061552
copybara-service bot pushed a commit that referenced this pull request Nov 22, 2024
--
378c835 by Xu Jun <[email protected]>:

optimize qs8-x8c8-gio packw using avxvnni

--
12915c9 by Xu Jun <[email protected]>:

fix _mm_unpacklo_epi32

--
67c98df by Xu Jun <[email protected]>:

fix w movement in kc remainder

--
a60cf60 by Xu Jun <[email protected]>:

vectorize NC remainder

--
3b3dac2 by Xu Jun <[email protected]>:

fix typo

--
ed80587 by Xu Jun <[email protected]>:

renmae safe_load_64bits to safe_load_u64

FUTURE_COPYBARA_INTEGRATE_REVIEW=#7497 from xujuntwt95329:qs8-gio-avxvnni ed80587
PiperOrigin-RevId: 699061552
copybara-service bot pushed a commit that referenced this pull request Nov 22, 2024
--
378c835 by Xu Jun <[email protected]>:

optimize qs8-x8c8-gio packw using avxvnni

--
12915c9 by Xu Jun <[email protected]>:

fix _mm_unpacklo_epi32

--
67c98df by Xu Jun <[email protected]>:

fix w movement in kc remainder

--
a60cf60 by Xu Jun <[email protected]>:

vectorize NC remainder

--
3b3dac2 by Xu Jun <[email protected]>:

fix typo

--
ed80587 by Xu Jun <[email protected]>:

renmae safe_load_64bits to safe_load_u64

FUTURE_COPYBARA_INTEGRATE_REVIEW=#7497 from xujuntwt95329:qs8-gio-avxvnni ed80587
PiperOrigin-RevId: 699061552
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants