Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adds better Affine support for GPUs when using CUDA 11. Introduces a new bias addition kernel for CUDA < 11 #778

Merged
merged 23 commits into from
Apr 9, 2021

Commits on Dec 8, 2020

  1. Added new inference operator that peforms bias addition and optionall…

    …y RELU for inference. When upgrading to cuda 11, the bias and relu can be fused into the matrix multiply with cublasLt.
    rhenry-nv committed Dec 8, 2020
    Configuration menu
    Copy the full SHA
    66006ec View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    23a3223 View commit details
    Browse the repository at this point in the history

Commits on Dec 9, 2020

  1. Configuration menu
    Copy the full SHA
    dbb1653 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3e94764 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    4b41b86 View commit details
    Browse the repository at this point in the history

Commits on Dec 15, 2020

  1. Configuration menu
    Copy the full SHA
    cc30a32 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    792669f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    0c0b8ac View commit details
    Browse the repository at this point in the history
  4. Removes NVIDIA notices

    rhenry-nv committed Dec 15, 2020
    Configuration menu
    Copy the full SHA
    6258fd5 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    764d7b8 View commit details
    Browse the repository at this point in the history
  6. Format changes

    rhenry-nv committed Dec 15, 2020
    Configuration menu
    Copy the full SHA
    95fb427 View commit details
    Browse the repository at this point in the history
  7. updates changelog

    rhenry-nv committed Dec 15, 2020
    Configuration menu
    Copy the full SHA
    4bbf17f View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    256e397 View commit details
    Browse the repository at this point in the history

Commits on Mar 6, 2021

  1. Configuration menu
    Copy the full SHA
    493c213 View commit details
    Browse the repository at this point in the history

Commits on Mar 23, 2021

  1. Configuration menu
    Copy the full SHA
    e5b549d View commit details
    Browse the repository at this point in the history

Commits on Mar 25, 2021

  1. refactor

    emjotde committed Mar 25, 2021
    Configuration menu
    Copy the full SHA
    a484893 View commit details
    Browse the repository at this point in the history
  2. remove previous code

    emjotde committed Mar 25, 2021
    Configuration menu
    Copy the full SHA
    9de84d4 View commit details
    Browse the repository at this point in the history
  3. add unit tests

    emjotde committed Mar 25, 2021
    Configuration menu
    Copy the full SHA
    43c54ce View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    04b0f95 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5c7b6ee View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    c027c6d View commit details
    Browse the repository at this point in the history

Commits on Apr 2, 2021

  1. Merge pull request #2 from marian-nmt/mjd/refactor_cuda11

    Small refactoring of cuda_11
    rhenry-nv authored Apr 2, 2021
    Configuration menu
    Copy the full SHA
    7ea2d09 View commit details
    Browse the repository at this point in the history

Commits on Apr 9, 2021

  1. Configuration menu
    Copy the full SHA
    36d8f0c View commit details
    Browse the repository at this point in the history