Torch.compile with LLM entrypoint #1055
Labels
enhancement
New feature or request
examples
Anything related to examples
good first issue
Good for newcomers
Is your feature request related to a problem? Please describe.
Torch.compile allows faster inference with quantization, examples are provided in imagenet/ptq entrypoint and in the stable_diffusion entrypoint.
Describe the solution you'd like
Investigate whether this can be extended to LLM, in particular compatibility with
accelerate
The text was updated successfully, but these errors were encountered: