Torch.compile with LLM entrypoint #1055

Giuseppe5 · 2024-10-14T09:35:10Z

Is your feature request related to a problem? Please describe.
Torch.compile allows faster inference with quantization, examples are provided in imagenet/ptq entrypoint and in the stable_diffusion entrypoint.

Describe the solution you'd like
Investigate whether this can be extended to LLM, in particular compatibility with accelerate

The text was updated successfully, but these errors were encountered:

Giuseppe5 added enhancement New feature or request good first issue Good for newcomers labels Oct 14, 2024

Giuseppe5 changed the title ~~Torch.compile with LLM entrypoing~~ Torch.compile with LLM entrypoint Oct 14, 2024

Giuseppe5 added the examples Anything related to examples label Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Torch.compile with LLM entrypoint #1055

Torch.compile with LLM entrypoint #1055

Giuseppe5 commented Oct 14, 2024

Torch.compile with LLM entrypoint #1055

Torch.compile with LLM entrypoint #1055

Comments

Giuseppe5 commented Oct 14, 2024