feat: add support for int8 quantization on linear layers #299

pommedeterresautee · 2023-02-27T09:42:43Z

add int8 matmul + scalers support
update PyTorch to last available (post branch cut)
update triton to 2.0 (requires to rework some kernels)

test pass


================================================== warnings summary ==================================================
test/test_model_optimization.py: 1 warning
test/test_torchdynamo.py: 79 warnings
  /home/geantvert/.local/share/virtualenvs/kernl/lib/python3.9/site-packages/torch/cuda/graphs.py:79: UserWarning: The CUDA Graph is empty. This ususally means that the graph was attempted to be captured on wrong device or stream. (Triggered internally at ../aten/src/ATen/cuda/CUDAGraph.cpp:191.)
    super().capture_end()

test/debugger/test_memory.py::test_load_is_in_different_memory
  /mnt/workspace/kernl/test/debugger/test_memory.py:58: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
    assert t.storage().data_ptr() != a.storage().data_ptr()

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
============================= 2885 passed, 3 skipped, 81 warnings in 10684.37s (2:58:04) =============================

# Conflicts: # .github/workflows/test.yaml # Dockerfile # README.md # docs/how-to-guides/get-started.md # requirements.txt

pommedeterresautee added 4 commits February 26, 2023 14:04

feat: basic scaler implementation + test

35aeabe

feat: add linear layers and related unit tests

f1998f4

feat: rework tests

f760dd0

feat: update PyTorch 2.0

eb3fb8f

pommedeterresautee added the feature label Feb 27, 2023

pommedeterresautee self-assigned this Feb 27, 2023

github-actions bot added feature and removed feature labels Feb 27, 2023

Merge branch 'main' into feat/quantization

13d6ae4

github-actions bot added feature and removed feature labels Feb 28, 2023

Merge branch 'main' into feat/quantization

4eb9fa8

# Conflicts: # .github/workflows/test.yaml # Dockerfile # README.md # docs/how-to-guides/get-started.md # requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add support for int8 quantization on linear layers #299

feat: add support for int8 quantization on linear layers #299

pommedeterresautee commented Feb 27, 2023 •

edited

Loading

feat: add support for int8 quantization on linear layers #299

Are you sure you want to change the base?

feat: add support for int8 quantization on linear layers #299

Conversation

pommedeterresautee commented Feb 27, 2023 • edited Loading

pommedeterresautee commented Feb 27, 2023 •

edited

Loading