Usage of Vector API in Jlama #19

anbusampath · 2024-02-22T09:57:07Z

anbusampath
Feb 22, 2024

I am new to LLMs, My understanding is that Java's Vector API used for SIMD instruction to auto-vectorization for different machine architecture. Where does SIMD used in Jlama? Because LLM uses embedding models, to communicate(input/output) with LLM why do we need to vectors.

tjake · 2024-02-26T04:27:50Z

tjake
Feb 26, 2024
Maintainer

Hi,

LLMs use many matrix multiplications. In fact it's where 90% of the processing time goes when running inference.

You can run a matrix multiplication on a CPU with plain old java loops. Or you can run them using Vector api to do it faster. You can also run it on a GPU.

You can see two implementations in Jlama.

Java Naive Operations

Java Panama Operations

1 reply

anbusampath Feb 26, 2024
Author

Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Usage of Vector API in Jlama #19

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Usage of Vector API in Jlama #19

anbusampath Feb 22, 2024

Replies: 1 comment · 1 reply

tjake Feb 26, 2024 Maintainer

anbusampath Feb 26, 2024 Author

anbusampath
Feb 22, 2024

Replies: 1 comment 1 reply

tjake
Feb 26, 2024
Maintainer

anbusampath Feb 26, 2024
Author