Is it possible to use open source embeddings for the vector similarity search? #158

dustyatx · 2023-07-22T17:26:36Z

dustyatx
Jul 22, 2023

The default tantivy tokenizers seem like basic NLP tokenization (unless I'm missing something, I'm no expert). I've been using open source embeddings models with other vector databases, is that possible with CozoDB? If so is there any advantage or disadvantage to using them?

I've been mainly using:
e5-large-v2

Also any chance someone has a basic walk through on how to load up tokens/embeddings? Datalog and this flavor of it is a bit of brain bender (I'm slowly getting it).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to use open source embeddings for the vector similarity search? #158

{{title}}

Replies: 0 comments

Select a reply

Is it possible to use open source embeddings for the vector similarity search? #158

dustyatx Jul 22, 2023

Replies: 0 comments

dustyatx
Jul 22, 2023