Hybrid Search #653

yuhongsun96 · 2023-10-30T04:06:25Z

No description provided.

vercel · 2023-10-30T04:06:29Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
internal-search	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Oct 30, 2023 4:30am

yuhongsun96 · 2023-10-30T04:07:30Z

backend/danswer/document_index/vespa/index.py

@@ -615,7 +612,7 @@ def keyword_retrieval(
            "query": final_query,
            "input.query(decay_factor)": str(DOC_TIME_DECAY * decay_multiplier),
            "hits": num_to_retrieve,
-            "num_to_rerank": 10 * num_to_retrieve,


This was never doing anything :/

yuhongsun96 · 2023-10-30T04:08:06Z

backend/danswer/document_index/vespa/index.py

@@ -640,7 +637,6 @@ def semantic_retrieval(
            # needed for highlighting while the N-gram highlighting is broken /
            # not working as desired
            + f'or ({{defaultIndex: "{CONTENT_SUMMARY}"}}userInput(@query)))'
-            + _build_vespa_limit(num_to_retrieve)


As far as testing shows, adding it to yql vs parameters is the same, feels easier/cleaner to just have it in the params

yuhongsun96 · 2023-10-30T04:08:39Z

backend/danswer/document_index/vespa/index.py

        )

        params: dict[str, str | int] = {
            "yql": yql,
            "query": query,
            "hits": num_to_retrieve,
-            "num_to_rerank": 10 * num_to_retrieve,
+            "offset": 0,


Set all these as 0 just so we remember later how to do it if we decide to introduce pagination

yuhongsun96 · 2023-10-30T04:14:12Z

backend/danswer/configs/model_configs.py

@@ -21,7 +21,9 @@
 DOC_EMBEDDING_DIM = 384
 # Model should be chosen with 512 context size, ideally don't change this
 DOC_EMBEDDING_CONTEXT_SIZE = 512
-NORMALIZE_EMBEDDINGS = (os.environ.get("SKIP_RERANKING") or "False").lower() == "true"


yuhongsun96 added 2 commits October 29, 2023 14:16

checkpoint

031a179

done

0bbae43

yuhongsun96 commented Oct 30, 2023

View reviewed changes

vercel bot deployed to Preview October 30, 2023 04:07 View deployment

yuhongsun96 commented Oct 30, 2023

View reviewed changes

random thing

cc198b0

vercel bot deployed to Preview October 30, 2023 04:30 View deployment

Weves approved these changes Oct 30, 2023

View reviewed changes

yuhongsun96 merged commit 52c0d6e into main Oct 30, 2023
2 checks passed

yuhongsun96 deleted the hybrid-search branch October 30, 2023 05:18

sidravi1 pushed a commit to IDinsight/danswer that referenced this pull request Nov 20, 2023

Hybrid Search (onyx-dot-app#653)

9b78625

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hybrid Search #653

Hybrid Search #653

yuhongsun96 commented Oct 30, 2023

vercel bot commented Oct 30, 2023 •

edited

Loading

yuhongsun96 Oct 30, 2023

yuhongsun96 Oct 30, 2023

yuhongsun96 Oct 30, 2023

yuhongsun96 Oct 30, 2023

Hybrid Search #653

Hybrid Search #653

Conversation

yuhongsun96 commented Oct 30, 2023

vercel bot commented Oct 30, 2023 • edited Loading

yuhongsun96 Oct 30, 2023

Choose a reason for hiding this comment

yuhongsun96 Oct 30, 2023

Choose a reason for hiding this comment

yuhongsun96 Oct 30, 2023

Choose a reason for hiding this comment

yuhongsun96 Oct 30, 2023

Choose a reason for hiding this comment

vercel bot commented Oct 30, 2023 •

edited

Loading