Elastic Rerank model landing page #2884

leemthompo · 2024-11-28T16:55:22Z

Initial drafting happened in shadow Google Doc

URL preview 🔭 👁️

TODO

Air-gapped deployment instructions (possibly in follow-up PR)

github-actions · 2024-11-28T16:55:36Z

A documentation preview will be available soon.

🔨 Buildkite builds
📚 HTML diff
📙 Preview page

Request a new doc build by commenting

Rebuild this PR: run docs-build
Rebuild this PR and all Elastic docs: run docs-build rebuild

_{run docs-build is much faster than run docs-build rebuild. A rebuild should only be needed in rare situations.}

_{If your PR continues to fail for an unknown reason, the doc build pipeline may be broken. Elastic employees can check the pipeline status here.}

szabosteve

Thank you for writing this up! I left a few suggestions, mostly nits to use attributes and to decrease word count. Please take or leave them.

docs/en/stack/ml/nlp/ml-nlp-elastic-rerank.asciidoc

Co-authored-by: István Zoltán Szabó <[email protected]>

docs/en/stack/ml/nlp/ml-nlp-elastic-rerank.asciidoc

davidkyle · 2024-12-11T10:44:59Z

docs/en/stack/ml/nlp/ml-nlp-elastic-rerank.asciidoc

+
+It's important to note that if you rerank to depth `n` then you will need to run `n` inferences per query. This will include the document text and will therefore be significantly more expensive than inference for query embeddings. Hardware can be scaled to run these inferences in parallel, but we would recommend shallow reranking for CPU inference: no more than top-30 results. You may find that the preview version is cost prohibitive for high query rates and low query latency requirements. We plan to address performance issues for GA.
+
+// // Is air-gapped deployment supported?


Yes air gapped download is supported, the same instructions as ELSER apply just change the model id

https://www.elastic.co/guide/en/machine-learning/master/ml-nlp-elser.html#air-gapped-install

ah ok cool thanks

~~@davidkyle are those model artifact files already available?~~

found 'em :)

Looks like we just have cross-platform version initially?

Yes, there is no platform specific model for rerank

Co-authored-by: David Kyle <[email protected]>

leemthompo · 2024-12-11T14:14:09Z

@davidkyle I updated with air-gapped instructions, copying the ELSER instructions but removing the trained model UI UI instructions and just replacing with "create inference endpoint".

maxhniebergall

LGTM

maxhniebergall · 2024-12-11T14:40:22Z

docs/en/stack/ml/nlp/ml-nlp-elastic-rerank.asciidoc

+
+--
+```
+xpack.ml.model_repository: file://${path.home}/config/models/`


theres an extra backtick at the end of this line

nice catch, fixing!

maxhniebergall · 2024-12-11T14:41:30Z

docs/en/stack/ml/nlp/ml-nlp-elastic-rerank.asciidoc

+
+When using the {ref}/semantic-text.html[`semantic_text` field type], text is divided into chunks. By default, each chunk contains 250 words (approximately 400 tokens). Be cautious when increasing the chunk size - if the combined length of your query and chunk text exceeds 512 tokens, the model won't have access to the full content.
+
+When the combined inputs exceed the 512 token limit, a balanced truncation strategy is used. If both the query and input text are longer than 255 tokens each then both are truncated, otherwise the longest is truncated.


This is very clear phrasing!

That's because it's 98% @tveasey and @davidkyle 😉

(cherry picked from commit 65aa83a)

(cherry picked from commit 65aa83a) Co-authored-by: Liam Thompson <[email protected]>

[WIP] Elastic Rerank model landing page

e9b391e

leemthompo added backport-8.17 Automated backport with mergify backport-8.x Automated backport with mergify labels Nov 28, 2024

leemthompo self-assigned this Nov 28, 2024

leemthompo added 3 commits November 28, 2024 17:59

Formatting

4f2ac29

Updates

7cd04c8

Tidy up links

e83ef2f

leemthompo changed the title ~~[WIP] Elastic Rerank model landing page~~ Elastic Rerank model landing page Dec 10, 2024

szabosteve reviewed Dec 10, 2024

View reviewed changes

Apply suggestions from code review

f7e52e1

Co-authored-by: István Zoltán Szabó <[email protected]>

leemthompo requested a review from davidkyle December 11, 2024 08:56

leemthompo marked this pull request as ready for review December 11, 2024 08:57

leemthompo requested a review from a team as a code owner December 11, 2024 08:57

davidkyle reviewed Dec 11, 2024

View reviewed changes

leemthompo and others added 6 commits December 11, 2024 12:04

Fix command typos

aeb429e

Co-authored-by: David Kyle <[email protected]>

Add air-gapped info, timeout note

c6bc61a

Revert rewording

e38169a

typo

14663ac

Del comments

e7554bc

Fix link

5e6955b

leemthompo requested a review from davidkyle December 11, 2024 14:10

maxhniebergall previously approved these changes Dec 11, 2024

View reviewed changes

Delete trailing backtick

7c1af36

leemthompo dismissed maxhniebergall’s stale review via 7c1af36 December 11, 2024 14:52

kosabogi approved these changes Dec 11, 2024

View reviewed changes

leemthompo merged commit 65aa83a into elastic:main Dec 11, 2024
3 checks passed

mergify bot pushed a commit that referenced this pull request Dec 11, 2024

Elastic Rerank model landing page (#2884)

5e37a06

(cherry picked from commit 65aa83a)

mergify bot pushed a commit that referenced this pull request Dec 11, 2024

Elastic Rerank model landing page (#2884)

853c115

(cherry picked from commit 65aa83a)

This was referenced Dec 11, 2024

[8.x] Elastic Rerank model landing page (backport #2884) #2891

Merged

[8.17] Elastic Rerank model landing page (backport #2884) #2892

Merged

leemthompo deleted the elastic-rerank-landing-page branch December 11, 2024 17:19

leemthompo added a commit that referenced this pull request Dec 11, 2024

Elastic Rerank model landing page (#2884) (#2891)

a4316cc

(cherry picked from commit 65aa83a) Co-authored-by: Liam Thompson <[email protected]>

leemthompo added a commit that referenced this pull request Dec 11, 2024

Elastic Rerank model landing page (#2884) (#2892)

9ed52a2

(cherry picked from commit 65aa83a) Co-authored-by: Liam Thompson <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Elastic Rerank model landing page #2884

Elastic Rerank model landing page #2884

leemthompo commented Nov 28, 2024 •

edited

Loading

github-actions bot commented Nov 28, 2024

szabosteve left a comment

davidkyle Dec 11, 2024

leemthompo Dec 11, 2024

leemthompo Dec 11, 2024 •

edited

Loading

leemthompo Dec 11, 2024

maxhniebergall Dec 11, 2024

leemthompo commented Dec 11, 2024

maxhniebergall left a comment

maxhniebergall Dec 11, 2024

leemthompo Dec 11, 2024

maxhniebergall Dec 11, 2024

leemthompo Dec 11, 2024


		It's important to note that if you rerank to depth `n` then you will need to run `n` inferences per query. This will include the document text and will therefore be significantly more expensive than inference for query embeddings. Hardware can be scaled to run these inferences in parallel, but we would recommend shallow reranking for CPU inference: no more than top-30 results. You may find that the preview version is cost prohibitive for high query rates and low query latency requirements. We plan to address performance issues for GA.

		// // Is air-gapped deployment supported?

Elastic Rerank model landing page #2884

Elastic Rerank model landing page #2884

Conversation

leemthompo commented Nov 28, 2024 • edited Loading

TODO

github-actions bot commented Nov 28, 2024

szabosteve left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leemthompo Dec 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leemthompo commented Dec 11, 2024

maxhniebergall left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leemthompo commented Nov 28, 2024 •

edited

Loading

leemthompo Dec 11, 2024 •

edited

Loading