GenAI: define conventions for embeddings operations #1603

trentm · 2024-11-21T22:58:11Z

Refs: #1174

Many LLMs support an Embeddings API, for example:

This proposal defines OpenTelemetry semantic conventions to use for instrumenting Embeddings API client usage.

Overview

A span will be created for Embeddings API calls. The only differences from existing chat spans are:

A new embeddings value is added for attribute gen_ai.operation.name.
A new gen_ai.request.encoding_formats attribute is defined. It is only relevant for embeddings operations.
Many of the "recommended" span attributes do not apply to embeddings operations. Perhaps that is fine as currently defined. (Open question: should a note: ... be added or if applicable. added to the brief for those span attributes?)

No new events are proposed. See "Notes" below.

For metrics:

The existing gen_ai.client.token.usage metric applies, with the note that output tokens do not apply for embeddings operations, so the metric would only be recorded with the gen_ai.token.type: 'input' attribute (as already specified).
The existing gen_ai.client.operation.duration metric applies as currently specified.

Example span

This an example embeddings operation span using the openai client library and the OTel JS ConsoleSpanExporter:

{
  traceId: '1dea787b9cd7bb895aee5bb74090610d',
  parentId: undefined,
  traceState: undefined,
  name: 'embeddings text-embedding-ada-002',
  id: '65aafaf145cf535a',
  kind: 2,
  timestamp: 1732229356238000,
  duration: 551555.417,
  attributes: {
    'gen_ai.operation.name': 'embeddings',
    'gen_ai.request.model': 'text-embedding-ada-002',
    'gen_ai.system': 'openai',
    'gen_ai.request.encoding_formats': [ 'float' ],
    'gen_ai.response.model': 'text-embedding-ada-002',
    'gen_ai.usage.input_tokens': 9
  },
  status: { code: 0 },
  events: [],
  links: []
}

Notes

This section contains supporting notes and reasoning for some of the proposed values.

A span attribute for the dimensions parameter (in the OpenAI API) was considered, but dropped as likely not being useful. Happy to revisit that if others know of a reasonable use case.
A (log) event to record the input strings to the Embeddings API call is not being proposed. The reasoning is that API calls for embeddings are expected to be higher volume, and the input strings less valuable to application observability than chat content, so the cost-benefit ratio is much less. If a good use case is presented for optionally recording Embeddings input strings, then this can be revisited.
Recording Embeddings API response vectors in telemetry is not being proposed. Vectors are large and would not be useful for observability.
The operation name embeddings was selected. Other possible options considered:
1. embeddings
2. embedding
3. embed
My inclination is embeddings to match the (OpenAI) API name: embeddings.create.
Cohere's API name is embed.
LangTrace uses embed.
Current semconv values are chat (e.g. for openai.chat.completions.create()) and text_completion (for the deprecated openai.chat.create()).
The Cohere and Anthropic APIs have a request attribute input_type, for creating embeddings for inputs other than text. This might be worth considering adding as well. I have not currently proposed this because I have only prototyped with OpenAI.

Refs: open-telemetry#1174

CONTRIBUTING.md

model/gen-ai/registry.yaml

docs/attributes-registry/gen-ai.md

lmolkova

Left a few minor comments, looks great otherwise!

model/gen-ai/registry.yaml

model/gen-ai/spans.yaml

… feedback from luidmilla

…n, per Liudmila suggestion

…ly removed

lmolkova

Looks great!

trentm · 2024-11-22T23:09:07Z

^^ the check failure is:

   ERROR: 1 dead links found in ./docs/system/system-metrics.md !
  [✖] https://blogs.oracle.com/linux/post/understanding-linux-kernel-memory-statistics → Status: 0
make: *** [Makefile:72: markdown-link-check] Error 1

I'm guessing it was a fluke server error response from that site. The URL exists for me. The check passes for me locally as well:

% make markdown-link-check
semantic-conventions2@ /Users/trentm/tm/semantic-conventions2
└── [email protected]

I don't have permissions to re-run that workflow run.

lmolkova · 2024-11-22T23:16:16Z

the link check is not required and oracle blog is the usual offender. We need another approval to merge though. @open-telemetry/semconv-genai-approvers ptal!

karthikscale3

Thanks for starting this

… semconv Update attributes to match open-telemetry/semantic-conventions#1603

axiomofjoy · 2024-11-28T04:27:48Z

Just wanted to add some thoughts on observability workflows for embeddings since they are important to our product at Arize. We use both embedding vectors and their associated text for debugging and troubleshooting workflows. Because embedding vectors map semantically similar content to nearby vectors, you can use them for observability workflows such as:

clustering to find semantically meaningful groups of user queries, and then measuring performance/ computing evaluation metrics on those clusters to gain granular understanding of your application performance on a meaningful subset of data
embedding-based similarity search to find semantically similar examples to a chosen example, e.g., for debugging or curating datasets for fine-tuning
understanding how query and corpus distributions differ (e.g., to identify out-of-distribution queries that likely do not have answers in the embedded corpus)

These workflows require both embedding vectors and associated content (text, image, etc.), which we include as semantic conventions in the OpenInference spec.

I've included a few resources below for additional context.

embeddings.mp4

… semconv (#36) Update attributes to match open-telemetry/semantic-conventions#1603

GenAI: define conventions for embeddings operations

23fa496

Refs: open-telemetry#1174

trentm commented Nov 21, 2024

View reviewed changes

CONTRIBUTING.md Show resolved Hide resolved

model/gen-ai/registry.yaml Outdated Show resolved Hide resolved

trentm added 3 commits November 21, 2024 15:26

add changlog entry

c21ed6e

fix changelog entry (forgot to save before commit)

fa11c05

update generated output for latest changes

879f36e

xrmx approved these changes Nov 22, 2024

View reviewed changes

xrmx reviewed Nov 22, 2024

View reviewed changes

model/gen-ai/registry.yaml Outdated Show resolved Hide resolved

trentm added 2 commits November 22, 2024 09:18

Merge branch 'main' into tm-genai-embeddings

71aa5e9

s/APIs/GenAI systems/ per xrmx feedback

2350ab0

trentm marked this pull request as ready for review November 22, 2024 17:22

trentm requested review from a team as code owners November 22, 2024 17:22

xrmx reviewed Nov 22, 2024

View reviewed changes

docs/attributes-registry/gen-ai.md Outdated Show resolved Hide resolved

lmolkova approved these changes Nov 22, 2024

View reviewed changes

model/gen-ai/registry.yaml Show resolved Hide resolved

model/gen-ai/spans.yaml Show resolved Hide resolved

lmolkova added the area:gen-ai label Nov 22, 2024

trentm added 2 commits November 22, 2024 10:57

moving 'if applicable' note from registry to the span definition, per…

d79bf33

… feedback from luidmilla

typo

b053fa1

lmolkova mentioned this pull request Nov 22, 2024

Remove requirement_level usages from registry and add a policy to check #1606

Merged

trentm added 2 commits November 22, 2024 12:31

mention that many gen_ai span attrs depend on the particular operatio…

cdb9e09

…n, per Liudmila suggestion

restore 'requirement_level: recommended' to output_tokens, accidental…

12f4409

…ly removed

trentm requested a review from lmolkova November 22, 2024 20:36

lmolkova approved these changes Nov 22, 2024

View reviewed changes

Merge branch 'main' into tm-genai-embeddings

5ee32c8

karthikscale3 approved these changes Nov 27, 2024

View reviewed changes

alizenhom approved these changes Nov 27, 2024

View reviewed changes

xrmx added a commit to elastic/elastic-otel-python-instrumentations that referenced this pull request Nov 27, 2024

elastic-opentelemetry-instrumentation-openai: match proposed upstream…

e521443

… semconv Update attributes to match open-telemetry/semantic-conventions#1603

xrmx mentioned this pull request Nov 27, 2024

elastic-opentelemetry-instrumentation-openai: match proposed upstream semconv elastic/elastic-otel-python-instrumentations#36

Merged

AlexanderWert approved these changes Nov 27, 2024

View reviewed changes

Merge branch 'main' into tm-genai-embeddings

f253432

lmolkova merged commit 0c17ad5 into open-telemetry:main Nov 27, 2024
14 checks passed

trentm deleted the tm-genai-embeddings branch November 27, 2024 19:17

xrmx added a commit to elastic/elastic-otel-python-instrumentations that referenced this pull request Nov 28, 2024

elastic-opentelemetry-instrumentation-openai: match proposed upstream…

1720368

… semconv (#36) Update attributes to match open-telemetry/semantic-conventions#1603

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GenAI: define conventions for embeddings operations #1603

GenAI: define conventions for embeddings operations #1603

trentm commented Nov 21, 2024

lmolkova left a comment

lmolkova left a comment

trentm commented Nov 22, 2024

lmolkova commented Nov 22, 2024

karthikscale3 left a comment

axiomofjoy commented Nov 28, 2024

GenAI: define conventions for embeddings operations #1603

GenAI: define conventions for embeddings operations #1603

Conversation

trentm commented Nov 21, 2024

Overview

Example span

Notes

lmolkova left a comment

Choose a reason for hiding this comment

lmolkova left a comment

Choose a reason for hiding this comment

trentm commented Nov 22, 2024

lmolkova commented Nov 22, 2024

karthikscale3 left a comment

Choose a reason for hiding this comment

axiomofjoy commented Nov 28, 2024