docs: rewrite image reasoning example without multimodal LLM #17148

masci · 2024-12-04T18:28:25Z

Description

Rewrite the "image reasoning" multimodal example leveraging the new ChatMessage and using the "plain" OpenAI class instead of the multimodal version.

Part of #15950

Before

openai_mm_llm = OpenAIMultiModal(model="gpt-4o", max_new_tokens=300)

image_documents = load_image_urls(image_urls)
msg = generate_openai_multi_modal_chat_message(
    prompt="Describe the images as an alternative text",
    role="user",
    image_documents=image_documents,
)

response = openai_mm_llm.chat(messages=[msg])

After

openai_llm = OpenAI(model="gpt-4o", max_new_tokens=300)

msg = ChatMessage(
    role=MessageRole.USER,
    blocks=[
        TextBlock(text="Describe the images as an alternative text"),
        ImageBlock(url=image_urls[0]),
        ImageBlock(url=image_urls[1]),
    ],
)
response = openai_llm.chat(messages=[msg])

review-notebook-app · 2024-12-04T18:28:30Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

logan-markewich

Lgtm! I wonder if we want to note in the example that so far only openai multi modal supports this syntax so far?

masci · 2024-12-05T08:05:15Z

Lgtm! I wonder if we want to note in the example that so far only openai multi modal supports this syntax so far?

My only concern would be maintenance, i.e. remembering to update/remove the note as we roll out blocks usage to other providers...

masci added 2 commits December 4, 2024 19:25

bubble up content types import path

1f0b8c9

rewrite image reasoning example without multimodalLLM

54c6206

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Dec 4, 2024

masci changed the title ~~docs: rewrite image reasoning example without multimodalLLM~~ docs: rewrite image reasoning example without multimodal LLM Dec 4, 2024

logan-markewich approved these changes Dec 5, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Dec 5, 2024

masci merged commit 3c666ea into main Dec 5, 2024
11 checks passed

masci deleted the massi/image-reasoning branch December 5, 2024 08:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: rewrite image reasoning example without multimodal LLM #17148

docs: rewrite image reasoning example without multimodal LLM #17148

masci commented Dec 4, 2024 •

edited

Loading

review-notebook-app bot commented Dec 4, 2024

logan-markewich left a comment

masci commented Dec 5, 2024

docs: rewrite image reasoning example without multimodal LLM #17148

docs: rewrite image reasoning example without multimodal LLM #17148

Conversation

masci commented Dec 4, 2024 • edited Loading

Description

Before

After

review-notebook-app bot commented Dec 4, 2024

logan-markewich left a comment

Choose a reason for hiding this comment

masci commented Dec 5, 2024

masci commented Dec 4, 2024 •

edited

Loading