diff --git a/assets/css/main.css b/assets/css/main.css index f1eaa75..0cd7eac 100644 --- a/assets/css/main.css +++ b/assets/css/main.css @@ -3249,7 +3249,7 @@ input, select, textarea { } .wrapper.style2 { - background-color: #888888; + background-color: #525252; } .wrapper.style2-alt { diff --git a/index.html b/index.html index 17e90a3..f444da5 100644 --- a/index.html +++ b/index.html @@ -51,7 +51,7 @@

Personalizing VLMs for User-Specific Queries



@@ -59,12 +59,11 @@

Personalizing VLMs for User-Specific Queries

-
+
-

Overview Video

@@ -100,8 +99,8 @@

Introducing MyVLM

-

The Vision Language Models

-

We apply MyVLM to various VLM architectures for personalizing captioning, visual question-answering, and referring expression comprehension.

+

The Vision Language Models

+

We apply MyVLM to various VLM architectures for personalized captioning, visual question-answering, and referring expression comprehension.

@@ -158,12 +157,12 @@

Step 3: Communicating the Concept

Results

  •    For each row, we show sample images of the target concept to the left
  • -
  •   Images to the right represent the input images passed to MyVLM
  • -
  •    Personalized responses generated by MyVLM can be seen by hovering each image
  • +
  •   The remaining images represent the input images passed to MyVLM
  • +
  •    Personalized responses generated by MyVLM can be seen by hovering over each image
  •   S* represents our concept's name

Personalized Captioning

-

   Hover over the images to see the personalized captions!

+

Hover over the images to see the personalized captions!

@@ -366,7 +365,7 @@

Personalized Captioning

-

Personalized Visual Question-Answering

+

Personalized Visual Question-Answering

@@ -498,7 +497,7 @@

Personalized Visual Question-Answering

Personalized Referring Expression Comprehension

-

Hover over the images to see the personalized captions!

+

Hover over the images to see the personalized captions!

@@ -546,14 +545,14 @@

Personalized Referring Expression Comprehension

-

Acknowledgements

+

Acknowledgements

This research was performed while Yuval Alaluf was at Snap.

We would like to thank Assaf Ben-Kish, Or Patashnik, Moran Yanuka, Morris Alper, Yonatan Biton, and Yuwei Fang for their fruitful discussions and valuable input which helped improve this work.

-

BibTeX

+

BibTeX

@article{}