Add Florence2SAM2 #24

dillonalaird · 2024-08-03T02:10:04Z

Note: had to name the file something other than sam2.py or else I can naming conflicts

camiloaz

Approving in @dillonalaird 's name

CamiloInx · 2024-08-13T18:40:58Z

vision_agent_tools/shared_types.py



 class BaseTool:
    pass


+DType = TypeVar("DType", bound=np.generic)
+
+VideoNumpy = Annotated[npt.NDArray[DType], Literal["N", "N", "N", 3]]


@dillonalaird I think this value is already defined inside the file vision_agent_tools/types.py We should probably merge together the shared_types.py file that you are moving out of the tools folder.

I merged them. I moved it to this file.

sorry, I thought I deleted the other file but I did not. I will delete the file, but I migrated all the imports to use this file.

I just pushed the changes

CamiloInx · 2024-08-13T18:44:34Z

vision_agent_tools/tools/florence2_sam2.py

+        self.image_predictor.set_image(np.array(image, dtype=np.uint8))
+        annotation_id = 0
+        for prompt in prompts:
+            with torch.autocast(device_type="cuda", dtype=torch.float16):


@dillonalaird I think this would be better to define a self.device value that stores the device type instead of hard coding it.

thanks! done

CamiloInx · 2024-08-13T18:45:20Z

vision_agent_tools/tools/florence2_sam2.py

+                    image, PromptTask.CAPTION_TO_PHRASE_GROUNDING, prompt
+                )[PromptTask.CAPTION_TO_PHRASE_GROUNDING]["bboxes"]
+            if return_mask:
+                with torch.autocast(device_type="cuda", dtype=torch.bfloat16):


CamiloInx · 2024-08-13T18:47:09Z

vision_agent_tools/tools/florence2_sam2.py

+        objs = self.get_bbox_and_mask(
+            Image.fromarray(video[0]).convert("RGB"), prompts, return_mask=False
+        )
+        with torch.autocast(device_type="cuda", dtype=torch.bfloat16):


Same here, change to self.device

CamiloInx · 2024-08-13T18:50:44Z

vision_agent_tools/tools/florence2_sam2.py

+    @validate_call(config={"arbitrary_types_allowed": True})
+    @torch.inference_mode()
+    def __call__(
+        self, media: Image.Image | VideoNumpy, prompts: list[str]


@dillonalaird @camiloaz should we handle the input value as either image or video as separate values as we do here? I think we should be consistent and change one of these two to have the same format.

yeah, agree. will do that.

CamiloInx · 2024-08-13T18:52:38Z

tests/tools/test_florence2_sam2.py

+
+def test_successful_florence2_sam2_image():
+    """
+    This test verifies that CLIPMediaSim returns a valid iresponse when passed a target_text


I think you should change CLIPMediaSim to Florence2SAM2

done. thanks.

CamiloInx · 2024-08-13T18:53:32Z

tests/tools/test_florence2_sam2.py

+
+def test_successful_florence2_sam2_video():
+    """
+    This test verifies that CLIPMediaSim returns a valid iresponse when passed a target_text


Same here, please change CLIPMediaSim

CamiloInx · 2024-08-13T18:54:27Z

tests/tools/test_florence2_sam2.py

+
+def test_florence2_sam2_invalid_media():
+    """
+    This test verifies that CLIPMediaSim raises a ValueError if the media is not a valid type.


CamiloInx

Nice!

dillonalaird requested review from CamiloInx and camiloaz August 3, 2024 02:10

dillonalaird changed the title ~~Feat/add sam2~~ Add SAM2 Aug 3, 2024

dillonalaird added 3 commits August 7, 2024 11:02

added sam2

1359d2b

added sam2 predictor

248a797

added sam to dependencies

d07105c

dillonalaird force-pushed the feat/add-sam2 branch from 7cc624c to d07105c Compare August 7, 2024 19:15

dillonalaird and others added 8 commits August 7, 2024 13:04

fixed sam2 naming, added as optional

6ef04a2

updated sam2 with latest changes from sam2 repo

1ab1b16

Merge branch 'main' into feat/add-sam2

ab8976e

typing and improvements

e759c79

tests and fixes

4a1678b

remove viz code

be4ac30

rename class

f718957

remove comment

4d9613b

camiloaz changed the title ~~Add SAM2~~ Add Florence2SAM2 Aug 13, 2024

use context manager

6df24de

camiloaz previously approved these changes Aug 13, 2024

View reviewed changes

better dependencies

8ea859e

camiloaz dismissed their stale review via 8ea859e August 13, 2024 17:41

CamiloInx requested changes Aug 13, 2024

View reviewed changes

CamiloInx reviewed Aug 13, 2024

View reviewed changes

address review comments and docs

d5f3a16

CamiloInx approved these changes Aug 13, 2024

View reviewed changes

camiloaz merged commit b8aab59 into main Aug 13, 2024
1 check passed

camiloaz deleted the feat/add-sam2 branch August 13, 2024 22:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Florence2SAM2 #24

Add Florence2SAM2 #24

dillonalaird commented Aug 3, 2024

camiloaz left a comment

CamiloInx Aug 13, 2024

camiloaz Aug 13, 2024

camiloaz Aug 13, 2024

camiloaz Aug 13, 2024

CamiloInx Aug 13, 2024

camiloaz Aug 13, 2024

CamiloInx Aug 13, 2024

CamiloInx Aug 13, 2024

CamiloInx Aug 13, 2024 •

edited

Loading

camiloaz Aug 13, 2024

camiloaz Aug 13, 2024

CamiloInx Aug 13, 2024

camiloaz Aug 13, 2024

CamiloInx Aug 13, 2024

CamiloInx Aug 13, 2024

CamiloInx left a comment

Add Florence2SAM2 #24

Add Florence2SAM2 #24

Conversation

dillonalaird commented Aug 3, 2024

camiloaz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CamiloInx Aug 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CamiloInx left a comment

Choose a reason for hiding this comment

CamiloInx Aug 13, 2024 •

edited

Loading