add CLIP w/ TORCH backend to inference_experimental #1415

hansent · 2025-07-09T19:42:33Z

Description

This adds torch implementation of CLIP and

full test coverage for:

shared preprocessing between onnx and torch implementation
e2e tests ensuring matching results for text and image results using cos similarity between original clip, clip_torch, and clip_onnx

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

How has this change been tested, please provide a test case or example of how you tested the change?

locally with torch weights only registered for RN50 atm

Any specific deployment considerations

n/a

Docs

n/a

inference_experimental/inference_exp/models/clip/clip_pytorch.py

PawelPeczek-Roboflow · 2025-07-14T08:48:43Z

inference_experimental/inference_exp/models/clip/clip_pytorch.py

+        device: torch.device,
+    ):
+        self.model = model
+        self.preprocess = preprocess


I am not sure if I understand the preprocess parameter

no longer parameter, just instantiated in init to use shared preprocessor with onnx implementation

PawelPeczek-Roboflow · 2025-07-14T09:32:54Z

inference_experimental/inference_exp/models/clip/clip_pytorch.py

+            for img in images:
+                tensor = _to_tensor(img)
+                if tensor.dtype == torch.uint8:
+                    tensor = tensor.to(torch.float32) / 255.0


seems like normalisation to 0-1 should be done regardless of data type(?) - hard to say arbitrarily thb as this is probably just a convention - I would keep assumption that image comes usually as [0-255] and this is how that is implemented for other models from what I remember, but you may check

now always done, based on how you had it in onnx preproc:

transforms = Compose( [ Resize(image_size, interpolation=InterpolationMode.BICUBIC, antialias=True), CenterCrop(image_size), lambda x: x.to(torch.float32) / 255.0, Normalize(MEAN, STD), ] ) ```

PawelPeczek-Roboflow · 2025-07-14T09:36:27Z

inference_experimental/inference_exp/models/clip/clip_pytorch.py

+                images_to_stack.append(cropped)
+            tensor_batch = torch.stack(images_to_stack, dim=0)
+        else:
+            # Handle single image or 4D batch for optimized processing


how about single ndarray - seems it will be failing into this branch

I think handled properly by new shared pre-processor

test here cover:

test_embed_single_numpy_image

test_embed_single_tensor_image

test_embed_list_of_numpy_images

test_embed_list_of_tensor_images

test_embed_batch_of_tensor_images

This reverts commit ee700e4.

…eights from our registry

PawelPeczek-Roboflow · 2025-07-16T13:07:31Z

inference_experimental/inference_exp/models/clip/preprocessing.py

+        ]
+    )
+
+    def _preprocess(


this function seems to be possible to be extracted from being inner one and just passed as first callable in a chain?

I extracted it into standalone function in the module to avoid the nesting/inner function, but I think tricky to make it part of compose chain

torchvision.transforms.Compose pipeline expects each transform to take a single argument. However, our _preprocess function is designed to be the main entry point and does more:

handles multiple input types: It accepts a single np.ndarray, a single torch.Tensor, a list of arrays, or a list of tensors.

calls the Compose pipeline on the prepared tensor batch, but have to do in for loop for lists because they may have different size and creates creates a batch tensor from a list of images, which may have variable sizes.

I might be wrong, not sure I completly understand how torchvision.transforms.Compose works / if it we always convert list to batch tensor first and then transform on that?!

PawelPeczek-Roboflow · 2025-07-16T13:08:25Z

inference_experimental/tests/integration_tests/e2e/test_clip_e2e.py

@@ -0,0 +1,203 @@
+import os
+
+os.environ["ROBOFLOW_API_HOST"] = "https://api.roboflow.one"


the models should be registered in prod API

removed, models are registered in prod now

PawelPeczek-Roboflow · 2025-07-16T13:09:12Z

inference_experimental/tests/integration_tests/e2e/test_clip_e2e.py

+import os
+
+os.environ["ROBOFLOW_API_HOST"] = "https://api.roboflow.one"
+


[1] lack of types of objects in functions

not sure what you mean on this

…inference into inference-exp-add-clip

initial stab at adding CLIP

6d1019f

hansent changed the title ~~initial stab at adding CLIP~~ add CLIP to inference_experimental Jul 9, 2025

Base automatically changed from inference-exp/add-perception-encoder to feature/inference-v1-models July 10, 2025 15:31

Merge branch 'feature/inference-v1-models' into inference-exp-add-clip

bb0c54e

Base automatically changed from feature/inference-v1-models to main July 11, 2025 16:41

PawelPeczek-Roboflow reviewed Jul 14, 2025

View reviewed changes

inference_experimental/inference_exp/models/clip/clip_pytorch.py Outdated Show resolved Hide resolved

PawelPeczek-Roboflow reviewed Jul 14, 2025

View reviewed changes

inference_experimental/inference_exp/models/clip/clip_pytorch.py Show resolved Hide resolved

PawelPeczek-Roboflow reviewed Jul 14, 2025

View reviewed changes

hansent added 11 commits July 14, 2025 13:15

separate clip e2e/preprocessing tests

ad338e9

fix clip preprocessing

ee700e4

Revert "fix clip preprocessing"

ec706bb

This reverts commit ee700e4.

split various input shapes into different tests

28b831a

cache test image generation

4d210eb

dont normalize clip output by default

697fb98

add color format parameter and test embeddings e2e

48f9f8a

It's common pre-processing for CLIP and start testing CLIP onnx also.

f27f4a3

fix tests

2bc9c2b

load clip torch from registry

7f0196d

register clip with TORCH backend and fix loading when getting torch w…

f2eb23a

…eights from our registry

hansent changed the title ~~add CLIP to inference_experimental~~ add CLIP w/ TORCH backend to inference_experimental Jul 15, 2025

Merge branch 'main' into inference-exp-add-clip

24b0092

hansent marked this pull request as ready for review July 15, 2025 21:19

hansent requested review from grzegorz-roboflow, yeldarby and probicheaux as code owners July 15, 2025 21:19

PawelPeczek-Roboflow reviewed Jul 16, 2025

View reviewed changes

PawelPeczek-Roboflow self-requested a review July 17, 2025 12:55

PawelPeczek-Roboflow previously approved these changes Jul 17, 2025

View reviewed changes

Add pytest-timeout

5761689

PawelPeczek-Roboflow dismissed their stale review via 5761689 July 17, 2025 13:00

PawelPeczek-Roboflow added 2 commits July 17, 2025 15:19

Improve tests

5a515e2

Make linters happy

b1cfcf8

PawelPeczek-Roboflow self-requested a review July 17, 2025 13:20

PawelPeczek-Roboflow previously approved these changes Jul 17, 2025

View reviewed changes

Make tests more resilient

7a751e7

PawelPeczek-Roboflow dismissed their stale review via 7a751e7 July 17, 2025 13:32

PawelPeczek-Roboflow added 2 commits July 17, 2025 15:49

Try to make lock actually working

5290f09

Try to make lock actually working

54104ea

PawelPeczek-Roboflow self-requested a review July 17, 2025 14:09

PawelPeczek-Roboflow previously approved these changes Jul 17, 2025

View reviewed changes

Bump version

ce57ab5

PawelPeczek-Roboflow dismissed their stale review via ce57ab5 July 17, 2025 15:07

PawelPeczek-Roboflow and others added 4 commits July 17, 2025 17:37

Fix bug with TRT runtime

8e63da0

Merge branch 'main' into inference-exp-add-clip

5d1b775

Fix and simplify onnx session run

04531e7

Bump version

bae7a1d

PawelPeczek-Roboflow self-requested a review July 17, 2025 17:04

PawelPeczek-Roboflow previously approved these changes Jul 17, 2025

View reviewed changes

hansent added 2 commits July 17, 2025 12:29

adjust tolerance on clip prediciton test

1254f63

Merge branch 'inference-exp-add-clip' of https://github.com/roboflow/…

b7d839a

…inference into inference-exp-add-clip

hansent dismissed PawelPeczek-Roboflow’s stale review via b7d839a July 17, 2025 17:29

Add Perception Encoder Predictions Test

7abc31c

PawelPeczek-Roboflow self-requested a review July 18, 2025 06:11

PawelPeczek-Roboflow approved these changes Jul 18, 2025

View reviewed changes

PawelPeczek-Roboflow merged commit 87abccd into main Jul 18, 2025
40 checks passed

PawelPeczek-Roboflow deleted the inference-exp-add-clip branch July 18, 2025 06:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add CLIP w/ TORCH backend to inference_experimental #1415

add CLIP w/ TORCH backend to inference_experimental #1415

Uh oh!

hansent commented Jul 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

PawelPeczek-Roboflow Jul 14, 2025

Uh oh!

hansent Jul 16, 2025

Uh oh!

PawelPeczek-Roboflow Jul 14, 2025

Uh oh!

hansent Jul 16, 2025

Uh oh!

PawelPeczek-Roboflow Jul 14, 2025

Uh oh!

hansent Jul 16, 2025

Uh oh!

PawelPeczek-Roboflow Jul 16, 2025

Uh oh!

hansent Jul 16, 2025

Uh oh!

PawelPeczek-Roboflow Jul 16, 2025

Uh oh!

hansent Jul 16, 2025

Uh oh!

PawelPeczek-Roboflow Jul 16, 2025

Uh oh!

hansent Jul 16, 2025

Uh oh!

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

		@@ -0,0 +1,203 @@
		import os

		os.environ["ROBOFLOW_API_HOST"] = "https://api.roboflow.one"

add CLIP w/ TORCH backend to inference_experimental #1415

add CLIP w/ TORCH backend to inference_experimental #1415

Uh oh!

Conversation

hansent commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How has this change been tested, please provide a test case or example of how you tested the change?

Any specific deployment considerations

Docs

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

hansent commented Jul 9, 2025 •

edited

Loading