Extracting CLIP embedding #11

TooManyMatts · 2024-09-16T18:20:37Z

Hi, thanks for sharing the code!

Small question about extracting the clip embedding. Your evals/models/clip.py has a forward method that outputs a tensor of shape:
torch.Size([1, 768, 32, 32])

Is there a straightforward way of extracting a feature vector from your forward function that would return something of shape:
torch.Size([1, 512])

i.e. feature vector that is commonly used? For example, using open_clip you would do:
image_features = model.encode_image(image)

Best,
Matt

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extracting CLIP embedding #11

Extracting CLIP embedding #11

TooManyMatts commented Sep 16, 2024

Extracting CLIP embedding #11

Extracting CLIP embedding #11

Comments

TooManyMatts commented Sep 16, 2024