LEAF-CLIP
/

OpenCLIP-ViT-bigG-rho50-k1-constrained

Feature Extraction

zero-shot-image-classification

Model card Files Files and versions

Model Initialized from laion/CLIP-ViT-bigG-14-laion2B-39B-b160k. The text encoder is finetuned with LEAF at $k=1$ with $\rho=50$ and semantic constraints.

To load this model use:

from transformers import CLIPProcessor, CLIPModel

model_name = "LEAF-CLIP/OpenCLIP-ViT-bigG-rho50-k1-constrained"
processor_name = "laion/CLIP-ViT-bigG-14-laion2B-39B-b160k"

model = CLIPModel.from_pretrained(model_name)
processor = CLIPProcessor.from_pretrained(processor_name)

Downloads last month: 30

Safetensors

Model size

3B params

Tensor type

F32

·

Model tree for LEAF-CLIP/OpenCLIP-ViT-bigG-rho50-k1-constrained

Base model

laion/CLIP-ViT-bigG-14-laion2B-39B-b160k

Finetuned

(3)

this model

Datasets used to train LEAF-CLIP/OpenCLIP-ViT-bigG-rho50-k1-constrained

Collection including LEAF-CLIP/OpenCLIP-ViT-bigG-rho50-k1-constrained

The Good Stuff

4 items • Updated May 29, 2025 • 2

Paper for LEAF-CLIP/OpenCLIP-ViT-bigG-rho50-k1-constrained

Robustness in Both Domains: CLIP Needs a Robust Text Encoder

Paper • 2506.03355 • Published Jun 3, 2025 • 6