Image Feature Extraction
Transformers
Safetensors
English
videollama3_vision_encoder
feature-extraction
visual-encoder
multi-modal-large-language-model
custom_code
Instructions to use DAMO-NLP-SG/VL3-SigLIP-NaViT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use DAMO-NLP-SG/VL3-SigLIP-NaViT with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-feature-extraction", model="DAMO-NLP-SG/VL3-SigLIP-NaViT", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("DAMO-NLP-SG/VL3-SigLIP-NaViT", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
what is the difference between this model and "DAMO-NLP-SG/SigLIP-NaViT"?
#5
by hao98 - opened
...