marcusinthesky
's Collections
Multimodal Embeddings
updated
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Paper
•
2403.19651
•
Published
•
25
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency
Determines Multimodal Model Performance
Paper
•
2404.04125
•
Published
•
29
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and
Training Strategies
Paper
•
2404.08197
•
Published
•
29
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Paper
•
2403.20327
•
Published
•
48
OpenGVLab/InternVL-14B-224px
Image Feature Extraction
•
14B
•
Updated
•
703
•
35
Alibaba-NLP/gte-large-en-v1.5
Sentence Similarity
•
0.4B
•
Updated
•
5.03M
•
232
jinaai/jina-embeddings-v2-base-en
Feature Extraction
•
0.1B
•
Updated
•
144k
•
731
castorini/repllama-v1.1-mrl-7b-lora-passage
Feature Extraction
•
7B
•
Updated
•
24
•
5
McGill-NLP/LLM2Vec-Sheared-LLaMA-mntp
Sentence Similarity
•
Updated
•
585
•
5
BAAI/bge-visualized
Updated
•
68
royokong/e5-v
Image-to-Text
•
8B
•
Updated
•
18.1k
•
29
TIGER-Lab/VLM2Vec-Full
Text Generation
•
4B
•
Updated
•
49.4k
•
28
openbmb/VisRAG-Ret
Feature Extraction
•
3B
•
Updated
•
2.46k
•
72