Embeddings new_releases v3 release!
Top-performing multimodal multilingual long-context embeddings for search, RAG, agents applications.
v3: Frontier Multilingual Embeddings
jina-embeddings-v3
is a frontier multilingual text embedding model with 570M parameters and 8192 token-length, outperforming the latest proprietary embeddings from OpenAI and Cohere on MTEB. Read our blog post and research paper below.Embedding API
Try our world-class embedding models to improve your search and RAG systems. Start with a free trial!
Example inputs
1
2
3
4
5
upload
Request
Bash
Language
curl https://api.jina.ai/v1/embeddings \
-H "Content-Type: application/json" \
-H "Authorization: Bearer " \
-d '{
"model": "undefined",
"normalized": true,
"embedding_type": "float",
"input": [
"Organic skincare for sensitive skin with aloe vera and chamomile: Imagine the soothing embrace of nature with our organic skincare range, crafted specifically for sensitive skin. Infused with the calming properties of aloe vera and chamomile, each product provides gentle nourishment and protection. Say goodbye to irritation and hello to a glowing, healthy complexion.",
"Bio-Hautpflege für empfindliche Haut mit Aloe Vera und Kamille: Erleben Sie die wohltuende Wirkung unserer Bio-Hautpflege, speziell für empfindliche Haut entwickelt. Mit den beruhigenden Eigenschaften von Aloe Vera und Kamille pflegen und schützen unsere Produkte Ihre Haut auf natürliche Weise. Verabschieden Sie sich von Hautirritationen und genießen Sie einen strahlenden Teint.",
"Cuidado de la piel orgánico para piel sensible con aloe vera y manzanilla: Descubre el poder de la naturaleza con nuestra línea de cuidado de la piel orgánico, diseñada especialmente para pieles sensibles. Enriquecidos con aloe vera y manzanilla, estos productos ofrecen una hidratación y protección suave. Despídete de las irritaciones y saluda a una piel radiante y saludable.",
"针对敏感肌专门设计的天然有机护肤产品:体验由芦荟和洋甘菊提取物带来的自然呵护。我们的护肤产品特别为敏感肌设计,温和滋润,保护您的肌肤不受刺激。让您的肌肤告别不适,迎来健康光彩。",
"新しいメイクのトレンドは鮮やかな色と革新的な技術に焦点を当てています: 今シーズンのメイクアップトレンドは、大胆な色彩と革新的な技術に注目しています。ネオンアイライナーからホログラフィックハイライターまで、クリエイティビティを解き放ち、毎回ユニークなルックを演出しましょう。"
]}'
key
API key
Available tokens
0
API Pricing
Our API pricing is structured around the number of tokens sent in the requests. For Reader API, it is the number of tokens in the responses. This pricing model is applicable to all products in Jina AI's search foundation: Embedding, Reranking, Reader, Auto Fine-Tuning APIs. With the same API key, you have access to all API services.
Enter the API key you wish to recharge
Auto-recharge when tokens are low
≤ 1M Tokens
Recharge threshold
speed
Understand the rate limit
Top up this API key with more tokens
Please input the right API key to top up
Choosing the Right Embeddings
Our embedding models are designed to cover diverse search and GenAI applications.
API Integrations
Our Embedding API is natively integrated with various renowned databases, vector stores, RAG, and LLMOps frameworks. To begin, just copy and paste your API key into any of the listed integrations for a quick and seamless start.
Vector Store
LLMOps
RAG
Observability
MongoDB
DataStax
Qdrant
Pinecone
Chroma
Weaviate
Milvus
Epsilla
MyScale
LlamaIndex
Haystack
Langchain
Dify
SuperDuperDB
DashVector
Portkey
Baseten
TiDB
LanceDB
Carbon
On-premises deployment
Deploy Jina Embeddings models in AWS Sagemaker and Microsoft Azure, and soon in Google Cloud Services, or contact our sales team to get customized Kubernetes deployments for your Virtual Private Cloud and on-premises servers.
Our Publications
Understand how our frontier search models were trained from scratch, check out our latest publications. Meet our team at EMNLP, SIGIR, ICLR, NeurIPS, and ICML!
Learning about Embeddings
Where to start with embeddings? We've got you covered. Learn about embeddings from the ground up with our comprehensive guide.
The Evolution of Embeddings Poster
Discover the ideal poster for your space, featuring captivating infographics or breathtaking visuals tracing the evolution of text embedding models since 1950.
Rate Limit
Columns
Product | API Endpoint | Description | Allowed Request | Without API Key (RPM) | With API Key (RPM) | With Premium API Key (RPM) | Average Latency (s) | Token Usage Counting | |
---|---|---|---|---|---|---|---|---|---|
Embedding API | https://api.jina.ai/v1/embeddings | Convert text/images to fixed-length vectors | POST | block | 60keyboard_double_arrow_up | 300keyboard_double_arrow_up | bolt depends on the input size | Count the number of tokens in the input request. | |
Reader API | https://r.jina.ai | Convert URL to LLM-friendly text | GET/POST | 20 | 200 | 1000 | 1.6 | Count the number of tokens in the output response. | |
Reader API | https://s.jina.ai | Search the web and convert results to LLM-friendly text | GET/POST | 5 | 40 | 100 | 7.7 | Count the number of tokens in the output response. | |
Segmenter API | https://segment.jina.ai | Tokenize and segment long text | GET/POST | 20 | 200 | 1000 | 0.3 | Token is not counted as usage. | |
Reranker API | https://api.jina.ai/v1/rerank | Tokenize and segment long text | POST | block | 60keyboard_double_arrow_up | 300keyboard_double_arrow_up | bolt depends on the input size | Count the number of tokens in the input request. |
Embeddings-related common questions
How were the jina-embeddings-v2 models trained?
What is jina-clip-v1, can I use it for search text and image?
Which languages do your models support?
What is the maximum length for a single sentence input?
What is the maximum number of sentences I can include in a single request?
How do I send images to the jina-clip-v1 model?
How do Jina Embeddings models compare to OpenAI's text-embedding-ada-002 model?
How seamless is the transition from OpenAI's text-embedding-ada-002 to your solution?
How tokens are calculated when using jina-clip-v1?
Do you provide models for embedding images or audio?
Can Jina Embedding models be fine-tuned with private or company data?
Can your endpoints be hosted privately on AWS, Azure, or GCP?
API-related common questions
code
Can I use the same API key for embedding, reranking, reader, fine-tuning APIs?
code
Can I monitor the token usage of my API key?
code
What should I do if I forget my API key?
code
Do API keys expire?
code
Why is the first request for some models slow?
code
Is user input data used for training your models?
Billing-related common questions
attach_money
Is billing based on the number of sentences or requests?
attach_money
Is there a free trial available for new users?
attach_money
Are tokens charged for failed requests?
attach_money
What payment methods are accepted?
attach_money
Is invoicing available for token purchases?