Embeddings

Multimodal, bilingual long-context embeddings for your search and RAG.

Choosing the Right Embeddings

Our embedding models are designed to cover diverse search and GenAI applications.

Multimodal Embeddings

jina-clip-v1

General-Purpose Embeddings

jina-embeddings-v2-base

Bilingual Embeddings

jina-embeddings-v2-base-de

jina-embeddings-v2-base-zh

jina-embeddings-v2-base-es

Code Embeddings

jina-embeddings-v2-base-code

Embedding API

Try our world-class embedding models to improve your search and RAG systems. Start with a free trial!

Auto preview

Read the docs faq

API Status

Returning data type

Besides the float, you can ask it to return as binary for faster vector retrieval, or as base64 encoding for faster transmission.

Default (as float)

Example inputs

Change them and see how the response changes!

Request

curl https://api.jina.ai/v1/embeddings \
	 -H "Content-Type: application/json" \
	 -H "Authorization: Bearer " \
	 -d '{
	"model": "undefined",
	"embedding_type": "float",
	"input": [
		"A blue cat", 
		"A red dog", 
		"btw to represent image u can either use URL or encode image into base64 like below.", 
		"https://i.pinimg.com/600x315/21/48/7e/21487e8e0970dd366dafaed6ab25d8d8.jpg", 
		"R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7"
    ]}'

API key

Available tokens

Make sure to store your API key at a safe place!

Our API pricing is structured around the number of tokens sent in the requests. For Reader API, it is the number of tokens in the responses. This pricing model is applicable to all products in Jina AI's search foundation: Embedding, Reranking, Reader, Auto Fine-Tuning APIs. With the same API key, you have access to all API services.

Enter the API key you wish to recharge

Auto-recharge when tokens are low

Recommended for uninterrupted service in production. When your token balance is below the threshold you set, we will automatically recharge your credit card for the same amount as your last top-up. If you purchased multiple packs in the last top-up, we will recharge only one pack.

≤ 1M Tokens

Recharge threshold

Top up this API key with more tokens

Depending on your location, you may be charged in USD, EUR, or other currencies. Taxes may apply.

Please input the right API key to top up

API Integrations

Our Embedding API is natively integrated with various renowned databases, vector stores, RAG, and LLMOps frameworks. To begin, just copy and paste your API key into any of the listed integrations for a quick and seamless start.

Vector Store

LLMOps

RAG

Observability

MongoDB

DataStax

Qdrant

Pinecone

Chroma

Weaviate

Milvus

Epsilla

MyScale

LlamaIndex

Haystack

Langchain

Dify

SuperDuperDB

DashVector

Portkey

Baseten

TiDB

LanceDB

On-premises deployment

Deploy Jina Embeddings models in AWS Sagemaker and Microsoft Azure, and soon in Google Cloud Services, or contact our sales team to get customized Kubernetes deployments for your Virtual Private Cloud and on-premises servers.

Contact

Our Publications

Understand how our search foundation were trained from scratch, check out our latest publications. Meet our team at EMNLP, SIGIR, ICLR, NeurIPS, and ICML!

arXiv

June 21, 2024

Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models

ICML 2024

May 30, 2024

Jina CLIP: Your CLIP Model Is Also Your Text Retriever

arXiv

February 26, 2024

Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings

arXiv

October 30, 2023

Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents

EMNLP 2023

July 20, 2023

Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models

Learning about Embeddings

Where to start with embeddings? We've got you covered. Learn about embeddings from the ground up with our comprehensive guide.

Comparison of Reranker, Vector Search, and BM25

The table below provides a comprehensive comparison of the Reranker, Vector/Embeddings Search, and BM25, highlighting their strengths and weaknesses across various categories.

	Reranker	Vector Search	BM25
Best For	Enhanced search precision and relevance	Initial, rapid filtering	General text retrieval across wide-ranging queries
Granularity	Detailed: Sub-document and query segment	Broad: Entire documents	Intermediate: Various text segments
Query Time Complexity	High	Medium	Low
Indexing Time Complexity	Not required	High	Low, utilizes pre-built index
Training Time Complexity	High	High	Not required
Search Quality	Superior for nuanced queries	Balanced between efficiency and accuracy	Consistent and reliable for a broad set of queries
Strengths	Highly accurate with deep contextual understanding	Quick and efficient, with moderate accuracy	Highly scalable, with established efficacy
	Try reranker API for free	Try embedding API for free

The Evolution of Embeddings Poster

Discover the ideal poster for your space, featuring captivating infographics or breathtaking visuals tracing the evolution of text embedding models since 1950.

Learn how we made it

Buy a hard copy

FAQ

How were the jina-embeddings-v2 models trained?

What is jina-clip-v1, can I use it for search text and image?

Which languages do your models support?

What is the maximum length for a single sentence input?

What is the maximum number of sentences I can include in a single request?

How do I send images to the jina-clip-v1 model?

How do Jina Embeddings models compare to OpenAI's text-embedding-ada-002 model?

How seamless is the transition from OpenAI's text-embedding-ada-002 to your solution?

How tokens are calculated when using jina-clip-v1?

Do you provide models for embedding images or audio?

Can Jina Embedding models be fine-tuned with private or company data?

Can your endpoints be hosted privately on AWS, Azure, or GCP?

Can I use the same API key for embedding, reranking, reader, fine-tuning APIs?

Can I monitor the token usage of my API key?

What should I do if I forget my API key?

Do API keys expire?

Why is the first request for some models slow?

Is user input data used for training your models?

Is billing based on the number of sentences or requests?

Is there a free trial available for new users?

Are tokens charged for failed requests?

What payment methods are accepted?

Is invoicing available for token purchases?