Text Embedding 005

Text Embedding 005 is an English-language text embedding model with a 66.31% average Massive Text Embedding Benchmark (MTEB) score at 768 dimensions, supporting dynamic embedding sizes down to 256 dimensions to reduce storage and compute costs with minor performance tradeoffs.

index.ts

import { embed } from 'ai';

const result = await embed({
  model: 'google/text-embedding-005',
  value: 'Sunny day at the beach',
})

Overview About Providers Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Google Vertex AI

$0.03/M

—

08/01/2024

More models by Google

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

google/gemini-3.5-flash

2.1s

186tps

$1.50/M

$9.00/M

Read:$0.15/M

Write:—

$14.00/K

+ input costs

—

05/19/2026

google/gemini-3.1-flash-lite-preview

0.5s

229tps

$0.25/M

$1.50/M

Read:$0.03/M

Write:—

$14.00/K

+ input costs

—

03/03/2026

google/gemini-3.1-pro-preview

3.6s

117tps

$2.00/M

$12.00/M

Read:

$0.2/M

Write:

—

$14.00/K

+ input costs

—

02/19/2026

google/gemini-3-flash

0.7s

167tps

$0.50/M

$3.00/M

Read:

$0.05/M

Write:

—

$14.00/K

+ input costs

—

12/17/2025

google/gemini-2.5-flash-lite

0.3s

254tps

$0.10/M

$0.40/M

Read:$0.01/M

Write:—

$35.00/K

+ input costs

—

06/17/2025

google/gemini-2.5-flash

0.4s

196tps

$0.30/M

$2.50/M

Read:$0.03/M

Write:—

$35.00/K

+ input costs

—

03/20/2025

About Text Embedding 005

Text Embedding 005 is Google's English-language text embedding model built on the Gecko architecture, which uses knowledge distillation from large language models (LLMs) to achieve competitive downstream task performance at a compact embedding size. At its full 768-dimension output, the model scores 66.31% on the MTEB benchmark, a standard evaluation suite covering eight categories including retrieval, reranking, clustering, classification, and semantic similarity.

This means Text Embedding 005 delivers competitive retrieval and similarity quality without requiring high-dimensional vector indices.

Dynamic embedding sizes are supported through Matryoshka Representation Learning (MRL), which trains the model to produce accurate representations at multiple dimension levels from a single pass. At 256 dimensions, it scores 64.37% on MTEB, a two-point reduction that may be an acceptable tradeoff when vector storage costs at scale are significant. This flexibility is built into the model architecture, not post-hoc dimension reduction.

What To Consider When Choosing a Provider

Configuration: When choosing between 256- and 768-dimension outputs, Text Embedding 005 shows a performance difference of approximately 2 percentage points on MTEB (64.37% vs 66.31%). For cost-sensitive applications with high vector storage volume, 256 dimensions can meaningfully reduce infrastructure costs with a modest quality tradeoff.
Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Text Embedding 005

Best For

English semantic search: Document retrieval over English corpora, with a 66.31% MTEB score at the full 768-dimension output
English clustering and classification: Grouping and labeling tasks over English text corpora at 768 dimensions
768-dimension vector similarity: Applications where 768-dimension indices are the standard infrastructure choice
Cost-sensitive deployments: Use 256-dimension output to reduce storage and compute
Google cloud integrations: Applications on BigQuery or Vertex AI Search that are already in the Gecko model family

Consider Alternatives When

Multilingual content: Your corpus includes non-English text or mixed languages (consider google/text-multilingual-embedding-002)
Higher dimensions needed: You need more than 768 dimensions for maximum theoretical discriminative power

Conclusion

Text Embedding 005 delivers strong English retrieval and semantic similarity quality on MTEB at a compact 768-dimension size, with the flexibility to operate at 256 dimensions for storage-sensitive deployments. For English-language semantic search, clustering, and similarity applications, it provides strong performance per dimension relative to larger models.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Text Embedding 005

Providers

More models by Google

About Text Embedding 005

What To Consider When Choosing a Provider

When to Use Text Embedding 005

Best For

Consider Alternatives When

Conclusion