Skip to content
Dashboard

Text Embedding 005

Text Embedding 005 is an English-language text embedding model with a 66.31% average Massive Text Embedding Benchmark (MTEB) score at 768 dimensions, supporting dynamic embedding sizes down to 256 dimensions to reduce storage and compute costs with minor performance tradeoffs.

index.ts
import { embed } from 'ai';
const result = await embed({
model: 'google/text-embedding-005',
value: 'Sunny day at the beach',
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Google Vertex AI
$0.03/M——
08/01/2024

More models by Google

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
2.1s
186tps
$1.50/M$9.00/M
Read:$0.15/M
Write:—
$14.00/K
+ input costs
—
+4
google logo
vertex logo
05/19/2026
1M
0.5s
229tps
$0.25/M$1.50/M
Read:$0.03/M
Write:—
$14.00/K
+ input costs
—
+4
google logo
vertex logo
03/03/2026
1M
3.6s
117tps
$2.00/M
$12.00/M
Read:
$0.2/M
Write:
—
$14.00/K
+ input costs
—
+4
google logo
vertex logo
02/19/2026
1M
0.7s
167tps
$0.50/M
$3.00/M
Read:
$0.05/M
Write:
—
$14.00/K
+ input costs
—
+4
google logo
vertex logo
12/17/2025
1M
0.3s
254tps
$0.10/M$0.40/M
Read:$0.01/M
Write:—
$35.00/K
+ input costs
—
+4
google logo
vertex logo
06/17/2025
1M
0.4s
196tps
$0.30/M$2.50/M
Read:$0.03/M
Write:—
$35.00/K
+ input costs
—
+4
google logo
vertex logo
03/20/2025

About Text Embedding 005

Text Embedding 005 is Google's English-language text embedding model built on the Gecko architecture, which uses knowledge distillation from large language models (LLMs) to achieve competitive downstream task performance at a compact embedding size. At its full 768-dimension output, the model scores 66.31% on the MTEB benchmark, a standard evaluation suite covering eight categories including retrieval, reranking, clustering, classification, and semantic similarity.

This means Text Embedding 005 delivers competitive retrieval and similarity quality without requiring high-dimensional vector indices.

Dynamic embedding sizes are supported through Matryoshka Representation Learning (MRL), which trains the model to produce accurate representations at multiple dimension levels from a single pass. At 256 dimensions, it scores 64.37% on MTEB, a two-point reduction that may be an acceptable tradeoff when vector storage costs at scale are significant. This flexibility is built into the model architecture, not post-hoc dimension reduction.

What To Consider When Choosing a Provider

  • Configuration: When choosing between 256- and 768-dimension outputs, Text Embedding 005 shows a performance difference of approximately 2 percentage points on MTEB (64.37% vs 66.31%). For cost-sensitive applications with high vector storage volume, 256 dimensions can meaningfully reduce infrastructure costs with a modest quality tradeoff.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Text Embedding 005

Best For

  • English semantic search: Document retrieval over English corpora, with a 66.31% MTEB score at the full 768-dimension output
  • English clustering and classification: Grouping and labeling tasks over English text corpora at 768 dimensions
  • 768-dimension vector similarity: Applications where 768-dimension indices are the standard infrastructure choice
  • Cost-sensitive deployments: Use 256-dimension output to reduce storage and compute
  • Google cloud integrations: Applications on BigQuery or Vertex AI Search that are already in the Gecko model family

Consider Alternatives When

  • Multilingual content: Your corpus includes non-English text or mixed languages (consider google/text-multilingual-embedding-002)
  • Higher dimensions needed: You need more than 768 dimensions for maximum theoretical discriminative power

Conclusion

Text Embedding 005 delivers strong English retrieval and semantic similarity quality on MTEB at a compact 768-dimension size, with the flexibility to operate at 256 dimensions for storage-sensitive deployments. For English-language semantic search, clustering, and similarity applications, it provides strong performance per dimension relative to larger models.