ScyllaDB VSS Sizing Calculator

Configuration

Cloud Provider

AWS Amazon Web Services GCP Google Cloud Platform

Number of Vectors

10 K4 B

Total vectors to index. Logarithmic scale.

Dimensions

116 000

Embedding dimensionality. Common: 128 (image), 768 (BERT), 1536 (OpenAI), 4096 (LLMs).

Target QPS

101 000 000

Desired queries per second at P99 ≤ 15 ms latency.

Recall Target

70 %99 %

Higher recall dramatically reduces throughput per vCPU.

K (results per query)

11 000

Nearest-neighbour results per query. K=100 halves throughput vs K=10.

Quantization

None float32 — no compression Scalar (uint8) ~3× memory savings Binary (1-bit) ~10× memory savings

Metadata per Vector

4 B1 MiB

Average payload alongside each embedding (ScyllaDB-node storage). Logarithmic scale.

Filtering Columns

020

Each column adds 30 bytes per vector to Vector Store-node RAM.

Adjust parameters on the left to see results.