TU
personOnline vector quantization algorithmWikipedia

TurboQuant

TurboQuant is an online vector quantization algorithm for compressing high-dimensional Euclidean vectors while preserving their geometric structure. It was proposed in 2025 by Amir Zandieh, Majid Daliri, Majid Hadian, and Vahab Mirrokni in the paper TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate. The paper lists Zandieh and Mirrokni as affiliated with Google Research, Daliri with New York University, and Hadian with Google DeepMind. The method was developed for applications including large language model (LLM) inference, key–value (KV) cache compression, vector databases, and nearest neighbor search.

9Mentions1Articles1Stories0Events0.01Salience
30-day activity pulse
Recent
1
Baseline
0
Ratio
new
Peak
1
Peak article volume on 2026-05-07.
Event Timeline
No linked events available for this entity yet.
TurboQuant - GDELT Cloud Entity | GDELT Cloud