


Vespa is an AI search platform for real-time retrieval, ranking, and inference on AWS, powering customer-facing applications such as RAG, search, recommendations, and personalization. It unifies structured, unstructured, vector, and tensor data in a single system to deliver low-latency, high-throughput performance. With hybrid search and machine-learned ranking executed directly in the engine, Vespa enables accurate, scalable AI applications that respond instantly to user interactions.