Kusto as a vector database

May 10, 2023
Open in Github

Azure Data Explorer aka Kusto is a cloud-based data analytics service that enables users to perform advanced analytics on large datasets in real-time. It is particularly well-suited for handling large volumes of data, making it an excellent choice for storing and searching vectors.

Kusto supports a special data type called dynamic, which can store unstructured data such as arrays and properties bag. Dynamic data type is perfect for storing vector values. You can further augment the vector value by storing metadata related to the original object as separate columns in your table.
Kusto also supports in-built function series_cosine_similarity_fl to perform vector similarity searches.

Get started with Kusto for free.

Kusto_Vector

Getting started with Kusto and Open AI embedding

Demo Scenario

Wiki_embeddings

semantic_search_flow

If you’d like to try this demo, please follow the instructions in the Notebook.

It will allow you to -

  1. Use precomputed embeddings created by OpenAI API.

  2. Store the embeddings in Kusto.

  3. Convert raw text query to an embedding with OpenAI API.

  4. Use Kusto to perform cosine similarity search in the stored embeddings.