RedisVectorStore#

class langchain_redis.vectorstores.RedisVectorStore(embeddings: Embeddings, config: RedisConfig | None = None, **kwargs: Any)[source]#

Redis vector store integration.

Setup:

Install langchain-redis and running the Redis docker container.

pip install -qU langchain-redis
docker run -p 6379:6379 redis/redis-stack-server:latest
Key init args — indexing params:
index_name: str

Name of the index to create.

embedding: Embeddings

Embedding function to use.

distance_metric: str

Distance metric to use for similarity search. Default is “COSINE”.

indexing_algorithm: str

Indexing algorithm to use. Default is “FLAT”.

vector_datatype: str

Data type of the vector. Default is “FLOAT32”.

Key init args — client params:
redis_url: Optional[str]

URL of the Redis instance to connect to.

redis_client: Optional[Redis]

Pre-existing Redis connection.

Instantiate:
from langchain_redis import RedisVectorStore
from langchain_openai import OpenAIEmbeddings

vector_store = RedisVectorStore(
    index_name="langchain-demo",
    embedding=OpenAIEmbeddings(),
    redis_url="redis://localhost:6379",
)

You can also connect to an existing Redis instance by passing in a pre-existing Redis connection via the redis_client argument.

Instantiate from existing connection:
from langchain_redis import RedisVectorStore
from langchain_openai import OpenAIEmbeddings
from redis import Redis

redis_client = Redis.from_url("redis://localhost:6379")

store = RedisVectorStore(
    embedding=OpenAIEmbeddings(),
    index_name="langchain-demo",
    redis_client=redis_client
)
Add Documents:
from langchain_core.documents import Document

document_1 = Document(page_content="foo", metadata={"baz": "bar"})
document_2 = Document(page_content="bar", metadata={"foo": "baz"})
document_3 = Document(page_content="to be deleted")

documents = [document_1, document_2, document_3]
ids = ["1", "2", "3"]
vector_store.add_documents(documents=documents, ids=ids)
Delete Documents:
vector_store.delete(ids=["3"])
Search:
results = vector_store.similarity_search(query="foo", k=1)
for doc in results:
    print(f"* {doc.page_content} [{doc.metadata}]")
* foo [{'baz': 'bar'}]
Search with filter:
from redisvl.query.filter import Tag

results = vector_store.similarity_search(
    query="foo",
    k=1,
    filter=Tag("baz") == "bar"
)
for doc in results:
    print(f"* {doc.page_content} [{doc.metadata}]")
* foo [{'baz': 'bar'}]
Search with score:
results = vector_store.similarity_search_with_score(query="foo", k=1)
for doc, score in results:
    print(f"* [SIM={score:.3f}] {doc.page_content} [{doc.metadata}]")
* [SIM=0.916] foo [{'baz': 'bar'}]
Use as Retriever:
retriever = vector_store.as_retriever(
    search_type="mmr",
    search_kwargs={"k": 1, "fetch_k": 2, "lambda_mult": 0.5},
)
retriever.get_relevant_documents("foo")
[Document(page_content='foo', metadata={'baz': 'bar'})]

Attributes

embeddings

Access the query embedding object if available.

index

key_prefix

Methods

__init__(embeddings[, config])

aadd_documents(documents, **kwargs)

Async run more documents through the embeddings and add to the vectorstore.

aadd_texts(texts[, metadatas, ids])

Async run more texts through the embeddings and add to the vectorstore.

add_documents(documents, **kwargs)

Add or update documents in the vectorstore.

add_texts(texts[, metadatas, keys])

Add text documents to the vector store.

adelete([ids])

Async delete by vector ID or other criteria.

afrom_documents(documents, embedding, **kwargs)

Async return VectorStore initialized from documents and embeddings.

afrom_texts(texts, embedding[, metadatas, ids])

Async return VectorStore initialized from texts and embeddings.

aget_by_ids(ids, /)

Async get documents by their IDs.

amax_marginal_relevance_search(query[, k, ...])

Async return docs selected using the maximal marginal relevance.

amax_marginal_relevance_search_by_vector(...)

Async return docs selected using the maximal marginal relevance.

as_retriever(**kwargs)

Return VectorStoreRetriever initialized from this VectorStore.

asearch(query, search_type, **kwargs)

Async return docs most similar to query using a specified search type.

asimilarity_search(query[, k])

Async return docs most similar to query.

asimilarity_search_by_vector(embedding[, k])

Async return docs most similar to embedding vector.

asimilarity_search_with_relevance_scores(query)

Async return docs and relevance scores in the range [0, 1].

asimilarity_search_with_score(*args, **kwargs)

Async run similarity search with distance.

convert_vector(obj)

delete([ids])

Delete ids from the vector store.

from_documents(documents, embedding[, ...])

Create a RedisVectorStore from a list of Documents.

from_existing_index(index_name, embedding, ...)

Create a RedisVectorStore from an existing Redis Search Index.

from_texts(texts, embedding[, metadatas, ...])

Create a RedisVectorStore from a list of texts.

get_by_ids(ids)

Get documents by their IDs.

max_marginal_relevance_search(query[, k, ...])

Return docs selected using the maximal marginal relevance.

max_marginal_relevance_search_by_vector(...)

Return docs selected using the maximal marginal relevance.

search(query, search_type, **kwargs)

Return docs most similar to query using a specified search type.

similarity_search(query[, k, filter, sort_by])

Return docs most similar to query.

similarity_search_by_vector(embedding[, k, ...])

Return docs most similar to embedding vector.

similarity_search_with_relevance_scores(query)

Return docs and relevance scores in the range [0, 1].

similarity_search_with_score(query[, k, ...])

Return documents most similar to query string, along with scores.

similarity_search_with_score_by_vector(embedding)

Return docs most similar to embedding vector.

Parameters:
__init__(embeddings: Embeddings, config: RedisConfig | None = None, **kwargs: Any)[source]#
Parameters:
async aadd_documents(documents: list[Document], **kwargs: Any) list[str]#

Async run more documents through the embeddings and add to the vectorstore.

Parameters:
  • documents (list[Document]) – Documents to add to the vectorstore.

  • kwargs (Any) – Additional keyword arguments.

Returns:

List of IDs of the added texts.

Raises:

ValueError – If the number of IDs does not match the number of documents.

Return type:

list[str]

async aadd_texts(texts: Iterable[str], metadatas: list[dict] | None = None, *, ids: list[str] | None = None, **kwargs: Any) list[str]#

Async run more texts through the embeddings and add to the vectorstore.

Parameters:
  • texts (Iterable[str]) – Iterable of strings to add to the vectorstore.

  • metadatas (list[dict] | None) – Optional list of metadatas associated with the texts. Default is None.

  • ids (list[str] | None) – Optional list

  • **kwargs (Any) – vectorstore specific parameters.

Returns:

List of ids from adding the texts into the vectorstore.

Raises:
  • ValueError – If the number of metadatas does not match the number of texts.

  • ValueError – If the number of ids does not match the number of texts.

Return type:

list[str]

add_documents(documents: list[Document], **kwargs: Any) list[str]#

Add or update documents in the vectorstore.

Parameters:
  • documents (list[Document]) – Documents to add to the vectorstore.

  • kwargs (Any) – Additional keyword arguments. if kwargs contains ids and documents contain ids, the ids in the kwargs will receive precedence.

Returns:

List of IDs of the added texts.

Raises:

ValueError – If the number of ids does not match the number of documents.

Return type:

list[str]

add_texts(texts: Iterable[str], metadatas: List[dict] | None = None, keys: List[dict] | None = None, **kwargs: Any) List[str][source]#

Add text documents to the vector store.

Parameters:
  • texts (Iterable[str]) – Iterable of strings to add to the vector store.

  • metadatas (List[dict] | None) – Optional list of metadata dicts associated with the texts.

  • keys (List[dict] | None) – Optional list of keys to associate with the documents.

  • **kwargs (Any) – Additional keyword arguments: - ids: Optional list of ids to associate with the documents. - refresh_indices: Whether to refresh the Redis indices after adding the texts. Defaults to True. - create_index_if_not_exists: Whether to create the Redis index if it doesn’t already exist. Defaults to True. - batch_size: Optional. Number of texts to add to the index at a time. Defaults to 1000.

Returns:

List of ids from adding the texts into the vector store.

Return type:

List[str]

Example

from langchain_redis import RedisVectorStore
from langchain_openai import OpenAIEmbeddings

vector_store = RedisVectorStore(
    index_name="langchain-demo",
    embedding=OpenAIEmbeddings(),
    redis_url="redis://localhost:6379",
)

texts = [
    "The quick brown fox jumps over the lazy dog",
    "Hello world",
    "Machine learning is fascinating"
]
metadatas = [
    {"source": "book", "page": 1},
    {"source": "greeting", "language": "english"},
    {"source": "article", "topic": "AI"}
]

ids = vector_store.add_texts(
    texts=texts,
    metadatas=metadatas,
    batch_size=2
)

print(f"Added documents with ids: {ids}")

Note

  • If metadatas is provided, it must have the same length as texts.

  • If keys is provided, it must have the same length as texts.

  • The batch_size parameter can be used to control the number of

documents added in each batch, which can be useful for managing memory usage when adding a large number of documents.

async adelete(ids: list[str] | None = None, **kwargs: Any) bool | None#

Async delete by vector ID or other criteria.

Parameters:
  • ids (list[str] | None) – List of ids to delete. If None, delete all. Default is None.

  • **kwargs (Any) – Other keyword arguments that subclasses might use.

Returns:

True if deletion is successful, False otherwise, None if not implemented.

Return type:

Optional[bool]

async classmethod afrom_documents(documents: list[Document], embedding: Embeddings, **kwargs: Any) VST#

Async return VectorStore initialized from documents and embeddings.

Parameters:
  • documents (list[Document]) – List of Documents to add to the vectorstore.

  • embedding (Embeddings) – Embedding function to use.

  • kwargs (Any) – Additional keyword arguments.

Returns:

VectorStore initialized from documents and embeddings.

Return type:

VectorStore

async classmethod afrom_texts(texts: list[str], embedding: Embeddings, metadatas: list[dict] | None = None, *, ids: list[str] | None = None, **kwargs: Any) VST#

Async return VectorStore initialized from texts and embeddings.

Parameters:
  • texts (list[str]) – Texts to add to the vectorstore.

  • embedding (Embeddings) – Embedding function to use.

  • metadatas (list[dict] | None) – Optional list of metadatas associated with the texts. Default is None.

  • ids (list[str] | None) – Optional list of IDs associated with the texts.

  • kwargs (Any) – Additional keyword arguments.

Returns:

VectorStore initialized from texts and embeddings.

Return type:

VectorStore

async aget_by_ids(ids: Sequence[str], /) list[Document]#

Async get documents by their IDs.

The returned documents are expected to have the ID field set to the ID of the document in the vector store.

Fewer documents may be returned than requested if some IDs are not found or if there are duplicated IDs.

Users should not assume that the order of the returned documents matches the order of the input IDs. Instead, users should rely on the ID field of the returned documents.

This method should NOT raise exceptions if no documents are found for some IDs.

Parameters:

ids (Sequence[str]) – List of ids to retrieve.

Returns:

List of Documents.

Return type:

list[Document]

Added in version 0.2.11.

Async return docs selected using the maximal marginal relevance.

Maximal marginal relevance optimizes for similarity to query AND diversity among selected documents.

Parameters:
  • query (str) – Text to look up documents similar to.

  • k (int) – Number of Documents to return. Defaults to 4.

  • fetch_k (int) – Number of Documents to fetch to pass to MMR algorithm. Default is 20.

  • lambda_mult (float) – Number between 0 and 1 that determines the degree of diversity among the results with 0 corresponding to maximum diversity and 1 to minimum diversity. Defaults to 0.5.

  • kwargs (Any)

Returns:

List of Documents selected by maximal marginal relevance.

Return type:

list[Document]

async amax_marginal_relevance_search_by_vector(embedding: list[float], k: int = 4, fetch_k: int = 20, lambda_mult: float = 0.5, **kwargs: Any) list[Document]#

Async return docs selected using the maximal marginal relevance.

Maximal marginal relevance optimizes for similarity to query AND diversity among selected documents.

Parameters:
  • embedding (list[float]) – Embedding to look up documents similar to.

  • k (int) – Number of Documents to return. Defaults to 4.

  • fetch_k (int) – Number of Documents to fetch to pass to MMR algorithm. Default is 20.

  • lambda_mult (float) – Number between 0 and 1 that determines the degree of diversity among the results with 0 corresponding to maximum diversity and 1 to minimum diversity. Defaults to 0.5.

  • **kwargs (Any) – Arguments to pass to the search method.

Returns:

List of Documents selected by maximal marginal relevance.

Return type:

list[Document]

as_retriever(**kwargs: Any) VectorStoreRetriever#

Return VectorStoreRetriever initialized from this VectorStore.

Parameters:

**kwargs (Any) –

Keyword arguments to pass to the search function. Can include: search_type (Optional[str]): Defines the type of search that

the Retriever should perform. Can be “similarity” (default), “mmr”, or “similarity_score_threshold”.

search_kwargs (Optional[Dict]): Keyword arguments to pass to the
search function. Can include things like:

k: Amount of documents to return (Default: 4) score_threshold: Minimum relevance threshold

for similarity_score_threshold

fetch_k: Amount of documents to pass to MMR algorithm

(Default: 20)

lambda_mult: Diversity of results returned by MMR;

1 for minimum diversity and 0 for maximum. (Default: 0.5)

filter: Filter by document metadata

Returns:

Retriever class for VectorStore.

Return type:

VectorStoreRetriever

Examples:

# Retrieve more documents with higher diversity
# Useful if your dataset has many similar documents
docsearch.as_retriever(
    search_type="mmr",
    search_kwargs={'k': 6, 'lambda_mult': 0.25}
)

# Fetch more documents for the MMR algorithm to consider
# But only return the top 5
docsearch.as_retriever(
    search_type="mmr",
    search_kwargs={'k': 5, 'fetch_k': 50}
)

# Only retrieve documents that have a relevance score
# Above a certain threshold
docsearch.as_retriever(
    search_type="similarity_score_threshold",
    search_kwargs={'score_threshold': 0.8}
)

# Only get the single most similar document from the dataset
docsearch.as_retriever(search_kwargs={'k': 1})

# Use a filter to only retrieve documents from a specific paper
docsearch.as_retriever(
    search_kwargs={'filter': {'paper_title':'GPT-4 Technical Report'}}
)
async asearch(query: str, search_type: str, **kwargs: Any) list[Document]#

Async return docs most similar to query using a specified search type.

Parameters:
  • query (str) – Input text.

  • search_type (str) – Type of search to perform. Can be “similarity”, “mmr”, or “similarity_score_threshold”.

  • **kwargs (Any) – Arguments to pass to the search method.

Returns:

List of Documents most similar to the query.

Raises:

ValueError – If search_type is not one of “similarity”, “mmr”, or “similarity_score_threshold”.

Return type:

list[Document]

Async return docs most similar to query.

Parameters:
  • query (str) – Input text.

  • k (int) – Number of Documents to return. Defaults to 4.

  • **kwargs (Any) – Arguments to pass to the search method.

Returns:

List of Documents most similar to the query.

Return type:

list[Document]

async asimilarity_search_by_vector(embedding: list[float], k: int = 4, **kwargs: Any) list[Document]#

Async return docs most similar to embedding vector.

Parameters:
  • embedding (list[float]) – Embedding to look up documents similar to.

  • k (int) – Number of Documents to return. Defaults to 4.

  • **kwargs (Any) – Arguments to pass to the search method.

Returns:

List of Documents most similar to the query vector.

Return type:

list[Document]

async asimilarity_search_with_relevance_scores(query: str, k: int = 4, **kwargs: Any) list[tuple[Document, float]]#

Async return docs and relevance scores in the range [0, 1].

0 is dissimilar, 1 is most similar.

Parameters:
  • query (str) – Input text.

  • k (int) – Number of Documents to return. Defaults to 4.

  • **kwargs (Any) –

    kwargs to be passed to similarity search. Should include: score_threshold: Optional, a floating point value between 0 to 1 to

    filter the resulting set of retrieved docs

Returns:

List of Tuples of (doc, similarity_score)

Return type:

list[tuple[Document, float]]

async asimilarity_search_with_score(*args: Any, **kwargs: Any) list[tuple[Document, float]]#

Async run similarity search with distance.

Parameters:
  • *args (Any) – Arguments to pass to the search method.

  • **kwargs (Any) – Arguments to pass to the search method.

Returns:

List of Tuples of (doc, similarity_score).

Return type:

list[tuple[Document, float]]

convert_vector(obj: dict) List[float][source]#
Parameters:

obj (dict)

Return type:

List[float]

delete(ids: List[str] | None = None, **kwargs: Any) bool | None[source]#

Delete ids from the vector store.

Parameters:
  • ids (List[str] | None) – Optional list of ids of the documents to delete.

  • **kwargs (Any) – Additional keyword arguments (not used in the current implementation).

Returns:

True if one or more keys are deleted, False otherwise

Return type:

Optional[bool]

Example

from langchain_redis import RedisVectorStore
from langchain_openai import OpenAIEmbeddings

vector_store = RedisVectorStore(
    index_name="langchain-demo",
    embedding=OpenAIEmbeddings(),
    redis_url="redis://localhost:6379",
)

# Assuming documents with these ids exist in the store
ids_to_delete = ["doc1", "doc2", "doc3"]

result = vector_store.delete(ids=ids_to_delete)
if result:
  print("Documents were succesfully deleted")
else:
  print("No Documents were deleted")

Note

  • If ids is None or an empty list, the method returns False.

  • If the number of actually deleted keys differs from the number of keys submitted for deletion the method returns False

  • The method uses the drop_keys functionality from RedisVL to delete the keys from Redis.

  • Keys are constructed by prefixing each id with the key_prefix specified in the configuration.

classmethod from_documents(documents: List[Document], embedding: Embeddings, config: RedisConfig | None = None, return_keys: bool = False, **kwargs: Any) RedisVectorStore[source]#

Create a RedisVectorStore from a list of Documents.

Parameters:
  • documents (List[Document]) – List of Document objects to add to the vector store.

  • embedding (Embeddings) – Embeddings object to use for encoding the documents.

  • config (RedisConfig | None) – Optional RedisConfig object. If not provided, one will be created from kwargs.

  • return_keys (bool) – Whether to return the keys of the added documents.

  • **kwargs (Any) –

    Additional keyword arguments to pass to RedisConfig if config

    is not provided.

    Common kwargs include: - index_name: Name of the Redis index to create. - redis_url: URL of the Redis instance to connect to. - distance_metric: Distance metric to use for similarity search.

    Default is “COSINE”.

    • indexing_algorithm: Indexing algorithm to use. Default is “FLAT”.

Returns:

A new RedisVectorStore instance with the documents added.

Return type:

RedisVectorStore

Example

from langchain_redis import RedisVectorStore
from langchain_openai import OpenAIEmbeddings
from langchain_core.documents import Document

documents = [
    Document(
      page_content="The quick brown fox",
      metadata={"animal": "fox"}
    ),
    Document(
      page_content="jumps over the lazy dog",
      metadata={"animal": "dog"}
    )
]

embeddings = OpenAIEmbeddings()

vector_store = RedisVectorStore.from_documents(
    documents=documents,
    embedding=embeddings,
    index_name="animal-docs",
    redis_url="redis://localhost:6379"
)

# Now you can use the vector_store for similarity search
results = vector_store.similarity_search("quick animal", k=1)
print(results[0].page_content)

Note

  • This method creates a new RedisVectorStore instance and adds the provided documents to it.

  • The method extracts the text content and metadata from each Document object.

  • If a RedisConfig object is not provided, one will be created using the additional kwargs passed to this method.

  • The embedding function is used to convert the document text into vector representations for efficient similarity search.

classmethod from_existing_index(index_name: str, embedding: Embeddings, **kwargs: Any) RedisVectorStore[source]#

Create a RedisVectorStore from an existing Redis Search Index.

This method allows you to connect to an already existing index in Redis, which can be useful for continuing work with previously created indexes or for connecting to indexes created outside of this client.

Parameters:
  • index_name (str) – Name of the existing index to use.

  • embedding (Embeddings) – Embedding function to use for encoding queries.

  • **kwargs (Any) –

    Additional keyword arguments to pass to RedisConfig. Common kwargs include: - redis_url: URL of the Redis instance to connect to. - redis_client: Pre-existing Redis client to use. - vector_query_field: Name of the field containing the vector

    representations.

    • content_field: Name of the field containing the document content.

Returns:

A new RedisVectorStore instance connected to the

existing index.

Return type:

RedisVectorStore

Example

from langchain_redis import RedisVectorStore
from langchain_openai import OpenAIEmbeddings
from redis import Redis

embeddings = OpenAIEmbeddings()

# Connect to an existing index
vector_store = RedisVectorStore.from_existing_index(
    index_name="my-existing-index",
    embedding=embeddings,
    redis_url="redis://localhost:6379",
    vector_query_field="embedding",
    content_field="text"
)

# Now you can use the vector_store for similarity search
results = vector_store.similarity_search("AI and machine learning", k=1)
print(results[0].page_content)

Note

  • This method assumes that the index already exists in Redis.

  • The embedding function provided should be compatible with the embeddings

stored in the existing index. - If you’re using custom field names for vectors or content in your

existing index, make sure to specify them using vector_query_field and content_field respectively.

  • This method is useful for scenarios where you want to reuse an

    existing index, such as when the index was created by another process or when you want to use the same index across different sessions or applications.

classmethod from_texts(texts: List[str], embedding: Embeddings, metadatas: List[dict] | None = None, config: RedisConfig | None = None, keys: List[str] | None = None, return_keys: bool = False, **kwargs: Any) RedisVectorStore[source]#

Create a RedisVectorStore from a list of texts.

Parameters:
  • texts (List[str]) – List of texts to add to the vector store.

  • embedding (Embeddings) – Embedding function to use for encoding the texts.

  • metadatas (List[dict] | None) – Optional list of metadata dicts associated with the texts.

  • config (RedisConfig | None) – Optional RedisConfig object. If not provided, one will be created from kwargs.

  • keys (List[str] | None) – Optional list of keys to associate with the documents.

  • return_keys (bool) – Whether to return the keys of the added documents.

  • **kwargs (Any) –

    Additional keyword arguments to pass to RedisConfig if config is not provided. Commonly used kwargs include: - index_name: Name of the Redis index to create. - redis_url: URL of the Redis instance to connect to. - distance_metric: Distance metric to use for similarity search.

    Default is “COSINE”.

    • indexing_algorithm: Indexing algorithm to use. Default is “FLAT”.

Returns:

A new RedisVectorStore instance with the texts added.

Return type:

RedisVectorStore

Example

from langchain_redis import RedisVectorStore
from langchain_openai import OpenAIEmbeddings

texts = [
    "The quick brown fox jumps over the lazy dog",
    "Hello world",
    "Machine learning is fascinating"
]
metadatas = [
    {"source": "book", "page": 1},
    {"source": "greeting", "language": "english"},
    {"source": "article", "topic": "AI"}
]

embeddings = OpenAIEmbeddings()

vector_store = RedisVectorStore.from_texts(
    texts=texts,
    embedding=embeddings,
    metadatas=metadatas,
    index_name="langchain-demo",
    redis_url="redis://localhost:6379",
    distance_metric="COSINE"
)

# Now you can use the vector_store for similarity search
results = vector_store.similarity_search("AI and machine learning", k=1)
print(results[0].page_content)

Note

  • This method creates a new RedisVectorStore instance and adds the

    provided texts to it.

  • If metadatas is provided, it must have the same length as texts.

  • If keys is provided, it must have the same length as texts.

  • The return_keys parameter determines whether the method returns just the

RedisVectorStore instance or a tuple of (RedisVectorStore, List[str]) where the second element is the list of keys for the added documents.

get_by_ids(ids: Sequence[str]) List[Document][source]#

Get documents by their IDs.

The returned documents are expected to have the ID field set to the ID of the document in the vector store.

Fewer documents may be returned than requested if some IDs are not found or if there are duplicated IDs.

Users should not assume that the order of the returned documents matches the order of the input IDs. Instead, users should rely on the ID field of the returned documents.

This method should NOT raise exceptions if no documents are found for some IDs.

Parameters:

ids (Sequence[str]) – List of ids to retrieve.

Returns:

List of Documents.

Return type:

List[Document]

Added in version 0.1.2.

Return docs selected using the maximal marginal relevance.

Maximal marginal relevance optimizes for similarity to query AND diversity among selected documents.

Parameters:
  • query (str) – Text to look up documents similar to.

  • k (int) – Number of Documents to return. Defaults to 4.

  • fetch_k (int) – Number of Documents to fetch to pass to MMR algorithm. Defaults to 20.

  • lambda_mult (float) – Number between 0 and 1 that determines the degree of diversity among the results with 0 corresponding to maximum diversity and 1 to minimum diversity. Defaults to 0.5.

  • **kwargs (Any) – Other keyword arguments to pass to the search function.

Returns:

List of Documents selected by maximal marginal relevance.

Return type:

List[Document]

max_marginal_relevance_search_by_vector(embedding: List[float], k: int = 4, fetch_k: int = 20, lambda_mult: float = 0.5, **kwargs: Any) List[Document][source]#

Return docs selected using the maximal marginal relevance.

Maximal marginal relevance optimizes for similarity to query AND diversity among selected documents.

Parameters:
  • embedding (List[float]) – Embedding to look up documents similar to.

  • k (int) – Number of Documents to return. Defaults to 4.

  • fetch_k (int) – Number of Documents to fetch to pass to MMR algorithm. Defaults to 20.

  • lambda_mult (float) – Number between 0 and 1 that determines the degree of diversity among the results with 0 corresponding to maximum diversity and 1 to minimum diversity. Defaults to 0.5.

  • **kwargs (Any) – Other keyword arguments to pass to the search function.

Returns:

List of Documents selected by maximal marginal relevance.

Return type:

List[Document]

search(query: str, search_type: str, **kwargs: Any) list[Document]#

Return docs most similar to query using a specified search type.

Parameters:
  • query (str) – Input text

  • search_type (str) – Type of search to perform. Can be “similarity”, “mmr”, or “similarity_score_threshold”.

  • **kwargs (Any) – Arguments to pass to the search method.

Returns:

List of Documents most similar to the query.

Raises:

ValueError – If search_type is not one of “similarity”, “mmr”, or “similarity_score_threshold”.

Return type:

list[Document]

Return docs most similar to query.

Parameters:
  • query (str) – Text to look up documents similar to.

  • k (int) – Number of Documents to return. Defaults to 4.

  • filter (FilterExpression | None) – Optional filter expression to apply.

  • sort_by (str | None) – Optional sort_by expression to apply.

  • **kwargs (Any) – Other keyword arguments to pass to the search function.

Returns:

List of Documents most similar to the query.

Return type:

List[Document]

similarity_search_by_vector(embedding: List[float], k: int = 4, filter: FilterExpression | None = None, sort_by: str | None = None, **kwargs: Any) List[Document][source]#

Return docs most similar to embedding vector.

Parameters:
  • embedding (List[float]) – Embedding to look up documents similar to.

  • k (int) – Number of Documents to return. Defaults to 4.

  • filter (FilterExpression | None) – Optional filter expression to apply.

  • sort_by (str | None) – Optional sort_by expression to apply.

  • **kwargs (Any) –

    Other keyword arguments: - return_metadata: Whether to return metadata. Defaults to True. - distance_threshold: Optional distance threshold for filtering results. - return_all: Whether to return all data in the Hash/JSON including

    non-indexed fields

Returns:

List of Documents most similar to the query vector.

Return type:

List[Document]

similarity_search_with_relevance_scores(query: str, k: int = 4, **kwargs: Any) list[tuple[Document, float]]#

Return docs and relevance scores in the range [0, 1].

0 is dissimilar, 1 is most similar.

Parameters:
  • query (str) – Input text.

  • k (int) – Number of Documents to return. Defaults to 4.

  • **kwargs (Any) –

    kwargs to be passed to similarity search. Should include: score_threshold: Optional, a floating point value between 0 to 1 to

    filter the resulting set of retrieved docs.

Returns:

List of Tuples of (doc, similarity_score).

Return type:

list[tuple[Document, float]]

similarity_search_with_score(query: str, k: int = 4, filter: FilterExpression | None = None, sort_by: str | None = None, **kwargs: Any) Sequence[Any][source]#

Return documents most similar to query string, along with scores.

Args:

query: Text to look up documents similar to. k: Number of Documents to return. Defaults to 4. filter: Optional filter expression to apply to the query. sort_by: Optional sort_by expression to apply to the query. **kwargs: Other keyword arguments to pass to the search function:

  • custom_query: Optional callable that can be used

    to customize the query.

  • doc_builder: Optional callable to customize Document creation.

  • return_metadata: Whether to return metadata. Defaults to True.

  • distance_threshold: Optional distance threshold for filtering results.

  • return_all: Whether to return all data in the Hash/JSON including

non-indexed fields. Defaults to False.

Returns:

List of tuples of (Document, score) most similar to the query.

Example:
from langchain_redis import RedisVectorStore
from langchain_openai import OpenAIEmbeddings

vector_store = RedisVectorStore(
    index_name="langchain-demo",
    embedding=OpenAIEmbeddings(),
    redis_url="redis://localhost:6379",
)

results = vector_store.similarity_search_with_score(
    "What is machine learning?",
    k=2,
    filter=None
)

for doc, score in results:
    print(f"Score: {score}")
    print(f"Content: {doc.page_content}")
    print(f"Metadata: {doc.metadata}

“)

Note:
  • The method returns scores along with documents. Lower scores indicate

higher similarity. - The actual search is performed using the vector representation of the

query, which is why an embedding function must be provided during initialization.

  • The filter parameter allows for additional filtering of results based on metadata.

  • If return_all is set to True, all fields stored in Redis will be returned, which may include non-indexed fields.

Parameters:
  • query (str)

  • k (int)

  • filter (FilterExpression | None)

  • sort_by (str | None)

  • kwargs (Any)

Return type:

Sequence[Any]

similarity_search_with_score_by_vector(embedding: List[float], k: int = 4, filter: FilterExpression | None = None, sort_by: str | None = None, **kwargs: Any) Sequence[Any][source]#

Return docs most similar to embedding vector.

Parameters:
  • embedding (List[float]) – Embedding to look up documents similar to.

  • k (int) – Number of Documents to return. Defaults to 4.

  • filter (FilterExpression | None) – Optional filter expression to apply.

  • sort_by (str | None) – Optional sort_by expression to apply.

  • **kwargs (Any) – Other keyword arguments: with_vectors: Whether to return document vectors. Defaults to False. return_metadata: Whether to return metadata. Defaults to True. distance_threshold: Optional distance threshold for filtering results.

Returns:

List of tuples of Documents most similar to the query vector, score, and optionally the document vector.

Return type:

Sequence[Any]