TruLens
TruLens is an open-source package that provides instrumentation and evaluation tools for large language model (LLM) based applications.
This page covers how to use TruLens to evaluate and track LLM apps built on langchain.
Installation and Setup
Install the trulens-eval
python package.
pip install trulens-eval
Quickstart
See the integration details in the TruLens documentation.
Tracking
Once you've created your LLM chain, you can use TruLens for evaluation and tracking. TruLens has a number of out-of-the-box Feedback Functions, and is also an extensible framework for LLM evaluation.
Create the feedback functions:
from trulens_eval.feedback import Feedback, Huggingface,
# Initialize HuggingFace-based feedback function collection class:
hugs = Huggingface()
openai = OpenAI()
# Define a language match feedback function using HuggingFace.
lang_match = Feedback(hugs.language_match).on_input_output()
# By default this will check language match on the main app input and main app
# output.
# Question/answer relevance between overall question and answer.
qa_relevance = Feedback(openai.relevance).on_input_output()
# By default this will evaluate feedback on main app input and main app output.
# Toxicity of input
toxicity = Feedback(openai.toxicity).on_input()