Want to objectively measure the quality and effectiveness of your LLM-based applications? In this post I discuss Trulens, the perfect tool for the job.

Engineering Leadership with a side of Quality Evangelism
Engineering manager with a passion for delivering high quality software at pace, for solving the impossible problems, and for helping individuals be the best version of themselves that they can be.
Want to objectively measure the quality and effectiveness of your LLM-based applications? In this post I discuss Trulens, the perfect tool for the job.
AI is here to stay, and if you work in Quality, you might want to learn how to test it. In this post I discuss how to leverage Playwright to test an LLM.
First in a series of posts about testing AI / Large Language models. In this first post learn how to run Llama2 locally so that you can begin your testing.