Cookbook
This cookbook, inspired by OpenAI's cookbook, is a collection of recipes for common use cases of Braintrust. Each recipe is an open source self-contained example, hosted on GitHub. We welcome community contributions and aspire for the cookbook to be a collaborative, living, breathing collection of best practices for building high quality AI products.
typescript
Evaluating a chat assistant

Tara Nagar
Jul 16, 2024evalschat
python
LLM Eval For Text2SQL

Ankur Goyal
May 29, 2024evalsdatasetstext2sql
python
Optimizing Ragas to evaluate a RAG pipeline

Ankur Goyal, Nelson Auner
May 27, 2024evalsrag
typescript
Comparing evals across multiple AI models

John Huang
May 22, 2024evalscharts
python
Detecting Prompt Injections
Nelson Auner
May 20, 2024evalsclassification
python
AI Search Bar

Austin Moehle
Mar 4, 2024evalssql
typescript
How Zapier uses assertions to evaluate tool usage in chatbots

Vítor Balocco
Feb 13, 2024evalsassertionstools
typescript
Generating release notes and hill-climbing to improve them

Ankur Goyal
Feb 2, 2024evalshill-climbing
typescript
Generating beautiful HTML components

Ankur Goyal
Jan 29, 2024loggingdatasetsevals
python
Coda's Help Desk with and without RAG


Austin Moehle, Kenny Wong
Dec 21, 2023evalsrag
typescript
Improving Github issue titles using their contents

Ankur Goyal
Oct 29, 2023evalssummarization
python
Classifying news articles

David Song
Sep 1, 2023evalsclassification
python
Text-to-SQL

Ankur Goyal
Aug 12, 2023evalssql