Skip to content

yjching/LLM-Evaluator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLM-Evaluator


This is a pipeline to run different LLM evaluators from DeepEval on a set of customer service enquiries to see how well a RAG pipeline performs on them.

The model used as the judge LLM is Azure OpenAI GPT4o and the RAG pipeline to generate answers is from: https://github.com/yjching/Banking-RAG-Chatbot-Demo

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages