LLM-Evaluator

This is a pipeline to run different LLM evaluators from DeepEval on a set of customer service enquiries to see how well a RAG pipeline performs on them.

The model used as the judge LLM is Azure OpenAI GPT4o and the RAG pipeline to generate answers is from: https://github.com/yjching/Banking-RAG-Chatbot-Demo

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
.gitignore		.gitignore
README.md		README.md
_process_output_csv_from_id.py		_process_output_csv_from_id.py
evaluate_evalset.py		evaluate_evalset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-Evaluator

About

Releases

Packages

Contributors 2

Languages

yjching/LLM-Evaluator

Folders and files

Latest commit

History

Repository files navigation

LLM-Evaluator

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages