Chatbot safety: designing safe tutoring systems

This repository contains analysis code and figures for the research paper "Safe Generative Chats in a WhatsApp Intelligent Tutoring System".

In addition, we include a reference moderation system using OpenAI's Moderation API in src/student_guardrails. This system is usable as-is, but should mostly be useful as a reference. Features:

OpenAI Moderation API interface appropriate for both human-written and LLM-generated messages, including custom, per-category moderation thresholds.
Banned word lists that supercede OpenAI moderation scores.
Email alerting system using SendGrid for messages in particular categories.
Pre-written moderation responses designed for users of the WhatsApp-based chatbot Rori.
Unit tests that provide mocked interfaces to the OpenAI Moderation API and the SendGrid API.

If this work is useful to you in any way, please cite the corresponding paper:

Zachary Levonian, Owen Henkel. Safe Generative Chats in a WhatsApp Intelligent Tutoring System. Educational Data Mining (EDM) Workshop: Leveraging Large Language Models for Next Generation Educational Technologies. 2024.

Development

Primary code contributor:

Zachary Levonian ([email protected])

Local development setup

This project uses make and Poetry to manage and install dependencies.

On Windows, you'll need to use WSL and maybe make some other changes.

Python development

Use make install to install all needed dependencies (including the pre-commit hooks and Poetry).

You'll probably need to manually add Poetry to your PATH, e.g. by updating your .bashrc (or relevant equivalent):

export PATH="$HOME/.local/bin:$PATH"

Run tests

make test

Run Jupyter Lab

make jupyter

Which really just runs poetry run jupyter lab, so feel free to customize your Jupyter experience.

Other useful commands

poetry run <command> - Run the given command, e.g. poetry run pytest invokes the tests.
poetry add <package> - Add the given package as a dependency. Use flag -G dev to add it as a development dependency.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
notebooks		notebooks
src/student_guardrails		src/student_guardrails
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chatbot safety: designing safe tutoring systems

Development

Local development setup

Python development

Run tests

Run Jupyter Lab

Other useful commands

About

Languages

License

DigitalHarborFoundation/chatbot-safety

Folders and files

Latest commit

History

Repository files navigation

Chatbot safety: designing safe tutoring systems

Development

Local development setup

Python development

Run tests

Run Jupyter Lab

Other useful commands

About

Topics

Resources

License

Stars

Watchers

Forks

Languages