Skip to content

Commit

Permalink
Cleaned env.sh
Browse files Browse the repository at this point in the history
  • Loading branch information
Adyasha Maharana committed Aug 12, 2024
1 parent a9f0a46 commit 425a79a
Show file tree
Hide file tree
Showing 18 changed files with 108,905 additions and 102 deletions.
31 changes: 24 additions & 7 deletions README.MD
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,27 @@ The dataset can be found in the ```./data/locomo10.json``` file in this reposito

Configuration variables like API keys, output directories etc. are set in `scripts/env.sh` and run at the beginning of all other scripts.

### 1. Evaluate open-source and closed-source LLMs on the Question Answering Task with the (truncated) conversation as context
### Generate *very* long-term conversations between two LLM-agents with pre-assigned personalities using our LLM-based generative framework
The code to generate conversations is available in `scripts/generate_conversations.sh` and be run as follows:
```
bash scripts/generate_conversations.sh
```

This code can be run under two settings:
1. Generate conversations between agents assigned with custom personas. To enable this setting, point `--out-dir` to a directory containing the files `agent_a.json` and `agent_b.json`. These files should contain the `name` and `persona_summary` of the speaker represented by the agent. See an example at `data/multimodal_dialog/example`.

```
{
"name": "Angela",
"persona_summary": "Angela is a 31 year old woman who works as the manager of a gift shop in Chapel Hill. She curates interesting pieces from local artists and has maintained a beautiful gallery in the form of the gift shop. She also makes her own art sometimes, in the form of oil paintings."
}
```

2. Create personalities using prompts from the MSC dataset. To enable this setting, point `--out-dir` to an empty directory. This will make the script sample a pair of personalities from `data/msc_personas_all.json`.

See `scripts/generate_conversations.py` for details on the various parameters that can be tweaked for generating the conversations.

### Evaluate open-source and closed-source LLMs on the LoCoMo Question Answering Task with the (truncated) conversation as context

* Evaluate OpenAI models
```
Expand All @@ -52,7 +72,7 @@ bash scripts/evaluate_gemini.sh
bash scripts/evaluate_hf_llm.sh
```

### 2. Generate observations and session summaries from LoCoMo conversations using `gpt-3.5-turbo` for evaluating RAG-based models
### Generate observations and session summaries from LoCoMo conversations using `gpt-3.5-turbo` for evaluating RAG-based models
We provide the observations and summaries with our release of the LoCoMo dataset. Follow these instructions to re-generate the same or for a different set of conversations.

* Generate observations from all sessions:
Expand All @@ -67,23 +87,20 @@ bash scripts/generate_session_summaries.sh
**Note 3**: Session-summaries are different from the event summaries of the event summarization task. The former summairze only a single session whereas event summaries are specific to each speaker and contain causal, temporal connections across sessions.


### 3. Evaluate retrieval-augmented `gpt-3.5-turbo` on the question-answering task using (a) dialogs, (b) observations and (c) session summaries as databases.
### Evaluate retrieval-augmented `gpt-3.5-turbo` on the LoCoMo question-answering task using (a) dialogs, (b) observations and (c) session summaries as databases.
* Evaluate `gpt-3.5-turbo` using retrieval-based augmentation
```
bash scripts/evaluate_rag_gpts.sh
```

### 4. Evaluate models on the event summarization task
### Evaluate models on the event summarization task

Coming soon!

### 5. Train and evaluate models on the multimodal dialog generation task

Coming soon!

### 6. Generate synthetic conversations using our LLM-based agent pipeline

Coming soon!

### Reference
Please cite our paper if you use LoCoMo in your works:
Expand Down
Binary file modified __pycache__/global_methods.cpython-39.pyc
Binary file not shown.
Loading

0 comments on commit 425a79a

Please sign in to comment.