Skip to content

Commit

Permalink
add graphs and update readme
Browse files Browse the repository at this point in the history
  • Loading branch information
davidzhu27 committed Jun 12, 2024
1 parent c52e3fa commit 03549f1
Show file tree
Hide file tree
Showing 5 changed files with 26 additions and 2 deletions.
28 changes: 26 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Optimal Reward Labeling

This repository is based heavily on the [CORL library](https://github.com/tinkoff-ai/CORL).
This repository is based on the [CORL library](https://github.com/tinkoff-ai/CORL).

## Installation

Expand All @@ -14,6 +14,7 @@ pip install -r requirements/requirements_dev.txt
```

## Initialize Wandb

Initialize Wandb by running the following command inside the folder:

```bash
Expand All @@ -24,7 +25,8 @@ Follow the prompts to create a new project or connect to an existing one. Make s

For more information on how to use Wandb, refer to the [Wandb documentation](https://docs.wandb.ai/).

## Run example scripts
## Run Example

Run the sample Python command. Make sure you have the necessary dependencies installed and the Python environment properly configured.

```bash
Expand All @@ -49,3 +51,25 @@ Execute these scripts in your terminal:
```bash
. abl.sh
```

## Experiment Results

### Main Experiments

This graph shows the comparison between different reward labeling methods: Oracle True Reward, ORL, Latent Reward Model, and IPL with True Reward.

![Graph 1](results/graphs/main_exp.png)

### Ablation Studies

This graph demonstrates the impact of using datasets of different sizes on the performance of the reward labeling method.

![Graph 2](results/graphs/size.png)

This graph illustrates the performance of the reward labeling method when different Offline RL algorithms are applied.

![Graph 3](results/graphs/algo.png)

This graph showcases the effect of performing multiple Bernoulli samples to generate preference labels on the performance of the reward labeling method.

![Graph 4](results/graphs/bernoulli.png)
Binary file added results/graphs/algo.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added results/graphs/bernoulli.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added results/graphs/main_exp.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added results/graphs/size.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 03549f1

Please sign in to comment.