Skip to content

Commit

Permalink
Work on comparison with zeroshot
Browse files Browse the repository at this point in the history
  • Loading branch information
wilkox committed Aug 29, 2023
1 parent 96b2054 commit 61a38be
Showing 1 changed file with 23 additions and 2 deletions.
25 changes: 23 additions & 2 deletions validation/summary/compare_zeroshot.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -64,15 +64,36 @@ results %>%
mutate(percentage = n / sum(n))
```

Select cases where zero-shot got it wrong, but chain-of-thoughts got it right.
Select cases where zero-shot got it right, but chain-of-thoughts got it wrong.

```{r}
cot_failures <- results %>%
filter(! GPT_includes_cot == human_includes) %>%
filter(GPT_includes_zs == human_includes)
```

Generate contingency table for chain-of-thoughts failures.

```{r}
cot_failures %>%
mutate(result = case_when(
human_includes & GPT_includes_cot ~ "True positive",
human_includes & ! GPT_includes_cot ~ "False negative",
! human_includes & GPT_includes_cot ~ "False positive",
! human_includes & ! GPT_includes_cot ~ "True negative")) %>%
count(result) %>%
mutate(percentage = n / sum(n))
```

Select cases where chain-of-thoughts got it right, but zero-shot got it wrong.

```{r}
zs_failures <- results %>%
filter(GPT_includes_cot == human_includes) %>%
filter(! GPT_includes_zs == human_includes)
```

Generate contingency table for zero-shot in these cases.
Generate contingency table for zero-shot failures.

```{r}
zs_failures %>%
Expand Down

0 comments on commit 61a38be

Please sign in to comment.