diff --git a/freebase/SimpleQuestions.md b/freebase/SimpleQuestions.md index f06616f8..6394881e 100644 --- a/freebase/SimpleQuestions.md +++ b/freebase/SimpleQuestions.md @@ -27,12 +27,11 @@ datasetUrl: https://github.com/davidgolub/SimpleQA/tree/master/datasets/SimpleQu | SYGMA | 2023 | - | 42.00 | 55.00 | 44.00 | - | EN | [Badenes-Olmedo and Corcho](https://www.semantic-web-journal.net/system/files/swj3379.pdf) | | Falcon 2.0 | 2023 | - | 34.00 | 41.10 | 36.30 | - | EN | [Badenes-Olmedo and Corcho](https://www.semantic-web-journal.net/system/files/swj3379.pdf) | | KGQA-RR(GPT2) | 2023 | - | - | - | - | 23.18 | EN | [Hu et al.](https://arxiv.org/pdf/2303.10368.pdf) | -| EffiQA w/Deepseek-V2 | 2024 | 69.4 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | -| EffiQA w/ChatGPT | 2024 | 65.7 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | -| EffiQA w/GPT-4 | 2024 | 76.5 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | | Prior FT SOTA | 2024 | 85.8 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | -| Prior Prompting SOTA | 2024 | - | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | +| EffiQA w/GPT-4 | 2024 | 76.5 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | +| EffiQA w/Deepseek-V2 | 2024 | 69.4 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | | Prior tight-coupling SOTA | 2024 | 66.7 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | -| SC w/ChatGPT | 2024 | 18.9 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | +| EffiQA w/ChatGPT | 2024 | 65.7 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | | CoT w/ChatGPT | 2024 | 20.3 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | -| IO prompt w/ChatGPT | 2024 | 20.0 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | \ No newline at end of file +| IO prompt w/ChatGPT | 2024 | 20.0 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | +| SC w/ChatGPT | 2024 | 18.9 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | \ No newline at end of file diff --git a/wikidata/QALD-10.md b/wikidata/QALD-10.md index e54f1fa3..ee3e6bc7 100644 --- a/wikidata/QALD-10.md +++ b/wikidata/QALD-10.md @@ -14,7 +14,6 @@ | EffiQA w/ChatGPT | 2024 | 46.2 | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | | EffiQA w/GPT-4 | 2024 | 51.4 | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | | Prior FT SOTA | 2024 | 45.4 | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | -| Prior Prompting SOTA | 2024 | - | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | | Prior tight-coupling SOTA | 2024 | 54.7 | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | | SC w/ChatGPT | 2024 | 45.3 | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) | | CoT w/ChatGPT | 2024 | 42.9 | - | - | - | EN | [Dong et al.](https://arxiv.org/pdf/2406.01238) |