Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add PTC data #158

Merged
merged 12 commits into from
Dec 3, 2024
2 changes: 1 addition & 1 deletion .github/workflows/black_check.yml
Original file line number Diff line number Diff line change
Expand Up @@ -23,4 +23,4 @@ jobs:
pip install black==24.10.0
- name: Run Black
run: |
black --check .
black --check --target-version py39 .
4 changes: 2 additions & 2 deletions .github/workflows/isort_check.yml
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
name: Black
name: Isort

on:
push:
Expand All @@ -24,4 +24,4 @@ jobs:
- name: Isort check
run: |
isort --check --diff factgenie || echo "We know it fails ATM"
isort --check --diff tests
isort --check --diff tests
18 changes: 18 additions & 0 deletions factgenie/config/resources.yml
Original file line number Diff line number Diff line change
Expand Up @@ -122,3 +122,21 @@ xsum-debug:
source: https://github.com/EdinburghNLP/XSum
splits:
- test
propaganda-techniques:
name: propaganda-techniques
class: propaganda_techniques.PropagandaTechniques
annotations:
- PTC
description: 'The Propaganda Techniques Corpus (PTC) is a corpus of propagandistic techniques at annotated by 6 pro-annotators in spans.
The corpus includes 451 articles (350k tokens) from 48 news outlets.
The corpus accompanied paper [Fine-Grained Analysis of Propaganda in News Article](https://aclanthology.org/D19-1565/).
The labels are:
Loaded Language, Name Calling&Labeling, Repetition, Exaggeration&Minimization, Doubt, Appeal to fear-prejudice, Flag-Waving, Causal Oversimplification, Slogans, Appeal to Authority, Black-and-White Fallacy, Thought-terminating Cliches, Whataboutism, Reductio ad Hitlerum, Red Herring, Bandwagon, Obfuscation&Intentional&Vagueness&Confusion, Straw Men'
data-link: https://owncloud.cesnet.cz/index.php/s/e5OKV8XF9LaQATj/download
source: https://web.archive.org/web/20210922205158/https://propaganda.qcri.org/data/fine_grained_propaganda_code.zip
outputs:
- identical
splits:
- train
- dev
- test
1 change: 1 addition & 0 deletions factgenie/data/datasets_TEMPLATE.yml
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
# ---

# If you want to start adding local datasets manually, rename this template file to `datasets.yml` and modify the example record.
# See the factgenie/data/resources.yml or the /manage/ page after you started factgenie server in order to see out-of-box supported datasets.

# A dataset record looks like the following:
example-dataset-id:
Expand Down
Loading
Loading