As a data scientist you are required to analyse the political landscape of Europe using the Chapel Hill Expert Survery dataset. The dataset provides insights into the positioning of 277 political parties in Europe based on 55 different attributes. The dataset can be downloaded here and the codebook provides further information on the survey attributes.
This repository contains the necessary setup and codebase to help guide you in performing an analysis using different statistical methods.
Please make sure you have the following software installed:
- Python (3.9 - 3.12)
- Poetry
Poetry is used for Python dependency management. To install the necessary Python dependencies run the following command:
poetry install
Alternatively, the make
command defined in the Makefile can also be used:
make install
Similarly to add and install new Python packages to your Poetry virtual environment, use:
poetry add <python-package-name>
The unit tests, linting checks, and type checks can be run either by using the make
commands (given in the Makefile) or by using the corresponding commands for the respective packages. For example, unit tests can be executed using:
make test
or
poetry run pytest tests
For running linting checks using flake8, use:
poetry run flake8 src tests
or
make lint-check
Please be sure to complete the below tasks before the pairing session:
- Get a high-level understanding of the dataset by looking into the codebook and if necessary, downloading and inspecting the dataset.
- Have your preferred text editor or IDE setup and ready to go.
- Have your coding environment ready by installing Python and Poetry.
- Ensure that you are able to run all commands mentioned in this README (note that a failing test from pytest is expected).
Please note that you DO NOT have to complete the code/tasks inside the src/
folder. These are intended to be completed collaboratively during the pairing session.