Outline

Problem Statement

Gathering thermal comfort related data (or in general user-generated data) is costly and takes time.
Hard to generalize over a big population of people, as well as obtain a similar number of responses for each label/category (e.g. Thermal comfort labels, always a predominance in the 'comfort' class and not the rest)

Try different generative models for augment datasets
- GAN (and its variations: CGAN, WGAN, WCGAN, TGAN, TableGAN)
- Autoencoders (and the variations: Adversarial, variational)

Baseline: Original train and test set
Train set and synthetic data as training set: Should increase the performance since classes would be more balanced
Synthetic data as training set: Performance should be comparable with the baseline showing the synthetic set captures the same characteristics as the real train set

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
data		data
notebooks		notebooks
src		src
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md