Title: Gender Inequality in China
Team Member: Yunyi Zhu ([email protected])
Abstract: In this project, we develop an interactive website that introduces and visualizes the gender inequality in China. It includes visualizations using multiple datasets that focus on discrimination faced by Chinese women in different life stages. For each life stage, it uses scrollytelling to display the data together with paragraphs that provides the contexts of the data. We hope that this website brings awareness to different parties on the gender inequalities faced by women living in China.
Link to Paper: https://github.com/6859-sp21/final-project-yunyi/blob/main/FinalPaper.pdf
Link to Video: https://www.youtube.com/watch?v=0M9cH8rjX0w
Link to Visualization Github Pages: https://6859-sp21.github.io/final-project-yunyi/#/
Split Work: this is a one-person project so all parts are implemented by myself.
During the data search and implementation, I made several compromises on the interactivity of the visualization. In the first sketch, there will be five subcategories: birth, education, work, marriage and inheritance. Each of the subcategories have enough non-numerical data that supports them, but they really lack numerical data.
Crawling and processing the data was harder than I estimated so I ended up using aggregated data from documents such as governmental and industrial reports. For example, I originally planned to crawl all the court pages that are related to domestic violence, classify them and create an interactive system where people can hover and see each individual case. The challenges of this plans include (1) crawling all the data, (2) classify the data and (3) translate the data to English. I crawled the data but they were all textual data, so I ended up falling back to my plan B, which is to use aggregated data.
For education and work inequalities, it was even harder because the data were nowhere to be found. Originally, I wanted to find different cut offs of the college entrance exam for men and women, but those cut offs turn out to be private. I could potentially get in touch with feminist groups that collect such info, but in that case the data can be skewed, so I put this part to future work since I will need more time to find places to get data.