Leader: Dr Wei Shao
Associated Leader: Yufan Kang, Dr Nan Gao
Team Member: Zhiqiu Chen, Ying Guo, Lanzhou Jiang, Junwei Ye, Haoran Ma, Zhengyuan Xu, Ziyan Peng
These visualisation codes show visualization modules by Seaborn and Matplotlib of Python. The codes were written by Cruise Group of RMIT University. Please refer to the following contents for details. If you're not into reading lenghty explanations at all, feel free to jump right into the commented code.
If you want to run the codes, you need to install some other Python libraries in addition to Seaborn and Matplotlib, which have been marked in the codes section.
The latest stable release (and required dependencies) can be installed from PyPI:
pip install paperviz
You may instead want to use the development version from Github:
pip install git+https://github.com/swsamleo/Paper-Viz.git
Paper-Viz is also available from Anaconda and can be installed with conda:
conda install paperviz
Paper-Viz development takes place on Github: https://github.com/swsamleo/Paper-Viz
By coloring the area between the axis and the line, the area map not only emphasizes the peaks and valleys, but also emphasizes the duration of the high and low points. The longer the high point lasts, the larger the area under the line. The stacked area plot can show the total value and item values, the percentage area chart can show the percentage of each item.
- Multi-item area chart
- Stacked area chart
- Percentange area chart
import paperviz
model=paperviz.Area(path=paperviz.__path__[0])
model.plot()
Configuration:
- file: file name of your data source
- x_col_name: ['x_column_name_a','x_column_name_b'...]
- paper_type : 'single' or 'double'
A bar chart is a classic way to visualise items based on counting or any given indicator.
- Single item bar chart
- Multi-item grouped bar chart
- Error bar chart with two groups
- Stacked bar chart with two groups
A bubble chart is a variation of a scatter chart in which the data points are replaced with bubbles, and an additional dimension of the data is represented in the size of the bubbles. Just like a scatter chart, a bubble chart does not use a category axis — both horizontal and vertical axes are value axes.
- Bubble chart with each bubble name and sizes
- Multi-item bubble plot with categorical colors
- Bubble plot with a color map
import paperviz
model=paperviz.Bubble(path=paperviz.__path__[0])
model.bubble_plot()
Configuration:
- file: file name of your data source
- x_col_name: The X column name
- y_col_name: The Y column name
- x_label: the content of x label
- y_label: the content of y label
A scatter plot is a basic chart used to study the relationship between two variables. If you have multiple groups in your data, you might want to visualize each group in a different color.
- Single item scatter plot with trend line
- Multi-item scatter plot
- Scatter plot with multi-mark
import paperviz
model=paperviz.Scat(path=paperviz.path[0])
model.plot()
Configuration:
- file: file name of your data source
- x_col_name: The X column name
- y_col_name: The Y column name
- x_label: x axis label
- y_label: y axis label
Maps have been used for a long time to help people navigate on road journeys or comprehend the closeness of one site to another. The material on maps, as well as the maps themselves, have become digitized, interactive, and more attractive as they have been integrated into data analysis and reporting. Seeing location data mapped and integrated in visualizations has increased comprehension and provided a useful, fresh context for more people.
- Point Map
- KDE(Kernel density estimation) Map
- Point and KDE(Kernel density estimation) Map
- Sankey Map
- Choropleth Map
- Route Map
A histogram is a graphical display of data using bars of different heights. In a histogram, each bar groups numbers into ranges. Taller bars show that more data falls in that range. A histogram displays the shape and spread of continuous sample data. The height can mean density or actual value.
- Single item histogram with a density curve
- Multi-item histogram which height means actual value
- Stacked histogram
import paperviz
model=paperviz.Hist(path=paperviz.path[0])
model.plot()
Configuration:
- file: file name of your data source
- x_col_name: The X column name
- paper_type : 'single' or 'double'
A heatmap is a graphical representation of data where the individual values contained in a matrix are represented as colors.
- Heatmap with values in blue to yellow colours
import paperviz
model=paperviz.Heat(path=paperviz.path[0])
model.plot()
Configuration:
- file: file name of your data source
- paper_type : 'single' or 'double'
Time series plots are used to show how a given metric changes over time.
- Time series plot with maximum, minimum values
- Time series plot with peaks and troughs
- Time series plot with multivariables
- Time series plot with multiple plots
- Time series plot with regional zoom-in
import paperviz
model=paperviz.Time(path=paperviz.path[0])
model.plot()
Configuration:
- file: file name of your data source
- paper_type : 'single' or 'double'
- x_col_name: name of the date variable
- y_col_name: list of names for y variable
- start_time: the start date on the timeline
- end_time: the end date on the timeline
A line chart or line plot or line graph or curve chart is a type of chart which displays information as a series of data points called 'markers' connected by straight line segments. It is a basic type of chart common in many fields.
- Multi-line chart
- Multi-item line chart with categorical colours and marks
- Double axis line chart
- Insetplot line chart
import paperviz
model=paperviz.Line(path=paperviz.__path__[0])
model.plot()
Configuration:
- file: file name of your data source
- x_col_name: ['index'] or ['x_column_name_a','x_column_name_b'...]
- y_col_name: ['y_column_name_a','y_column_name_b'...]
- x_label: x axis label
- y_label: y axis label
- legend_label: legend labels names
- paper_type: 'single' or 'double'
A pie chart is the classic way of displaying composition. However, it is generally not recommended now because sometimes the area of the pie would be misleading. Therefore, if you want to use a pie chart, it is highly recommended to explicitly note the percentage or number of each part in the pie chart.
- Pie chart with outside annotation
- Pie chart with percentage marked on and changable radius and font size
- Pie chart with some labels exploration
- Multi-layer pie chart
import paperviz
model=paperviz.Time(path=paperviz.path[0])
model.plot()
Configuration:
- file: file name of your data source
- paper_type: 'single' or 'double'
The pyramid can be used to show the distribution of groups sorted by number, or it can be used to show the step-by-step filtering of the population.
- Pyramid with actual value annotation
- Pyramid with sort functions
import paperviz
model=paperviz.Pie(path=paperviz.path[0])
model.plot()
Configuration:
- file: file name of your data source
- x_col_name: ['index'] or ['x_column_name_a','x_column_name_b'...]
- y_col_name: ['y_column_name_a','y_column_name_b'...]
- paper_type: 'single' or 'double'
A box plot or boxplot is a method for graphically depicting groups of numerical data through their quartiles. Displays the five-number summary of a set of data. The five-number summary is the minimum, first quartile, median, third quartile, and maximum.
- Multi-item box plot
- Multi-item box plot with annotation
import paperviz
model=paperviz.Bar(path=paperviz.__path__[0])
model.plot()
Configuration:
- file: file name of your data source
- x_col_name: ['x_column_name_a','x_column_name_b'...]
- paper_type : 'single' or 'double'
- x_label: the content of x label
- y_label: the content of y label
A radar chart is a graphical method of displaying multivariate data in the form of a two-dimensional chart of three or more quantitative variables represented on axes starting from the same point. The relative position and angle of the axes is typically uninformative.
- Radar chart with single groups
- Radar chart with multiple groups
- Spider chart with multiple groups
import paperviz
model=paperviz.Radar(path=paperviz.path[0])
model.plot()
Configuration:
- file: file name of your data source, it support csv/excel,text.for example:'new_file.csv'
- index_col: The index column. Normally it should be the name of the content
- category_col: input list of attributes for the radar plot
- y_col_name: input list of items
- y_lim: The Y lim value
A violin plot is a method of plotting numeric data. It is similar to a box plot, with the addition of a rotated kernel density plot on each side.
- Violin plot
- category violin plot
import paperviz
model=paperviz.Radar(path=paperviz.path[0])
model.plot()
Configuration:
- file: file name of your data source
- x_col_name: ['x_column_name_a','x_column_name_b'...]
- paper_type : 'single' or 'double'