Data Visualization
Data Visualization
Q1
a) explain the consideration, remember about the chart format for webpage,
paper publication,
b) Inappropriate colour, Potential impact
c) Implement text level Lie factor
2. EV Electrical Vehicle
a) Gender age
b) Challenge
c) How moziac plot and relationship
d) Create simple moziac plot using empirical data
e) Explain different section/perception about the mosaic plot
f) Relationship between variables
g) Why use mosaic
h) What you want from mosaic
o Webpage:
▪ Use a color palette that maintains high contrast for accessibility in on-screen
viewing【18†source】.
o Paper Publication:
▪ Charts for publication should be static but of high resolution with annotations,
labels, and a clear title.
▪ Ensure that the colors are distinguishable in grayscale for compatibility with
black-and-white prints【18†source】.
• Avoid colors that lack contrast or may be interpreted differently by people with visual
impairments, such as red and green combinations. These colors can make it difficult for users
with color blindness to differentiate sections of the chart.
• Impact: Using poor color combinations could lead to misinterpretation of the data, rendering
the visualization ineffective or misleading【18†source】.
o Include descriptive labels for axes and data points to help the audience interpret the
chart more easily. Adding meaningful titles and annotations helps guide the audience
to understand the data quickly【18†source】.
2. Lie Factor:
o The Lie Factor should be close to 1 to ensure the accuracy of visual representation:
𝑆𝑖𝑧𝑒 𝑜𝑓 𝐸𝑓𝑓𝑒𝑐𝑡 𝑖𝑛 𝑉𝑖𝑠𝑢𝑎𝑙𝑖𝑧𝑎𝑡𝑖𝑜𝑛
𝐿𝑖𝑒 𝐹𝑎𝑐𝑡𝑜𝑟 = 𝑆𝑖𝑧𝑒 𝑜𝑓 𝐸𝑓𝑓𝑒𝑐𝑡 𝑖𝑛 𝐷𝑎𝑡𝑎
o For instance, using a zero baseline on a bar chart helps to avoid exaggerating the
differences between categories. A high lie factor misrepresents the data and may lead
to incorrect conclusions【18†source】.
• Pre-Attentive Attributes:
o Use visual features like color and size to draw attention to key points. Highlight major
milestones (e.g., the year with the highest EV sales) using a distinct color.
• Minimize Complexity:
o Present a simplified version of the data, focusing on key aspects such as trends or
peaks in EV sales, while avoiding excessive information that could lead to cognitive
overload【17†source】.
o X-axis: Represents the time period (e.g., years from 2015 to 2023).
o Level of Detail: Show yearly data points to avoid clutter. Highlight significant
milestones (e.g., new incentives or policies) using colored markers.
• Linked Views are a technique that links multiple charts or visualizations such that selecting
data in one view dynamically highlights corresponding data in other views【16†source】.
o Linked views help users see relationships between different attributes, such as
comparing scores in multiple subjects.
2. Interactive Exploration:
o Users can interactively filter or select students to explore their performance across
different subjects, which aids in identifying individual strengths or weaknesses.
• Chart Type: Use three bar charts, each representing a different subject (e.g., Math, Science,
English).
• Interaction Insight:
o If a student is selected in the Math chart, their scores in Science and English are
highlighted, allowing a comparison of that student’s overall performance.
• Each chart is linked, so selecting a student or filtering by a range of scores in one chart updates
the others, providing insights into performance relationships.
• Scenario: Visualize relationships between Gender, Age Group, and Income Level.
b) Challenges in Mosaic Plots:
1. Overplotting:
o Too many categories can make the mosaic plot hard to interpret.
2. Interpretability:
o Users may struggle to interpret complex mosaic plots if they are not labeled clearly.
• Mosaic plots divide data hierarchically into segments. They provide an overview of how
different categories (e.g., gender, age group) are distributed across another attribute, such as
income level【15†source】.
• Third Split: Income level (Low, Medium, High) within each age group.
• Each section of the plot represents a combination of categories (e.g., males in the 21–30 age
group with a high income), making it easier to visually compare group distributions.
• The mosaic plot helps in determining if income level distribution patterns differ significantly
between genders or age groups.
• Mosaic plots are useful for visualizing multivariate relationships and hierarchical data
distributions effectively.
• To uncover hidden patterns or outliers within the dataset by comparing distributions across
various categories.
• Defining the task aligns the visualization design with the intended goal and audience.
b) Create Abstract:
o Task Abstraction: Define general user tasks, such as comparing or filtering data.
1. Effectiveness of Encoding:
o Evaluate if the visual encodings (bar height, line slope) communicate the data
accurately without misrepresentation.
2. Ease of Interpretation:
o Determine how easy it is for users to understand the visualization, often through user
testing and feedback【20†source】.
o Assess whether the visualization is easy to interact with and maintains user
engagement.
o Maintain the Lie Factor close to 1 to ensure truthful visual representation without
distortions【18†source】.
o Ensure that the visualization design matches the audience's needs and data literacy
level, using appropriate colors, annotations, and interaction capabilities【20†source】
.
These answers have been compiled based on the content in your uploaded document and previously
provided slides to address each question comprehensively. Let me know if you need further clarification
or any specific elaboration!
d)