0% found this document useful (0 votes)
8 views

Data Visualization notes

This document introduces data visualization, defining it as the graphical representation of information to enhance understanding of trends and patterns. It covers the history of visualization, its importance, the visualization process, and its relationship with computer graphics and other fields. Additionally, it outlines the visualization pipeline, knowledge discovery pipeline, and conventions for writing pseudocode in data visualization.

Uploaded by

saradha.r
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views

Data Visualization notes

This document introduces data visualization, defining it as the graphical representation of information to enhance understanding of trends and patterns. It covers the history of visualization, its importance, the visualization process, and its relationship with computer graphics and other fields. Additionally, it outlines the visualization pipeline, knowledge discovery pipeline, and conventions for writing pseudocode in data visualization.

Uploaded by

saradha.r
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 8

Data Visualization

UNIT – I

1.1 INTRODUCTION

This chapter provides a high – level introduction to data and information visualization. What
visualizations are, why imagery is so important. It shows how visualizations are applied to
problem solving and discusses the process of visualization.

What is Visualization?

Visualization is defined as the communication of information using graphical


representations. Pictures have been used as a mechanism for communication since before
the formalization of written language. A picture speaks thousand words. Pictures are
independent of local language, as a graph or map may be understood by group of people with
no common tongue.

Definition: Data visualization is the graphical representation of information and data. By


using visual elements like charts, graphs, and maps, data visualization tools provide an
accessible way to see and understand trends, outliers, and patterns in data. Additionally, it
provides an excellent way for employees or business owners to present data to non-technical
audiences without confusion.

Examples of Visualization in Everyday Life

 Table in newspaper
 Train and subway map with arrival timings
 Map of a region
 Weather chart
 Graph of stock market etc.,

Why is Visualization important?

There are many reasons why visualization is important. Perhaps the most obvious reason is
that

 human beings use eyes as one of our key senses for information understanding.
 Visualization is real and highlights the needs for testing user interpretation of
visualizations in specific decision-making processes.
 In larger applications, visualization provide alternative data views and help describe
the data structure, pattern or anomaly in data.
1.2 HISTORY OF VISUALIZATION

The history of visualization is explained below.

1.2.1 Early Visualizations:


 First technique for graphically recording and presenting information used by early
man.
 Example: Chauvet-Pont’dArc Cave in Southern France contains 250 paintings
created 30,000 years ago.
 Oldest writing systems used pictures to encode symbols and whole words. Such
systems are called logograms.
 The Kish Limestone tablet is considered the earliest written document from
Mesopotamia.
 Another early writing system which came from ancient Egyptians, is called
Hieroglyphics.
 Hieroglyphs are divided in to three categories- Logograms, Phonograms and
Determinatives.
 Hieroglyphics Logograms are signs that represent morphemes, the smallest
language unit that carries semantic interpretation. Phonograms are signs that
represent one or more sounds. Determinatives are signs that help to join logograms
and phonograms together to disambiguate the meaning of sequence of glyphs.
 Early visualizations came out of necessity to provide support for travellers planning
or survival.
 The Peutinger map or Cartogram was an early road map of 70,000 miles of
imperial highways of Roman world.
 The Hereford map is approximately 4 and a half feet calf skin map of the world that
can be seen in Cathedral at Hereford Wales. It depicts the land masses of Asia,
Europe and Africa.
 John Snow’s map of deaths resulting from cholera in London in 1854. Each bar
within the houses represents one deceased individual.
 Minard’s Napoleonic march representation was a brilliant tour-de -force, presenting
linked geographic and time-series data on static representation. This is one of the last
maps to appeal the destruction of France through war.
1.2.2 Visualization Today
 Visualization provides different levels of both qualitative and quantitative views of
information being communicated.
 Visualizations provide precise views of the data.
 Modern visualizations harness digital media.
 Visualizations provide visual representation of objects that may include data,
algorithms, results of computations, processes, user controls and numerous other
components of an applications.
 These visual representations provide information through the use of computer-
generated graphics.
 It is even possible for an application to be totally driven through its visualization.
 Examples of modern visualization application: medical reconstruction, aerospace
simulation, bioinformatics, airflow generated by a jet during take-off etc.,
 Visualizations are also used in network representations for traffic patterns, e-mail
exchanges, Facebook friend relationships and other numerous relationships.
1.3 RELATIONSHIP BETWEEN VISUALIZATION AND OTHER FIELDS.

 In all visualizations, one can clearly see the use of the graphics primitives (points,
lines, areas, and volumes). Beyond the use of graphics, the most important aspect
of all visualizations is their connection to data.
 However, visualization is more than simply computer graphics. The fieldof
visualization encompasses aspects from numerous other disciplines, including
human-computer interaction, perceptual psychology, databases, statistics, and data
mining, to name a few. While computer graphics can be used to define and
generate the displays that are used to communicate the information, the sources of
data and the way users interact and perceive the data are all important components
to understand when presenting information

 Aspect Visualization Computer Graphics


Visual representation of data to Creation and manipulation of
Definition make it easily interpretable and images and animations for realistic
meaningful for users or artistic rendering
Enhancing data comprehension and Producing visually appealing and
Primary Goal communication through visual realistic images, models, or
formats animations
Generating visual content for
Conveying insights, patterns, and
Focus simulations, entertainment, virtual
trends in data
environments
Charts, graphs, maps, dashboards, 3D modeling, rendering, texturing,
Methods
heatmaps shading, animation
Tableau, Power BI, D3.js, Blender, Maya, 3ds Max, Unity,
Tools/Software
Matplotlib OpenGL
Business intelligence, data
Video games, movies, simulations,
Applications dashboards, scientific data
virtual reality, augmented reality
representation
Interactive dashboards, infographics, Images, animations, 3D models,
Output
visual charts virtual environments

In summary, Visualization focuses on data representation for insights, while Computer


Graphics centers on creating immersive or realistic visual scenes, models, and animations.

1.4 VISUALIZATION PROCESS

Data Collection and Preparation: The process begins with gathering data from relevant
sources and preparing it for visualization. This involves handling data quality issues, such as
missing values, inconsistencies, or redundancies, through cleaning and preprocessing.
Transformations like normalization or filtering may be applied to make the data suitable for
visual representation, ensuring it is accurate and ready for analysis. Properly prepared data
forms the foundation for effective visualization.

Data Analysis and Selection: Once prepared, the data is analyzed to identify the key
variables, trends, and relationships that best address the visualization's goals. Through
statistical summaries and exploratory analysis, the data is reduced or refined to focus only on
the most relevant parts. This stage emphasizes selecting data elements that will effectively
convey insights without overwhelming the user, especially if dealing with high-dimensional
datasets.

Mapping Data to Visual Structures: Here, data is mapped to visual elements, determining
how features like position, colour, size, or shape will represent different data attributes. This
mapping depends on the data type (e.g., categorical or quantitative) and the message intended
to be conveyed. The visual encodings chosen should capitalize on human perceptual abilities
to ensure clarity and enhance users’ understanding of the data through the visualization.

Design of Interaction Techniques: This step adds interactivity to allow users to engage
directly with the visualization. Interactive features like filtering, zooming, panning, and
highlighting empower users to explore data dynamically, focusing on specific areas or
discovering relationships. Well-designed interaction supports analytical tasks and should be
intuitive, allowing users to easily manipulate the visualization to deepen their insights and
exploration.

Rendering the Visualization: This stage involves the actual implementation of the
visualization using software tools and libraries. Data transformations must be handled
efficiently, especially for real-time or large-scale visualizations, to ensure smooth rendering.
Attention to rendering performance is key to maintaining a responsive experience, with visual
elements presented clearly and in a way that enhances legibility, thus allowing the
visualization to serve as an effective communication tool.

Evaluation and Refinement: Evaluation assesses the visualization’s clarity, usability, and
effectiveness. By gathering qualitative feedback from users and quantitative measures such as
task completion time and error rates, developers can pinpoint areas for improvement. This
feedback guides iterative refinement, ensuring the visualization achieves its goals and is
intuitive for users, making it a continuous process to adapt the visualization based on user
experience.

Deployment and Maintenance: The final step is to deploy the visualization, making it
accessible to its target audience through integration in applications, websites, or dashboards.
Long-term usability requires regular updates to ensure data accuracy, compatibility with
technology, and responsiveness to user feedback. Maintenance is crucial to keep the
visualization relevant and effective, ensuring scalability and accessibility as needs evolve,
thus sustaining its impact over time.
1.4.1 The Computer Graphics Pipeline

For computer graphics the stages are as follows.

Modeling: A 3D model consisting of planar polygons defined by vertices and surface


properties, is generated using a world co-ordinate system.

Viewing: A virtual camera is defined at a location in world coordinates, along with a


direction and orientation. All vertices are transformed into viewing coordinate system based
on camera parameters.

Clipping: Clipping involves discarding any part of an object or scene that falls outside the
viewing frustum (the visible region that the camera can "see"). Objects may be transformed
into normalized viewing coordinates to simplify the clipping process. Clipping can actually
be performed at many different stages of pipeline.

Hidden Surface Removal: Polygons facing away from the camera, or those obscured by
others are removed or clipped. This process may be integrated into the projection process.

Projection: 3D polygons are projected onto the 2D plane of projection, usually using a
perspective transformation. The results may be in a normalized 2D coordinate system or
device/ screen coordinates.

Rendering: Rendering is the final step in the computer graphics pipeline, where the 3D scene
is converted into a 2D image. This process takes all elements (geometry, textures, lighting,
and shading) and calculates the colour, brightness, and effects for each pixel in the final
image. Rendering can include various techniques to simulate realistic effects, such as
shadows, reflections, and textures.

Ray tracing: A variant on this pipeline involves casting rays from the camera through the
plane of projection to ascertain what polygons are hit. Secondary rays can also be generated
upon intersection with the surface and the results accumulated. The key algorithms include
mechanism for combining the secondary ray’s effects.

1.4.2 The Visualization Pipeline


The visualization pipeline is similar to the graphics pipeline at least on abstract level. The
stages of this pipeline are as follows.

Data Modeling: The data to be visualized, whether from a file or database, has to be
structured to facilitate its visualization. The name, type, range and semantics of each attribute
or field of a data record must be available in a format that ensures rapid access and easy
modification.

Data Selection: It is similar to clipping and it involves identifying the subset of the data that
will be potentially visualized. This occurs totally under algorithmic methods, such as cycling
through time slices or automatically detecting features of potential interest to the user.

Data to visual Mappings: The heart of visualization pipeline is performing the mapping of
data values to graphical entities or their attributes. This mapping often involves processing
the data prior to mapping, such as scaling, shifting, filtering, interpolating or subsampling.

Scene parameter setting (View transformations).: Like traditional graphics, the user must
specify several attributes of visualization that are relatively independent of the data. This
involves colour map selection, sound map selection and lighting specifications.

Rendering or Generation of the Visualization: is a key stage in the visualization pipeline,


where the processed and mapped data is transformed into a visual format that users can
interpret. In this stage, all the mappings created in the previous steps (such as position, color,
size, and shape of data elements) are converted into graphical forms that can be displayed on
the screen. The goal is to generate a coherent, visually clear representation of the data that
accurately communicates the intended insights.

1.4.3 Knowledge discovery Pipeline

It is also called Data Mining. It has its own pipeline. Here also we start with data, and we
process it with goal of rendering a model rather than graphics display. The process structure
is as follows.

Data: In the KD pipeline there is more focus on data, as the graphics and visualization
process assume that data is already structured to facilitate display.
Data Integration, Cleaning, warehousing & Selection: This involves identifying the
various data sets that will be potentially analysed. Involves filtering, sampling and other
techniques that help curate the data.

Data Mining: The Heart of KD pipeline is algorithmically analysing the data to produce a
model.

Pattern Evaluation: The resulting model must be evaluated to determine their robustness,
stability, precision and accuracy.

Rendering or Visualization: The specific results must be produced to the user. Interactive
visualization can be used at every step of KD pipeline.

1.5 PSEUDO CODE CONVENTIONS

When writing pseudocode for data visualization, it is essential to follow conventions that
ensure clarity, readability, and maintainability. Here’s a list of common conventions:

 Data: The working data table is assumed to contain only numeric values. Original data
table that contains non- numeric values will be converted to numeric values. When
visualizing the working data table is assumed to be the subset.
 m: The number of dimensions in the working data table. Dimensions are typically
iterated over using j as the running dimension index.
 n- The number od records (rows) in the working table. Records are typically iterated
over using i as the running record index.
 NORMALIZE: (Record, dimension), A function that maps the value for given record
and dimension in the working data table to a value between min and max, or between
zero and one if min and max are not specified.
 COLOR: A function that sets the color state of the graphics environment to the
specified color.
 MAPCOLOR (record, dimension)- A function that sets the color state of the graphics
environment to be the color derived from applying the global color map to the
normalized value in working table.
 CIRCLE (x, y, radius): A function that fills a circle centered at the location (x,y) with
given radius.
 POLYLINE (xs, ys): A function that draws the polyline (many connected segments)
from the given rays of a and y co-ordinates.
 POLYGON (xs, ys): A function that fills the polygon defined by arrays of x and y co-
ordinates with the current color state.
For geographic visualizations, the following functions are assumed to exist.

 GETLATITUDES (record), GETLONGITUDES (record)- retrieves the arrays of


longitudes and latitudes for the record.
 PROJECTLATITUDES (lats, scale), PROJECTLONGITUDES (longs, scal)-
Functions that project arrays of latitude values to y values and longitude values to x
values.

For graph and 3D surface data sets, the following is provided.

 GETCONNECTIONS (record) – A function that retrieves an array of record


indices to which the given record is connected.
1.6 SCATTRR PLOT

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy