Visualization 2 Data Representation 1
Visualization 2 Data Representation 1
Data Visualization
Data Representation
Y. Raymond Fu
Professor
Electrical and Computer Engineering (ECE), COE
College of Computer and Information Science (CCIS)
Northeastern University
Data Visualization
exists everywhere!
http://benfry.com/zipdecode/
0****
02***
021**
Example 2: Visualizing a Music
Viva la Vida
(MUSIC VIDEO)
Mp3 wma rm …
https://www.youtube.com/watch?v=-bdKMT1znJ0
Viva la Vida (MUSIC VIDEO)
https://www.youtube.com/watch?v=LSNyWGc0gsU
Example 3: Visualizing 200 countries, 200 years
https://www.youtube.com/watch?v=jbkSRLYSojo
Example 4: Visualizing TV Series
https://www.youtube.com/watch?v=7DemM7UGmIg
Visualization Pipeline
Collection Data table can be Rendering components
• a table of figures, network, graphs draw the contents of the
• a social network graph, and trees visual abstraction into any
• a file directory structure, number of interactive
• or any other data set. views
mouse and
keyboard
feedback
varying
• a data model that
Internal representations perspectives
includes visual
• reading in the data from a panning and
features such as
formatted file or database, zooming,
spatial layout, color,
• could potentially involve snapshots
size, and shape
transformations.
• Information to
draw
http://prefuse.org/doc/manual/introduction/structure/
Seven Stages of Visualizing Data
1. Input data
• your primary “raw” source of information
• can be anything (measurements, simulations, databases, …)
2. Formatted data
• converted to points, cells, attributes (discussed next in this module)
• Ready to use for visualization algorithms
3. Filtered data
• eliminates the unneeded data, adds the needed information
• read and written by visualization algorithms
4. Spatial (mapped) data
• has spatial embedding can be drawn
5. 2D Image
• final image you look at to get your answers
http://www.mbta.com/schedules_and_maps/subway/
Two Fundamental Forms of Data
--Bertin (1977)
Information Visualization (perception for design) (2nd Edition), Colin Ware, Elsevier Press.
50 Great Examples of Data Visualization
http://www.webdesignerdepot.com/2009/06/50-great-examples-of-data-visualization/
50 Great Examples of Data Visualization
http://www.webdesignerdepot.com/2009/06/50-great-examples-of-data-visualization/
Entity–relationship model ---software engineering
Wikipedia
http://en.wikipedia.org/wiki/Entity%E2%80%93relationship_model
Attribute Quality - Four Measurements
--S.S. Stevens (1946)
Information Visualization (perception for design) (2nd Edition), Colin Ware, Elsevier Press.
Typical Data Classes in Visualization
Information Visualization (perception for design) (2nd Edition), Colin Ware, Elsevier Press.
Attribute Dimensions and Orders
• Dimensions
– 1D: scalar
– 2D: two-dimensional vector
– 3D: three-dimensional vector
– >3D: multi-dimensional vector
• Orders
– scalars
– vectors
– tensors (high-order) 1-st order 2-nd order
An attribute of an entity can have multiple dimensions.
1D--We can have a single scalar quantity, such as the weight of a
person.
2D/3D-- We can have a vector quantity, such as the direction in which
that person is traveling. vector matrix
Information Visualization (perception for design) (2nd Edition), Colin Ware, Elsevier Press.
Earth's gravity field as captured by the GOCE satellite gives an unprecedented view of how the force acts on our planet. The
differences in gravitational force are represented using colours that show -100 metres up to 100 metres.
Tensors are higher-order quantities that describe both direction and shear forces.
The gravitational field of the Earth is a three-dimensional attribute of the Earth. In fact, it is a three-dimensional
vector field attribute. If we are interested only in the strength of gravity at the Earth’s surface, it is a two-
dimensional scalar field attribute.
Extraordinary map reveals Earth‘s gravity field for the first time, By DAILY MAIL REPORTER
Data Table
www.many-eyes.com
Courtesy of Prof. Hanspeter Pfister, Harvard University.
Common Data Operations
• Mathematical operations (multiplication, division, etc.)
• Merging two lists to create a longer list
• Inverting a value to create its opposite
• Bringing an entity or relationship into existence (such as the
mean of a set of numbers)
• Deleting an entity or relationship (a marriage breaks up)
• Transforming an entity in some way (the chrysalis turns into a
butterfly)
• Forming a new object out of other objects (a pie is baked from
apples and pastry)
• Splitting a single entity into its component parts (a machine is
disassembled)
Information Visualization (perception for design) (2nd Edition), Colin Ware, Elsevier Press.
Metadata
• Metadata is "data about other data", of any sort in any media.
• Describe an individual datum, or content item, or a collection
of data with multiple content items and hierarchical levels.
• About the context, quality and condition, or characteristics
– Data elements or attributes, (name, size, data type, etc.)
– Records or data structures (length, fields, columns, etc.)
– About data (location, how it is associated, ownership, etc.)
• Example: A digital audio file, such as an MP3 of a song, might
include the album name, song title, genre, year, composer,
contributing artist, track number and album art.
http://en.wikipedia.org/wiki/Metadata
Quantities of Data
• Intrinsically Continuous and Intrinsically Discrete
• Continuous: pressure, temperature, position, speed, density,
force, color, light intensity, and electromagnetic radiation.
• Discrete: hypermedia, e.g. text and image, contents of web
pages, software source code, plain text, database records.
• Continuous data are often manipulated in finite approximated
form— Sampled Data.
• Sampled data are also discrete with finite set of elements.
• Difference
– Sampled data have corresponding continuous
approximation of the original intrinsically continuous data
– Intrinsically discrete data has no counterpart in the
continuous world
Sampling
(data importing) Reconstruction
Sampling Reconstruction
Basic idea
•Map each scalar value f ÎR at a point to a color via a function c : [0,1] [0,1]3
Color tables
•precompute (sample) c and save results into a table {ci }i=1.. N
•index table by normalized scalar values
f - fmin
normalize input to [0,N] i=N
fmax - fmin
æiö
Color mapping ci = c ç ÷ c :[0,1] ®[0,1]3
èNø desired color
color table transfer function
f is color-mapped to ci
blue=0 red=100
p answer: s = 90
Simple to implement
(see Sec. 5.2)
Explanation
•per-vertex: f c( f ) interpolation(c( f )) color interpolation can fall outside colormap!
•per-pixel: f interpolation( f ) c(interpolation( f )) colors always stay in colormap
color banding
Question
•can we see sharp color banding
with per-vertex colormapping?
Why (not)?
57
Courtesy of Zhiqiang Tao
Representation learning by neural networks and labels
recurrent neural
Encoder
Time-series network (RNN)
graph convolutional
Graph-structured network (GCN)
Representation Kinetics (650K of 700 classes) Visual Genome (relationship)
Different Data
s
Parameterize encoder with different Large-scale dataset with human annotations
neural networks to handle data variety
recurrent neural
Encoder Decoder recurrent neural
Time-series network (RNN) network (RNN)
Time-series
59
Courtesy of Zhiqiang Tao
Reconstruction
60
Generation
Translation
Projection
Approximation
high-jump
basketball
volleyball
3. Visual Understanding
fake
Inner Product decoder
convolutional neural
recurrent neural
network (RNN)
network (CNN)
multi-layer
perceptron
(MLP)
Higher-level information
with different higher-level guidance
Layer.
couple.
…
Double-click the black color stop below the
gradient ramp, set the color to #ab82bc, and
click OK to close the Color Picker.
…
Then, with your mouse button still held down,
drag straight up to the very top: Click and drag
out the transition area for the gradient.
For our example, we are using a color gradient
Φ ℎ
partitions
Encoder
→
Regularizations
raw features
convolutional neural
graph convolutional
recurrent neural
network (RNN)
network (CNN)
network (GCN)
multi-layer
adversarial training
perceptron
(MLP)
KL-divergence
Different Data
Images / Videos
Raw features
nuclear norm
Time-series
ℓ0 , ℓ1 norm
low-rank
sparse