9512.net

# Example Parallel coordinate plot of Iris data

High Dimensional Data Visualisation: the Textile Plot
Natsuhiko KUMASAKA
PhD Student School of Fundamental Science and Technology Keio University

Ritei SHIBATA
Department of Mathematics

Building good models from data
Exploring data through visualisation
Finding outliers Clustering observations Investigating relationships between variables
Har d to

exp lo

re h

igh d

ime nsio n

al d ata

Parallel Coordinate Plots
(Inselberg 1985, Wegman 1990)

Visualising a set of points in high dimensional space Axes are placed in parallel (not right angle) Coordinates of each point are connected by segments

Example: Parallel coordinate plot of Iris data

Iris

Example: Parallel coordinate plot of Iris data

Iris

Sepal.Length

Example: Parallel coordinate plot of Iris data

Iris

Sepal.Width

Example: Parallel coordinate plot of Iris data

Iris
Petal.Length

Example: Parallel coordinate plot of Iris data

Iris
Petal.Width

One polygonal line indicates one observation

Difficult to understand any mechanism behind the data

The number of the intersections increases

Location and scale of each axis are independently chosen

All coordinate points fill up the range of the axis.

Choosing appropriate locations and scales and the order of the axes

Textile plot
(Kumasaka and Shibata, submitted) A parallel coordinate plot
Locations and scales are simultaneously chosen
All polygonal lines are aligned as horizontally as possible

Order of axes is carefully chosen
To provide a clear image of the data to the user

Any kind of data can be displayed
Numerical data Unordered categorical data Ordered categorical data Missing values

Named by analogy to a fabric
Warp and Weft

Go od

Tex tile!

Choice of locations and scales for numerical data
Data (p-dimensional n observations)

Data vector

Coordinate vector (for numerical data)

Choice of locations and scales for numerical data
Data (p-dimensional n observations)

Data vector

Coordinate vector (for numerical data)

Choice of locations and scales for numerical data
Data (p-dimensional n observations)

Data vector

Coordinate vector (for numerical data)

Choice of locations and scales for numerical data
Data (p-dimensional n observations)

Data vector

Coordinate vector (for numerical data)

Criterion
Coordinate vector

Location parameter vector Scale parameter vector Ideal coordinate vector

The sum of squared deviations is minimised

Solution of the ideal coordinate vector

Constraint

Solution of location parameter

Solution of scale parameter

Optimal choice of locations and scales

Order of axes
According to the squared distance
The further left axis is closer to the mean vector

Categorical data vector
To determine a coordinate of each level
Encoding the categorical data vector
Example Using a treatment contrast

by a set of contrasts

Coordinate vector

Choice of locations and scales for numerical and categorical data
Data Matrix
Encoded matrix for a categorical data vector with Original data vector for a numerical data vector levels

Coordinate vector Location parameter vector Scale parameter vector

Sum of squared deviations is minimised

Solution of location and scale

Categorical data on parallel coordinate plot

versicolor

virginica

setosa

Ordered categorical data
Using the specific contrast matrix

Example

Missing values
Indicator matrix reflecting missing information

Sum of squared deviations

Constraint

Design of display
Textile plot
Understanding various aspect of data Points displayed on a axis are carefully chosen Further classification of data types

Way of displaying points on a axis
Numerical data
Continuous data
Continuous line

Discrete data
Tick marks

Arrow head to show the orientation Possible minimum and maximum Ｎon-numerical data
Possible levels Ordered categorical data

Arrows
Logical

Coloured

All data
Multiplicity on the coordinate is represented by the area of the circle Missing value Label (with unit or numeral)

Textile plot of Iris data

TOPIX (Tokyo Stock Price Index) from Jan 1991 to Oct 2002

TOPIX (Tokyo Stock Price Index) from Jan 1991 to Oct 2002

Two significant features
Knot A point on a axis, where all polygonal lines are pass through Isolated data vector Parallel wefts Segments horizontally aligned between two axes Perfect linear relationship or mapping between two data vectors

Preparation
Assumption
No missing values and no ordered categorical data Normalisation

Matrix notations

Knot

Simplified condition for a knot to occur

Parallel wefts

TOPIX (Tokyo Stock Price Index) from Jan 1991 to Oct 2002

Textile plot
Visualisation for understanding data
Polygonal lines are aligned as horizontally as possible Any kind of data can be displayed Symbols for points displayed are carefully chosen Knot and Parallel wefts

Implemented on R DandDR (http://www.stat.math.keio.ac.jp/DandDIV/)
Add-on package for R Interface between DandD and R Receiving data and necessary information Creating a dad object on R
List object which consists of data and attributes Own plot method producing the textile plot

Further developments
Non-linear transformations Design enhancements
Using colour Line width and thickness

Dynamic or interactive display
Improving user interface Java Language

Reference
A. Inselberg, The plane with parallel coordinates, The Visual Computer 1 (1985) 69-91. E. Wegman, Hyperdimensional data analysis using parallel coordinates. Journal of The American Statistical Association 85 (1990) 664--675.

Example Parallel coordinate plot of Iris data_图文.pdf
Example: Parallel coordinate plot of Iris data Iris Example: Parallel coordinate plot of Iris data Iris Sepal.Length Example: Parallel coordinate plot of ...
Day1.pdf
the continuation prompt is + >plot(cars,xlab=...2 cos(4θ) + (sin(π/12))5 X-coordinate:...Example of Data frame: Get to know iris data ...
chap3_data_exploration_图文.ppt
how easy it is to understand the data Example:...#? Scatter Plot Array of Iris Attributes ? Tan...coordinate axis and the points are connected by ...
Iris recognition border-crossing system in the UAE_图文.pdf
because the polar coordinate system is ...plot showing the trade-off between False Reject ...An unlimited number of parallel IrisEngines ensure...
R An Open Source Statistical Environment_图文.ppt
of Iris versicolor Sepal.Width 2.0 2.5 3.0 3.5 ... parallel plot and coplot Given : depth Petal ... from the available data (for the example we ...
Detection of eye locations in unconstrained visual images_....pdf
appear in a gradient direction plot of the image...ow ?eld around the iris. to 0, which serve ...be spatially close in terms of the y coordinate...
An anatomy of iriscode for precise phase representation.pdf
applied to an iris image in dimensionless polar coordinate system, I(ρ, ...applications can verify, 1-to-1 match, or identify from a small database...
Starting GNUPLOT GNUPLOT Quick Reference_图文.pdf
iris4d [8 24] kc_tek40xx km_tek40xx next ...example of plotting a 3-d data example of ...The x coordinate must be speci?ed. The order ...
Data Set Editing by Ordered Projection.pdf
those based on axis-parallel classifiers are the...upper values for each coordinate of this rectangle...Results. DataBase Iris Breast Cancer Pima Diabetes...
Visualization of hidden node activity in neural networks I. ....pdf
parallel coordinate representation for the two ...For example, the plots for the ?rst class may...2 for the Iris data and the XOR data. If ...
An Extension of Parallel Coordinate Plot for Visualizing ....unkown
(Session CPS109) p.4811 An Extension of Parallel Coordinate Plot for Visualizing Aggregated Symbolic Data Yoshikazu YAMAMOTO 1,3 and Junji NAKANO 2 1 ...
Devaney Abstract.unkown
3D Glyph-based display Parallel coordinate plot PDF / PMF...This is an attribute rank plot of iris data based on the information gain...
...Context Switching and Cognition in Dual-View Coordinated ....unkown
{gconvert, jichen8, beyost, yryu, north}@vt.edu Abstract Multiple-view visualizations are useful for finding patterns in complex data sets, but little...
Construction of Line Densities for Parallel Coordinate Plots1.unkown
University Fairfax, VA 22030 ABSTRACT The graphical representation of high dimensional data may be accomplished by using the parallel coordinate plotting system...
Application Of The Parallel Coordinate Plot For Ranking Com....unkown
[1] showed that a parallel coordinate plot [2] can be used for data ...example of actual POS data analysis, which is a visualization method for ...
dissertation: Magneto-optical.unkown
plane) to p (parallel to the incidence plane)....position coordinate, each Kerr transient data point...(7 MHz), a time delay Figure 4.2: Plots ...
Iris Surface Deformation and Normalization.unkown
Iris Surface Deformation and Normalization Somying ...extension and compression parallel to the iris ...[16] where θ is a polar angle coordinate ...
Models Federico.unkown
page 49 Coordinate value for the j-th land...Fingerprints, iris and hand geometry, for example...3.1. The typical plot of the labels as a ...
Parallel Coordinate Plot for.unkown
Parallel Coordinate Plot for Original At
Categorical parallel coordinate plot.unkown
Categorical parallel coordinate plot Ret