9512.net

甜梦文库

甜梦文库

当前位置：首页 >> >> # Example Parallel coordinate plot of Iris data

High Dimensional Data Visualisation: the Textile Plot

Natsuhiko KUMASAKA

PhD Student School of Fundamental Science and Technology Keio University

Ritei SHIBATA

Department of Mathematics

Building good models from data

Exploring data through visualisation

Finding outliers Clustering observations Investigating relationships between variables

Har d to

exp lo

re h

igh d

ime nsio n

al d ata

Parallel Coordinate Plots

(Inselberg 1985, Wegman 1990)

Visualising a set of points in high dimensional space Axes are placed in parallel (not right angle) Coordinates of each point are connected by segments

Example: Parallel coordinate plot of Iris data

Iris

Example: Parallel coordinate plot of Iris data

Iris

Sepal.Length

Example: Parallel coordinate plot of Iris data

Iris

Sepal.Width

Example: Parallel coordinate plot of Iris data

Iris

Petal.Length

Example: Parallel coordinate plot of Iris data

Iris

Petal.Width

One polygonal line indicates one observation

Difficult to understand any mechanism behind the data

The number of the intersections increases

Location and scale of each axis are independently chosen

All coordinate points fill up the range of the axis.

Choosing appropriate locations and scales and the order of the axes

Textile plot

(Kumasaka and Shibata, submitted) A parallel coordinate plot

Locations and scales are simultaneously chosen

All polygonal lines are aligned as horizontally as possible

Order of axes is carefully chosen

To provide a clear image of the data to the user

Any kind of data can be displayed

Numerical data Unordered categorical data Ordered categorical data Missing values

Named by analogy to a fabric

Warp and Weft

Go od

Tex tile!

Choice of locations and scales for numerical data

Data (p-dimensional n observations)

Data vector

Coordinate vector (for numerical data)

Choice of locations and scales for numerical data

Data (p-dimensional n observations)

Data vector

Coordinate vector (for numerical data)

Choice of locations and scales for numerical data

Data (p-dimensional n observations)

Data vector

Coordinate vector (for numerical data)

Choice of locations and scales for numerical data

Data (p-dimensional n observations)

Data vector

Coordinate vector (for numerical data)

Criterion

Coordinate vector

Location parameter vector Scale parameter vector Ideal coordinate vector

The sum of squared deviations is minimised

Solution of the ideal coordinate vector

Constraint

Solution of location parameter

Solution of scale parameter

Optimal choice of locations and scales

Order of axes

According to the squared distance

The further left axis is closer to the mean vector

Categorical data vector

To determine a coordinate of each level

Encoding the categorical data vector

Example Using a treatment contrast

by a set of contrasts

Coordinate vector

Choice of locations and scales for numerical and categorical data

Data Matrix

Encoded matrix for a categorical data vector with Original data vector for a numerical data vector levels

Coordinate vector Location parameter vector Scale parameter vector

Sum of squared deviations is minimised

Solution of location and scale

Categorical data on parallel coordinate plot

versicolor

virginica

setosa

Ordered categorical data

Using the specific contrast matrix

Additional constraints

Example

Missing values

Indicator matrix reflecting missing information

Sum of squared deviations

Constraint

Design of display

Textile plot

Understanding various aspect of data Points displayed on a axis are carefully chosen Further classification of data types

Way of displaying points on a axis

Numerical data

Continuous data

Continuous line

Discrete data

Tick marks

Arrow head to show the orientation Possible minimum and maximum Ｎon-numerical data

Possible levels Ordered categorical data

Arrows

Logical

Coloured

All data

Multiplicity on the coordinate is represented by the area of the circle Missing value Label (with unit or numeral)

Textile plot of Iris data

TOPIX (Tokyo Stock Price Index) from Jan 1991 to Oct 2002

TOPIX (Tokyo Stock Price Index) from Jan 1991 to Oct 2002

Two significant features

Knot A point on a axis, where all polygonal lines are pass through Isolated data vector Parallel wefts Segments horizontally aligned between two axes Perfect linear relationship or mapping between two data vectors

Preparation

Assumption

No missing values and no ordered categorical data Normalisation

Matrix notations

Knot

Simplified condition for a knot to occur

Parallel wefts

TOPIX (Tokyo Stock Price Index) from Jan 1991 to Oct 2002

Textile plot

Visualisation for understanding data

Polygonal lines are aligned as horizontally as possible Any kind of data can be displayed Symbols for points displayed are carefully chosen Knot and Parallel wefts

Implemented on R DandDR (http://www.stat.math.keio.ac.jp/DandDIV/)

Add-on package for R Interface between DandD and R Receiving data and necessary information Creating a dad object on R

List object which consists of data and attributes Own plot method producing the textile plot

Further developments

Non-linear transformations Design enhancements

Using colour Line width and thickness

Dynamic or interactive display

Improving user interface Java Language

Thank you for your attention.

Reference

A. Inselberg, The plane with parallel coordinates, The Visual Computer 1 (1985) 69-91. E. Wegman, Hyperdimensional data analysis using parallel coordinates. Journal of The American Statistical Association 85 (1990) 664--675.

- Development of parallel algorithms in Data Field Haskell
- Tiled Parallel Coordinates for the Visualization of Time-Varying Multichannel EEG Data
- Algorithm Engineering of Parallel Algorithms and Parallel Data Structures
- Design, Implementation and Evaluation of ParaDict, a Data Parallel Library for Dictionaries
- Automatic optimization of parallel data
- Parallel coordinate descent methods for big data optimization
- Effect of Data Distribution in Parallel Mining of Associations 1999
- Abstract Parallel bulk-loading of spatial data
- Refinement of data parallel programs in Pei
- Efficient Data Parallel Implementations of Highly Irregular Problems

更多相关文章：
**
***Example* *Parallel* *coordinate* *plot* *of* *Iris* *data*_图文.pdf

*Example*: *Parallel* *coordinate* *plot* *of* *Iris* *data* *Iris* *Example*: *Parallel* *coordinate* *plot* *of* *Iris* *data* *Iris* Sepal.Length *Example*: *Parallel* *coordinate* *plot* *of* ...**
Day1.pdf
**

the continuation prompt is + >*plot*(cars,xlab=...2 cos(4θ) + (sin(π/12))5 X-*coordinate*:...*Example* *of* *Data* frame: Get to know *iris* *data* ...**
chap3_***data*_exploration_图文.ppt

how easy it is to understand the*data* *Example*:...#? Scatter *Plot* Array *of* *Iris* Attributes ? Tan...*coordinate* axis and the points are connected by ...**
***Iris* recognition border-crossing system in the UAE_图文.pdf

because the polar*coordinate* system is ...*plot* showing the trade-off between False Reject ...An unlimited number *of* *parallel* *Iris*Engines ensure...**
R An Open Source Statistical Environment_图文.ppt
**

*of* *Iris* versicolor Sepal.Width 2.0 2.5 3.0 3.5 ... *parallel* *plot* and co*plot* Given : depth Petal ... from the available *data* (for the *example* we ...**
Detection ***of* eye locations in unconstrained visual images_....pdf

appear in a gradient direction*plot* *of* the image...ow ?eld around the *iris*. to 0, which serve ...be spatially close in terms *of* the y *coordinate*...**
An anatomy ***of* *iris*code for precise phase representation.pdf

applied to an*iris* image in dimensionless polar *coordinate* system, I(ρ, ...applications can verify, 1-to-1 match, or identify from a small *data*base...**
Starting GNUPLOT GNUPLOT Quick Reference_图文.pdf
**

*iris*4d [8 24] kc_tek40xx km_tek40xx next ...*example* *of* *plotting* a 3-d *data* *example* *of* ...The x *coordinate* must be speci?ed. The order ...**
***Data* Set Editing by Ordered Projection.pdf

those based on axis-*parallel* classifiers are the...upper values for each *coordinate* *of* this rectangle...Results. *Data*Base *Iris* Breast Cancer Pima Diabetes...**
Visualization ***of* hidden node activity in neural networks I. ....pdf

*parallel* *coordinate* representation for the two ...For *example*, the *plots* for the ?rst class may...2 for the *Iris* *data* and the XOR *data*. If ...**
An Extension ***of* *Parallel* *Coordinate* *Plot* for Visualizing ....unkown

(Session CPS109) p.4811 An Extension*of* *Parallel* *Coordinate* *Plot* for Visualizing Aggregated Symbolic *Data* Yoshikazu YAMAMOTO 1,3 and Junji NAKANO 2 1 ...**
Devaney
Abstract.unkown
**

3D Glyph-based display*Parallel* *coordinate* *plot* PDF / PMF...This is an attribute rank *plot* *of* *iris* *data* based on the information gain...**
...Context Switching and Cognition in Dual-View ***Coordinated* ....unkown

{gconvert, jichen8, beyost, yryu, north}@vt.edu Abstract Multiple-view visualizations are useful for finding patterns in complex*data* sets, but little...**
Construction ***of* Line Densities for *Parallel* *Coordinate* *Plot*s1.unkown

University Fairfax, VA 22030 ABSTRACT The graphical representation*of* high dimensional *data* may be accomplished by using the *parallel* *coordinate* *plotting* system...**
Application ***Of* The *Parallel* *Coordinate* *Plot* For Ranking Com....unkown

[1] showed that a*parallel* *coordinate* *plot* [2] can be used for *data* ...*example* *of* actual POS *data* analysis, which is a visualization method for ...**
dissertation:
Magneto-optical.unkown
**

plane) to p (*parallel* to the incidence plane)....position *coordinate*, each Kerr transient *data* point...(7 MHz), a time delay Figure 4.2: *Plots* ...**
***Iris* Surface Deformation and Normalization.unkown

*Iris* Surface Deformation and Normalization Somying ...extension and compression *parallel* to the *iris* ...[16] where θ is a polar angle *coordinate* ...**
Models
Federico.unkown
**

page 49*Coordinate* value for the j-th land...Fingerprints, *iris* and hand geometry, for *example*...3.1. The typical *plot* *of* the labels as a ...**
***Parallel* *Coordinate* *Plot* for.unkown

*Parallel* *Coordinate* *Plot* for Original At**
Categorical ***parallel* *coordinate* *plot*.unkown

Categorical*parallel* *coordinate* *plot* Ret 更多相关标签：

the continuation prompt is + >

how easy it is to understand the

because the polar

appear in a gradient direction

applied to an

those based on axis-

(Session CPS109) p.4811 An Extension

3D Glyph-based display

{gconvert, jichen8, beyost, yryu, north}@vt.edu Abstract Multiple-view visualizations are useful for finding patterns in complex

University Fairfax, VA 22030 ABSTRACT The graphical representation

[1] showed that a

plane) to p (

page 49

Categorical