Visualization of high dimensional data using an automated 3D star co-ordinate system

Abstract

This paper presents a 3D star coordinate-based visualization technique for exploratory data analysis. To improve the data visualization and reveal the hidden patterns in complex high dimensional data sets, first the 2D star coordinate system is extended to the 3D star coordinate system. An autonomous procedure is defined to find the best configuration for the 3D star coordinate system based on cluster validation measures. To illustrate the efficacy of the proposed techniques, empirical analysis were conducted on a number of synthetic (Five dimensional Gaussian distribution with three classes) and real (Fisher's IRIS, Leukemia, Gastric cancer and Petroleum datasets) databases. Empirical analyses shows that automated 3D star coordinate system helps in better visualization of the complex high dimensional data when compared to 2D star coordinate system and also other projection-based visualization techniques. Also the automated configuration for 3D star coordinate system reveals the hidden patterns in the complex datasets without human intervention. © 2006 IEEE.

Publication Title

IEEE International Conference on Neural Networks - Conference Proceedings

Share

COinS