This paper introduces an automatic procedure to assist on the interpretation of a large dataset when a similarity metric is available.
We propose a visualization approach based on a graph layout method- ology that uses a Quadratic Assignment Problem (QAP) formulation.
The methodology is presented using as testbed a time series dataset of the Standard & Poor’s 100, one the leading stock market indicators in the United States. A weighted graph is created with the stocks repre- sented by the nodes and the edges’ weights are related to the correlation between the stocks’ time series. A heuristic for clustering is then pro- posed; it is based on the graph partition into disconnected subgraphs allowing the identification of clusters of highly-correlated stocks. The final layout corresponds well with the perceived market notion of the different industrial sectors. We compare the output of this procedure with a traditional dendogram approach of hierarchical clustering
Notas
IFIP International Conference on Artificial Intelligence in Theory and Practice - Knowledge Acquisition and Data Mining
Información general
Fecha de exposición:agosto 2006
Fecha de publicación:agosto 2006
Idioma del documento:Inglés
Evento:19 th IFIP World Computer Congress - WCC 2006
Institución de origen:Red de Universidades con Carreras en Informática (RedUNCI)
Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)