Application of LSA space's dimension character in document multi-hierarchy clustering
Abstract
In LSA space, dimensions corresponding to bigger singular values reflect the general concept of language elements, while dimensions corresponding to smaller singular values reflect particular concept of language elements. On this basis, different dimensions of LSA space are adopted for document clustering under various concept granularities. In addition, in the LSA-based algorithm of document clustering, better clustering results are obtained by taking the row vectors of document self-indexing matrix as the objects to be clustered, instead of the document vectors with low dimensionality. © 2005 IEEE.
Publication Title
2005 International Conference on Machine Learning and Cybernetics, ICMLC 2005
Recommended Citation
Liu, Y., Qi, H., Hu, X., Cai, Z., Dai, J., & Zhu, L. (2005). Application of LSA space's dimension character in document multi-hierarchy clustering. 2005 International Conference on Machine Learning and Cybernetics, ICMLC 2005, 2384-2389. Retrieved from https://digitalcommons.memphis.edu/facpubs/7398