Abstract
In this paper, we present a sparse linear discriminant analysis (LDA) algorithm for high-dimensional objects in subspaces. In high dimensional data, groups of objects often exist in subspaces rather than in the entire space. For example, in text data classification, groups of documents of different types are categorized by different subsets of terms. The terms for one group may not occur in the samples of other groups. In the new algorithm, we consider a LDA to calculate a weight for each dimension and use the weight values to identify the subsets of important dimensions in the discriminant vectors that categorize different groups. This is achieved by including the weight sparsity term in the objective function that is minimized in the LDA. We develop an iterative algorithm for computing such sparse and orthogonal vectors in the LDA. Experiments on real data sets have shown that the new algorithm can generate better classification results and identify relevant dimensions.
Original language | English |
---|---|
Pages (from-to) | 223-235 |
Number of pages | 13 |
Journal | Numerical Linear Algebra with Applications |
Volume | 18 |
Issue number | 2 |
DOIs | |
Publication status | Published - Mar 2011 |
Scopus Subject Areas
- Algebra and Number Theory
- Applied Mathematics
User-Defined Keywords
- High-dimensional data
- Linear discriminant analysis
- Sparsity
- Weighting