Abstract
Modeling the second-order statistics of articulatory trajectories is likely to improve the performance in classifying phone segments compared to using only linear combinations of MFCCs. Nevertheless, the extremely high dimensionality of the feature space spanned by a combination of monomials of degree-1 and degree-2 makes it difficult to effectively exploit the discriminative information in the full covariance matrix. This paper proposes a novel algorithm, dubbed Knowledge-based Quadratic Discriminant Analysis (KnQDA), for reducing the number of dimensions of the space spanned by degree-1 and degree-2 monomials by using phonetic knowledge for selecting the set of degree-2 monomials that are most likely to improve classification. KnQDA seeks a trade-off between overfitting and undertraining, which further improves the learnability. Binary classifications on all pairs of phones in TIMIT show the effectiveness of the proposed method, especially on those phone pairs that overlap strongly in the linear feature space.
| Original language | English |
|---|---|
| Title of host publication | 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Proceedings |
| Publisher | IEEE |
| Pages | 4145-4148 |
| Number of pages | 4 |
| ISBN (Electronic) | 9781467300445 |
| ISBN (Print) | 9781467300469 |
| DOIs | |
| Publication status | Published - 25 Mar 2012 |
| Event | 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Kyoto, Japan Duration: 25 Mar 2012 → 30 Mar 2012 https://doi.org/10.1109/ICASSP15465.2012 (Conference proceeding) |
Publication series
| Name | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
|---|---|
| ISSN (Print) | 1520-6149 |
Conference
| Conference | 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 |
|---|---|
| Abbreviated title | ICASSP 2012 |
| Country/Territory | Japan |
| City | Kyoto |
| Period | 25/03/12 → 30/03/12 |
| Internet address |
|
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 9 Industry, Innovation, and Infrastructure
User-Defined Keywords
- Dimensionality Reduction
- Knowledge-Based Quadratic Discriminant Analysis
- Phone Classification
- TIMIT
Fingerprint
Dive into the research topics of 'Knowledge-based quadratic discriminant analysis for phonetic classification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver