Skip to main navigation Skip to search Skip to main content

Knowledge-based quadratic discriminant analysis for phonetic classification

  • Heyun Huang*
  • , Yang Liu
  • , Louis Ten Bosch
  • , Bert Cranen
  • , Lou Boves
  • *Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

4 Citations (Scopus)

Abstract

Modeling the second-order statistics of articulatory trajectories is likely to improve the performance in classifying phone segments compared to using only linear combinations of MFCCs. Nevertheless, the extremely high dimensionality of the feature space spanned by a combination of monomials of degree-1 and degree-2 makes it difficult to effectively exploit the discriminative information in the full covariance matrix. This paper proposes a novel algorithm, dubbed Knowledge-based Quadratic Discriminant Analysis (KnQDA), for reducing the number of dimensions of the space spanned by degree-1 and degree-2 monomials by using phonetic knowledge for selecting the set of degree-2 monomials that are most likely to improve classification. KnQDA seeks a trade-off between overfitting and undertraining, which further improves the learnability. Binary classifications on all pairs of phones in TIMIT show the effectiveness of the proposed method, especially on those phone pairs that overlap strongly in the linear feature space.

Original languageEnglish
Title of host publication2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Proceedings
PublisherIEEE
Pages4145-4148
Number of pages4
ISBN (Electronic)9781467300445
ISBN (Print)9781467300469
DOIs
Publication statusPublished - 25 Mar 2012
Event2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Kyoto, Japan
Duration: 25 Mar 201230 Mar 2012
https://doi.org/10.1109/ICASSP15465.2012 (Conference proceeding)

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
Abbreviated titleICASSP 2012
Country/TerritoryJapan
CityKyoto
Period25/03/1230/03/12
Internet address

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 9 - Industry, Innovation, and Infrastructure
    SDG 9 Industry, Innovation, and Infrastructure

User-Defined Keywords

  • Dimensionality Reduction
  • Knowledge-Based Quadratic Discriminant Analysis
  • Phone Classification
  • TIMIT

Fingerprint

Dive into the research topics of 'Knowledge-based quadratic discriminant analysis for phonetic classification'. Together they form a unique fingerprint.

Cite this