Abstract

We perform discriminant analysis together with principal component analysis on dialect and accent recognition. Since the data matrix exhibits high dimension low sample size feature, we calculate the principal components and the score matrix based on the dual space. Given the transformed score matrix, linear discriminant model does not fit the data well, while quadratic discriminant model, the superior model comparing to LDA, may fail sometimes when large number of principal components are required. Using the Gaussian radial basis function kernel, we calculate the kernel matrix and perform LDA directly on it. Comparing the LDA-PCA method, the in-sample prediction error rate of LDA reduces by more than 20% on average.

Publication Date

2013

Comments

This is a technical report

Note: imported from RIT's Digital Media Library running on DSpace to RIT Scholar Works on April 2014.

Document Type

Technical Report

Department, Program, or Center

The John D. Hromi Center for Quality and Applied Statistics (KGCOE)

Recommended Citation

Fokoue, Ernest and Ma, Zichen, "Modern Multivariate Methods for Accurate Dialect Classification" (2013). Technical Report,Accessed from
https://repository.rit.edu/article/1748

Campus

RIT – Main Campus

Download

COinS

Articles

Modern Multivariate Methods for Accurate Dialect Classification

Abstract

Publication Date

Comments

Document Type

Department, Program, or Center

Recommended Citation

Campus

Search

Browse

Author Corner

RIT Links

Articles

Modern Multivariate Methods for Accurate Dialect Classification

Authors

Abstract

Publication Date

Comments

Document Type

Department, Program, or Center

Recommended Citation

Campus

Share

Search

Browse

Author Corner

RIT Links