Cepstral analysis of speech signals in the process of automatic pathological voice assessment

Anna Samborska-Owczarek

doi:10.17951/ai.2007.6.1.117-126

Cepstral analysis of speech signals in the process of automatic pathological voice assessment

Anna Samborska-Owczarek

Abstract

The paper describes the problem of cepstral speech analysis in the process of automated voicedisorder probability estimation. The author proposes to derive two of the most diagnosticallysignificant voice features: quality of harmonic structure and degree of subharmonic from cepstrumof speech signal. Traditionally, these attributes are estimated auricularly or by spectrum (orspectrogram) observation, hence this analysis often lacks accuracy and objectivity. The introducedparameters were calculated for the recordings from Disordered Voice Database (Kay, model 4337version 2.7.0) which consists of 710 voice samples (657 pathological, 53 healthy) recorded in thelaboratory environment and described with diagnosis and a number of additional attributes (suchas age, sex, nationality).The proposed cepstral voice features were compared to similar voice parameters derived fromMultidimensional Voice Program (Kay, model 5105 version 2.7.0) in respect to their diagnosticsignificance and presented graphically. The results show that cepstral features are more correlatedwith decision and better discriminate clusters of healthy and disordered voices. Additionally, bothparameters are obtained by single cepstral transform and do not require to perform F0 trackingearlier as it is derived simultaneously.

Full Text:

PDF

DOI: http://dx.doi.org/10.17951/ai.2007.6.1.117-126
Date of publication: 2015-01-04 00:00:00
Date of submission: 2016-04-27 10:20:03

Statistics

Total abstract view - 1182

Downloads (from 2020-06-17) - PDF - 0

Indicators

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Username
Password
Remember me