声带息肉患者持续元音及连贯言语声的倒频谱声学分析

A Cepstral Analysis of Sustained Vowels and Continuous Speech in Patients with Vocal Polyps

余明强;周莉;徐新林;潘晗;庄佩耘;

1:厦门大学附属中山医院耳鼻咽喉科

2:厦门大学

摘要
目的探讨倒频谱声学分析法与连贯言语声学信号用于鉴别病理性声信号的价值。方法分别采集26例成人声带息肉患者(息肉组,男10例,女16例)及27例正常嗓音者(正常组,男13例,女14例)持续元音和连贯言语声信号,采用MDVP(multi dimensional voice program)软件分析各组持续元音频率微扰(jitter)和振幅微扰(shimmer),采用ADSV(analysis of dysphonia in speech and voice)软件分析各组持续元音和连贯言语的倒频谱参数:倒频谱峰值突出(cepstral peak prominence,CPP)、低高频谱能量比(the mean ratio of singnal energy below 4000Hz to the energy above 4 000Hz,L/HSR)、CPP的标准差(STD CPP)、L/HSR的标准差(STD L/HSR)及发音障碍倒频谱指数(the cepstral/spectral index of dysphonia,CSID),分析扰动参数和倒频谱参数对鉴别病理声学信号的敏感性。结果正常组持续元音的jitter和shimmer值均小于声带息肉组(P<0.05);除STD L/HSR外,正常组持续元音的倒频谱参数值均高于息肉组(P<0.05);连贯言语的倒频谱参数中,男性声带息肉组的CPP、L/HSR均低于男性正常组(P<0.05),女性声带息肉组CPP值明显低于女性正常组(P<0.05)。男女性持续元音声信号的倒频谱参数CPP和CSID在ROC曲线下的面积与参考值0.5相比,差异有统计学意义(P<0.05);男性连贯言语声的CPP及L/HSR、女性CPP ROC曲线下的面积与参考值0.5的差异有统计学意义(P<0.05)。结论连贯言语声和持续元音的扰动参数和倒频谱参数均可用于区别正常与声带息肉患者的噪音声学信号,倒频谱参数CPP对区别正常和声带息肉患者嗓音信号有较好的特异度和灵敏度。
关键词
倒频谱峰值突出;持续元音;连贯言语;声带息肉
基金项目(Foundation):
国家自然科学基金(NSFC81371080);; 福建省卫生系统中青年骨干人才培养项目(2013-ZQN-JC-35)联合资助
作者
余明强;周莉;徐新林;潘晗;庄佩耘;
参考文献

1 Titze IR,Liang H.Comparison of F0extraction method for high-precision voice perturbation measurements[J].J Speech Hear Res,1993,36:1120.

2 Packard NH,Crutchfield JP,Farmer JD,et al.Geometry from a time series[J].Phys Rev Lett,1980,45:712.

3 Awan SN,Roy N.Toward the development of an objective index of dysphonia sverith:a four-factor acoustic model[J].Clin linguist phon,2006,20:35.

4 Awan SN,Roy N,Jette ME,et al.Quantifying dysphonia severity using a spectral/cepstral-based acoustic index:comparisons with auditory-perceptual judgements from the CAPEV[J].Clin Linguist Phon,2010,24:742.

5 Awan SN,Roy N.Outcomes measurement in voice disorders:application of an acoustic index of dysphonia severity[J].J Speech Lang Hear Res,2009,52:482.

6 Mors C.Vowel-and text-based cepstral analysis of chronic hoarness[J].Journal of Voice,2012,26:416.

7 Lowell SY.The acoustic cssessment of voice in continuous speech[J].Perspectives on Voice and Voice Disorders,2012,22:57.

8 王刚,于萍,徐文,等.嗓音主观听感知评估稳定性的研究[J].中华耳鼻咽喉头颈外科杂志,2011,46:485.

9 李进让,孙雁雁,徐文,等.嗓音障碍主观听感知评估中标准化朗读文本的设计[J].中华耳鼻咽喉头颈外科杂志,2010,45:719.

10 赵逸,王伟,郑宏良,等,嗓音障碍听感知评估汉语普通话朗读文本的设计[J],听力学及言语疾病杂志,2014,22:130.

11 Lowell SY,Colton RH,Kelley RT,et al.Spectral-and cepstral-based measures during continuous speech:capacity to distinguish dysphonia and consistency within a speaker[J].Journal of Voice,2011,25:223.

12 韩德民,Sataloff RT.嗓音医学[M].北京:人民卫生出版社,2007.132~136.

13 Hillenbrand JM.A methodological study of perturbation and additive noise in synthetically generated voice signals[J].J Speech Hear Res,1987,112:324.

14 Watts CR,Awan SN.Use of spectral/cepstral analyses for differentiating normal from hypofunctional voices in sustained vowel and continuous speech contexts[J].Journal of Speech,Language,and Hearing Research,2011,54:1523.

15 Adrian F,张家騄.嗓音质量评价与测量(2)[J].听力学及言语疾病杂志,2008,16:439.

16 Heman-Acka YD,Michael DD,Goding GS.The relationship between cepstral peak prominence and selected parameters of dysphonia[J].Journal of Voice,2002,16:20.

17 Zhang Y,Jiang JJ.Nonlinear dynamic analysis in signal typing of pathological human voices[J].Electronics Letters,2003,39:1021.

18 Balasubramanium RK,Bhat JS,Fahim S,et al.Cepstral analysis of voice in unilateral adductor vocal fold palsy[J].J Voice,2011,25:326.

19 余明强,徐新林,张赛,等.非线性动力学方法在分析声带息肉、囊肿患者嗓音信号中的应用[J].听力学及言语疾病杂志,2013,21:244.