首页 > 过刊浏览 > 文章详情

时域精细结构与包络信息在噪声下声调识别中的作用

The Contributions of Temporal Fine Structure and Envelope Information for Lexical Tone Identification in Noise

亓贝尔;刘佳星;古鑫;刘博;

1:首都医科大学附属北京同仁医院北京市耳鼻咽喉科研究所耳鼻咽喉头颈科学教育部重点实验室(首都医科大学)

2:首都医科大学附属北京朝阳医院

在线阅读

摘要

目的探讨时域精细结构信息(temporal fine structure, TFS)和时域包络信息(envelope, Env)在噪声下声调识别中的作用。方法自行编制噪声下声调识别能力测试材料,对20例年龄19～30岁、母语为汉语普通话的听力正常人进行五种信噪比(SNR)条件下(SNR分别为-18、-12、-6、0、+6 dB)的声调识别能力测试,采用广义线性模型(generalized linear model, GLM)对所得数据进行统计分析。结果 (1)噪声环境下的声调识别与TFS信息和Env信息均相关,两者协同作用更有助于提高噪声下的声调识别能力,在言语谱噪声(speech spectrum-shaped noise, SSN)条件下Env、TFS以及二者协同作用与声调识别成绩的回归系数分别为0.095(P<0.000 1)、0.070(P<0.000 1)和-0.002(P<0.000 1);在两人谈话噪声(two-talker babble, TTB)条件下Env、TFS以及二者协同作用与声调识别成绩回归系数分别为0.052(P<0.000 1)、0.073(P<0.000 1)和-0.000 3(P=0.13)。(2)当TFS信息和Env信息量相等时,改善信噪比有利于提高声调识别成绩;SNR_(TFS)和SNR_(Env)相等时,SSN噪声下五种信噪比时声调识别平均正确率分别为27.6%、60.2%、82.1%、 93.9%和94.7%;TTB噪声下五种信噪比时声调识别平均正确率分别为53.5%、72.0%、86.4%、92.7%和95.0%。结论时域精细结构信息和时域包络信息对于听力正常人进行噪声下声调识别具有同等作用,两者协同作用更有助于提高噪声下的声调识别能力。

关键词

声调;时域精细结构;时域包络

基金项目(Foundation):

北京市卫生系统高层次卫生技术人才培养计划（2015-3-019）

作者

亓贝尔;刘佳星;古鑫;刘博;

参考文献

1 Qi B,Liu P,Gu X,et al .Characterization of lexical tone perception in native Mandarin speakers with sensorineural hearing loss[J].Acta Otolaryngol,2018,138(9):801-806.

2 亓贝尔，刘鹏，傅新星，等.听力正常人汉语普通话声调知觉特征研究[J].临床耳鼻咽喉头颈外科杂志，2016,30(19):1507-1511.

3 Smith ZM,Delgutte B,Oxenham AJ.Chimaeric sounds reveal dichotomies in auditory perception[J].Nature,2002,416(6876):87-90.

4 Xu L,Pfingst BE.Relative importance of temporal envelope and fine structure in lexical-tone perception[J].J Acoust Soc Am,2003,114(6 Pt 1):3024-3027.

5 Kong YY,Zeng FG.Temporal and spectral cues in Mandarin tone recognition[J].J Acoust Soc Am,2006,120(5 Pt 1):2830-2840.

6 Apoux F,Yoho SE,Youngdahl CL,et al.Role and relative contribution of temporal envelope and fine structure cues in sentence recognition by normal-hearing listeners[J].J Acoust Soc Am,2013,134(3):2205-2012.

7 Rosen S.Temporal information in speech:Acoustic,auditory and linguistic aspects[J].Philos Trans R Soc Lond B Biol Sci,1992,336(1278):367-373.

8 Füllgrabe C,Berthommier F,Lorenzi C.Masking release for consonant features in temporally fluctuating background noise[J].Hear Res,2006,211(1-2):74-84.

9 Moore BC.The role of temporal fine structure processing in pitch perception,masking,and speech perception for normal-hearing and hearing-impaired people[J].J Assoc Res Otolaryngol,2008,9(4):399-406.

10 Rosen S,Souza P,Ekelund C,et al.Listening to speech in a background of other talkers:effects of talker number and noise vocoding[J].J Acoust Soc Am,2013,133(4):2431-2443.

11 Moon IJ,Won JH,Park MH,et al.Optimal combination of neural temporal envelope and fine structure cues to explain speech identification in background noise[J].J Neurosci,2014,34(36),12145-12154.

12 Hopkins K,Moore BCJ.The contribution of temporal fine structure to the intelligibility of speech in steady and modulated noise[J].J Acoust Soc Am,2009,125(1):442-446.

13 Apoux F,Healy EW.A glimpsing account of the role of temporal fine structure information in speech recognition[J].Adv Exp Med Biol,2013,787:119-126.

14 Miller RE,Gibbs BE,Fogerty D.Glimpsing speech interrupted by speech-modulated noise[J].J Acoust Soc Am,2018,143(5):3058-3067.

15 Fogerty D,Xu J,Gibbs BE 2nd.Modulation masking and glimpsing of natural and vocoded speech during single-talker modulated noise:effect of the modulation spectrumBE[J].J Acoust Soc Am,2016,140(3):1800-1816.

16 Wang S,Xu L,Mannell R.Relative contributions of temporal fine structure and envelope cues to lexical tone recognition in hearing-Impaired listeners[J].J Assoc Res Otolaryngol,2011,12(6):783-794.

17 Johannesen PT,Pérez-González P,Kalluri S,et al.The influence of cochlear mechanical dysfunction,Temporal processing deficits,and age on the intelligibility of audible speech in noise for hearing-impaired listeners[J].Trends Hear,2016,20:2331216516641055.

本文信息

PDF(510K)

本文关键词相关文章

声调时域精细结构时域包络

本文作者相关文章

亓贝尔刘佳星古鑫刘博