GRBAS和CAPE-V量表中文听感知评估一致性分析

The Intra-rater and Inter-rater Consistency of GRBAS and CAPE-V in Chinese Context

刘阳;刘恒鑫;屈歌;李瑞香;黄冬雁

1:中国人民解放军总医院第六医学中心/耳鼻咽喉头颈外科医学部/国家耳鼻咽喉疾病临床医学研究中心

2:国家儿童医学中心/首都医科大学附属北京儿童医院耳鼻咽喉头颈外科/儿童耳鼻咽喉头颈外科疾病北京市重点实验室

3:西安音乐学院嗓音研究中心

4:北京语言大学语言康复学院

摘要
目的 评估在中文语境下GRBAS和CAPE-V的评估者内和评估者间一致性。方法 5名嗓音相关专业人员使用《发音评估助手》APP对抽取自解放军总医院第六医学中心病理嗓音数据库V1.0的52例语音样本进行GRBAS和CAPE-V评估,52例用于评估者间一致性分析,其中的38例用于评估者内部一致性评估。使用ICC对CAPE-V进行评估者内和评估者间一致性分析,采用Cohen's Kappa方法对GRBAS的5个特征进行评估者内一致性分析,采用Fleiss Kappa方法对GRBAS评估者间一致性进行分析,并对两个量表中相同的特征进行Spearman相关性分析。结果 评估者内一致性分析结果示,CAPE-V 6个特征均呈现出高的一致性,各特征的ICC系数分别为总体音质(OS)=0.80,粗糙度(R)=0.69,气息音(B)=0.77,紧张度(S)=0.75,音调异常程度(P)=0.74,响度异常程度(L)=0.78;GRBAS除特征G(0.48)和S(0.45)存在较弱的评估者内一致性外,其余特征的评估者内一致性均较差。评估者间一致性分析结果示,CAPE-V各特征的ICC相关系数均大于0.85,提示不同评估者对各特征的评分均具有很高的一致性;GRBAS除G特征(相关系数=0.48)外,其余特征的相关系数均小于0.40,说明除特征G外,其余特征的评估者间一致性均较差。在同时使用GRBAS和CAPE-V对同一样本进行评估时,总体音质(OS/G)、粗糙度(R)、气息音(B)、紧张度(S)的Spearman相关系数分别为0.89、0.85、0.91、0.91,提示两个量表中的这4个特征的评分结果具有高度相关性。结论 在中文语音样本的听感知评估中,CAPE-V量表的评估者内和评估者间一致性更高,较GRBAS量表更适用于临床。
关键词
GRBAS评估;共识听觉-知觉嗓音评估;评估者内部一致性;评估者间一致性;中文语境
基金项目(Foundation):
北京市自然科学基金(7232170)
作者
刘阳;刘恒鑫;屈歌;李瑞香;黄冬雁
参考文献

[1] ?ZCEBE E,AYDINLI F E,TI■,et al.Reliability and validity of the turkish version of the consensus auditory-perceptual evaluation of voice (CAPE-V)[J].Journal of Voice,2019,33(3):382.e1-382.e10.

[2] KEMPSTER G B,GERRATT B R,VERDOLINI ABBOTT K,et al.Consensus auditory-perceptual evaluation of voice:development of a standardized clinical protocol[J].American Journal of Speech-Language Pathology,2009,18(2):124-132.

[3] CARDING P N,WILSON J A,MACKENZIE K,et al.Measuring voice outcomes:state of the science review[J].The Journal of Laryngology and Otology,2009,123(8):823-829.

[4] BARSTIES B,BODT M D.Assessment of voice quality:current state-of-the-art[J].Auris Nasus Larynx,2015,42(3):183-188.

[5] NEMR K,SIM?ES-ZENARI M,CORDEIRO G F,et al.GRBAS and CAPE-V scales:high reliability and consensus when applied at different times[J].Journal of Voice,2012,26(6):812.e17-812.e22.

[6] EADIE T L,BAYLOR C R.The effect of perceptual training on inexperienced listeners' judgments of dysphonic voice[J].Journal of Voice,2006,20(4):527-544.

[7] WALDEN P R.Perceptual voice qualities database (PVQD):database characteristics[J].Journal of Voice,2020:S0892199720303751.DOI:10.1016/j.jvoice.2020.10.001.

[8] KREIMAN J,GERRATT B R.Sources of listener disagreement in voice quality assessment[J].The Journal of the Acoustical Society of America,2000,108(4):1867-1876.

[9] OATES J.Auditory-perceptual evaluation of disordered voice quality:pros,cons and future directions[J].Folia Phoniatrica et Logopaedica,2009,61(1):49-56.

[10] WEBB A L,CARDING P N,DEARY I J,et al.The reliability of three perceptual evaluation scales for dysphonia[J].Eur Arch Otorhinolaryngol,2004,261:429-434.

[11] KARNEL M P,MELTON S D,CHILDES J M,et al.Reliability of clinician-based (GRBAS and CAPE-V) and patient-based (V-RQOL and IPVI) documentation of voice disorders[J].Journal of Voice,2007,21(5):576-590.

[12] VAZ FREITAS S,PESTANA P M,ALMEIDA V,et al.Audio-perceptual evaluation of portuguese voice disorders—an inter- and intrajudge reliability study[J].Journal of Voice,2014,28(2):210-215.

[13] 章薇,屈季宁,崔前波.病态嗓音主观评价与声学分析的相关性研究[J].听力学及言语疾病杂志,2012,20(6):544-546.

[14] 梁万顺,王文华,王园园,等.音乐疗法在发声困难儿童康复过程中的应用研究[J].现代医药卫生,2022,38(23):4029-4032.

[15] SCHOBER P,MASCHA E J,VETTER T R.Statistics from A (agreement) to Z (z score):a guide to interpreting common measures of association,agreement,diagnostic accuracy,effect size,heterogeneity,and reliability in medical research[J].Anesthesia and Analgesia,2021,133(6):1633-1641.

[16] CHEN Z,FANG R,ZHANG Y,et al.The Mandarin version of the consensus auditory-perceptual evaluation of voice (CAPE-V) and its reliability[J].Journal of Speech,Language,and Hearing Research,2018,61(10):2451-2457.