當(dāng)前位置：主頁 > 科技論文 > 網(wǎng)絡(luò)通信論文 >

基于性別預(yù)分類的年齡自動估計(jì)研究

發(fā)布時間：2018-01-05 12:00

本文關(guān)鍵詞：基于性別預(yù)分類的年齡自動估計(jì)研究　出處：《江蘇師范大學(xué)》2014年碩士論文　論文類型：學(xué)位論文

【摘要】：年齡估計(jì)技術(shù)是以計(jì)算機(jī)作為輔助工具,根據(jù)說話人語音,利用已設(shè)計(jì)好的年齡估計(jì)系統(tǒng)自動判別說話人所屬年齡段。該技術(shù)在信息檢索、人機(jī)通信、刑事偵查等領(lǐng)域都有重要應(yīng)用價值和廣泛的應(yīng)用前景。目前,在研究基于語音的年齡估計(jì)時主要采用單一語音特征或者單一分類器構(gòu)成的系統(tǒng)來處理多個年齡段分類任務(wù);雖然相關(guān)學(xué)者在特征提取和分類算法方面做了大量卓有成效的工作,但是對于語音年齡估計(jì)技術(shù)特征不穩(wěn)定、單層系統(tǒng)分類準(zhǔn)確率低等問題還沒有較好的解決方案,同時也缺乏標(biāo)準(zhǔn)評價平臺即公認(rèn)的年齡語音數(shù)據(jù)庫。針對這些問題,論文從建立年齡語音數(shù)據(jù)庫、提取特征到分類識別進(jìn)行了系統(tǒng)研究,取得以下成果。1、建立年齡語音數(shù)據(jù)庫以國際上語音語料庫設(shè)計(jì)標(biāo)準(zhǔn)為參照,考慮話者年齡、性別分布選擇。最終建立起一個包含三個年齡段、男女分布較均勻的年齡語音數(shù)據(jù)庫。另外,對每段語音標(biāo)注說話人的相關(guān)信息,如年齡、性別、錄制時間。此工作有利于擴(kuò)展語音數(shù)據(jù)庫功能,例如年齡標(biāo)注可用于年齡估計(jì),性別標(biāo)注可使數(shù)據(jù)庫用于性別分類。2、建立融合性別預(yù)分類的年齡估計(jì)系統(tǒng)目前年齡估計(jì)系統(tǒng)大多使用單一特征、單一分類器進(jìn)行分類,分類準(zhǔn)確率普遍較低。論文先進(jìn)行性別預(yù)分類,根據(jù)分層分類思想優(yōu)先判斷是否為兒童;然后在特定性別下進(jìn)行青壯年、老年的估計(jì)。結(jié)合每個子任務(wù)的特點(diǎn)選用多種特征和分類器,以提高系統(tǒng)最終的分類效果。3、提出基于改進(jìn)Citation-kNN算法的成人性別分類方法Citation-kNN算法多用于圖像處理,對其改進(jìn)并首先引入到成人性別分類研究中。提出了基于GMM的語音多示例包生成方法;對Citation-kNN算法的距離測度改進(jìn)進(jìn)行模式分類,簡化了系統(tǒng)訓(xùn)練方法。實(shí)驗(yàn)結(jié)果表明,改進(jìn)后的Citation-kNN算法應(yīng)用到說話人性別分類是可行的,系統(tǒng)的平均分類準(zhǔn)確率與傳統(tǒng)的算法相比略有提高。4、提出基于頻帶加權(quán)MFCC的年齡子類別估計(jì)系統(tǒng)語音信號經(jīng)離散傅里葉變換后得到的各個頻帶信息對年齡估計(jì)任務(wù)有不同的貢獻(xiàn)度,以頻帶能量為參數(shù),依據(jù)F-ratio準(zhǔn)則設(shè)計(jì)區(qū)分度函數(shù)來計(jì)算各個頻帶的整體貢獻(xiàn)度。計(jì)算MFCC特征時,在Mel濾波之后對各個濾波器輸出的頻帶能量按貢獻(xiàn)度不同進(jìn)行加權(quán),以強(qiáng)化或削弱相應(yīng)頻帶。基于性別信息的年齡子類別估計(jì)實(shí)驗(yàn)結(jié)果表明,改進(jìn)后的MFCC特征比傳統(tǒng)MFCC更能體現(xiàn)語音年齡信息。
[Abstract]:Age estimation technique is based on the computer as a tool, according to the speaker, the speaker is estimated the system automatically determine the age by age. The design has good technology in information retrieval, human-computer communication, the field of criminal investigation have important application value and broad application prospect. At present, in the study of age estimation based on speech time the system mainly adopts a single speech feature or a single classifier to deal with multiple age classification tasks; although some scholars have done a lot of very fruitful work in feature extraction and classification algorithm, but for the voice of age estimation of technical characteristics is not stable, single system low classification accuracy is not a better solution, but also the lack of the standard evaluation platform known as the age of speech database. To solve these problems, this paper from the establishment of age speech database, feature extraction To sign recognition system research, obtains the following results.1, establish the age speech database based on international standard design of speech corpus for reference, then consider the age and gender distribution. Finally set up a three age, men and women in uniform distribution age speech database. In addition, the relevant information. Each speech tagging speaker such as age, gender, recording time. This work is conducive to the expansion of speech database functions, such as tagging can be used for age estimation of age, sex can make the annotation database for gender classification.2, establish the integration of the gender age estimation of pre classification system at present age estimation systems mostly use single feature single classifier., the classification accuracy rate is generally low. The first sex pre classification, according to the classification of priority to determine whether the idea of children; then in the specific nature of don't The young, elderly estimation. According to the characteristics of each sub task feature and classifier selection, in order to improve the effect of the final.3 classification system, put forward the adult gender classification method improved Citation-kNN algorithm based on Citation-kNN algorithm for image processing, to improve and first introduced to study the classification of adult sex. The GMM voice the multi instance bag generation method based on distance measure; on the improvement of Citation-kNN algorithm for pattern classification, simplify the system training methods. The experimental results show that the improved Citation-kNN algorithm is applied to speaker gender classification is feasible, the average classification accuracy of system and the traditional algorithm is compared to a slight increase of.4, the age estimation task different age weighted MFCC tribute band sub categories of speech signal estimation system by discrete Fourier transform obtained after each frequency band based on information In order to offer degrees, frequency band energy parameters, according to the F-ratio criteria for the design of the discrimination function to calculate the overall contribution of each band. In the calculation of MFCC features, Mel filter after the band energy of the output of each filter according to the contribution of different weights to strengthen or weaken the corresponding frequency band. The gender information age estimation of the sub categories the results show that MFCC based on improved feature can reflect the information age speech more than traditional MFCC.

【學(xué)位授予單位】：江蘇師范大學(xué)
【學(xué)位級別】：碩士
【學(xué)位授予年份】：2014
【分類號】：TN912.3

【相似文獻(xiàn)】

相關(guān)期刊論文前10條

1 謝貴武;楊繼紅;肖勇;閔剛;;基于語音分段的自適應(yīng)時長調(diào)整算法[J];軍事通信技術(shù);2008年02期

2 樊建中;孫晴;楊永杰;;一種智能盲文學(xué)習(xí)機(jī)設(shè)計(jì)[J];現(xiàn)代電子技術(shù);2010年05期

3 溫洪昌;黃應(yīng)強(qiáng);傅貴興;;單片機(jī)的多段語音組合錄放系統(tǒng)設(shè)計(jì)[J];單片機(jī)與嵌入式系統(tǒng)應(yīng)用;2011年10期

4 張劍;袁華強(qiáng);;Rhetorical-State SVM在抽取式語音摘要中的應(yīng)用[J];科學(xué)技術(shù)與工程;2013年21期

5 盧堅(jiān) ,毛兵 ,孫正興 ,張福炎;一種改進(jìn)的基于說話者的語音分割算法[J];軟件學(xué)報(bào);2002年02期

6 章文義,朱杰;幾種無語音檢測噪音估計(jì)方法的比較研究[J];計(jì)算機(jī)工程與設(shè)計(jì);2003年10期

7 林鑫;陳樺;王開志;王繼成;;語音驅(qū)動唇形自動合成算法[J];計(jì)算機(jī)工程;2007年17期

8 蔡鐵;;基于在線單類支持向量機(jī)的自適應(yīng)語音活動檢測[J];深圳信息職業(yè)技術(shù)學(xué)院學(xué)報(bào);2008年02期

9 章釗;郭武;;話者識別中結(jié)合模型和能量的語音激活檢測算法[J];小型微型計(jì)算機(jī)系統(tǒng);2010年09期

10 朱淑琴,裘雪紅;一種精確檢測語音端點(diǎn)的方法[J];計(jì)算機(jī)仿真;2005年03期

相關(guān)會議論文前9條

1 田野;王作英;陸大金;;基于韻律結(jié)構(gòu)信息的非語音拒識[A];第六屆全國人機(jī)語音通訊學(xué)術(shù)會議論文集[C];2001年

2 徐明;胡瑞敏;黃云森;;基于音素識別的語音評價方法[A];第二屆和諧人機(jī)環(huán)境聯(lián)合學(xué)術(shù)會議(HHME2006)——第15屆中國多媒體學(xué)術(shù)會議(NCMT'06)論文集[C];2006年

3 王歡良;韓紀(jì)慶;李海峰;王承發(fā);;面向嵌入式應(yīng)用的小詞匯量語音串識別系統(tǒng)[A];第七屆全國人機(jī)語音通訊學(xué)術(shù)會議（NCMMSC7）論文集[C];2003年

4 那斯?fàn)柦ね聽栠d;吾守爾·斯拉木;麥麥提艾力;;維吾爾語大詞匯量連續(xù)語音識別研究——語音語料庫的建立[A];民族語言文字信息技術(shù)研究——第十一屆全國民族語言文字信息學(xué)術(shù)研討會論文集[C];2007年

5 簡志華;王向文;;考慮幀間信息的語音轉(zhuǎn)換算法[A];浙江省信號處理學(xué)會2012學(xué)術(shù)年會論文集[C];2012年

6 魏維;馬海燕;;一種丟失語音信包重建的新算法[A];通信理論與信號處理新進(jìn)展——2005年通信理論與信號處理年會論文集[C];2005年

7 陳凡;羅四維;;一個實(shí)用語音開發(fā)應(yīng)用系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[A];第二屆全國人機(jī)語音通訊學(xué)術(shù)會議論文集[C];1992年

8 劉紅星;戴蓓劏;陸偉;;基于圖像增強(qiáng)方法的共振峰諧波能量參數(shù)的語音和端點(diǎn)檢測[A];第九屆全國人機(jī)語音通訊學(xué)術(shù)會議論文集[C];2007年

9 林愛華;張文俊;王毅敏;;基于肌肉模型的語音驅(qū)動唇形動畫[A];第十三屆全國圖象圖形學(xué)學(xué)術(shù)會議論文集[C];2006年

相關(guān)重要報(bào)紙文章前5條

1 atvoc;數(shù)碼語音電路產(chǎn)品概述[N];電子資訊時報(bào);2008年

2 記者李山;德用雙音素改進(jìn)人工語音表達(dá)[N];科技日報(bào);2012年

3 中國科學(xué)院自動化研究所模式識別國家重點(diǎn)實(shí)驗(yàn)室于劍邋陶建華;個性化語音生成技術(shù)面面觀[N];計(jì)算機(jī)世界;2007年

4 江西林慧勇;語音合成芯片MSM6295及其應(yīng)用[N];電子報(bào);2006年

5 ;與“小超人”對話[N];中國計(jì)算機(jī)報(bào);2001年

相關(guān)博士學(xué)位論文前9條

1 陶冶;文本語音匹配的研究和應(yīng)用[D];山東大學(xué);2009年

2 何俊;聲紋身份識別中非常態(tài)語音應(yīng)對方法研究[D];華南理工大學(xué);2012年

3 李冬冬;基于拓展和聚類的情感魯棒說話人識別研究[D];浙江大學(xué);2008年

4 雙志偉;個性化語音生成研究[D];中國科學(xué)技術(shù)大學(xué);2011年

5 古今;語音感知認(rèn)證的關(guān)鍵技術(shù)研究[D];中國科學(xué)技術(shù)大學(xué);2009年

6 彭波;Internet上語音的魯棒性傳輸研究[D];華南理工大學(xué);2001年

7 黃湘松;基于混淆網(wǎng)絡(luò)的漢語語音檢索技術(shù)研究[D];哈爾濱工程大學(xué);2010年

8 應(yīng)娜;基于正弦語音模型的低比特率寬帶語音編碼算法的研究[D];吉林大學(xué);2006年

9 田立斌;語音通信質(zhì)量客觀評價、有效接收及錯誤恢復(fù)算法研究[D];華南理工大學(xué);2004年

相關(guān)碩士學(xué)位論文前10條

1 王明明;基于GMM和碼本映射相結(jié)合的語音轉(zhuǎn)換方法研究[D];西安建筑科技大學(xué);2015年

2 印雪晨;宋詞朗讀呼吸信號和韻律時長研究[D];西北民族大學(xué);2015年

3 邱一良;噪聲環(huán)境下的語音檢測方法研究[D];電子科技大學(xué);2015年

4 朱俊梅;基于性別預(yù)分類的年齡自動估計(jì)研究[D];江蘇師范大學(xué);2014年

5 周慧;基于PAD三維情緒模型的情感語音轉(zhuǎn)換與識別[D];西北師范大學(xué);2009年

6 李塵一;基于聯(lián)合得分的語音置信度評估系統(tǒng)的研究與設(shè)計(jì)[D];內(nèi)蒙古大學(xué);2006年

7 朱君波;PCA在語音檢測中的應(yīng)用研究[D];浙江工業(yè)大學(xué);2004年

8 陳宇超;廣播語音的分割與分類研究[D];北京郵電大學(xué);2009年

9 何明哲;語音片段檢索算法的研究與應(yīng)用[D];華南理工大學(xué);2012年

10 邸燕君;基于感知哈希的語音內(nèi)容認(rèn)證方法研究[D];蘭州理工大學(xué);2013年

，

本文編號：1383011

資料下載

論文發(fā)表

本文鏈接：http://sikaile.net/kejilunwen/wltx/1383011.html

上一篇：適用于超寬帶信號的多核稀疏字典構(gòu)造及應(yīng)用
下一篇：一種高動態(tài)衛(wèi)星網(wǎng)絡(luò)的擁塞控制算法

論文發(fā)表

·知網(wǎng)|萬方|維普|龍?jiān)磡省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于性別預(yù)分類的年齡自動估計(jì)研究