天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當前位置:主頁 > 科技論文 > 信息工程論文 >

語音驅(qū)動三維唇形動畫算法研究

發(fā)布時間:2018-04-05 22:15

  本文選題:語音驅(qū)動 切入點:三維動畫 出處:《北京理工大學(xué)》2016年碩士論文


【摘要】:語音驅(qū)動三維唇形動畫算法屬于語音信號處理與三維動畫技術(shù)交叉范疇,可應(yīng)用于各種需要語音與唇形同步的三維動畫領(lǐng)域,如三維動畫電影或視頻、3D游戲、虛擬主播、教學(xué)視頻等。目前國內(nèi)外關(guān)于語音驅(qū)動唇形動畫的研究較少,進行唇形動畫制作時多以人工制作為主,費時費力,因此研究語音驅(qū)動三維唇形動畫算法具有一定的社會意義與應(yīng)用價值。在語音驅(qū)動三維唇形動畫算法中,語音到唇形的映射直接影響到唇形動畫的真實感。在現(xiàn)有的語音驅(qū)動唇形動畫算法中,主要存在以下難點和問題:(1)不同語言間音素的發(fā)音規(guī)律有所不同,難以與唇形形成統(tǒng)一的映射關(guān)系;(2)使用BP神經(jīng)網(wǎng)絡(luò)進行語音特征參數(shù)到唇形的映射,通常速度和精度高度受限于訓(xùn)練樣本數(shù)量和網(wǎng)絡(luò)結(jié)構(gòu);(3)三維人臉模型的格式多種多樣,沒有統(tǒng)一的唇形動畫標準,通用性存在不足。本文針對上述問題,在現(xiàn)有的語音驅(qū)動唇形動畫算法基礎(chǔ)上,做了如下改進工作:首先,分析了漢語普通話和英語的發(fā)音規(guī)律,嘗試用國際音標將兩種語言的發(fā)音規(guī)律統(tǒng)一起來,并以此為依據(jù)錄制了訓(xùn)練語音庫。其次,嘗試適用高斯混合模型算法和基于有向無環(huán)圖的支持向量機多分類算法(DAG-SVM)代替神經(jīng)網(wǎng)絡(luò)進行音素分類,并對DAG-SVM進行了改進。最后,利用DirectX中的三維網(wǎng)格漸變動畫技術(shù)實現(xiàn)了通用性強且具有真實感的三維人臉唇形動畫,并與分類算法相結(jié)合,編寫了圖形界面。實驗結(jié)果表明本文提出的算法性能較好,能達到預(yù)期要求。
[Abstract]:Speech driven 3D lip animation algorithm belongs to the cross category of speech signal processing and 3D animation technology. It can be used in various 3D animation fields, such as 3D animation movies or video games, virtual anchors, etc.Teaching videos, etc.At present, there are few researches on speech driven lip animation at home and abroad. Most of the lip animation is made manually, which is time-consuming and laborious. Therefore, the study of speech driven three-dimensional lip animation algorithm has certain social significance and application value.In the speech driven 3D lip animation algorithm, the mapping of speech to lip shape directly affects the reality of lip animation.In the existing speech driven lip animation algorithms, there are mainly the following difficulties and problems: 1) the phoneme sounds differently among different languages.It is difficult to form a unified mapping relationship with lip shape.) BP neural network is used to map speech feature parameters to lip shape. Usually, the speed and accuracy are highly limited by the number of training samples and network structure.There is no uniform standard for lip animation, and there is a lack of generality.In order to solve the above problems, based on the existing speech driven lip animation algorithms, this paper makes the following improvements: firstly, it analyzes the pronunciation rules of Mandarin and English.This paper attempts to unify the pronunciation rules of the two languages with the International phonetic Alphabet and record the training corpus on the basis of it.Secondly, we try to use Gao Si hybrid model algorithm and support vector machine multi-classification algorithm based on directed acyclic graph (SVM) instead of neural network to classify phoneme, and improve DAG-SVM.Finally, the 3D facial lip animation with strong generality and realistic sense is realized by using the technology of 3D mesh gradual animation in DirectX, and the graphical interface is compiled by combining with the classification algorithm.The experimental results show that the proposed algorithm has good performance and can meet the expected requirements.
【學(xué)位授予單位】:北京理工大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2016
【分類號】:TP391.41;TN912.3
,

本文編號:1716712

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/xinxigongchenglunwen/1716712.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶319e3***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com