Nagoya Institute of Technology Repository System

HOME        Japanese    library    university    Feedback

Nagoya Institute of Technology Repository System >
Nagare College >
Academic Paper - Nagare College >

 
Title :FULL COVARIANCE STATE DURATION MODELING FOR HMM-BASED SPEECH SYNTHESIS
Authors :Lu, Heng
Wu, Yi Jian
Dai, Li Rong
Wang, Ren hua
Tokuda, Keiichi
Authors alternative :徳田, 恵一
Issue Date :24-Apr-2009
Abstract :This paper proposes a state duration modeling method using full covariance matrix for HMM-based speech synthesis. In this method, a full covariance matrix instead of the conventional diagonal covariance matrix is adopted in the multi-dimensional Gaussian distribution to model the state duration of each context-dependent phoneme. At synthesis stage, the state durations are predicted using the clustered context-dependent distributions with full covariance matrices. Experimental results show that the synthesized speech using full-covariance state duration models is more natural than the conventional method when we change the speaking rate of synthesized speech.
Type Local :会議発表論文
ISBN :9781424423545
DOI :10.1109/ICASSP.2009.4960513
Publisher :Institute of Electrical and Electronics Engineers
URI :http://repo.lib.nitech.ac.jp/handle/123456789/2205
Citation :ICASSP 2009. IEEE International Conference on Acoustics, Speech and Signal Processing, 2009. p.4033 -4036
Appears in Collections:Academic Paper - Nagare College

Files in This Item:

File Description SizeFormat
ICASSP2009_lu_heng.pdf202KbAdobe PDFView/Open