|
Nagoya Institute of Technology Repository System >
Nagare College >
Academic Paper - Nagare College >
| |
| Title | : | FULL COVARIANCE STATE DURATION MODELING FOR HMM-BASED SPEECH SYNTHESIS |
| Authors | : | Lu, Heng Wu, Yi Jian Dai, Li Rong Wang, Ren hua Tokuda, Keiichi |
| Authors alternative | : | 徳田, 恵一 |
| Issue Date | : | 24-Apr-2009 |
| Abstract | : | This paper proposes a state duration modeling method using full covariance matrix for HMM-based speech synthesis. In this method, a full covariance matrix instead of the conventional diagonal covariance matrix is adopted in the multi-dimensional Gaussian distribution to model the state duration of each context-dependent phoneme. At synthesis stage, the state durations are predicted using the clustered context-dependent distributions with full covariance matrices. Experimental results show that the synthesized speech using full-covariance state duration models is more natural than the conventional method when we change the speaking rate of synthesized speech. |
| Type Local | : | 会議発表論文 |
| ISBN | : | 9781424423545 |
| DOI | : | 10.1109/ICASSP.2009.4960513 |
| Publisher | : | Institute of Electrical and Electronics Engineers |
| URI | : | http://repo.lib.nitech.ac.jp/handle/123456789/2205 |
| Citation | : | ICASSP 2009. IEEE International Conference on Acoustics, Speech and Signal Processing, 2009. p.4033 -4036 |
| Appears in Collections | : | Academic Paper - Nagare College
|
Files in This Item:
| File |
Description |
Size | Format |
| ICASSP2009_lu_heng.pdf | | 202Kb | Adobe PDF | View/Open |
|
|
|