 |
 |
|
|
|
|
Title: |
US6202049:
Identification of unit overlap regions for concatenative speech synthesis system
[ Derwent Title ]

|
Country: |
US United States of America

|
| |
Inventor: |
Kibre, Nicholas; Lompoc, CA
Pearson, Steve; Santa Barbara, CA

|
Assignee: |
Matsushita Electric Industrial Co., Ltd., Osaka, Japan
other patents from MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. (358975) (approx. 19,828)
News, Profiles, Stocks and More about this company

|
Published / Filed: |
2001-03-13
/ 1999-03-09

|
Application Number: |
US1999000264981

|
IPC Code: |
Advanced:
G06F 15/18;
G06N 3/04;
G10L 13/06;
G10L 13/08;
G10L 15/14;
G10L 15/16;
Core:
G06N 3/00;
G10L 13/00;
G10L 15/00;
more...
IPC-7:
G10L 13/06;

|
ECLA Code: |
G10L13/06C;

|
U.S. Class: |
Current:
704/267;
704/254;
704/E13.01;
Original:
704/267;
704/254;

|
Field of Search: |
704/265,266,267,249,254,258

|
Priority Number: |
| 1999-03-09 |
US1999000264981 |

|
Abstract: |
Speech signal parameters are extracted from time-series data corresponding to different sound units containing the same vowel. The extracted parameters are used to train a statistical model, such as a Hidden Markov-based Model, that has a data structure for separately modeling the nuclear trajectory region of the vowel and its surrounding transition elements. The model is trained as through embedded re-estimation to automatically determine optimally aligned models that identify the nuclear trajectory region. The boundaries of the nuclear trajectory region serve to delimit the overlap region for subsequent sound unit concatenation.

|
Attorney, Agent or Firm: |
Harness, Dickey & Pierce, P.L.C. ;

|
Primary / Asst. Examiners: |
Hudspeth, David; Storm, Donald L.

|
INPADOC Legal Status: |
Show legal status actions
Family Legal Status Report

|
Designated Country: |
AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

|
Family: |
Show 13 known family members

|
First Claim:
Show all 15 claims |
What is claimed is:
1. A method for identifying a unit overlap region for concatenative speech synthesis, comprising:
- defining a statistical model for representing time-varying properties of speech;
- providing a plurality of time-series data corresponding to different sound units containing the same vowel;
- extracting speech signal parameters from said time-series data and using said parameters to train said statistical model;
- using said trained statistical model to identify a recurring sequence in said time-series data and associating said recurring sequence with a nuclear trajectory region of said vowel;
- using said recurring sequence to delimit the unit overlap region for concatenative speech synthesis.

|
Background / Summary: |
Show background / summary

|
Drawing Descriptions: |
Show drawing descriptions

|
Description: |
Show description

|
Forward References: |
Show 2 U.S. patent(s) that reference this one

|
 |
 |
|
|
|
|
Foreign References: |

|
Other Abstract Info: |
DERABS G2000-566952

|
Other References: |
Mercier, G., D. Bigorgne, L. Miclet, L. LeGuenne, and M. Querre, "Recognition of Speaker-dependent Continuous Speech with KEAL," IEE Proceedings-Communications, Speech, and Vision, Part I, vol. 136, iss. 2, Apr. 1989, pp. 145-154.
(10 pages)
Cited by 3 patents
Weigel, Walter, "Continuous Speech-Recognition with Vowel-Context-Independent Hidden Markov Models for Demisyllables," Proc. ICSLP, Kobe Japan, Nov. 1990, pp. 701-704.
Matsui, K., S. D. Pearson, K. Hata, and T. Kamai, "Improving Naturalness in Text-to-Speech Synthesis Using Natural Glottal Source," 1991 Int. Conf. Acoust., Speech, Sig. Proc., 1991, ICASSP-91, vol. 2, Apr. 14-17 1991, pp. 769-772.
Boeffard, O., L. Miclet, and S. White, "Automatic Generation of Optimized Unit Dictionaries for text to Speech Synthesis," Int. Conf. Spoken Language Proc., Banff, Alberta, Canada, vol. 2, Oct. 12-16, 1992, pp. 1211-1241.
Acero, H. Hon, A., Huang, X., Liu, J., and Plumpe, M.; "Automatic Generation Of Synthesis Units For Trainable Text-To-Speech Systems"; Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No. 98CH36181) Part vol. 1; pp. 293-296 vol. 1; May 1998.
Boeffard, O., Miclet, L., and White, S.; "Automatic Generation Of Optimized Unit Dictionaries For Text To Speech Synthesis"; In Proceedings ICSLP 92, Baraff, Alberta, Canada; pp. 1211-1214.; 1992.
Conkie, Alistair D., and Isard, Stephen; "Optimal Coupling of Diphones"; Text-To-Speech Synthesis: Progress In Speech Synthesis Workshop; 2nd ; pp. 293-304; Spring 1996.

|


|
Nominate this for the Gallery...

|
|