Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   


 The Delphion Integrated View

  Buy Now:   Buy PDF- 15pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 
 Email this to a friend  Email this to a friend 
       
Title: US6101470: Methods for generating pitch and duration contours in a text to speech system
[ Derwent Title ]


Country: US United States of America

View Images High
Resolution

 Low
 Resolution

 
15 pages

 
Inventor: Eide, Ellen M.; Mount Kisco, NY
Donovan, Robert E.; Mount Kisco, NY

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 2000-08-08 / 1998-05-26

Application Number: US1998000084679

IPC Code: Advanced: G10L 13/08;
IPC-7: G10L 13/08;

ECLA Code: G10L13/10;

U.S. Class: Current: 704/260; 704/268; 704/E13.013;
Original: 704/260; 704/268;

Field of Search: 704/260,266,267,268

Priority Number:
1998-05-26  US1998000084679

Abstract:     A method for automatically generating pitch contours in a text to speech (TtS) system, the system converting input text into an output acoustic signal simulating natural speech, the method comprising the steps of: storing a plurality of associated stress and pitch level pairs, each of the plurality of pairs including a lexical stress level and a pitch level; calculating lexical stress levels of the input text; comparing the stress levels of the input text to the stored stress levels of the plurality of associated stress and pitch level pairs to find the stored stress levels closest to the stress levels of the input text; and copying the pitch levels associated with the closest stored stress levels of the stress and pitch level pairs to generate the pitch contours of the input text. Features illustrative of various modes of the invention include stress and pitch level pairs that correspond with the end of vowels, use of a phonetic dictionary to expand words to phonemes and concatenate stress levels, blocking sentences and the stress contours into constant or variable lengths by segmenting from the ends toward the beginnings, and averaging at the block boundary. The method may distinguish among declarations, questions, and exclamations. Training text may be collected from more than one speaker and scaled; the speaker(s) may wear a laryngograph to provide vocal cord activity.

Attorney, Agent or Firm: F. Chau & Associates, LLP ;

Primary / Asst. Examiners: Hudspeth, David R.; Storm, Donald L.

INPADOC Legal Status: Show legal status actions

Family: None

First Claim:
Show all 41 claims
What is claimed is:     1. A method for generating pitch contours in a text to speech (TtS) system, the system converting input text into an output acoustic signal simulating natural speech, the method comprising the steps of:
  • (a) storing a plurality of associated stress and pitch level pairs, each of the plurality of pairs including a lexical stress level and a pitch level;
  • (b) determining lexical stress levels of the input text;
  • (c) comparing the stress levels of the input text to the stored stress levels of the plurality of associated stress and pitch levels pairs to find the stored stress levels closest to the stress levels of the input text; and
  • (d) copying the pitch levels associated with the closest stress levels of the stress and pitch level pairs to generate the pitch contours of the input text.


Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 50 U.S. patent(s) that reference this one

       
U.S. References: Go to Result Set: All U.S. references   |  Forward references (50)   |   Backward references (7)   |   Citation Link

Buy
PDF
Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 17pp US3704345  1972-11 Coker et al.  Bell Telephone Laboratories, Incorporated CONVERSION OF PRINTED TEXT INTO SYNTHETIC SPEECH
Get PDF - 13pp US4278838  1981-07 Antonov  Edinen Centar Po Physika Method of and device for synthesis of speech from printed text
Get PDF - 12pp US4908867  1990-03 Silverman  British Telecommunications public limited company Speech synthesis
Get PDF - 28pp US5384893  1995-01 Hutchins  Emerson & Stern Associates, Inc. Method and apparatus for speech synthesis based on prosodic analysis
Get PDF - 11pp US5536171  1996-07 Javkin et al.  Panasonic Technologies, Inc. Synthesis-based speech training system and method
Get PDF - 11pp US5758320  1998-05 Asano  Sony Corporation Method and apparatus for text-to-voice audio output with accent control and improved phrase control
Get PDF - 17pp US5913193  1999-06 Huang et al.  Microsoft Corporation Method and system of runtime acoustic unit selection for speech synthesis
       
Foreign References: None

Other References:
  • Xuedong Huang, A. Acero, J. Adcock, Hsiao-Wuen Hon, J. Goldsmith, Jingsong Liu, and M. Plumpe, "Whistler: A Trainable Text-to-Speech System," Proc. Fourth Int. Conf. Spoken Language, 1996. ICSLP 96, vol. 4, pp. 2387-2390, Oct.3-6, 1996.
  • Campbell et al., Stress, Prominence, and Spectral Tilt, ESCA Workshop on Intonation: Theory, Models and Applications, Athens Greece, Sep. 18-20, 1997, pp. 67-70.
  • Huang et al. Recent Improvements on Microsoft's Trainable Text-to-Speech System-Whistler, 1997 IEEE, pp. 959-962; ICASSP-97, Apr. 21-24.
  • Donovan et al., Improvements in an HMM-Based Synthesizer, ESCA Eurospeech '95.4th European Conference on Speech Communication and Technology, Madrid, Sep. 1995, pp. 573-576.
  • G. David Forney, Jr.; The Viterbi Algorithm, Proceedings of the IEEE, vol. 61, No. 3, Mar. 1973, pp. 268-278. (11 pages) Cited by 159 patents


  • Inquire Regarding Licensing

    Powered by Verity


    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help