Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   


 The Delphion Integrated View

  Buy Now:   Buy PDF- 14pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 
 Email this to a friend  Email this to a friend 
       
Title: US5970453: Method and system for synthesizing speech

Country: US United States of America

View Images High
Resolution

 Low
 Resolution

 
14 pages

 
Inventor: Sharman, Richard Anthony; Highfield, United Kingdom

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 1999-10-19 / 1995-06-09

Application Number: US1995000489179

IPC Code: Advanced: G10L 13/07; G10L 13/04; G10L 13/08; G10L 15/14; G10L 25/27;
IPC-7: G10L 5/02; G10L 9/00;

ECLA Code: G10L13/07; S10L13/04; S10L15/14M; S10L25/27; S10L13/08;

U.S. Class: Current: 704/260; 704/258; 704/E13.01;
Original: 704/260; 704/258;

Field of Search: 395/2.67,2.64,2.65,2.63,2.54,2.69

Priority Number:
1995-01-07  GB1995000000284

Abstract:     A method and system for synthesizing acoustic waveforms in, for example, a text-to-speech system is disclosed which employs the concatenation of a very large number of very small, sub-phoneme, acoustic units. Such sub-phoneme sized audio segments, called wavelets, can be individually spectrally analyzed and labelled as fenones. Fenones are clustered into logically related groups called fenemes. Sequences of fenemes can be matched with individual phonemes, and hence words. In the case of a text-to-speech system, the required phonemes are determined from prior linguistic analysis of the input words in the text. Suitable sequences of fenemes are predicted for each phoneme in its own context using hidden markov modelling techniques. A complete output waveform is constructed by concatenating wavelets to produce a very long sequence thereof, each wavelet corresponding to its respective feneme. The advantages of using a feneme set extracted from a training script read by a single human speaker is that it is possible to generate natural sounding speech, using a finite sized codebook.

Primary / Asst. Examiners: Isen, Forester W.; Edouard, Patrick N.

Maintenance Status: E1 Expired  Check current status

INPADOC Legal Status: Show legal status actions          Buy Now: Family Legal Status Report

Family: Show 3 known family members

First Claim:
Show all 25 claims
I claim:     1. A method for synthesizing speech from text, comprising the steps of:
  • generating a sequence of sub-phoneme elements from text, each sub-phoneme element representing a corresponding acoustic waveform; and
  • concatenating said sub-phoneme elements to produce an output waveform, wherein said generating step comprises the steps of:
    • generating from said text corresponding speech elements; and
    • mapping each speech element to one or more of a plurality of sub-phoneme elements to produce said sequence.


Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 46 U.S. patent(s) that reference this one

       
U.S. References: Go to Result Set: All U.S. references   |  Forward references (46)   |   Backward references (11)   |   Citation Link

Buy
PDF
Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 17pp US4521907  1985-06 Amir et al.  American Microsystems, Incorporated Multiplier/adder circuit
Get PDF - 18pp US4692941  1987-09 Jacks et al.  First Byte Real-time text-to-speech conversion system
Get PDF - 53pp US4833712  1989-05 Bahl et al.  International Business Machines Corporation Automatic generation of simple Markov model stunted baseforms for words in a vocabulary
Get PDF - 23pp US4882759  1989-11 Bahl et al.  International Business Machines Corporation Synthesizing word baseforms used in speech recognition
Get PDF - 13pp US5031217  1991-07 Nishimura  International Business Machines Corporation Speech recognition system using Markov models having independent label output sets
Get PDF - 31pp US5033087  1991-07 Bahl et al.  International Business Machines Corp. Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system
Get PDF - 9pp US5165007  1992-11 Bahl et al.  International Business Machines Corporation Feneme-based Markov models for words
Get PDF - 16pp US5199077  1993-03 Wilcox et al.  Xerox Corporation Wordspotting for voice editing and indexing
Get PDF - 17pp US5230037  1993-07 Giustiniani et al.  International Business Machines Corporation Phonetic Hidden Markov model speech synthesizer
Get PDF - 10pp US5353377  1994-10 Kuroda et al.  International Business Machines Corporation Speech recognition system having an interface to a host computer bus for direct access to the host memory
Get PDF - 13pp US5502791  1996-03 Nishimura et al.  International Business Machines Corporation Speech recognition by concatenating fenonic allophone hidden Markov models in parallel among subwords
       
Foreign References: None

Other Abstract Info: DERABS G1996-303133

Inquire Regarding Licensing

Powered by Verity


Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help