Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   


 The Delphion Integrated View

  Buy Now:   Buy PDF- 14pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 
 Email this to a friend  Email this to a friend 
       
Title: US6119086: Speech coding via speech recognition and synthesis based on pre-enrolled phonetic tokens
[ Derwent Title ]


Country: US United States of America

View Images High
Resolution

 Low
 Resolution

 
14 pages

 
Inventor: Ittycheriah, Abraham; Danbury, CT
Maes, Stephane H.; Danbury, CT
Nahamoo, David; White Plains, NY

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 2000-09-12 / 1998-04-28

Application Number: US1998000067863

IPC Code: Advanced: G10L 19/00; G10L 15/02;
IPC-7: G10L 13/06; G10L 13/08; G10L 15/26;

ECLA Code: G10L19/00S;

U.S. Class: Current: 704/267; 704/235; 704/249; 704/260; 704/E19.007;
Original: 704/267; 704/235; 704/249; 704/260;

Field of Search: 704/235,249,260,267

Priority Number:
1998-04-28  US1998000067863

Abstract:     A speech coding system, responsive to an input speech signal provided by a system user, comprises: a speech coding portion including a speech recognition system responsive to the input speech signal and having a word vocabulary associated therewith, the speech recognition system recognizing the input speech signal in accordance with the vocabulary and generating phonetic tokens, such as at least one sequence of lefemes, representative of the input speech signal; a channel, responsive to the at least one sequence of lefemes, for transmitting and/or storing the at least one sequence of lefemes; and a speech synthesizing portion, responsive to the transmitted/stored sequence of lefemes, for generating a synthesized speech signal which is representative of the input speech signal provided by the system user using the at least one sequence of lefemes. The speech recognition system preferably generates acoustic parameters from the input speech signal which include voice characteristics of the system user. The speech coding system also preferably comprises a labeler which processes the input speech signal including words uttered by the system user which are not in the word vocabulary associated with the speech recognition system, the labeler generating phonetic tokens, such as at least one sequence of lefemes, optimally representative of the input speech signal. The sequence of lefemes from the labeler and the speech recognition portion are compared, for each speech segment, and the sequence most similar to the input speech is selected for transmission/storage. The speech synthesizing portion of the system preferably performs speech synthesis using pre-enrolled phonetic sub-units or tokens.

Attorney, Agent or Firm: F. Chau & Associates, LLP ;

Primary / Asst. Examiners: Hudspeth, David R.; Smits, Talivaldis Ivars

Maintenance Status: E1 Expired  Check current status

INPADOC Legal Status: Show legal status actions

Family: None

First Claim:
Show all 38 claims
What is claimed is:     1. A speech coding system responsive to an input speech signal provided by a system user, the system comprising:
  • a first speech transcribing means comprising a speech recognition means having a word vocabulary associated therewith, the speech recognition means recognizing words in the input speech signal in accordance with the vocabulary and generating at least one phonetic token representative of the input speech signal;
  • a second speech transcribing means for generating at least one phonetic token representative of a word in the input speech signal which is not in the word vocabulary;
  • channel means, responsive to at least one of the phonetic tokens, for handling at least one of the phonetic tokens in accordance with an application of the speech coding system; and
  • speech synthesizing means, responsive to the channel means, for generating a synthesized speech signal using at least one of a plurality of pre-enrolled phonetic tokens that substantially matches at least one of the phonetic tokens which is representative of the input speech signal provided by the system user.


Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 14 U.S. patent(s) that reference this one

       
U.S. References: Go to Result Set: All U.S. references   |  Forward references (14)   |   Backward references (8)   |   Citation Link

Buy
PDF
Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 17pp US4424415  1984-01 Lin  Texas Instruments Incorporated Formant tracker
Get PDF - 14pp US4473904  1984-09 Suehiro et al.  Hitachi, Ltd. Speech information transmission method and system
Get PDF - 17pp US4661915  1987-04 Ott  Texas Instruments Incorporated Allophone vocoder
Get PDF - 14pp US4707858  1987-11 Fette  Motorola, Inc. Utilizing word-to-digital conversion
Get PDF - 13pp US5305421  1994-04 Li  ITT Corporation Low bit rate speech coding system and compression
Get PDF - 11pp US5524051  1996-06 Ryan  Command Audio Corporation Method and system for audio information dissemination using various modes of transmission
Get PDF - 6pp US5696879  1997-12 Cline et al.  International Business Machines Corporation Method and apparatus for improved voice transmission
Get PDF - 12pp US5832425  1998-11 Mead  Hughes Electronics Corporation Phoneme recognition and difference signal for speech coding/decoding
       
Foreign References: None

Other References:
  • D. A. Reynolds and L. P. Heck, "Integration of Speaker and Speech Recognition Systems," Proc. IEEE ICASSP 91, p. 869-872, Apr. 1991.


  • Inquire Regarding Licensing

    Powered by Verity


    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help