Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   

 The Delphion Integrated View

  Buy Now:   Buy PDF- 13pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 Email this to a friend  Email this to a friend 
Title: US4741036: Determination of phone weights for markov models in a speech recognition system
[ Derwent Title ]

Country: US United States of America

View Images High


13 pages

Inventor: Bahl, Lalit R.; Amawalk, NY
DeSouza, Peter V.; Yorktown Heights, NY
Mercer, Robert L.; Yorktown Heights, NY

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 1988-04-26 / 1985-01-31

Application Number: US1985000696976

IPC Code: Advanced: G10L 11/00; G10L 15/14; G10L 15/02;
IPC-7: G10L 5/00;

ECLA Code: G10L15/14M1;

U.S. Class: Current: 704/256; 704/E15.029;
Original: 381/043;

Field of Search: 381/041-47 364/513.5

Priority Number:
1985-01-31  US1985000696976

Abstract:     In a speech recognition system, discrimination between similar-sounding uttered words is improved by weighting the probability vector data stored for the Markov model representing the reference word sequence of phones. The weighting vector is derived for each reference word by comparing similar sounding utterances using Viterbi alignment and multivariate analysis which maximizes the differences between correct and incorrect recognition multivariate distributions.

Attorney, Agent or Firm: Block, Marc A. ;

Primary / Asst. Examiners: Kemeny, Emanuel S.;

INPADOC Legal Status: Show legal status actions          Buy Now: Family Legal Status Report

Designated Country: DE FR GB 

Family: Show 7 known family members

First Claim:
Show all 17 claims
We claim:     1. In a speech recognition system having
    • (a) a speech processor which converts input word utterances into coded label strings, and
    • (b) a stored vocabulary comprising for each word a model comprising
      • (i) a plurality of phones representation, and
      • (ii) statistical data including label probabilities,
wherein the probabilities that any label string represents the phones of a given word is indicated by corresponding probability vectors, and in which the label string of each word utterance to be recognized is matched in a Viterbi alignment procedure against word models in the vocabulary, whereby the word having the highest probability for the respective label string is selected as output word,
  • a speech recognition method for improving the capability of discriminating between similar utterances corresponding to different words, the method comprising the steps of:
    • (a) identifying for each label string of a plurality of utterances, in a fast match procedure, a subset of coarsely matching candidate words and indicating which of these represented the correct word and which not,
    • (b) generating for each word an inverted list of label strings for which it was selected in the fast match procedure, and indicating whether the selection was correct or not,
    • (c) generating for each word, using the label strings identified in the inverted fast match output list and using the statistical data of the respective word model, a set of probability vectors in a Viterbi alignment procedure, each for one label string and carrying a designation whether the initial fast match selection was correct or wrong,
    • (d) generating for each word, from the sets of probability vectors, in a linear discriminant analysis procedure, a weighting vector, and
    • (e) weighting, during an actual speech recognition process, the probability vector elements by the associated weighting vector elements.

Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 40 U.S. patent(s) that reference this one

U.S. References: Go to Result Set: All U.S. references   |  Forward references (40)   |   Backward references (2)   |   Citation Link

Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 12pp US4099257  1978-06 Arnold et al.  International Business Machines Corporation Markov processor for context encoding from given characters and for character decoding from given contexts
Get PDF - 9pp US4100370  1978-07 Suzuki et al.  Fuji Xerox Co., Ltd. Voice verification system based on word pronunciation
Foreign References:
Publication Date IPC Code Assignee   Title

Other Abstract Info: DERABS G86-219881

Other References:
  • Language and Speech, vol. 4, No. 4, Oct./Dec. 1961, pp. 200-219; G. E. Peterson: "Automatic Speech Recognition Procedures".
  • The Bell System Technical Journal, vol. 60, No. 5, May-Jun. 1981, pp. 739-766, American Telephone and Telegraph Co., Murray Hill, NJ, US; L. R. Rabiner et al.: "A Two-Pass Pattern-Recognition Approach to Isolated Word Recognition". (28 pages)

  • Inquire Regarding Licensing

    Powered by Verity

    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help