Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   


 The Delphion Integrated View

  Buy Now:   Buy PDF- 10pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 
 Email this to a friend  Email this to a friend 
       
Title: US5046099: Adaptation of acoustic prototype vectors in a speech recognition system
[ Derwent Title ]


Country: US United States of America

View Images High
Resolution

 Low
 Resolution

 
10 pages

 
Inventor: Nishimura, Masafumi; Yokohama, Japan

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 1991-09-03 / 1990-02-27

Application Number: US1990000485402

IPC Code: Advanced: G10L 11/00; G10L 15/02; G10L 15/06; G10L 15/14;
IPC-7: G10L 5/00;

ECLA Code: G10L15/07; S10L15/14M1;

U.S. Class: Current: 704/256.4; 704/222; 704/244; 704/E15.011;
Original: 381/043; 364/513.5;

Field of Search: 381/043 364/513.5

Priority Number:
1989-03-13  JP1989000057760

Abstract: In a speech recognition system, the prior parameters of acoustic prototype vectors are adapted to a new speaker to obtain posterior parameters by having the speaker utter a set of adaptation words. The prior parameters of an acoustic prototype vector are adapted by a weighted sum of displacement vectors obtained from the adaptation utterances. Each displacement vector is associated with one segment of an uttered adaptation word. Each displacement vector represents the distance between the associated segment of the adaptation utterance and the model corresponding to that segment. Each displacement vector is weighted by the strength of the relationship of the acoustic prototype vector to the word segment model corresponding to the displacement vector.

Attorney, Agent or Firm: Schechter, Marc D. ;

Primary / Asst. Examiners: Kemeny, Emanuel S.;

Maintenance Status: E3 Expired  Check current status

INPADOC Legal Status: Show legal status actions          Buy Now: Family Legal Status Report

Designated Country: DE FR GB 

Family: Show 9 known family members

First Claim:
Show all 9 claims
I claim:     1. A speech recognition system performing a frequency analysis of an input speech for each period to obtain feature vectors, producing the corresponding label train using a vector quantization code book, matching a plurality of word baseforms expressed by a train of Markov models each corresponding to labels, with said label train, and recognizing the input speech on the basis of the matching result, and comprising:
  • a means for dividing each of a plurality of word input speeches into N segments (N is an integer number more than 1) and producing a representative value of the feature vector of each segment of each of said word input speeches;
  • a means for dividing word baseforms each corresponding to said word input speeches and producing a representative value of each segment feature vector of each word baseform on the basis of prototype vectors of said vector quantization code book;
  • a means for producing displacement vectors indicating the displacements between the representative values of the segments of the word input speeches and the representative values of the corresponding segments of the corresponding word baseforms;
  • a means for storing the degree of relation between each segment of said each word input speech and each label in a label group of the vector quantization code book; and
  • a prototype adaptation means for correcting a prototype vector of each label of said vector quantization code book by said each displacement vector in accordance with the degree of relation between the label and the segment.


Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 41 U.S. patent(s) that reference this one

       
U.S. References: Go to Result Set: All U.S. references   |  Forward references (41)   |   Backward references (2)   |   Citation Link

Buy
PDF
Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 79pp US4718094  1988-01 Bahl et al.  International Business Machines Corp. Speech recognition system
Get PDF - 50pp US4977599  1990-12 Bahl et al.  International Business Machines Corporation Speech recognition employing a set of Markov models that includes Markov models representing transitions to and from silence
       
Foreign References: None

Other Abstract Info: DERABS G90-284267 JAPABS 140558P000077

Other References:
  • Bahl, L. R., et al. "Acoustic Markov Models Used in the Tangora Speech Recognition System", Proc. ICASSP '88, S11-3, pp. 497-500, Apr. 1988.
  • Shikano, K. "Speaker Adaptation by Vector Quantization", Electronics and Communication Institute Technical Research Report, SP-86-65, pp. 33-40, Dec. 1986.
  • Furui, S. "Speaker Adaptation Method without a Teacher Based upon Clustering of Spectrum Space", Japanese Acoustic Institute, Proceeding of Spring National Meeting of Showa 63, 2-2-16, Mar. 1988.
  • Nishimura, M. et al, "Speaker Adaptation Method for HMM-Based Speech Recognition", Proc. ICASSP '88, S5-7, Apr. 1988.


  • Inquire Regarding Licensing

    Powered by Verity


    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help