Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   


 The Delphion Integrated View

  Buy Now:   Buy PDF- 30pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 
 Email this to a friend  Email this to a friend 
       
Title: US6092039: Symbiotic automatic speech recognition and vocoder
[ Derwent Title ]


Country: US United States of America

View Images High
Resolution

 Low
 Resolution

 
30 pages

 
Inventor: Zingher, Arthur Richard; White Plains, NY

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 2000-07-18 / 1997-10-31

Application Number: US1997000960535

IPC Code: Advanced: G10L 15/02; G10L 19/00;
Core: G10L 15/00; more...
IPC-7: G10L 19/02;

ECLA Code: G10L15/02; G10L19/00U;

U.S. Class: Current: 704/221; 704/203; 704/222; 704/231; 704/E15.004; 704/E19.008;
Original: 704/221; 704/222; 704/231; 704/203;

Field of Search: 704/203,204,231-235,239,246,221,222,223

Priority Number:
1997-10-31  US1997000960535

Abstract: The device and method of the invention receives a digital speech signal, which is processed by an Acoustic Processor to produce a Mel-Cepstrum Vector and Pitch. This is recalibrated and encoded. The encoded signal is transmitted over a narrow-band Channel, then decoded, split and recalibrated. From the split signals, one signal feeds a Statistical Processor which produces Recognized Text. Another signal feeds a Regenerator, which produces Regenerated Speech. The device and method according to the invention achieve simultaneously very perceptive Automatic Speech Recognition and high quality VoCoding, using Speech communicated or stored via a Channel with narrow-bandwidth; very perceptive Automatic Speech Recognition on a Client & Server system without a need to store or to communicate wide-bandwidth Speech signals; very perceptive Automatic Speech Recognition with Deferred Review and Editing without storage of wide-bandwidth Speech signals; better feedback in a system for Automatic Speech Recognition particularly for Deferred Automatic Speech Recognition; and good usability for unified Automatic Speech Recognition and VoCoding.

Attorney, Agent or Firm: F. Chau & Associates, LLP ;

Primary / Asst. Examiners: Hudspeth, David R.; Azad, Abul K.

Maintenance Status: E1 Expired  Check current status

INPADOC Legal Status: Show legal status actions

Family: None

First Claim:
Show all 13 claims
What is claimed is:     1. A method for automatic speech recognition (ASR) and vocoding (VC), comprising the steps of:
  • converting a first signal representing speech to a second signal having raw mel capstrum vector (MCV) and a third signal having raw pitch;
  • subtracting a calibration vector from said MCV to form a difference vector;
  • multiplying a calibration matrix with said difference vector to produce a recalibrated MCV;
  • recalibrating said raw pitch with a logarithmic function;
  • concatenating said recalibrated MCV with said recalibrated pitch to form a recalibrated vector;
  • compressing and quantizing said recalibrated vector to form a vector quantized signal; and
  • forwarding said vector quantized signal to a remote receiver for decoding said vector quantized signal received by the remote receiver to recover said speech.


Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 40 U.S. patent(s) that reference this one

       
U.S. References: Go to Result Set: All U.S. references   |  Forward references (40)   |   Backward references (6)   |   Citation Link

Buy
PDF
Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 14pp US4707858  1987-11 Fette  Motorola, Inc. Utilizing word-to-digital conversion
Get PDF - 10pp US5485543  1996-01 Aso  Canon Kabushiki Kaisha Method and apparatus for speech analysis and synthesis by sampling a power spectrum of input speech
Get PDF - 17pp US5590242  1996-12 Juang et al.  Lucent Technologies Inc. Signal bias removal for robust telephone speech recognition
Get PDF - 11pp US5606645  1997-02 Matsuura  Kabushiki Kaisha Toshiba Speech pattern recognition apparatus utilizing multiple independent sequences of phonetic segments
Get PDF - 11pp US5649056  1997-07 Nitta  Kabushiki Kaisha Toshiba Speech recognition system and method which permits a speaker's utterance to be recognized using a hidden markov model with subsequent calculation reduction
Get PDF - 57pp US5729694  1998-03 Holzrichter et al.  The Regents of the University of California Speech coding, reconstruction and recognition using acoustics and electromagnetic waves
       
Foreign References: None

Other Abstract Info: DERABS G2000-593327 DERABS G2000-593327

Other References:
  • Rabiner (Applications of Voice Processing to Telecommunications), Proceeding of the IEEE, vol. 82, No. 2, Feb. 1994.
  • Imai et al., "A Direct Approximation Technique of Log Magnitude Response for Digital Filters," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-25, No. 2, Apr. 1977.
  • Imai, "Low Bit Rate Cepstral Vocoder Using the Log Magnitude Approximation Filter," IEEE 1978, pp. 441-444.
  • Koishida et al., "CELP Coding Based on Mel-Cepstral Analysis," IEEE 1995, pp. 33-36.


  • Inquire Regarding Licensing

    Powered by Verity


    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help