Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   


 The Delphion Integrated View

  Buy Now:   Buy PDF- 9pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 
 Email this to a friend  Email this to a friend 
       
Title: US6199041: System and method for sampling rate transformation in speech recognition
[ Derwent Title ]


Country: US United States of America

View Images High
Resolution

 Low
 Resolution

 
9 pages

 
Inventor: Liu, Fu-Hua; Scarsdale, NY
Picheny, Michael A.; White Plains, NY

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 2001-03-06 / 1998-11-20

Application Number: US1998000197024

IPC Code: Advanced: G10L 15/06; G10L 21/00;
IPC-7: G10L 15/00;

ECLA Code: G10L15/065; G10L21/00;

U.S. Class: Current: 704/231; 704/203; 704/204; 704/234; 704/237; 704/E15.01; 704/E21.001;
Original: 704/231; 704/204; 704/203; 704/234; 704/237;

Field of Search: 704/200,202,211,219,231,236,241,246,247,265,267,258,252,205,234,237,203,204

Priority Number:
1998-11-20  US1998000197024

Abstract:     A method and system for transforming a sampling rate in speech recognition systems, in accordance with the present invention, includes the steps of providing cepstral based data including utterances comprised of segments at a reference frequency, the segments being represented by cepstral vector coefficients, converting the cepstral vector coefficients to energy bands in logarithmic spectra, filtering the energy bands of the logarithmic spectra to remove energy bands having a frequency above a predetermined portion of a target frequency and converting the filtered logarithmic spectra to modified cepstral vector coefficients at the target frequency. Another method and system convert system prototypes for speech recognition systems from a reference frequency to a target frequency.

Attorney, Agent or Firm: F. Chau & Associates, LLP ;

Primary / Asst. Examiners: Dorvil, Richemond; Nolan, Daniel A.

INPADOC Legal Status: Show legal status actions

Family: None

First Claim:
Show all 30 claims
What is claimed is:     1. A method for transforming a sampling rate in speech recognition systems comprising the steps of:
  • providing cepstral based data including utterances comprised of segments at a reference frequency, the segments being represented by cepstral vector coefficients;
  • converting the cepstral vector coefficients to energy bands in logarithmic spectra;
  • filtering the energy bands of the logarithmic spectra to remove energy bands having a frequency above a predetermined portion of a target frequency; and
  • converting the filtered logarithmic spectra to modified cepstral vector coefficients at the target frequency the target frequency being different than the reference frequency.


Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 22 U.S. patent(s) that reference this one

       
U.S. References: Go to Result Set: All U.S. references   |  Forward references (22)   |   Backward references (6)   |   Citation Link

Buy
PDF
Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 20pp US5165008  1992-11 Hermansky et al.  U S West Advanced Technologies, Inc. Speech synthesis using perceptual linear prediction parameters
Get PDF - 25pp US5581653  1996-12 Todd  Dolby Laboratories Licensing Corporation Low bit-rate high-resolution spectral envelope coding for audio encoder and decoder
Get PDF - 25pp US5732394  1998-03 Nakadai et al.  Nippon Telegraph and Telephone Corporation Method and apparatus for word speech recognition by pattern matching
Get PDF - 41pp US5809459  1998-09 Bergstrom et al.  Motorola, Inc. Method and apparatus for speech excitation waveform coding using multiple error waveforms
Get PDF - 22pp US5857000  1999-01 Jar-Ferr et al.  National Science Council Time domain aliasing cancellation apparatus and signal processing method thereof
Get PDF - 32pp US5913188  1999-06 Tzirkel-Hancock  Canon Kabushiki Kaisha Apparatus and method for determining articulatory-orperation speech parameters
       
Foreign References: None

Other References:
  • Haeb-Umbach et al (R. Haeb-Umbach, X. Aubert, P. Beyerlein,D. Klakow, M. Ullrich, A. Wendemuth, P. Wilcox, "Acoustic Modeling in the Philips Hub-4 Continuous-Speech Recognition System," DARPA Broadcast News, Transcription & Understanding Workshop, Feb. 1998.
  • Parrott (Parrott Systems, Inc., Internet web page "http://www.say-parrot.com/us/technology/algorithms/recognition/index. html," Feb. 2000).
  • Padmanabhan et al (M. Padmanabhan, L.R. Bahl, D. Nahamoo, M. Picheny, "Speaker Clustering and Transformation for Speaker Adaptation in Speech Recognition Systems", IEEE Transactions on Speech and Audio Processing, Jan. 1998).
  • Bahl et al., "Performance of the IBM Large Vocabulary Continuous Speech Recognition System on the ARPA Wall Street Journal Task," ICASSP-95, 1995.
  • Davis et al., "Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentences", IEEE Trans. on ASSP, vol. 28, pp. 357-366, 1980. (10 pages) Cited by 17 patents


  • Inquire Regarding Licensing

    Powered by Verity


    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help