Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   


 The Delphion Integrated View

  Buy Now:   Buy PDF- 10pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 
 Email this to a friend  Email this to a friend 
       
Title: US5313531: Method and apparatus for speech analysis and speech recognition
[ Derwent Title ]


Country: US United States of America

View Images High
Resolution

 Low
 Resolution

 
10 pages

 
Inventor: Jackson, John W.; Southlake, TX

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 1994-05-17 / 1990-11-05

Application Number: US1990000610888

IPC Code: Advanced: G10L 11/00; G10L 15/00; G10L 15/02; G10L 15/10; G10L 21/06;
Core: G10L 21/00; more...
IPC-7: G10L 5/00;

ECLA Code: G10L21/06;

U.S. Class: Current: 704/243; 704/231; 704/276; 704/E21.019;
Original: 381/041; 395/002.4;

Field of Search: 381/051,43,42,41,48,49 395/002,2.2,2.4,2.29,2.67,2.76,2.77,2.6,2.16

Priority Number:
1990-11-05  US1990000610888

Abstract:     A method and apparatus are disclosed for speech analysis and speech recognition. Each speech utterance under examination in accordance with the method of the present invention is digitally sampled and represented as a temporal sequence of data frames. Each data frame is then analyzed by the application of a Fast Fourier Transform (FFT) to obtain an indication of the energy content of each data frame in a plurality of frequency bands or bins. An indication of each of the most significant frequency bands, in terms of energy content, are then plotted by bin number for all data frames and graphically combined to create a power content signature for the speech utterance which is indicative of the movement of audio power through the audio spectrum over time for that utterance. By comparing the power content signature of an unknown speech utterance to a number of previously stored power content signatures, each associated with a known utterance, it is possible to identify an unknown speech utterance with a high degree of accuracy. In one preferred embodiment of the present invention, comparisons of power content signatures from unknown speech utterances are made with stored power content signatures utilizing a least squares fit or other suitable technique.

Attorney, Agent or Firm: Dillon, Andrew J. ;

Primary / Asst. Examiners: Shaw, Dale M.; Tung, Kee M.

Maintenance Status: E2 Expired  Check current status

INPADOC Legal Status: Show legal status actions          Buy Now: Family Legal Status Report

Designated Country: DE FR GB 

Family: Show 5 known family members

First Claim:
Show all 19 claims
I claim:     1. A method for analyzing human speech, said method comprising the steps of:
  • representing a speech utterance as a temporal sequence of frames, each frame representing acoustic parameters at one of a succession of brief time periods;
  • analyzing each frame of acoustic parameters to obtain a plurality of spectral parameters, each of said plurality of spectral parameters representing an energy level at one of a series of different frequency bins;
  • identifying a spectral parameter within each frame having the highest energy level within that frame; and
  • plotting an indication of said spectral parameters having the highest energy level for each frame in said temporal sequence to form a first continuous signature representative of said speech utterance.


Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 16 U.S. patent(s) that reference this one

       
U.S. References: Go to Result Set: All U.S. references   |  Forward references (16)   |   Backward references (8)   |   Citation Link

Buy
PDF
Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 7pp US3588353  1971-06 Martin   SPEECH SYNTHESIZER UTILIZING TIMEWISE TRUNCATION OF ADJACENT PHONEMES TO PROVIDE SMOOTH FORMANT TRANSITION
Get PDF - 11pp US3603738  1971-09 Focht  Philco-Ford Corporation TIME-DOMAIN PITCH DETECTOR AND CIRCUITS FOR EXTRACTING A SIGNAL REPRESENTATIVE OF PITCH-PULSE SPACING REGULARITY IN A SPEECH WAVE
Get PDF - 54pp US4748670  1988-05 Bahl et al.  International Business Machines Corporation Apparatus and method for determining a likely word sequence from labels generated by an acoustic processor
Get PDF - 9pp US4776017  1988-10 Fujimoto  Ricoh Company, Ltd. Dual-step sound pattern matching
Get PDF - 26pp US4809332  1989-02 Jongman et al.  Central Institute for the Deaf Speech processing apparatus and methods for processing burst-friction sounds
Get PDF - 19pp US4829574  1989-05 Dewhurst et al.  The University of Melbourne Signal processing
Get PDF - 19pp US4852170  1989-07 Boedeaux  R & D Associates Real time computer speech recognition system
Get PDF - 18pp US4933973  1990-06 Porter  ITT Corporation Apparatus and methods for the selective addition of noise to templates employed in automatic speech recognition systems
       
Foreign References: None

Other Abstract Info: DERABS G92-161058

Other References:
  • Flanagan "Speech Analysis Synthesis and Perception", Springer-Verlag 1972 pp. 141-147, 150-155.


  • Inquire Regarding Licensing

    Powered by Verity


    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help