Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   


 The Delphion Integrated View

  Buy Now:   Buy PDF- 9pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 
 Email this to a friend  Email this to a friend 
       
Title: US6182037: Speaker recognition over large population with fast and detailed matches
[ Derwent Title ]


Country: US United States of America

View Images High
Resolution

 Low
 Resolution

 
9 pages

 
Inventor: Maes, Stephane Herman; Danbury, CT

Assignee: International Business Machines Corporation
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 2001-01-30 / 1997-05-06

Application Number: US1997000851982

IPC Code: Advanced: G10L 15/02; G10L 15/10; G10L 15/14; G10L 17/00;
IPC-7: G10L 17/00;

ECLA Code: G10L17/06;

U.S. Class: Current: 704/247; 704/245; 704/E17.007;
Original: 704/247; 704/245;

Field of Search: 704/246,247,250,249,243,245,244

Priority Number:
1997-05-06  US1997000851982

Abstract:     Fast and detailed match techniques for speaker recognition are combined into a hybrid system in which speakers are associated in groups when potential confusion is detected between a speaker being enrolled and a previously enrolled speaker. Thus the detailed match techniques are invoked only at the potential onset of saturation of the fast match technique while the detailed match is facilitated by limitation of comparisons to the group and the development of speaker-dependent models which principally function to distinguish between members of a group rather than to more fully characterize each speaker. Thus storage and computational requirements are limited and fast and accurate speaker recognition can be extended over populations of speakers which would degrade or saturate fast match systems and degrade performance of detailed match systems.

Attorney, Agent or Firm: McGuireWoods, LLP ; Otterstedt, Paul J. ;

Primary / Asst. Examiners: Hudspeth, David R.; Zintel, Harold

INPADOC Legal Status: Show legal status actions          Buy Now: Family Legal Status Report

Family: Show 6 known family members

First Claim:
Show all 23 claims
Having thus described my invention, what I claim as new and desire to secure by Letters Patent is as follows:     1. A method for performing speaker recognition in accordance with a computer system, said computer system including a storage unit for storing a plurality of codebooks each of which corresponds to one of a plurality of speakers, said method comprising:
  • allocating said plurality of speakers into groups of speakers;
  • performing a text-independent, fast-match speaker recognition process in accordance with steps that include:
    • (a) comparing an input speech signal with a portion of said plurality of codebooks stored in said storage unit, wherein step (a) includes:
      • (1) splitting said input speech signal into a plurality of frames,
      • (2) for each frame:
        • (i) deriving at least one feature vector from said input speech signal,
        • (ii) comparing the at least one feature vector in each frame with codewords in said portion of said plurality of codebooks, the codewords in each codebook being formed from feature vectors derived from test data previously input for a corresponding one of said speakers,
      • (3) counting frames which correspond to each of said plurality of codebooks, and
      • (4) providing an indication of how closely each of said codebooks match said input speech signal based on said frame counting step, and
    • (b) identifying a predetermined number of said portion of said plurality of codebooks which best match said input speech signal; and
  • performing a detailed-match speaker recognition process in accordance with steps that include:
    • (c) identifying groups into which the speakers identified in step (b) belong;
    • (d) comparing said input speech signal only with models corresponding to speakers in said groups identified in step (c); and
    • (e) identifying the speaker of said input speech signal based on an outcome of comparing step (d).


Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 28 U.S. patent(s) that reference this one

       
U.S. References: Go to Result Set: All U.S. references   |  Forward references (28)   |   Backward references (27)   |   Citation Link

Buy
PDF
Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 14pp US3673331  1972-06 Hair et al.  Texas Instruments Incorporated IDENTITY VERIFICATION BY VOICE SIGNALS IN THE FREQUENCY DOMAIN
Get PDF - 22pp US4363102  1982-12 Holmgren et al.  Bell Telephone Laboratories, Incorporated Speaker identification system using word recognition templates
Get PDF - 10pp US4716593  1987-12 Hirai et al.  Tokyo Shibaura Denki Kabushiki Kaisha Identity verification system
Get PDF - 42pp US4720863  1988-01 Li et al.  ITT Defense Communications Method and apparatus for text-independent speaker recognition
Get PDF - 9pp US4827518  1989-05 Feustel et al.  Bell Communications Research, Inc. Speaker verification system using integrated circuit cards
Get PDF - 7pp US4947436  1990-08 Greaves et al.  British Telecommunications public limited company Speaker verification using memory address
Get PDF - 26pp US5073939  1991-12 Vensko et al.  ITT Corporation Dynamic time warping (DTW) apparatus for use in speech recognition systems
Get PDF - 59pp US5121428  1992-06 Uchiyama et al.  Ricoh Company, Ltd. Speaker verification system
Get PDF - 11pp US5167004  1992-11 Netsch et al.  Texas Instruments Incorporated Temporal decorrelation method for robust speaker verification
Get PDF - 19pp US5189727  1993-02 Guerreri  Electronic Warfare Associates, Inc. Method and apparatus for language and speaker recognition
Get PDF - 7pp US5216720  1993-06 Naik et al.  Texas Instruments Incorporated Voice verification circuit for validating the identity of telephone calling card customers
Get PDF - 21pp US5241649  1993-08 Niyada  Matsushita Electric Industrial Co., Ltd. Voice recognition method
Get PDF - 14pp US5271088  1993-12 Bahler  ITT Corporation Automated sorting of voice messages through speaker spotting
Get PDF - 10pp US5274695  1993-12 Green  U.S. Sprint Communications Company Limited Partnership System for verifying the identity of a caller in a telecommunications network
Get PDF - 12pp US5339385  1994-08 Higgins  ITT Corporation Speaker verifier using nearest-neighbor distance measure
Get PDF - 59pp US5347595  1994-09 Bokser  Palantir Corporation (Calera Recognition Systems) Preprocessing means for use in a pattern classification system
Get PDF - 6pp US5384833  1995-01 Cameron  British Telecommunications public limited company Voice-operated service
Get PDF - 12pp US5412738  1995-05 Brunelli et al.  Istituto Trentino Di Cultura Recognition system, particularly for recognising people
Get PDF - 10pp US5414755  1995-05 Bahler et al.  ITT Corporation System and method for passive voice verification in a telephone network
Get PDF - 13pp US5522012  1996-05 Mammone et al.  Rutgers University Speaker identification and verification system
Get PDF - 30pp US5537488  1996-07 Menon et al.  Massachusetts Institute of Technology Pattern recognition system with statistical classification
Get PDF - 23pp US5608840  1997-03 Tsuboka  Matsushita Electric Industrial Co., Ltd. Method and apparatus for pattern recognition employing the hidden markov model
Get PDF - 10pp US5666466  1997-09 Lin et al.  Rutgers, The State University of New Jersey Method and apparatus for speaker recognition using selected spectral information
Get PDF - 28pp US5675704  1997-10 Juang et al.  Lucent Technologies Inc. Speaker verification with cohort normalized scoring
Get PDF - 18pp US5682464  1997-10 Sejnoha  Kurzweil Applied Intelligence, Inc. Word model candidate preselection for speech recognition using precomputed matrix of thresholded distance values
Get PDF - 15pp US5689616  1997-11 Li  ITT Corporation Automatic language identification/verification system
Get PDF - 8pp US5895447  1999-04 Ittycheriah et al.  International Business Machines Corporation Speech recognition using thresholded speaker class model selection or model adaptation
       
Foreign References:
Buy
PDF
Publication Date IPC Code Assignee   Title
  JP59111699 1984-06       
  JP61002599 1986-01       
  JP04015700 1992-01       


Other Abstract Info: DERABS G1999-266603

Other References:
  • T. Matsui et al.; "A Study of Model and a Priori Threshold Updating in Speaker Verification"; Technical Report of the Institute of Electronics, Information & Communications Engineers; SP95-120(1996-01); pp 21-26.
  • Parsons "Voice and Speech Processing" 1987, McGraw-Hill, pp. 332-336.
  • Bahl et al "A fast approximate acoustic match for large vocabulary speech recognition" IEEE Transactions, Jan. 1993, pp. 59-67.
  • Rudasi, Text-independent talker identification using recurrent neural networks: J Acoust Soc Am Supp 1 v 87, pg s104, 1990.
  • "Merriam-Webster collegiate dictionary" pp. 211 and 550, 1993.
  • Rabiner "Digital processing of speech signals" p. 478, 1978.
  • Parsons "Voice and Speech Processing" 1987, McGraw-Hill, p. 175.
  • Yu et al "Speakerrecognition using hidden Markov models, dynamic time warping and vector quantisation", Oct. 1995, IEEE, 313-318. (6 pages) [ISI abstract]
  • Rosenberg, Lee, and Soone, Sub-Word Unit Talker Verification Using Hidden Markov Models, 1990, AT&T Bell Laboratories, pp 269-272.
  • Herbert Gish, Robust Discrimination in Automatic Speaker Identification, BBN Systems and Technologies Corporation, pp 289-292.
  • Naik, Netsch and Doddington, Speaker Verification Over Long Distance Telephone Lines, Texas Instruments Inc., pp 524-527.


  • Inquire Regarding Licensing

    Powered by Verity


    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help