Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   

 The Delphion Integrated View

  Buy Now:   Buy PDF- 14pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 Email this to a friend  Email this to a friend 
Title: US6434520: System and method for indexing and querying audio archives
[ Derwent Title ]

Country: US United States of America

View Images High


14 pages

Inventor: Kanevsky, Dimitri; Ossining, NY
Maes, Stephane H.; Danbury, CT

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 2002-08-13 / 1999-04-16

Application Number: US1999000294214

IPC Code: Advanced: G10L 15/26; G10L 17/00;
IPC-7: G10L 15/06;

ECLA Code: G06F17/30U1T; G10L15/26A; G10L17/00U;

U.S. Class: 704/243; 704/246; 704/245; 704/251;

Field of Search: 704/245,246,255,231,270-275,247,243,233,251,257,250 379/067

Priority Number:
1999-04-16  US1999000294214

Abstract:     A system and method for indexing segments of audio/multimedia files and data streams for storage in a database according to audio information such as speaker identity, the background environment and channel (music, street noise, car noise, telephone, studio noise, speech plus music, speech plus noise, speech over speech), and/or the transcription of the spoken utterances. The content or topic of the transcribed text can also be determined using natural language understanding to index based on the context of the transcription. A user can then retrieve desired segments of the audio file from the database by generating a query having one or more desired parameters based on the indexed information.

Attorney, Agent or Firm: F. Chau & Associates, LLP ;

Primary / Asst. Examiners: Chawan, Vijay B;

INPADOC Legal Status: Show legal status actions

Family: None

First Claim:
Show all 34 claims
What is claimed is:     1. A method for processing an audio data file, comprising the steps of:
  • segmenting the audio data file into segments based on detected speaker changes;
  • performing speaker identification for each segment and assigning at least one speaker identification tag to each segment based on an identified speaker;
  • verifying the identity of the speaker associated with the at least one identification tag for each segment; and
  • indexing the segments of the audio data file for storage in a database in accordance with the identification tags of verified speakers.

Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 105 U.S. patent(s) that reference this one

U.S. References: Go to Result Set: All U.S. references   |  Forward references (105)   |   Backward references (16)   |   Citation Link

Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 19pp US3936805  1976-02 Bringol et al.  International Business Machines Corporation Dictation system for storing and retrieving audio information
Get PDF - 25pp US5465290  1995-11 Hampton et al.  Litle & Co. Confirming identity of telephone caller
Get PDF - 9pp US5550966  1996-08 Drake et al.  International Business Machines Corporation Automated presentation capture, storage and playback system
Get PDF - 16pp US5598507  1997-01 Kimber et al.  Xerox Corporation Method of speaker clustering for unknown speakers in conversational audio data
Get PDF - 18pp US5606643  1997-02 Balasubramanian et al.  Xerox Corporation Real-time audio recording system for automatic speaker indexing
Get PDF - 16pp US5649060  1997-07 Ellozy et al.  International Business Machines Corporation Automatic indexing and aligning of audio and text using speech recognition
Get PDF - 21pp US5655058  1997-08 Balasubramanian et al.  Xerox Corporation Segmentation of audio data for indexing of conversational speech for real-time or postprocessing applications
Get PDF - 16pp US5659662  1997-08 Wilcox et al.  Xerox Corporation Unsupervised speaker clustering for automatic speaker indexing of recorded audio data
Get PDF - 8pp US5737532  1998-04 Delair et al.  Hughes Missile Systems Company System and technique for accessing stored audio and visual information from a database
Get PDF - 25pp US5774841  1998-06 Salazar et al.  The United States of America as represented by the Adminstrator of the National Aeronautics and Space Administration Real-time reconfigurable adaptive speech recognition command and control apparatus and method
Get PDF - 15pp US5897616  1999-04 Kanevsky et al.  International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
Get PDF - 38pp US5918223  1999-06 Blum et al.  Muscle Fish Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information
Get PDF - 6pp US5937383  1999-08 Ittycheriah et al.  International Business Machines Corporation Apparatus and methods for speech recognition including individual or speaker class dependent decoding history caches for fast word acceptance or rejection
Get PDF - 14pp US5960399  1999-09 Barclay et al.  GTE Internetworking Incorporated Client/server speech processor/recognizer
Get PDF - 15pp US6161090  2000-12 Kanevsky et al.  International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
Get PDF - 21pp US6185527  2001-02 Petkovic et al.  International Business Machines Corporation System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval
Foreign References:
Publication Date IPC Code Assignee   Title
Get PDF - 15pp EP0507743 1992-10  G06F 17/30 STENOGRAPH CORP Information storage and retrieval systems 

Other References:
  • Wilcox et al., ("HMM-Based Wordspotting for Voice Editing and Indexing", 2nd European Conference on Speech Communication and Technology, Genova, Italy, Sep. 24-26, 1991, pp. 25-28).*
  • "Automatic Content-Based Retrieval of Broadcast News", ACM Multimedia 95--Electronic Proceedings, Nov. 5-9, 1995, San Francisco, California.
  • Sugiyama, et al., "Speech Segmentation and Clustering Based on Speaker Features", 1993 IEEE, pp. II-395-II-398.
  • Wilcox, et al., "Segmentation of Speech Using Speaker Identification", 1994 IEEE, pp. I-161-I-164.
  • Cohen, et al., "Data Retrieval through a Compact Disk Drive having a Speech-Driven Interface", IBM Technical Disclosure Bulletin, vol. 38, No. 01, Jan. 1995.

  • Inquire Regarding Licensing

    Powered by Verity

    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help