Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   


 The Delphion Integrated View

  Buy Now:   Buy PDF- 10pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 
 Email this to a friend  Email this to a friend 
       
Title: US6345253: Method and apparatus for retrieving audio information using primary and supplemental indexes
[ Derwent Title ]


Country: US United States of America

View Images High
Resolution

 Low
 Resolution

 
10 pages

 
Inventor: Viswanathan, Mahesh; Yorktown Heights, NY

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 2002-02-05 / 1999-06-18

Application Number: US1999000335627

IPC Code: Advanced: G06F 3/16; G06F 17/30; G10L 15/00; G10L 15/08; G10L 15/10; G10L 15/26; G10L 15/28; G10L 17/00;
IPC-7: G10L 15/22;

ECLA Code: G10L15/26A; G06F17/30U1T; G10L17/00U;

U.S. Class: Current: 704/272; 704/251; 704/275; 704/500; 704/E15.045; 704/E17.003; 707/E17.101;
Original: 704/272; 704/275; 704/251; 704/500;

Field of Search: 704/231,236,238,250,251,255,260,200,270,275,272

Priority Number:
1999-06-18  US1999000335627
1999-04-09  US1999000288724

Abstract:     An audio retrieval system and method are provided for augmenting the transcription of an audio file with one or more alternate word or phrase choices, such as next-best guesses for each word or phrase, in addition to the best word sequence identified by the transcription process. The audio retrieval system can utilize a primary index file containing the best identified words and/or phrases for each portion of the input audio stream and a supplemental index file containing alternative choices for each word or phrase in the transcript. The present invention allows words that are incorrectly transcribed during speech recognition to be identified in response to a textual query by searching the supplemental index files. During an indexing process, the list of alternative word or phrase choices provided by the speech recognition system are collected to produce a set of supplemental index files. During a retrieval process, the user-specified textual query is matched against the primary and supplemental indexes derived from the transcribed audio to identify relevant documents. An objective ranking function scales matches found in the supplemental index file(s) using a predefined scaling factor, or a value reflecting the confidence value of the corresponding alternative choice as identified by the speech recognition system.

Attorney, Agent or Firm: Ryan, Mason & Lewis, LLP ; Otterstedt, Esq., Paul J. ;

Primary / Asst. Examiners: Dorvil, Richemond;

INPADOC Legal Status: Show legal status actions          Buy Now: Family Legal Status Report

       
Related Applications:
Application Number Filed Patent Pub. Date  Title
US1999000288724 1999-04-09       


       
Parent Case:     This application is a continuation-in-part of U.S. patent application Ser. No. 09/288,724, filed Apr. 9, 1999, which is assigned to the assignee of the present invention and incorporated by reference herein.

Designated Country: AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE 

Family: Show 15 known family members

First Claim:
Show all 28 claims
What is claimed is:     1. A method for retrieving audio information from one or more audio sources, said method comprising the steps of:
  • receiving a user query specifying at least one key word; and
  • comparing said user query with a primary index of best choices for words in said audio source and a supplemental index of at least one alternate choice for said words in said audio source to identify audio information satisfying said user query.


Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 39 U.S. patent(s) that reference this one

       
U.S. References: Go to Result Set: All U.S. references   |  Forward references (39)   |   Backward references (2)   |   Citation Link

Buy
PDF
Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 16pp US5659662  1997-08 Wilcox et al.  Xerox Corporation Unsupervised speaker clustering for automatic speaker indexing of recorded audio data
Get PDF - 21pp US6185527  2001-02 Petkovic et al.  International Business Machines Corporation System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval
       
Foreign References: None

Other Abstract Info: DERABS G2001-149202

Other References:
  • Proceedings of the Speech Recognition Workshop. Neti et al., "Audio Visual Speaker Recognition for Video Broadcast News" 1999.*
  • ICASSP-97. 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing. Roy et al., "Speaker Identification based Text to Audio Alignment of Audi Retrieval System" Apr. 1997.*
  • S. Dharanipragada et al., "Experimental Results in Audio Indexing," Proc. ARPA SLT Workshop, (Feb. 1996).
  • L. Polymenakos et al., "Transcription of Broadcast News--Some Recent Improvements to IBM's LVCSR System," Proc. ARPA SLT Workshop, (Feb. 1996).
  • R. Bakis, "Transcription of Broadcast News Shows with the IBM Large Vocabulary Speech Recognition System," Proc. ICASSP98, Seattle, WA (1998).
  • H. Beigi et al., "A Distance Measure Between Collections of Distributions and its Application to Speaker Recognition," Proc. ICASSP98, Seattle, WA (1998).
  • S. Chen, "Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion," Proceedings of the Speech Recognition Workshop (1998).
  • S. Chen et al., "Clustering via the Bayesian Information Criterion with Applications in Speech Recognition," Proc. ICASSP98, Seattle, WA (1998).
  • S. Chen et al., "IBM's LVCSR System for Transcription of Broadcast News Used in the 1997 Hub4 English Evaluation," Proceedings of the Speech Recognition Workshop (1998).
  • S. Dharanipragada et al., "A Fast Vocabulary Independent Algorithm for Spotting Words in Speech," Proc. ICASSP98, Seattle, WA (1998).
  • J. Navratil et al., "An Efficient Phonotactic-Acoustic system for Language Identification," Proc. ICASSP98, Seattle, WA (1998).
  • G. N. Ramaswamy et al., "Compression of Acoustic Features for Speech Recognition in Network Environments," Proc. ICASSP98, Seattle, WA (1998).
  • S. Chen et al., "Recent Improvements to IBM's Speech Recognition System for Automatic Transcription of Broadcast News," Proceedings of the Speech Recognition Workshop (1999).
  • S. Dharanipragada et al., "Story Segmentation and Topic Detection in the Broadcast News Domain," Proceedings of the Speech Recognition Workshop (1999).
  • C. Neti et al., "Audio-Visual Speaker Recognition for Video Broadcast News," Proceedings of the Speech Recognition Workshop (1999).


  • Inquire Regarding Licensing

    Powered by Verity


    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help