Work Files Saved Searches
   My Account                                                  Search:   Quick/Number   Boolean   Advanced       Help   

 The Delphion Integrated View

  Buy Now:   Buy PDF- 14pp  PDF  |   File History  |   Other choices   
  Tools:  Citation Link  |  Add to Work File:    
  View:  Expand Details   |  INPADOC   |  Jump to: 
 Email this to a friend  Email this to a friend 
Title: US6345252: Methods and apparatus for retrieving audio information using content and speaker information
[ Derwent Title ]
>> View Certificate of Correction for this publication

Country: US United States of America

View Images High


14 pages

Inventor: Beigi, Homayoon Sadr Mohammad; Yorktown Heights, NY
Tritschler, Alain Charles Louis; New York, NY
Viswanathan, Mahesh; Yorktown Heights, NY

Assignee: International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
 News, Profiles, Stocks and More about this company

Published / Filed: 2002-02-05 / 1999-04-09

Application Number: US1999000288724

IPC Code: Advanced: G06F 3/16; G06F 17/30; G10L 15/00; G10L 15/08; G10L 15/10; G10L 15/26; G10L 15/28; G10L 17/00;
IPC-7: G10L 15/22;

ECLA Code: G10L17/00U; G06F17/30U1T; G10L15/26A;

U.S. Class: Current: 704/272; 704/251; 704/275; 704/500; 704/E15.045; 704/E17.003; 707/E17.101;
Original: 704/272; 704/275; 704/500; 704/251;

Field of Search: 704/231,250,238,236,251,255,260,200,270,272,275

Priority Number:
1999-04-09  US1999000288724

Abstract:     Methods and apparatus are provided for retrieving audio information based on the audio content as well as the identity of the speaker. The results of content and speaker-based audio information retrieval methods are combined to provide references to audio information (and indirectly to video). A query search system retrieves information responsive to a textual query containing a text string (one or more key words), and the identity of a given speaker. An indexing system transcribes and indexes the audio information to create time-stamped content index file(s) and speaker index file(s). An audio retrieval system uses the generated content and speaker indexes to perform query-document matching based on the audio content and the speaker identity. Documents satisfying the user-specified content and speaker constraints are identified by comparing the start and end times of the document segments in both the content and speaker domains. Documents satisfying the user-specified content and speaker constraints are assigned a combined score that can be used in accordance with the present invention to rank-order the identified documents returned to the user, with the best-matched segments at the top of the list.

Attorney, Agent or Firm: Ryan, Mason & Lewis, LLP ; Otterstedt, Esq., Paul J. ;

Primary / Asst. Examiners: Dorvil, Richemond;

Maintenance Status: CC Certificate of Correction issued
View Certificate of Correction

INPADOC Legal Status: Show legal status actions          Buy Now: Family Legal Status Report


Family: Show 15 known family members

First Claim:
Show all 33 claims
What is claimed is:     1. A method for retrieving audio information from one or more audio sources, said method comprising the steps of:
  • receiving a user query specifying at least one content and one speaker constraint; and
  • comparing said user query with a content index and a speaker index of said audio source to identify audio information satisfying said user query.

Background / Summary: Show background / summary

Drawing Descriptions: Show drawing descriptions

Description: Show description

Forward References: Show 58 U.S. patent(s) that reference this one

U.S. References: Go to Result Set: All U.S. references   |  Forward references (58)   |   Backward references (1)   |   Citation Link

Patent  Pub.Date  Inventor Assignee   Title
Get PDF - 21pp US6185527  2001-02 Petkovic et al.  International Business Machines Corporation System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval
Foreign References: None

Other Abstract Info: DERABS G2001-149202

Other References:
  • Proceedings of the Speech Recognition Worshop. C. Neti et al., "Audio Visual Speaker Recognition for video Broadcast News" 1999.*
  • ICASSP-97. 1997 IEEE International Conference on Acoustics, Speech and Signal Processing. Roy et al., Speaker Identification based Text to Audio Alignment for audio Retrieval System, Apr. 1997.*
  • ICIP 98. Proceedings. Iternational Conference on Image Processing, 1998, Tsekeridou et al. "Speaker dependent videi indexing based on audio-visual interaction". Pp. 358-362 vol. 1. Oct. 1998.*
  • 1996 IEEE Multimedia. Wold et al. "Content based classification, search, and retrieval of audio" pp. 27-36. Fall 1996.* (10 pages) Cited by 27 patents [ISI abstract]
  • S. Dharanipragada et al., "Experimental Results in Audio Indexing," Proc. ARPA SLT Workshop, (Feb. 1996).
  • L. Polymenakos et al., "Transcription of Braodcast News--Some Recent Inprovements to IBM's LVCSR System," Proc. ARPA SLT Workshop, (Feb. 1996).
  • R. Bakis, "Transcription of Broadcast News Shows with the IBM Large Vocabulary Speech Recognition System," Proc. ICASSP98, Seattle, WA (1998).
  • H. Beigi et al., "A Distance Measure Between Collections of Distributions and its Application to Speaker Recognition," Proc. ICASSP98, Seattle, WA (1998).
  • S. Chen, "Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion," Proceedings of the Speech Recognition Workshop (1998).
  • S. Chen et al., "Clustering via the Bayesian Information Criterion with Applications in Speech Recognition," Proc. ICASSP98, Seattle, WA (1998).
  • S. Chen et al., "IBM's LVCSR System for Transcription of Broadcast News Used in the 1997 Hub4 English Evaluation," Proceedings of the Speech Recognition Workshop (1998).
  • S. Dharanipragada et al., "A Fast Vocabulary Independent Algorithm for Spotting Words in Speech," Proc. ICASSP98, Seattle, WA (1998).
  • J. Navratil et al., "An Efficient Phonotactic-Acoustic system for Language Identification," Proc. ICASSP98, Seattle, WA (1998).
  • G. N. Ramaswamy et al., "Compression of Acoustic Features for Speech Recognition in Network Environments," Proc. ICASSP98, Seattle, WA (1998).
  • S. Chen et al., "Recent Improvements to IBM's Speech Recognition System for Automatic Transcription of Broadcast News," Proceedings of the Speech Recognition Workshop (1999).
  • S. Dharanipragada et al., "Story Segmentation and Topic Detection in the Broadcast News Domain," Proceedings of the Speech Recognition Workshop (1999).
  • C. Neti et al., "Audio-Visual Speaker Recognition for Video Broadcast News," Proceedings of the Speech Recognition Workshop (1999).

  • Inquire Regarding Licensing

    Powered by Verity

    Plaques from Patent Awards      Gallery of Obscure PatentsNominate this for the Gallery...

    Thomson Reuters Copyright © 1997-2014 Thomson Reuters 
    Subscriptions  |  Web Seminars  |  Privacy  |  Terms & Conditions  |  Site Map  |  Contact Us  |  Help