 |
 |
|
|
|
|
Title: |
US6092034:
Statistical translation system and method for fast sense disambiguation and translation of large corpora using fertility models and sense models
[ Derwent Title ]

|
Country: |
US United States of America

|
| |
Inventor: |
McCarley, Jeffrey Scott; Bedford Hills, NY
Roukos, Salim; Scarsdale, NY

|
Assignee: |
International Business Machines Corporation, Armonk, NY
other patents from INTERNATIONAL BUSINESS MACHINES CORPORATION (280070) (approx. 44,393)
News, Profiles, Stocks and More about this company

|
Published / Filed: |
2000-07-18
/ 1998-07-27

|
Application Number: |
US1998000123166

|
IPC Code: |
Advanced:
G06F 17/27;
G06F 17/28;
Core:
more...
IPC-7:
G06F 17/28;

|
ECLA Code: |
G06F17/27T;

|
U.S. Class: |
Current:
704/002;
704/004;
704/009;
704/277;
Original:
704/002;
704/004;
704/009;
704/277;

|
Field of Search: |
704/009,2,3,4,7,1,8,10,277,251,255,257
711/001
707/530,531,532

|
Government Interest: |
The U.S. Government has a paid-up license in this invention and the right in limited circumstances to require the patent owner to license others on reasonable terms as provided for by the terms of Grant No. 70NANB5H1174 awarded by the National Institute of Standards and Technology (NIST).

|
Priority Number: |
| 1998-07-27 |
US1998000123166 |

|
Abstract: |
A system and method for translating a series of source words in a first language to a series of target words in a second language is provided. The system includes an input device for inputting the series of source words. A fertility hypothesis generator operatively coupled to the input device generates at least one fertility hypotheses for a fertility of a source word, based on the source word and a context of the source word. A sense hypothesis generator operatively coupled to the input device generates sense hypotheses for a translation of the source word, based on the source word and the context of the source word. A fertility model operatively coupled to the fertility hypothesis generator determines a probability of the fertility of the source word, based on the source word and the context of the source word. A sense model operatively coupled to the sense hypothesis generator determines a probability of a target word being a correct translation of the source word, based on the source word and the context of the source word. A decoder operatively coupled to the fertility and sense models for generating a list of target words for the translation of the source word, based on the probability calculated by the fertility model and the probability calculated by the sense model.

|
Attorney, Agent or Firm: |
F. Chau & Associates, LLP ;

|
Primary / Asst. Examiners: |
Thomas, Joseph;

|
Maintenance Status: |
E1 Expired Check current status

|
INPADOC Legal Status: |
Show legal status actions

|
Family: |
None

|
First Claim:
Show all 37 claims |
What is claimed is:
1. A system for translating a series of source words in a first language to a series of target words in a second language, comprising:
- input means for inputting the series of source words;
- a fertility hypothesis generator operatively coupled to said input means for generating at least one fertility hypotheses for a fertility of a source word, based on the source word and a context of the source word;
- a sense hypothesis generator operatively coupled to said input means for generating sense hypotheses for a translation of the source word, based on the source word and the context of the source word;
- a fertility model operatively coupled to said fertility hypothesis generator for determining a probability of the fertility of the source word, based on the source word and the context of the source word;
- a sense model operatively coupled to said sense hypothesis generator for determining a probability of a target word being a correct translation of the source word, based on the source word and the context of the source word; and
- a decoder operatively coupled to said fertility and sense models for generating a list of target words for the translation of the source word, based on the probability calculated by said fertility model and the probability calculated by said sense model.

|
Background / Summary: |
Show background / summary

|
Drawing Descriptions: |
Show drawing descriptions

|
Description: |
Show description

|
Forward References: |
Show 23 U.S. patent(s) that reference this one

|
 |
 |
|
|
|
|
Foreign References: |

|
Other Abstract Info: |
DERABS G2000-637168
DERABS G2000-637168

|
Other References: |
Brown et al., "The Mathematics of Statistical Machine Translation: Parameter Estimation", Computational Linguistics, vol. 19, No. 2, pp. 263-311, 1993.
Brown et al., "Aligning Sentences in Parallel Corpora", Proceedings, 29th Annual Meeting of the Association for Computational Linguistics, pp. 169-176, Jun. 1991.
Bahl et al., "A Maximum Likelihood Approach to Continuous Speech Recognition", IEEE Transaction on Pattern Analysis and Machine Intelligence, vol. PAMI-5, No. 2, pp. 179-187, Mar. 1983.
Berger et al., "A Maximum Entropy Approach to Natural Language Processing", Computational Linguistics, vol. 22, No. 1, pp. 39-71, 1996.
Della Pietra et al., "Inducing Features of Random Fields", IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 19, No. 4, Apr. 1997.
Hackett et al., "Document Translation for Cross-Language Text Retrieval at the University of Maryland", The 6th Text REtrieval Conference (TREC-6), ed. by E.M. Voorhees and D.K. Harman.

|


|
Nominate this for the Gallery...

|
|