Scoring and Optimization: Difference between revisions

Revision as of 14:53, 24 August 2015

Features of MT Models

Phrase Translation Probabilities

Lexical Weights

Lexical weights are a method for smoothing the phrase table. Infrequent phrases have unreliable probability estimates; for instance many long phrases occur together only once in the corpus, resulting in $P(\mathbf {e} |\mathbf {f} )=P(\mathbf {f} |\mathbf {e} )=1$ . Several methods exist for computing lexical weights. The most common one is based on word alignment inside the phrase. The probability of each foreign word $f_{j}$ is estimated as the average of lexical translation probabilities $w(f_{j},e_{i})$ over the English words aligned to it. Thus for the phrase $(\mathbf {e} ,\mathbf {f} )$ with the set of alignment points $a$ , the lexical weight is:

${\text{lex}}(\mathbf {f} |\mathbf {e} ,a)=\prod _{j=1}^{l_{f}}{\frac {1}{|{i|(i,j)\in a}|}}\sum _{\forall (i,j)\in a}w(f_{j},e_{i})$

@@ Line 17: / Line 17: @@
 in the corpus, resulting in <math>P(\mathbf{e}|\mathbf{f}) = P(\mathbf{f}|\mathbf{e})
 = 1</math>. Several methods exist for computing lexical weights. The most common one
-is based on word alignment inside the phrase \citep{koehn:phd-thesis}. The
+is based on word alignment inside the phrase. The
-probability of each \emph{foreign} word <math>f_j</math> is estimated as the average of
+probability of each ''foreign'' word <math>f_j</math> is estimated as the average of
 lexical translation probabilities <math>w(f_j, e_i)</math> over the English words aligned
 to it.  Thus for the phrase <math>(\mathbf{e},\mathbf{f})</math> with the set of alignment

Scoring and Optimization: Difference between revisions

Revision as of 14:53, 24 August 2015

Contents

Features of MT Models

Phrase Translation Probabilities

Lexical Weights

Language Model

Word and Phrase Penalty

Distortion Penalty

Decoding

Phrase-Based Search

Decoding in SCFG

Optimization of Feature Weights

Navigation menu

Lecture 13: Scoring and Optimization

Lecture video:	web TODO Youtube