Phrase-based Model: Difference between revisions

Revision as of 14:38, 7 April 2015

Phrase-based machine translation (PBMT) is probably the most widely used approach to MT today. It is relatively simple and easy to adapt to new languages.

Phrase Extraction

PBMT uses phrases as the basic unit of translation. Phrases are simply sequences of words which have been observed in the training data, they don't correspond to any linguistic notion of phrases.

In order to obtain a phrase table (a probabilistic dictionary of phrases), we need word-aligned parallel data. A heuristic is used to

Revision as of 14:36, 7 April 2015 (view source) Tamchyna (talk \| contribs) No edit summary ← Older edit		Revision as of 14:38, 7 April 2015 (view source) Tamchyna (talk \| contribs) No edit summary Newer edit →
Line 19:		Line 19:

	* [http://www.statmt.org/book/slides/05-phrase-based-models.pdf Philipp Koehn's slides on PBMT]		* [http://www.statmt.org/book/slides/05-phrase-based-models.pdf Philipp Koehn's slides on PBMT]
			* [http://www.statmt.org/book/slides/06-decoding.pdf Decoding in PBMT]

Phrase-based Model: Difference between revisions

Revision as of 14:38, 7 April 2015

Phrase Extraction

See Also

Navigation menu

Lecture 8: Phrase-based model

Lecture video:	web TODO Youtube

Phrase-based Model: Difference between revisions

Revision as of 14:38, 7 April 2015

Phrase Extraction

See Also

Navigation menu

Search