Rich Vocabulary: Difference between revisions

From MT Talks
Jump to navigation Jump to search
(Created page with "{{Infobox |title = Lecture 12: Rich Vocabulary |image = 200px |label1 = Lecture video: |data1 = [http://example.com web '''TODO'''] <br/> [http...")
 
No edit summary
Line 7: Line 7:
{{#ev:youtube|https://www.youtube.com/watch?v=eSIbNT-yjdg|800|center}}
{{#ev:youtube|https://www.youtube.com/watch?v=eSIbNT-yjdg|800|center}}


== Examples Languages with a Rich Vocabulary ==
== Examples of Languages with a Rich Vocabulary ==
 
=== German -- compounding ===
 
While German has some degree of inflection, it is the Germans' fondness of complex word compounds that causes the large vocabulary problem for MT. Consider the following compound:
 
[[File:rindfleish-prezi.png|300px]]
 
=== Finnish -- agglutination ===
 
[[File:finnish-prezi.png|300px]]
 
=== Czech -- fusional inflection ===
 
[[File:czech-inflection-prezi.png|300px]]


== Large Vocabulary Sizes in MT Pipeline ==
== Large Vocabulary Sizes in MT Pipeline ==

Revision as of 13:16, 12 August 2015

Lecture 12: Rich Vocabulary
Lecture video: web TODO
Youtube

{{#ev:youtube|https://www.youtube.com/watch?v=eSIbNT-yjdg%7C800%7Ccenter}}

Examples of Languages with a Rich Vocabulary

German -- compounding

While German has some degree of inflection, it is the Germans' fondness of complex word compounds that causes the large vocabulary problem for MT. Consider the following compound:

Finnish -- agglutination

Czech -- fusional inflection

Large Vocabulary Sizes in MT Pipeline

Word Alignment

Phrase Extraction

Decoding

Evaluation

Possible Solutions