MT Talks

MT Talks is a series of mini-lectures on machine translation.

Our goal is to hit just the right level of detail and technicality to make the talks interesting and attractive to people who are not yet familiar with the field but mix in new observations and insights so that even old pals will have a reason to watch us.

MT Talks and the expanded notes on this wiki will never be the ultimate resource for MT, but we would be very happy to serve as an ultimate commented directory of good pointers.

By the way, this is indeed a Wiki, so your contributions are very welcome! Please register and feel free to add comments, corrections or links to useful resources.

Our Talks

01 Intro: Why is MT difficult, Approaches to MT.

02 MT that Deceives: Serious translation errors even for short and simple inputs.

03 Pre-processing: Normalization and other technical tricks bound to help your MT system.

04 MT Evaluation in General: Techniques of judging MT quality, dimensions of translation quality, number of possible translations.

05 Automatic MT Evaluation: Two common automatic MT evaluation methods: PER and BLEU

06 Data Acquisition: The need and possible sources of training data for MT. And the diminishing utility of the new data additions due to Zipf's law.

07 Sentence Alignment: An introduction to the Gale & Church sentence alignment algorithm.

08 Word Alignment: Cutting the chicken-egg problem.

CodEx – Coding Exercises

Log in to CodEx and solve programming exercises that complement our talks.
Brief description of CodEx: how to get an account and submit a solution.

Acknowledgement

The work on this project has been supported by the grant FP7-ICT-2011-7-288487 (MosesCore).

MT Talks

Contents

Our Talks

CodEx – Coding Exercises

Other Videolectures on MT

Acknowledgement

Navigation menu

MT Talks

Our Talks

CodEx – Coding Exercises

Other Videolectures on MT

Acknowledgement

Navigation menu

Search