Data Acquisition

From MT Talks
Revision as of 16:01, 23 February 2015 by Tamchyna (talk | contribs)
Jump to navigation Jump to search

There seems to be a universal rule for (not only) statistical methods in NLP: More data is better data.

Translation systems have at their disposal (order of magnitude) more data than a person reads in a lifetime[1].

References

  1. Phillip Koehn. [Inaugural lecture. https://www.youtube.com/watch?v=6UVgFjJeFGY]