Data Acquisition
There seems to be a universal rule for (not only) statistical methods in NLP: More data is better data.
Translation systems have at their disposal (order of magnitude) more data than a person reads in a lifetime[1].
References
- ↑ Phillip Koehn. [Inaugural lecture. https://www.youtube.com/watch?v=6UVgFjJeFGY]