Jim's
Tutorials

Spring 2012
course
navigation

2012-02-08

The Rosetta Stone

Challenges

1. Lexical ambiguity
2. Differing word orders
3. Unpreserved syntax
4. Syntactic ambiguity
5. Idiosyncracies

Classical Machine Translation: Dictionaries

Translate 'much' or 'many' into Russian if preceding word is how return skol’ko else if preceding word is as return stol’ko zhe else if word is much if preceding word is very return nil else if following word is a noun return mnogo else (word is many) if preceding word is a preposition and next word is a noun return mnogii else return mnogo

Classical Machine Translation: Transfer-based approach

Source

Jim says

I suggest that to really understand this stuff - and explain it to me - you try to code (likely in python) the simplest toy model you can invent to implement this (model1 -> model2 -> ...) chain of counting/probability calculations, as described in the "Knight: Statistical MT Workbook". (Note that there's an FAQ related document which looks like its actually a correction.)
http://cs.marlboro.edu/ courses/ spring2012/jims_tutorials/ elias/ 2012-02-08
last modified Wednesday February 8 2012 11:01 am EST