Jim's
Tutorials

Spring 2012
course
navigation

2012-04-11

Maximum Entropy (Maxent) Models

http://spark-public.s3.amazonaws.com/nlp/slides/Maximum_Entropy_Classifiers_v2.pdf
Training Set Test Set Objective Accuracy Objective Accuracy Joint Like. 86.8 Joint Like. 73.6 Cond. Like. 98.5 Cond. Like. 76.1
What do these tables mean? All features, smoothing, ... unchanged, conditional probabilities are more accurate – higher performance.

Only ACTIVE features matter for a decision about a data point. BUSINESS: Stocks hit a yearly low … Label: BUSINESS Features {…, stocks, hit, a, yearly, low, …}
Building Maxent:

Info. Extraction/Named Entity Recognition.

http://spark-public.s3.amazonaws.com/nlp/slides/Information_Extraction_and_Named_Entity_Recognition_v2.pdf

Assignment

Submitted to Coursera.
I would like to keep playing with this. I probably would adapt it a little because it's currently written to run over their submission server – I just need it to run locally.

Updated Code from Last Time

compare.py
http://cs.marlboro.edu/ courses/ spring2012/jims_tutorials/ elias/ 2012-04-11
last modified Wednesday April 11 2012 12:30 am EDT