This is an old revision of the document!
Table of Contents
Wishlist
Parsing
- Keith Hall: k-best Spanning Tree Parsing ACL 2007
- Introduction to MALT Parser (one of the many papers by Joakim Nivre) + one advance technique, e.g. An Improved Oracle for Dependency Parsing with Online Reordering
- Koo et al.: Dual Decomposition for Parsing with Non-Projective Head Automata EMNLP 2010.
- Eugene Charniak: A maximum-entropy-inspired parser (Zdeněk Žabokrtský)
Machine Learning
- Andrew McCallum, Dayne Freitag, Fernando Pereira: Maximum Entropy Markov Models for Information Extraction and Segmentation, Conference on Machine Learning 2000, slides
- John Lafferty, Andrew McCallum, Fernando Pereira: Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, 2001
- Yoav Goldberg, Michael Elhadad: splitSVM: Fast, Space-Efficient, non-Heuristic, Polynomial Kernel Computation for NLP Applications ACL 2008
- Ryan McDonald, Keith Hall, Gideon Mann: Distributed Training Strategies for the Structured Perceptron
Machine Translation
- Ann Clifton, Anoop Sarkar: Combining Morpheme-based Machine Translation with Post-processing Morpheme Prediction ACL 2011
- Taro Watanabe, Eiichiro Sumit: Machine Translation System Combination by Confusion Forest ACL 2011
- Nan Duan, Mu Li, Ming Zhou: Hypothesis Mixture Decoding for Statistical Machine Translation ACL 2011
- Abhishek Arun, Chris Dyer, Barry Haddow, Phil Blunsom, Adam Lopez and Philipp Koehn: Monte Carlo Inference and Maximization for Phrase-based Translation. Conference on Computational Natural Language Learning, 2009.
- Phil Blunsom, Trevor Cohn, Chris Dyer and Miles Osborne: A Gibbs Sampler for Phrasal Synchronous Grammar Induction. ACL-IJCNLP 2009
- Trevor Cohn and Phil Blunsom: A Bayesian Model of Syntax-Directed Tree to String Grammar Induction. EMNLP 2009.
- Chi-kiu LO and Dekai WU: MEANT: An inexpensive, high-accuracy, semi-automatic metric for evaluating translation utility via semantic frames. ACL HLT 2011 (or other MEANT or HMEANT paper, but this one seems to be THE main one)
Other
Jakob Uszkoreit, Thorsten Brants: Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation ACL 2008- Helmut Schmid, Florian Laws: Estimation of Conditional Probabilities With Decision Trees and an Application to Fine-Grained POS Tagging Coling 2008
- Mark Johnson: Why Doesn't EM Find Good HMM POS-Taggers? (Ondřej Bojar)