This is an old revision of the document!
Reading Group
| Contact | popel at ufal.mff.cuni.cz | 
|---|---|
| Mailing list | rg at ufal.mff.cuni.cz | 
| List Archive | http://ufal.mff.cuni.cz/mailman/listinfo/rg | 
| Meetings | Mondays 15:00, in front of 422 | 
Wishlist
- Mark Johnson: Why Doesn't EM Find Good HMM POS-Taggers? (Ondřej)
- Eugene Charniak: A maximum-entropy-inspired parser (Zdeněk)
A source of inspiration: Edinburgh Reading Group
Summer 2010
| date | speaker | paper | 
|---|---|---|
| Apr 26, May 5 | Martin Popel | Ronald Rosenfeld: A Maximum Entropy Approach to Adaptive Statistical Language Modeling | 
| Apr 12 | David Mareček | Jens Nillson, Joakim Nivre, Johan Hall: Graph Transformations in Data-Driven Dependency Parsing | 
| Mar 29 | Martin Popel | Jeff Bilmes and Katrin Kirchhoff: Factored Language Models and Generalized Parallel Backoff (plus tutorial) | 
| Mar 22 | Zdeněk Žabokrtský | Deniz Yuret and Mehmet Ali Yatbaz: The Noisy Channel Model for Unsupervised Word Sense Disambiguation, Computational Linguistics 2010 | 
| Mar 15 | Ondřej Bojar | Philipp Koehn and Barry Haddow: Interactive Assistance to Human Translators using Statistical Machine Translation Methods, MT Summit XII, 2009. | 
| Mar 8 | startup meeting | 
Winter 2009/2010
| date | speaker | paper | přečíst? | 
|---|---|---|---|
| Dec 14 | Martin Popel | Wei Lu, Hwee Tou Ng, Wee Sun Lee: Natural Language Generation with Tree Conditional Random Fields | |
| Dec 7 | David Mareček | Nivre, J.: Non-Projective Dependency Parsing in Expected Linear Time. ACL 2009 | |
| Nov 23, Nov 30 | Jana Straková | John Lafferty, Andrew McCallum and Fernando Pereira: Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, Hanna M. Wallach. Conditional Random Fields: An Introduction | |
| Nov 9 | Zdeněk Žabokrtský | Eugene Charniak and Micha Elsner: EM Works for Pronoun Anaphora Resolution. EACL 2009 | |
| Oct 26, Nov 2 | Ondřej Bojar | Kevin Knight: Bayesian Inference with Tears. September 2009. článek byl delší, ale čtivý a přesně takové jsou i Ondrovy poznámky k němu | Ano, i s úkoly | 
| Oct 19 | Eda Bejček | Josh Schroeder, Trevor Cohn, and Philipp Koehn: Word Lattices for Multi-Source Translation. EACL 2009 přehledné rozdělení metod; dobře, že udělali tolik experimentů, škoda některých nepodložených interpretací; potřebná znalost Mojžíše; omezili množství trénovacích dat – chybí test, zda více dat není účinnější než více jazyků; klesá skutečně MAX s množstvím jazyků? (to je divné, neměly by se tedy vážit výsledky jednotlivých systémů (viz věta s “little benefit”, 2.1)? A nebo není příčinou spíš než přidání šestého jazyka přidání špatného jazyka (Table 6)? Jak by dopadly testy po dvojicích?); v závěru 2.3 vynechávají reordering s odkazem na diversitu zdrojových jazyků – to nemusí platit | Ano | 
| Oct 15 | Pavel Pecina | Daniel David Walker, Eric K. Ringger: Model-based document clustering with a collapsed gibbs sampler, Fotky tabule | |
| Oct 12 | Pavel Schlesinger | Gibbsův sampling (http://en.wikipedia.org/wiki/Gibbs_sampling) | 
Summer 2009
| date | speaker | paper | 
|---|---|---|
| May 25 | David Mareček | Christian Hanig, Stefan Bordag, Uwe Quasthoff: UnsuParse: Unsupervised Parsing with unsupervised Part of Speech tagging. LREC 2009 | 
| May 11 | Václav Novák | Julien Ah-Pine, Guillaume Jacquet: Clique-Based Clustering for improving Named Entity Recognition systems. EACL 2009 | 
| May 4 | Dan Zeman | Kristina Toutanova, Hisami Suzuki, Achim Ruopp: Applying Morphology Generation Models to Machine Translation. ACL 2008, Columbus, Ohio | 
| Apr 6 | Zdeněk Žabokrtský | Pascal Denis and Jason Baldridge, Specialized models and ranking for coreference resolution | 
| Mar 30 | Pavel Schlesinger | |
| Mar 9 | Pavel Pecina | 
Winter 2008/2009
| date | speaker | paper | 
|---|---|---|
| Mon Jan 5 | Martin Popel | Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean: Large Language Models in Machine Translation, 2007 | 
| Mon Dec 1 | Jan Ptáček | Improving Statistical MT through Morphological Analysis | 
| Mon Nov 24 | Pavel Češka | A TAG-based noisy channel model of speech repairs | 
| Wed Nov 19 | Ondřej Bojar | Forest Reranking: Discriminative Parsing with Non-Local Features by Liang Huang (see Google Tech Talks), Forest-Based Translation by Haitao Mi and Liang Huang and Qun Liu | 
| Mon Nov 10 | Zdeněk Žabokrtský | Katja Filippova and Michael Strube: Sentence Fusion via Dependency Graph Compression, 2008 | 
| Mon Nov 3 | Jiří Mírovský | Alexander E. Richman and Patrick Schone: Mining Wiki Resources for Multilingual Named Entity Recognition, ACL, Columbus, 2008. | 
| Wed Oct 20 | Jan Raab | Libin Shen, Giorgio Satta, and Aravind K. Joshi: Guided Learning for Bidirectional Sequence Classification, ACL, Prague, 2007. | 
Summer 2008
| date | speaker | paper | 
|---|---|---|
| Mar 17 | Pavel Schlesinger | Aria Haghighi and Dan Klein: Unsupervised Coreference Resolution in a Nonparametric Bayesian Model, ACL, Prague, 2007. | 
| Mar 31 | Pavel Schlesinger | Unsupervised Coreference Resolution in a Nonparametric Bayesian Model, 2nd part  | 
| Apr 28 | Pavel Straňák | Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll: Unsupervised Acquisition of Predominant Word Senses in Computational Linguistics 33 (4), 2007 | 
Winter 2007/2008
| Nov 26 | Markéta Lopatková | Friedrich Otto: Restarting Automata (Notes for a Course), Technical Report, Universitat kassel, 2004. | 
|---|---|---|
| Nov 19 | Dan Zeman | Anil Kumar Singh (अनिल कुमार सिंह), Jagadeesh Gorla: Identification of Languages and Encodings in a Multilingual Document Identification of Languages and Encodings in a Multilingual Document. In: Proceedings of the 3rd ACL SIGWAC Workshop on Web as Corpus, pp. 95-108. Louvain-la-Neuve, Belgium, 2007. | 
| Nov 12 | Otakar Smrž | M. Nowak, N. Komarova, P. Niyogi. Computational and Evolutionary Aspects of Language. Nature, Vol. 417, pp. 611-617, 2002. | 
| Nov 5 | Jan Ptáček | Philipp Koehn; Hieu Hoang: Factored Translation Models, EMNLP & CoNLL, Prague, 2007. | 
| Oct 29 | Miroslav Spousta | David Talbot; Miles Osborne Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap http://www.aclweb.org/anthology-new/D/D07/D07-1049.pdf | 
| Oct 17 | Zdeněk Žabokrtský | Mary Hearne, John Tinsley, Ventsislav Zhechev, Andy Way (2007): Capturing Translational Divergences with a Statistical Tree-to-Tree Aligner HearneEtAl_TMI_07.pdf | 
Summer 2007
Winter 2006/2007
| Nov 15 | Glöckner, Ingo; Sven Hartrumpf; and Hermann Helbig (2006): Automatic knowledge acquisition by semantic analysis and assimilation of textual information gloeckner.ps - plná verze | 
|---|
Summer 2006
| Mar 1 | Ondřej Bojar | D. Chiang: A Hierarchical Phrase-Based Model for Statistical Machine Translation, ACL, Ann Arbor, 2005. | |
|---|---|---|---|
| Mar 8 | Zdeněk Žabokrtský | S. Kahan: The Meaning-Text Theory. | |
| Mar 15 | Pavel Schlesinger | N. A. Smith and J. Eisner: Contrastive Estimation: Training Log-Linear Models on Unlabeled Data. ACL, Ann Arbor, 2005. | |
| Mar 22 | Pavel Pecina | D. Ravichandran, P. Pantel and E.Hovy: Randomized Algorithms and NLP: Using Locality Sensitive Hash Functions for High Speed Noun Clustering, ACL, Ann Arbor, 2005. | |
| Apr 5 | Pavel Straňák | Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang: Sinica Treebank: Design criteria, representational issues and implementation. | |
| Apr 12 | Zdeněk Žabokrtský | P. Sgall: Prague School Typology. In Masayoshi Shibatani and Theodora Bynon (eds), Approaches to Language Typology. Clarendon Press, Oxford, United Kingdom, 1995. | |
| Apr 19 | Barbora Hladká | B. Scholkopf and A.J. Smola: A Short Introduction to Learning Method with Kernels. | |
| May 3 | Otakar Smrž | ||
| May 10 | Barbora Hladká | B. Scholkopf and A.J. Smola: A Short Introduction to Learning Method with Kernels. | |
Winter 2005/2006
| Oct 19 | Kiril Ribarov | Non-projective Dependency Parsing using Spanning Tree Algorithms. | |
|---|---|---|---|
| Oct 26 | Petr Podveský | K. Crammer and Y. Singer: Ultraconservative on-line algorithms for multiclass problems, JMLR, 2003. | |
| Nov 2 | Jiří Havelka | L. Georgiadis: Arborescence optimization problems solvable by Edmonds’ algorithm. | |
| Nov 9 | Barbora Hladká | ||
| Nov 16 | Pavel Pecina | B. Moore: Discriminative Framework for Bilingual Word Alignment, HLT/EMNLP, Vancouver, 2005. | |
| Nov 23 | Otakar Smrž | Noah A. Smith, David A. Smith, Roy W. Tromble: Context-Based Morphological Disambiguation with Random Fields. | |
| Nov 30 | Václav Novák | J. Eisner and D. Karakos: Bootstrapping Without the Boot, HLT/EMNLP, Vancouver, 2005. | |
| Dec 7 | Pavel Schlesinger | B. Taskar, D. Klein, M. Collins, D. Koller and C. Manning: Max-Margin Parsing, EMNLP, Barcelona, 2004. | |
| Dec 14 | Daniel Zeman | D. Zeman, Z. Zabokrstky: Improving Parsing Accuracy by Combining Diverse Dependecy Parsers, IWPT, Vancouver, 2005. | |
| Jan 4 | Ondřej Bojar | Franz Och: Tutorial, MT Summit, 2005. | |
| Jan 11 | Jiří Semecký | M. Carpuat and D. Wu: Word Sense Disambiguation vs. Statistical Machine Translation. | |
