courses:rg [ufal wiki]

This is an old revision of the document!

Reading Group

Contact	popel at ufal.mff.cuni.cz
Mailing list	rg at ufal.mff.cuni.cz
List Archive	http://ufal.mff.cuni.cz/mailman/listinfo/rg
Meetings	Mondays 15:10, room S1

From autumn 2010, there are new pages

Wishlist

Mark Johnson: Why Doesn't EM Find Good HMM POS-Taggers? (Ondřej)
Eugene Charniak: A maximum-entropy-inspired parser (Zdeněk)
Abhishek Arun, Chris Dyer, Barry Haddow, Phil Blunsom, Adam Lopez and Philipp Koehn: Monte Carlo Inference and Maximization for Phrase-based Translation. Conference on Computational Natural Language Learning, 2009.
Phil Blunsom, Trevor Cohn, Chris Dyer and Miles Osborne: A Gibbs Sampler for Phrasal Synchronous Grammar Induction. ACL-IJCNLP 2009
Trevor Cohn and Phil Blunsom: A Bayesian Model of Syntax-Directed Tree to String Grammar Induction. EMNLP 2009.

A source of inspiration: Edinburgh Reading Group

Summer 2010

date	speaker	paper
May 24	Ondřej Bojar	Kevin Gimpel and Noah A. Smith: Cube Summing, Approximate Inference with Non-Local Features, and Dynamic Programming without Semirings. In Proceedings of EACL, Athens, Greece, March/April 2009 slides
May 10	Martin Popel	Ronald Rosenfeld: A Maximum Entropy Approach to Adaptive Statistical Language Modeling (cont.) Martinovy poznámky
May 3	Elizabeth Shriberg	Automatic speaker recognition, feature selection, linear interpolation and its connotations, demography and stock market
Apr 26	Martin Popel	Ronald Rosenfeld: A Maximum Entropy Approach to Adaptive Statistical Language Modeling
Apr 12	David Mareček	Jens Nillson, Joakim Nivre, Johan Hall: Graph Transformations in Data-Driven Dependency Parsing
Mar 29	Martin Popel	Jeff Bilmes and Katrin Kirchhoff: Factored Language Models and Generalized Parallel Backoff (plus tutorial)
Mar 22	Zdeněk Žabokrtský	Deniz Yuret and Mehmet Ali Yatbaz: The Noisy Channel Model for Unsupervised Word Sense Disambiguation, Computational Linguistics 2010
Mar 15	Ondřej Bojar	Philipp Koehn and Barry Haddow: Interactive Assistance to Human Translators using Statistical Machine Translation Methods, MT Summit XII, 2009.
Mar 8		startup meeting

Winter 2009/2010

date	speaker	paper	přečíst?
Dec 14	Martin Popel	Wei Lu, Hwee Tou Ng, Wee Sun Lee: Natural Language Generation with Tree Conditional Random Fields
Dec 7	David Mareček	Nivre, J.: Non-Projective Dependency Parsing in Expected Linear Time. ACL 2009
Nov 23, Nov 30	Jana Straková	John Lafferty, Andrew McCallum and Fernando Pereira: Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, Hanna M. Wallach. Conditional Random Fields: An Introduction
Nov 9	Zdeněk Žabokrtský	Eugene Charniak and Micha Elsner: EM Works for Pronoun Anaphora Resolution. EACL 2009
Oct 26, Nov 2	Ondřej Bojar	Kevin Knight: Bayesian Inference with Tears. September 2009. článek byl delší, ale čtivý a přesně takové jsou i Ondrovy poznámky k němu	Ano, i s úkoly
Oct 19	Eda Bejček	Josh Schroeder, Trevor Cohn, and Philipp Koehn: Word Lattices for Multi-Source Translation. EACL 2009 přehledné rozdělení metod; dobře, že udělali tolik experimentů, škoda některých nepodložených interpretací; potřebná znalost Mojžíše; omezili množství trénovacích dat – chybí test, zda více dat není účinnější než více jazyků; klesá skutečně MAX s množstvím jazyků? (to je divné, neměly by se tedy vážit výsledky jednotlivých systémů (viz věta s “little benefit”, 2.1)? A nebo není příčinou spíš než přidání šestého jazyka přidání špatného jazyka (Table 6)? Jak by dopadly testy po dvojicích?); v závěru 2.3 vynechávají reordering s odkazem na diversitu zdrojových jazyků – to nemusí platit	Ano
Oct 15	Pavel Pecina	Daniel David Walker, Eric K. Ringger: Model-based document clustering with a collapsed gibbs sampler, Fotky tabule
Oct 12	Pavel Schlesinger	Gibbsův sampling (http://en.wikipedia.org/wiki/Gibbs_sampling)

Summer 2009

date	speaker	paper
May 25	David Mareček	Christian Hanig, Stefan Bordag, Uwe Quasthoff: UnsuParse: Unsupervised Parsing with unsupervised Part of Speech tagging. LREC 2009
May 11	Václav Novák	Julien Ah-Pine, Guillaume Jacquet: Clique-Based Clustering for improving Named Entity Recognition systems. EACL 2009
May 4	Dan Zeman	Kristina Toutanova, Hisami Suzuki, Achim Ruopp: Applying Morphology Generation Models to Machine Translation. ACL 2008, Columbus, Ohio
Apr 6	Zdeněk Žabokrtský	Pascal Denis and Jason Baldridge, Specialized models and ranking for coreference resolution
Mar 30	Pavel Schlesinger
Mar 9	Pavel Pecina

Winter 2008/2009

date	speaker	paper
Mon Jan 5	Martin Popel	Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean: Large Language Models in Machine Translation, 2007
Mon Dec 1	Jan Ptáček	Improving Statistical MT through Morphological Analysis
Mon Nov 24	Pavel Češka	A TAG-based noisy channel model of speech repairs
Wed Nov 19	Ondřej Bojar	Forest Reranking: Discriminative Parsing with Non-Local Features by Liang Huang (see Google Tech Talks), Forest-Based Translation by Haitao Mi and Liang Huang and Qun Liu
Mon Nov 10	Zdeněk Žabokrtský	Katja Filippova and Michael Strube: Sentence Fusion via Dependency Graph Compression, 2008
Mon Nov 3	Jiří Mírovský	Alexander E. Richman and Patrick Schone: Mining Wiki Resources for Multilingual Named Entity Recognition, ACL, Columbus, 2008.
Wed Oct 20	Jan Raab	Libin Shen, Giorgio Satta, and Aravind K. Joshi: Guided Learning for Bidirectional Sequence Classification, ACL, Prague, 2007.

Summer 2008

date	speaker	paper
Mar 17	Pavel Schlesinger	Aria Haghighi and Dan Klein: Unsupervised Coreference Resolution in a Nonparametric Bayesian Model, ACL, Prague, 2007.
Mar 31	Pavel Schlesinger	Unsupervised Coreference Resolution in a Nonparametric Bayesian Model, 2nd part
Apr 28	Pavel Straňák	Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll: Unsupervised Acquisition of Predominant Word Senses in Computational Linguistics 33 (4), 2007

Winter 2007/2008

Nov 26	Markéta Lopatková	Friedrich Otto: Restarting Automata (Notes for a Course), Technical Report, Universitat kassel, 2004.
Nov 19	Dan Zeman	Anil Kumar Singh (अनिल कुमार सिंह), Jagadeesh Gorla: Identification of Languages and Encodings in a Multilingual Document Identification of Languages and Encodings in a Multilingual Document. In: Proceedings of the 3rd ACL SIGWAC Workshop on Web as Corpus, pp. 95-108. Louvain-la-Neuve, Belgium, 2007.
Nov 12	Otakar Smrž	M. Nowak, N. Komarova, P. Niyogi. Computational and Evolutionary Aspects of Language. Nature, Vol. 417, pp. 611-617, 2002.
Nov 5	Jan Ptáček	Philipp Koehn; Hieu Hoang: Factored Translation Models, EMNLP & CoNLL, Prague, 2007.
Oct 29	Miroslav Spousta	David Talbot; Miles Osborne Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap http://www.aclweb.org/anthology-new/D/D07/D07-1049.pdf
Oct 17	Zdeněk Žabokrtský	Mary Hearne, John Tinsley, Ventsislav Zhechev, Andy Way (2007): Capturing Translational Divergences with a Statistical Tree-to-Tree Aligner HearneEtAl_TMI_07.pdf

Summer 2007

Winter 2006/2007

Nov 15		Glöckner, Ingo; Sven Hartrumpf; and Hermann Helbig (2006): Automatic knowledge acquisition by semantic analysis and assimilation of textual information gloeckner.ps - plná verze

Summer 2006

Mar 1	Ondřej Bojar	D. Chiang: A Hierarchical Phrase-Based Model for Statistical Machine Translation, ACL, Ann Arbor, 2005.
Mar 8	Zdeněk Žabokrtský	S. Kahan: The Meaning-Text Theory.
Mar 15	Pavel Schlesinger	N. A. Smith and J. Eisner: Contrastive Estimation: Training Log-Linear Models on Unlabeled Data. ACL, Ann Arbor, 2005.
Mar 22	Pavel Pecina	D. Ravichandran, P. Pantel and E.Hovy: Randomized Algorithms and NLP: Using Locality Sensitive Hash Functions for High Speed Noun Clustering, ACL, Ann Arbor, 2005.
Apr 5	Pavel Straňák	Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang: Sinica Treebank: Design criteria, representational issues and implementation.
Apr 12	Zdeněk Žabokrtský	P. Sgall: Prague School Typology. In Masayoshi Shibatani and Theodora Bynon (eds), Approaches to Language Typology. Clarendon Press, Oxford, United Kingdom, 1995.
Apr 19	Barbora Hladká	B. Scholkopf and A.J. Smola: A Short Introduction to Learning Method with Kernels.
May 3	Otakar Smrž
May 10	Barbora Hladká	B. Scholkopf and A.J. Smola: A Short Introduction to Learning Method with Kernels.

Winter 2005/2006

Oct 19	Kiril Ribarov	Non-projective Dependency Parsing using Spanning Tree Algorithms.
Oct 26	Petr Podveský	K. Crammer and Y. Singer: Ultraconservative on-line algorithms for multiclass problems, JMLR, 2003.
Nov 2	Jiří Havelka	L. Georgiadis: Arborescence optimization problems solvable by Edmonds’ algorithm.
Nov 9	Barbora Hladká
Nov 16	Pavel Pecina	B. Moore: Discriminative Framework for Bilingual Word Alignment, HLT/EMNLP, Vancouver, 2005.
Nov 23	Otakar Smrž	Noah A. Smith, David A. Smith, Roy W. Tromble: Context-Based Morphological Disambiguation with Random Fields.
Nov 30	Václav Novák	J. Eisner and D. Karakos: Bootstrapping Without the Boot, HLT/EMNLP, Vancouver, 2005.
Dec 7	Pavel Schlesinger	B. Taskar, D. Klein, M. Collins, D. Koller and C. Manning: Max-Margin Parsing, EMNLP, Barcelona, 2004.
Dec 14	Daniel Zeman	D. Zeman, Z. Zabokrstky: Improving Parsing Accuracy by Combining Diverse Dependecy Parsers, IWPT, Vancouver, 2005.
Jan 4	Ondřej Bojar	Franz Och: Tutorial, MT Summit, 2005.
Jan 11	Jiří Semecký	M. Carpuat and D. Wu: Word Sense Disambiguation vs. Statistical Machine Translation.

[ Back to the navigation ] [ Back to the content ]

Institute of Formal and Applied Linguistics Wiki

Reading Group

Wishlist

Summer 2010

Winter 2009/2010

Summer 2009

Winter 2008/2009

Summer 2008

Winter 2007/2008

Summer 2007

Winter 2006/2007

Summer 2006

Winter 2005/2006