| Both sides previous revision
Previous revision
Next revision | Previous revision | 
                        
                | courses:rg [2010/10/11 22:01] popel Paper for Oct 18 & formating issues
 | courses:rg [2022/02/08 00:40] (current) rosa +phd rg
 | 
        
| ~~NOTOC~~ | ~~NOTOC~~ | 
| ===== Reading Group ===== |  | 
|  | ===== Reading Group for Master students ===== | 
| Official name of this course is [[https://is.cuni.cz/studium/predmety/index.php?do=predmet&kod=NPFL095|NPFL095]] **Modern Methods in Computational Linguistics**. It is a continuation of informal Reading Group (RG) meetings. | Official name of this course is [[https://is.cuni.cz/studium/predmety/index.php?do=predmet&kod=NPFL095|NPFL095]] **Modern Methods in Computational Linguistics**. It is a continuation of informal Reading Group (RG) meetings. | 
|  |  | 
| ^ Contact      | popel at ufal.mff.cuni.cz | | Since 2016, the wiki is moved to https://github.com/ufal/NPFL095/wiki and the mailing list to [[https://groups.google.com/forum/#!forum/npfl095|npfl095@googlegroups.com]]. | 
| ^ Mailing list | rg at ufal.mff.cuni.cz     | | See also [[courses:rg:past|an overview of past meetings]], [[courses:rg:wishlist|an outdated wishlist]] and [[https://github.com/ufal/rg/wiki|Machine Learning RG (active in 2014)]]. | 
| ^ List Archive | [[http://ufal.mff.cuni.cz/mailman/listinfo/rg]] | |  | 
| ^ Meetings     | Mondays 15:10, room S1 | |  | 
|  |  | 
| === Wishlist === |  | 
|  |  | 
| * Mark Johnson: [[http://acl.ldc.upenn.edu/D/D07/D07-1031.pdf|Why Doesn't EM Find Good HMM POS-Taggers?]] (Ondřej) |  | 
| * Eugene Charniak: [[http://acl.ldc.upenn.edu/A/A00/A00-2018.pdf|A maximum-entropy-inspired parser]] (Zdeněk) |  | 
| * [[http://www.aclweb.org/anthology/W/W09/W09-1114.pdf |Abhishek Arun, Chris Dyer, Barry Haddow, Phil Blunsom, Adam Lopez and Philipp Koehn: Monte Carlo Inference and Maximization for Phrase-based Translation. Conference on Computational Natural Language Learning, 2009. ]] |  | 
| * [[http://homepages.inf.ed.ac.uk/pblunsom/pubs/blunsom-acl09.pdf|Phil Blunsom, Trevor Cohn, Chris Dyer and Miles Osborne: A Gibbs Sampler for Phrasal Synchronous Grammar Induction. ACL-IJCNLP 2009]] |  | 
| * [[http://homepages.inf.ed.ac.uk/pblunsom/pubs/cohn-blunsom-emnlp09.pdf|Trevor Cohn and Phil Blunsom: A Bayesian Model of Syntax-Directed Tree to String Grammar Induction. EMNLP 2009.]] |  | 
|  |  | 
| A source of inspiration: [[http://www.statmt.org/ued/?n=Public.WeeklyMeeting|Edinburgh Reading Group]] |  | 
|  |  | 
| === Winter 2010/2011 === |  | 
| ^ date | **speaker** | **paper** | |  | 
| ^ Jan 10 |  |  | |  | 
| ^ Jan  3 |  |  | |  | 
| ^ Dec 20 | Lasha Abzianidze |  | |  | 
| ^ Dec 13 | Srdjan Prodanovic |  | |  | 
| ^ Dec  6 | Karel Vandas |  | |  | 
| ^ Nov 29 | Bushra Jawaid |  | |  | 
| ^ Nov 22 | Septina Larasati |  | |  | 
| ^ Nov 15 | Angelina Ivanova |  | |  | 
| ^ Nov  8 | Michal Novák |  | |  | 
| ^ Nov  1 | Martin Kirschner |  | |  | 
| ^ Oct 25 | Loganathan Ramasamy |  | |  | 
| ^ Oct 18 | Martin Popel | Kevin Duh et al.: [[http://www.aclweb.org/anthology/W/W10/W10-1757.pdf|N-Best Reranking by Multitask Learning]]  | |  | 
| ^ Oct 11 | Eda Bejček | Matthew Gerber and Joyce Y. Chai: [[http://aclweb.org/anthology-new/P/P10/P10-1160.pdf| Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates]]| |  | 
| ^ Oct  4 |  | startup meeting | |  | 
|  |  | 
|  |  | 
|  |  | 
| === Summer 2010 === |  | 
| ^ date | **speaker** | **paper** | |  | 
| ^ May 24 | Ondřej Bojar | Kevin Gimpel and Noah A. Smith: [[http://www.cs.cmu.edu/~kgimpel/papers/gimpel+smith.eacl09.pdf|Cube Summing, Approximate Inference with Non-Local Features, and Dynamic Programming without Semirings. In Proceedings of EACL, Athens, Greece, March/April 2009]] [[http://www.cs.cmu.edu/~kgimpel/talks/gimpel+smith.eacl09.slides.pdf|slides]] | |  | 
| ^ May 10 | Martin Popel | Ronald Rosenfeld: [[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.61.1472&rep=rep1&type=pdf|A Maximum Entropy Approach to Adaptive Statistical Language Modeling]] (cont.) [[courses::rg:maxent-lm|Martinovy poznámky]]| |  | 
| ^ May 3 | Elizabeth Shriberg | Automatic speaker recognition, feature selection, linear interpolation and its connotations, demography and stock market | |  | 
| ^ Apr 26 | Martin Popel | Ronald Rosenfeld: [[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.61.1472&rep=rep1&type=pdf|A Maximum Entropy Approach to Adaptive Statistical Language Modeling]] | |  | 
| ^ Apr 12 | David Mareček | Jens Nillson, Joakim Nivre, Johan Hall: [[http://acl.ldc.upenn.edu/P/P06/P06-1033.pdf|Graph Transformations in Data-Driven Dependency Parsing]] | |  | 
| ^ Mar 29 | Martin Popel | Jeff Bilmes and Katrin Kirchhoff: [[http://acl.ldc.upenn.edu/N/N03/N03-2002.pdf|Factored Language Models and Generalized Parallel Backoff]] (plus [[https://www.ee.washington.edu/techsite/papers/documents/UWEETR-2008-0004.pdf|tutorial]]) | |  | 
| ^ Mar 22 | Zdeněk Žabokrtský | Deniz Yuret and Mehmet Ali Yatbaz: [[http://www.mitpressjournals.org/doi/pdf/10.1162/coli.2010.36.1.36103|The Noisy Channel Model for Unsupervised Word Sense Disambiguation, Computational Linguistics 2010]] | |  | 
| ^ Mar 15 | Ondřej Bojar | Philipp Koehn and Barry Haddow: [[ |  | 
| http://www.mt-archive.info/MTS-2009-Koehn-2.pdf|Interactive Assistance to Human Translators using Statistical Machine Translation Methods, MT Summit XII, 2009.]] | |  | 
| ^ Mar 8 |  | startup meeting | |  | 
|  |  | 
| === Winter 2009/2010 === |  | 
|  |  | 
| ^ date | **speaker** | **paper** | **přečíst?** | |  | 
| ^ Dec 14 | Martin Popel | [[http://nlp.csie.ncnu.edu.tw/~shin/acl-ijcnlp2009/proceedings/CDROM/EMNLP/pdf/EMNLP042.pdf|Wei Lu, Hwee Tou Ng, Wee Sun Lee: Natural Language Generation with Tree Conditional Random Fields]] | | |  | 
| ^ Dec 7 | David Mareček | [[http://www.aclweb.org/anthology/P/P09/P09-1040.pdf|Nivre, J.: Non-Projective Dependency Parsing in Expected Linear Time. ACL 2009]] | | |  | 
| ^ Nov 23, Nov 30 | Jana Straková | [[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.23.9849&rep=rep1&type=pdf|John Lafferty, Andrew McCallum and Fernando Pereira: Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data]],\\  [[http://www.inference.phy.cam.ac.uk/hmw26/papers/crf_intro.pdf|Hanna M. Wallach. Conditional Random Fields: An Introduction]] | | |  | 
| ^ Nov 9 | Zdeněk Žabokrtský | [[http://www.aclweb.org/anthology/E/E09/E09-1018.pdf|Eugene Charniak and Micha Elsner: EM Works for Pronoun Anaphora Resolution. EACL 2009]]  | | |  | 
| ^ Oct 26, Nov 2 | Ondřej Bojar | [[http://www.isi.edu/natural-language/people/bayes-with-tears.pdf|Kevin Knight: Bayesian Inference with Tears. September 2009.]]\\ článek byl delší, ale čtivý a přesně takové jsou i [[courses:rg:bayes-with-tears|Ondrovy poznámky]] k němu | Ano, i s úkoly | |  | 
| ^ Oct 19 | Eda Bejček | [[http://www.aclweb.org/anthology/E/E09/E09-1082.pdf|Josh Schroeder, Trevor Cohn, and Philipp Koehn: Word Lattices for Multi-Source Translation. EACL 2009]]\\ přehledné rozdělení metod; dobře, že udělali tolik experimentů, škoda některých nepodložených interpretací; potřebná znalost Mojžíše; omezili množství trénovacích dat -- chybí test, zda více dat není účinnější než více jazyků; klesá skutečně MAX s množstvím jazyků? (to je divné, neměly by se tedy vážit výsledky jednotlivých systémů (viz věta s "little benefit", 2.1)? A nebo není příčinou spíš než přidání šestého jazyka přidání špatného jazyka (Table 6)? Jak by dopadly testy po dvojicích?); v závěru 2.3 vynechávají reordering s odkazem na diversitu zdrojových jazyků -- to nemusí platit |  Ano  | |  | 
| ^ Oct 15 | Pavel Pecina | [[http://portal.acm.org/citation.cfm?id=1401975|Daniel David Walker, Eric K. Ringger: Model-based document clustering with a collapsed gibbs sampler]], [[courses:rg:2009-10-15-tabule|Fotky tabule]] | | |  | 
| ^ Oct 12 | Pavel Schlesinger | Gibbsův sampling (http://en.wikipedia.org/wiki/Gibbs_sampling)| | |  | 
|  |  | 
|  |  | 
|  |  | 
| === Summer 2009 === |  | 
|  |  | 
| ^ date | **speaker** | **paper** | |  | 
| ^ May 25 | David Mareček | [[http://www.lrec-conf.org/proceedings/lrec2008/pdf/286_paper.pdf|Christian Hanig, Stefan Bordag, Uwe Quasthoff: UnsuParse: Unsupervised Parsing with unsupervised Part of Speech tagging. LREC 2009]] | |  | 
| ^ May 11 | Václav Novák | [[http://newdesign.aclweb.org/anthology-new/E/E09/E09-1007.pdf|Julien Ah-Pine, Guillaume Jacquet: Clique-Based Clustering for improving Named Entity Recognition systems. EACL 2009]] | |  | 
| ^ May 4 | Dan Zeman | [[http://www.aclweb.org/anthology/P/P08/P08-1059|Kristina Toutanova, Hisami Suzuki, Achim Ruopp: Applying Morphology Generation Models to Machine Translation. ACL 2008, Columbus, Ohio]] | |  | 
| ^ Apr 6 | Zdeněk Žabokrtský | [[http://pauillac.inria.fr/~pdenis/papers/emnlp08.pdf|Pascal Denis and Jason Baldridge, Specialized models and ranking for coreference resolution]] | |  | 
| ^ Mar 30 | Pavel Schlesinger |  | |  | 
| ^ Mar 9 | Pavel Pecina |  | |  | 
|  |  | 
| === Winter 2008/2009 === |  | 
|  |  | 
| ^ date | **speaker** | **paper** | |  | 
| ^ Mon Jan 5 | Martin Popel | Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean: {{courses:rg:2007brants-large_language_models_in_mt.pdf|Large Language Models in Machine Translation}}, 2007 | |  | 
| ^ Mon Dec 1 | Jan Ptáček | {{courses:rg:goldwater-mcclosky-czech-english-mt-0-33.pdf|Improving Statistical MT through Morphological Analysis}} | |  | 
| ^ Mon Nov 24 | Pavel Češka | [[http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=656CAA9B55D39BC763A26946D91E7FBD?doi=10.1.1.61.3358&rep=rep1&type=pdf|A TAG-based noisy channel model of speech repairs]] | |  | 
| ^ Wed Nov 19 | Ondřej Bojar | [[http://aclweb.org/anthology-new/P/P08/P08-1023.pdf|Forest Reranking: Discriminative Parsing with Non-Local Features by Liang Huang]] (see Google Tech Talks), [[http://aclweb.org/anthology-new/P/P08/P08-1023.pdf|Forest-Based Translation]] by Haitao Mi and Liang Huang and Qun Liu | |  | 
| ^ Mon Nov 10 | Zdeněk Žabokrtský | Katja Filippova and Michael Strube: [[http://www.eml-research.de/nlp/papers/filippova.emnlp08.pdf|Sentence Fusion via Dependency Graph Compression]], 2008 | |  | 
| ^ Mon Nov 3 | Jiří Mírovský | Alexander E. Richman and Patrick Schone: Mining Wiki Resources for Multilingual Named Entity Recognition, ACL, Columbus, 2008. | |  | 
| ^ Wed Oct 20 | Jan Raab | Libin Shen, Giorgio Satta, and Aravind K. Joshi: Guided Learning for Bidirectional Sequence Classification, ACL, Prague, 2007. | |  | 
|  |  | 
| === Summer 2008 === |  | 
|  |  | 
| ^ date | **speaker** | **paper** | |  | 
| ^ Mar 17 | Pavel Schlesinger | Aria Haghighi and Dan Klein: //[[http://www.eecs.berkeley.edu/~aria42/pubs/acl07-hdp-coref.pdf|Unsupervised Coreference Resolution in a Nonparametric Bayesian Model]]//, ACL, Prague, 2007.   | |  | 
| ^ Mar 31 | Pavel Schlesinger | //Unsupervised Coreference Resolution in a Nonparametric Bayesian Model//, 2nd part :-)| |  | 
| ^ Apr 28 | Pavel Straňák | Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll: //{{courses:unsupervised_acquisition_of_predominant_word_senses.pdf|Unsupervised Acquisition of Predominant Word Senses}}// in Computational Linguistics 33 (4), 2007 | |  | 
|  |  | 
| === Winter 2007/2008 === |  | 
| ^ Nov 26 | Markéta  Lopatková | Friedrich Otto: {{courses:rg:tr-ra-tutorial.ps|Restarting Automata (Notes for a Course)}}, Technical Report, Universitat kassel, 2004. | |  | 
| ^ Nov 19 | Dan Zeman | Anil Kumar Singh (अनिल कुमार सिंह), Jagadeesh Gorla: {{courses:rg:kumar-gorla-identification-of-languages.pdf|Identification of Languages and Encodings in a Multilingual Document}} {{courses:kumar-gorla-identification-of-languages.zip|Identification of Languages and Encodings in a Multilingual Document}}. In: Proceedings of the 3rd ACL SIGWAC Workshop on Web as Corpus, pp. 95-108. Louvain-la-Neuve, Belgium, 2007. | |  | 
| ^ Nov 12 | Otakar Smrž | M. Nowak, N. Komarova, P. Niyogi. [[http://people.cs.uchicago.edu/~niyogi/papersps/NKNnature.pdf|Computational and Evolutionary Aspects of Language]]. Nature, Vol. 417, pp. 611-617, 2002. | |  | 
| ^ Nov 5 | Jan Ptáček | Philipp Koehn; Hieu Hoang: **{{courses:factored-translation-models.pdf|Factored Translation Models}}**, EMNLP & CoNLL, Prague, 2007. | |  | 
| ^ Oct 29 | Miroslav Spousta | David Talbot; Miles Osborne Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap http://www.aclweb.org/anthology-new/D/D07/D07-1049.pdf | |  | 
| ^ Oct 17 | Zdeněk Žabokrtský | **Mary Hearne, John Tinsley, Ventsislav Zhechev, Andy Way (2007)**: Capturing Translational Divergences with a Statistical Tree-to-Tree Aligner {{seminare:hearneetal_tmi_07.pdf|HearneEtAl_TMI_07.pdf}} | |  | 
|  |  | 
| === Summer 2007 === |  | 
|  |  | 
| === Winter 2006/2007 === |  | 
|  |  | 
| ^ Nov 15 | | Glöckner, Ingo; Sven Hartrumpf; and Hermann Helbig (2006): Automatic knowledge acquisition by semantic analysis and assimilation of textual information {{:seminare:reading-group:gloeckner.ps|gloeckner.ps - plná verze}} | |  | 
|  |  | 
| === Summer 2006 === |  | 
|  |  | 
| ^ Mar 1  | Ondřej Bojar | D. Chiang: **A Hierarchical Phrase-Based Model for Statistical Machine Translation**, ACL, Ann Arbor, 2005. | | ===== Reading Group for PhD students ===== | 
| ^ Mar 8  | Zdeněk Žabokrtský | S. Kahan: **The Meaning-Text Theory**. | |  | 
| ^ Mar 15 | Pavel Schlesinger | N. A. Smith and J. Eisner: **Contrastive Estimation: Training Log-Linear Models on Unlabeled Data**. ACL, Ann Arbor, 2005. | |  | 
| ^ Mar 22 | Pavel Pecina | D. Ravichandran, P. Pantel and E.Hovy: **Randomized Algorithms and NLP: Using Locality Sensitive Hash Functions for High Speed Noun Clustering**, ACL, Ann Arbor, 2005. | |  | 
| ^ Apr 5  | Pavel Straňák | Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang: **Sinica Treebank: Design criteria, representational issues and implementation**. | |  | 
| ^ Apr 12 | Zdeněk Žabokrtský | P. Sgall: **Prague School Typology**. In Masayoshi Shibatani and Theodora Bynon (eds), Approaches to Language Typology. Clarendon Press,  Oxford, United Kingdom, 1995. | |  | 
| ^ Apr 19 | Barbora Hladká | B. Scholkopf and A.J. Smola:  **A Short Introduction to Learning Method with  Kernels**. | |  | 
| ^ May 3  | Otakar Smrž | || |  | 
| ^ May 10 | Barbora Hladká | B. Scholkopf and A.J. Smola:  **A Short Introduction to Learning Method with Kernels**.  | |  | 
|  |  | 
| === Winter 2005/2006 === | See the [[https://ufal.mff.cuni.cz/courses/rg/|website of PhD reading group]] (related also to a previous reading group called Deep Learning Seminar originally led by Milan Straka). | 
|  |  | 
| ^ Oct 19 | Kiril Ribarov    | **Non-projective Dependency Parsing using Spanning Tree Algorithms**. | |  | 
| ^ Oct 26 | Petr Podveský    | K. Crammer and Y. Singer: **Ultraconservative on-line algorithms for multiclass problems**, JMLR, 2003. | |  | 
| ^ Nov 2  | Jiří Havelka     | L. Georgiadis: **Arborescence optimization problems solvable by Edmonds’ algorithm**. | |  | 
| ^ Nov 9  | Barbora Hladká   | | |  | 
| ^ Nov 16 | Pavel Pecina     | B. Moore: **Discriminative Framework for Bilingual Word Alignment**, HLT/EMNLP, Vancouver, 2005. | |  | 
| ^ Nov 23 | Otakar Smrž         | Noah A. Smith, David A. Smith, Roy W. Tromble: **Context-Based Morphological Disambiguation with Random Fields**. || |  | 
| ^ Nov 30 | Václav Novák     | J. Eisner and D. Karakos: **Bootstrapping Without the Boot**, HLT/EMNLP, Vancouver, 2005. | |  | 
| ^ Dec 7  | Pavel Schlesinger| B. Taskar, D. Klein, M. Collins, D. Koller and C. Manning: **Max-Margin Parsing**, EMNLP, Barcelona, 2004. | |  | 
| ^ Dec 14 | Daniel Zeman     | D. Zeman, Z. Zabokrstky: **Improving Parsing Accuracy by Combining Diverse Dependecy Parsers**, IWPT, Vancouver, 2005. | |  | 
| ^ Jan 4  | Ondřej Bojar     | Franz Och: **Tutorial**, MT Summit, 2005. | |  | 
| ^ Jan 11 | Jiří Semecký     | M. Carpuat and D. Wu: **Word Sense Disambiguation vs. Statistical Machine Translation**. | |  |