[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
courses:rg [2010/11/02 15:53]
septina.larasati
courses:rg [2022/02/08 00:40] (current)
rosa +phd rg
Line 1: Line 1:
 ~~NOTOC~~ ~~NOTOC~~
-===== Reading Group =====+ 
 +===== Reading Group for Master students =====  
 Official name of this course is [[https://is.cuni.cz/studium/predmety/index.php?do=predmet&kod=NPFL095|NPFL095]] **Modern Methods in Computational Linguistics**. It is a continuation of informal Reading Group (RG) meetings. Official name of this course is [[https://is.cuni.cz/studium/predmety/index.php?do=predmet&kod=NPFL095|NPFL095]] **Modern Methods in Computational Linguistics**. It is a continuation of informal Reading Group (RG) meetings.
  
-^ Contact      | popel at ufal.mff.cuni.cz | +Since 2016the wiki is moved to https://github.com/ufal/NPFL095/wiki and the mailing list to [[https://groups.google.com/forum/#!forum/npfl095|npfl095@googlegroups.com]]. 
-^ Mailing list | rg at ufal.mff.cuni.cz     | +See also [[courses:rg:past|an overview of past meetings]], [[courses:rg:wishlist|an outdated wishlist]] and [[https://github.com/ufal/rg/wiki|Machine Learning RG (active in 2014)]].
-^ List Archive | [[http://ufal.mff.cuni.cz/mailman/listinfo/rg]] | +
-^ Meetings     | Mondays 15:10room S1 |  +
- +
-=== Wishlist === +
- +
-  * Mark Johnson: [[http://acl.ldc.upenn.edu/D/D07/D07-1031.pdf|Why Doesn't EM Find Good HMM POS-Taggers?]] (Ondřej) +
-  * Eugene Charniak: [[http://acl.ldc.upenn.edu/A/A00/A00-2018.pdf|A maximum-entropy-inspired parser]] (Zdeněk) +
-  * Abhishek Arun, Chris Dyer, Barry Haddow, Phil Blunsom, Adam Lopez and Philipp Koehn: [[http://www.aclweb.org/anthology/W/W09/W09-1114.pdf |Monte Carlo Inference and Maximization for Phrase-based Translation. Conference on Computational Natural Language Learning, 2009. ]] +
-  * Phil Blunsom, Trevor Cohn, Chris Dyer and Miles Osborne: [[http://homepages.inf.ed.ac.uk/pblunsom/pubs/blunsom-acl09.pdf|A Gibbs Sampler for Phrasal Synchronous Grammar Induction. ACL-IJCNLP 2009]] +
-  * Trevor Cohn and Phil Blunsom: [[http://homepages.inf.ed.ac.uk/pblunsom/pubs/cohn-blunsom-emnlp09.pdf|A Bayesian Model of Syntax-Directed Tree to String Grammar Induction. EMNLP 2009.]] +
- +
-A source of inspiration: [[http://www.statmt.org/ued/?n=Public.WeeklyMeeting|Edinburgh Reading Group]], [[http://www.aclweb.org/anthology-new/|ACL archive]], [[http://scholar.google.com]] +
- +
-=== Winter 2010/2011 === +
-^ date | **speaker** | **paper** | +
-^ Jan 10 |  |  | +
-^ Jan  3 | Radoslav Klíč | Richard Wicentowski (2004): {{:courses:rg:wicentowski_2004.pdf|Multilingual Noise-Robust Supervised Morphological Analysis using the WordFrame Model}} | +
-^ Dec 20 | Lasha Abzianidze |  | +
-^ Dec 13 | Srdjan Prodanovic |  | +
-^ Dec  6 | Karel Vandas |  | +
-^ Nov 29 | Bushra Jawaid |  | +
-^ Nov 22 | Angelina Ivanova | Gosse Bouma: {{:courses:rg:bouma_lrec10.pdf|Cross-lingual Ontology Alignment using EuroWordNet and Wikipedia}}, LREC 2010 | +
-^ Nov 15 | Septina Larasati | Linda Wiechetek, Francis M. Tyers, and Thomas Omma: {{:courses:rg:2010-shooting_at_flies_in_the_dark.pdf|Shooting at Flies in the Dark: Rule-Based Lexical Selection for a Minority Language Pair}}, IceTAL 2010 | +
-^ Nov  8 | Michal Novák | Aria Haghighi and Dan Klein: [[http://www.aclweb.org/anthology/N/N10/N10-1061.pdf|Coreference Resolution in a Modular, Entity-Centered Model]], HLT 2010 | +
-^ Nov  1 | Martin Kirschner | Matteo Negri and Milen Kouylekov: [[http://www.aclweb.org/anthology/S/S10/S10-1044.pdf|A WordNet-based system for multi-way classification of semantic relations]] [[courses:rg:a-wordnet-based-system|comments by Septina Larasati]] | +
-^ Oct 25 | Loganathan Ramasamy | Kuzman Ganchev, Jennifer Gillenwater and Ben Taskar: {{:courses:rg:2010_dep_grammar_induction_acl_ijcnlp_09.pdf|Dependency Grammar Induction via Bitext Projection Constraints}}, ACL 2009, [[courses:rg:dependency_grammar_via_bitext_projection|comments by David Mareček]] | +
-^ Oct 18 | Martin Popel | Kevin Duh et al.: [[http://www.aclweb.org/anthology/W/W10/W10-1757.pdf|N-Best Reranking by Multitask Learning]], ACL WMT 2010, [[courses:rg:reranking-by-multitask-learning|comments by Karel Vandas]] | +
-^ Oct 11 | Eda Bejček | Matthew Gerber and Joyce Y. Chai: [[http://aclweb.org/anthology-new/P/P10/P10-1160.pdf| Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates]], ACL 2010, [[courses:rg:beyond-nombank|comments by Martin Popel]]+
-^ Oct  4 |  | startup meeting | +
- +
- +
- +
-=== Summer 2010 === +
-^ date | **speaker** | **paper** | +
-^ May 24 | Ondřej Bojar | Kevin Gimpel and Noah A. Smith: [[http://www.cs.cmu.edu/~kgimpel/papers/gimpel+smith.eacl09.pdf|Cube Summing, Approximate Inference with Non-Local Features, and Dynamic Programming without Semirings. In Proceedings of EACL, Athens, Greece, March/April 2009]] [[http://www.cs.cmu.edu/~kgimpel/talks/gimpel+smith.eacl09.slides.pdf|slides]] | +
-^ May 10 | Martin Popel | Ronald Rosenfeld: [[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.61.1472&rep=rep1&type=pdf|A Maximum Entropy Approach to Adaptive Statistical Language Modeling]] (cont.) [[courses::rg:maxent-lm|Martinovy poznámky]]| +
-^ May 3 | Elizabeth Shriberg | Automatic speaker recognition, feature selection, linear interpolation and its connotations, demography and stock market | +
-^ Apr 26 | Martin Popel | Ronald Rosenfeld: [[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.61.1472&rep=rep1&type=pdf|A Maximum Entropy Approach to Adaptive Statistical Language Modeling]] | +
-^ Apr 12 | David Mareček | Jens Nillson, Joakim Nivre, Johan Hall: [[http://acl.ldc.upenn.edu/P/P06/P06-1033.pdf|Graph Transformations in Data-Driven Dependency Parsing]] | +
-^ Mar 29 | Martin Popel | Jeff Bilmes and Katrin Kirchhoff: [[http://acl.ldc.upenn.edu/N/N03/N03-2002.pdf|Factored Language Models and Generalized Parallel Backoff]] (plus [[https://www.ee.washington.edu/techsite/papers/documents/UWEETR-2008-0004.pdf|tutorial]]) | +
-^ Mar 22 | Zdeněk Žabokrtský | Deniz Yuret and Mehmet Ali Yatbaz: [[http://www.mitpressjournals.org/doi/pdf/10.1162/coli.2010.36.1.36103|The Noisy Channel Model for Unsupervised Word Sense Disambiguation, Computational Linguistics 2010]] | +
-^ Mar 15 | Ondřej Bojar | Philipp Koehn and Barry Haddow: [[ +
-http://www.mt-archive.info/MTS-2009-Koehn-2.pdf|Interactive Assistance to Human Translators using Statistical Machine Translation Methods, MT Summit XII, 2009.]] | +
-^ Mar 8 |  | startup meeting | +
- +
-=== Winter 2009/2010 === +
- +
-^ date | **speaker** | **paper** | **přečíst?** | +
-^ Dec 14 | Martin Popel | [[http://nlp.csie.ncnu.edu.tw/~shin/acl-ijcnlp2009/proceedings/CDROM/EMNLP/pdf/EMNLP042.pdf|Wei Lu, Hwee Tou Ng, Wee Sun Lee: Natural Language Generation with Tree Conditional Random Fields]] | | +
-^ Dec 7 | David Mareček | [[http://www.aclweb.org/anthology/P/P09/P09-1040.pdf|Nivre, J.: Non-Projective Dependency Parsing in Expected Linear Time. ACL 2009]] | | +
-^ Nov 23, Nov 30 | Jana Straková | [[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.23.9849&rep=rep1&type=pdf|John Lafferty, Andrew McCallum and Fernando Pereira: Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data]],\\  [[http://www.inference.phy.cam.ac.uk/hmw26/papers/crf_intro.pdf|Hanna M. Wallach. Conditional Random Fields: An Introduction]] | | +
-^ Nov 9 | Zdeněk Žabokrtský | [[http://www.aclweb.org/anthology/E/E09/E09-1018.pdf|Eugene Charniak and Micha Elsner: EM Works for Pronoun Anaphora Resolution. EACL 2009]]  | | +
-^ Oct 26, Nov 2 | Ondřej Bojar | [[http://www.isi.edu/natural-language/people/bayes-with-tears.pdf|Kevin Knight: Bayesian Inference with Tears. September 2009.]]\\ článek byl delší, ale čtivý a přesně takové jsou i [[courses:rg:bayes-with-tears|Ondrovy poznámky]] k němu | Ano, i s úkoly | +
-^ Oct 19 | Eda Bejček | [[http://www.aclweb.org/anthology/E/E09/E09-1082.pdf|Josh Schroeder, Trevor Cohn, and Philipp Koehn: Word Lattices for Multi-Source Translation. EACL 2009]]\\ přehledné rozdělení metod; dobře, že udělali tolik experimentů, škoda některých nepodložených interpretací; potřebná znalost Mojžíše; omezili množství trénovacích dat -- chybí test, zda více dat není účinnější než více jazyků; klesá skutečně MAX s množstvím jazyků? (to je divné, neměly by se tedy vážit výsledky jednotlivých systémů (viz věta s "little benefit", 2.1)? A nebo není příčinou spíš než přidání šestého jazyka přidání špatného jazyka (Table 6)? Jak by dopadly testy po dvojicích?); v závěru 2.3 vynechávají reordering s odkazem na diversitu zdrojových jazyků -- to nemusí platit |  Ano  | +
-^ Oct 15 | Pavel Pecina | [[http://portal.acm.org/citation.cfm?id=1401975|Daniel David Walker, Eric K. Ringger: Model-based document clustering with a collapsed gibbs sampler]], [[courses:rg:2009-10-15-tabule|Fotky tabule]] | | +
-^ Oct 12 | Pavel Schlesinger | Gibbsův sampling (http://en.wikipedia.org/wiki/Gibbs_sampling)+
- +
- +
- +
-=== Summer 2009 === +
- +
-^ date | **speaker** | **paper** | +
-^ May 25 | David Mareček | [[http://www.lrec-conf.org/proceedings/lrec2008/pdf/286_paper.pdf|Christian Hanig, Stefan Bordag, Uwe Quasthoff: UnsuParse: Unsupervised Parsing with unsupervised Part of Speech tagging. LREC 2009]] | +
-^ May 11 | Václav Novák | [[http://newdesign.aclweb.org/anthology-new/E/E09/E09-1007.pdf|Julien Ah-Pine, Guillaume Jacquet: Clique-Based Clustering for improving Named Entity Recognition systems. EACL 2009]] | +
-^ May 4 | Dan Zeman | [[http://www.aclweb.org/anthology/P/P08/P08-1059|Kristina Toutanova, Hisami Suzuki, Achim Ruopp: Applying Morphology Generation Models to Machine Translation. ACL 2008, Columbus, Ohio]] | +
-^ Apr 6 | Zdeněk Žabokrtský | [[http://pauillac.inria.fr/~pdenis/papers/emnlp08.pdf|Pascal Denis and Jason Baldridge, Specialized models and ranking for coreference resolution]] | +
-^ Mar 30 | Pavel Schlesinger |  | +
-^ Mar 9 | Pavel Pecina |  | +
- +
-=== Winter 2008/2009 === +
- +
-^ date | **speaker** | **paper** | +
-^ Mon Jan 5 | Martin Popel | Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean: {{courses:rg:2007brants-large_language_models_in_mt.pdf|Large Language Models in Machine Translation}}, 2007 | +
-^ Mon Dec 1 | Jan Ptáček | {{courses:rg:goldwater-mcclosky-czech-english-mt-0-33.pdf|Improving Statistical MT through Morphological Analysis}} | +
-^ Mon Nov 24 | Pavel Češka | [[http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=656CAA9B55D39BC763A26946D91E7FBD?doi=10.1.1.61.3358&rep=rep1&type=pdf|A TAG-based noisy channel model of speech repairs]] | +
-^ Wed Nov 19 | Ondřej Bojar | [[http://aclweb.org/anthology-new/P/P08/P08-1023.pdf|Forest Reranking: Discriminative Parsing with Non-Local Features by Liang Huang]] (see Google Tech Talks), [[http://aclweb.org/anthology-new/P/P08/P08-1023.pdf|Forest-Based Translation]] by Haitao Mi and Liang Huang and Qun Liu | +
-^ Mon Nov 10 | Zdeněk Žabokrtský | Katja Filippova and Michael Strube: [[http://www.eml-research.de/nlp/papers/filippova.emnlp08.pdf|Sentence Fusion via Dependency Graph Compression]], 2008 | +
-^ Mon Nov 3 | Jiří Mírovský | Alexander E. Richman and Patrick Schone: Mining Wiki Resources for Multilingual Named Entity Recognition, ACL, Columbus, 2008. | +
-^ Wed Oct 20 | Jan Raab | Libin Shen, Giorgio Satta, and Aravind K. Joshi: Guided Learning for Bidirectional Sequence Classification, ACL, Prague, 2007. | +
- +
-=== Summer 2008 === +
- +
-^ date | **speaker** | **paper** | +
-^ Mar 17 | Pavel Schlesinger | Aria Haghighi and Dan Klein: //[[http://www.eecs.berkeley.edu/~aria42/pubs/acl07-hdp-coref.pdf|Unsupervised Coreference Resolution in a Nonparametric Bayesian Model]]//, ACL, Prague, 2007.   |  +
-^ Mar 31 | Pavel Schlesinger | //Unsupervised Coreference Resolution in a Nonparametric Bayesian Model//, 2nd part :-)+
-^ Apr 28 | Pavel Straňák | Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll: //{{courses:unsupervised_acquisition_of_predominant_word_senses.pdf|Unsupervised Acquisition of Predominant Word Senses}}// in Computational Linguistics 33 (4), 2007 | +
- +
-=== Winter 2007/2008 === +
-^ Nov 26 | Markéta  Lopatková | Friedrich Otto: {{courses:rg:tr-ra-tutorial.ps|Restarting Automata (Notes for a Course)}}, Technical Report, Universitat kassel, 2004. | +
-^ Nov 19 | Dan Zeman | Anil Kumar Singh (अनिल कुमार सिंह), Jagadeesh Gorla: {{courses:rg:kumar-gorla-identification-of-languages.pdf|Identification of Languages and Encodings in a Multilingual Document}} {{courses:kumar-gorla-identification-of-languages.zip|Identification of Languages and Encodings in a Multilingual Document}}. In: Proceedings of the 3rd ACL SIGWAC Workshop on Web as Corpus, pp. 95-108. Louvain-la-Neuve, Belgium, 2007. | +
-^ Nov 12 | Otakar Smrž | M. Nowak, N. Komarova, P. Niyogi. [[http://people.cs.uchicago.edu/~niyogi/papersps/NKNnature.pdf|Computational and Evolutionary Aspects of Language]]. Nature, Vol. 417, pp. 611-617, 2002. | +
-^ Nov 5 | Jan Ptáček | Philipp Koehn; Hieu Hoang: **{{courses:factored-translation-models.pdf|Factored Translation Models}}**, EMNLP & CoNLL, Prague, 2007. | +
-^ Oct 29 | Miroslav Spousta | David Talbot; Miles Osborne Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap http://www.aclweb.org/anthology-new/D/D07/D07-1049.pdf | +
-^ Oct 17 | Zdeněk Žabokrtský | **Mary Hearne, John Tinsley, Ventsislav Zhechev, Andy Way (2007)**: Capturing Translational Divergences with a Statistical Tree-to-Tree Aligner {{seminare:hearneetal_tmi_07.pdf|HearneEtAl_TMI_07.pdf}} | +
- +
-=== Summer 2007 === +
- +
-=== Winter 2006/2007 === +
- +
-^ Nov 15 | | Glöckner, Ingo; Sven Hartrumpf; and Hermann Helbig (2006): Automatic knowledge acquisition by semantic analysis and assimilation of textual information {{:seminare:reading-group:gloeckner.ps|gloeckner.ps - plná verze}} | +
- +
-=== Summer 2006 ===+
  
-^ Mar 1  | Ondřej Bojar | D. Chiang: **A Hierarchical Phrase-Based Model for Statistical Machine Translation**, ACL, Ann Arbor, 2005. | +===== Reading Group for PhD students =====
-^ Mar 8  | Zdeněk Žabokrtský | S. Kahan: **The Meaning-Text Theory**. | +
-^ Mar 15 | Pavel Schlesinger | N. A. Smith and J. Eisner: **Contrastive Estimation: Training Log-Linear Models on Unlabeled Data**. ACL, Ann Arbor, 2005. | +
-^ Mar 22 | Pavel Pecina | D. Ravichandran, P. Pantel and E.Hovy: **Randomized Algorithms and NLP: Using Locality Sensitive Hash Functions for High Speed Noun Clustering**, ACL, Ann Arbor, 2005. | +
-^ Apr 5  | Pavel Straňák | Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang: **Sinica Treebank: Design criteria, representational issues and implementation**. | +
-^ Apr 12 | Zdeněk Žabokrtský | P. Sgall: **Prague School Typology**. In Masayoshi Shibatani and Theodora Bynon (eds), Approaches to Language Typology. Clarendon Press,  Oxford, United Kingdom, 1995. | +
-^ Apr 19 | Barbora Hladká | B. Scholkopf and A.J. Smola:  **A Short Introduction to Learning Method with  Kernels**. | +
-^ May 3  | Otakar Smrž | || +
-^ May 10 | Barbora Hladká | B. Scholkopf and A.J. Smola:  **A Short Introduction to Learning Method with Kernels**.  |+
  
-=== Winter 2005/2006 ===+See the [[https://ufal.mff.cuni.cz/courses/rg/|website of PhD reading group]] (related also to a previous reading group called Deep Learning Seminar originally led by Milan Straka).
  
-^ Oct 19 | Kiril Ribarov    | **Non-projective Dependency Parsing using Spanning Tree Algorithms**. | 
-^ Oct 26 | Petr Podveský    | K. Crammer and Y. Singer: **Ultraconservative on-line algorithms for multiclass problems**, JMLR, 2003. | 
-^ Nov 2  | Jiří Havelka     | L. Georgiadis: **Arborescence optimization problems solvable by Edmonds’ algorithm**. | 
-^ Nov 9  | Barbora Hladká   | | 
-^ Nov 16 | Pavel Pecina     | B. Moore: **Discriminative Framework for Bilingual Word Alignment**, HLT/EMNLP, Vancouver, 2005. | 
-^ Nov 23 | Otakar Smrž         | Noah A. Smith, David A. Smith, Roy W. Tromble: **Context-Based Morphological Disambiguation with Random Fields**. || 
-^ Nov 30 | Václav Novák     | J. Eisner and D. Karakos: **Bootstrapping Without the Boot**, HLT/EMNLP, Vancouver, 2005. | 
-^ Dec 7  | Pavel Schlesinger| B. Taskar, D. Klein, M. Collins, D. Koller and C. Manning: **Max-Margin Parsing**, EMNLP, Barcelona, 2004. | 
-^ Dec 14 | Daniel Zeman     | D. Zeman, Z. Zabokrstky: **Improving Parsing Accuracy by Combining Diverse Dependecy Parsers**, IWPT, Vancouver, 2005. | 
-^ Jan 4  | Ondřej Bojar     | Franz Och: **Tutorial**, MT Summit, 2005. | 
-^ Jan 11 | Jiří Semecký     | M. Carpuat and D. Wu: **Word Sense Disambiguation vs. Statistical Machine Translation**. | 

[ Back to the navigation ] [ Back to the content ]