[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
courses:rg [2010/04/07 17:20]
marecek
courses:rg [2017/10/12 15:44] (current)
popel
Line 1: Line 1:
 ~~NOTOC~~ ~~NOTOC~~
  
 +===== Reading Group =====  ​
 +Official name of this course is [[https://​is.cuni.cz/​studium/​predmety/​index.php?​do=predmet&​kod=NPFL095|NPFL095]] **Modern Methods in Computational Linguistics**. It is a continuation of informal Reading Group (RG) meetings.
  
 +Since 2016, the wiki is moved to https://​github.com/​ufal/​NPFL095/​wiki and the mailing list to [[https://​groups.google.com/​forum/#​!forum/​npfl095|npfl095@googlegroups.com]].
 +See also [[courses:​rg:​past|an overview of past meetings]], [[courses:​rg:​wishlist|an outdated wishlist]] and [[https://​github.com/​ufal/​rg/​wiki|Machine Learning RG (active in 2014)]].
  
  
  
  
- 
- 
- 
- 
- 
- 
- 
- 
- 
- 
- 
- 
- 
- 
- 
- 
-===== Reading Group ===== 
- 
-^ Contact ​     | popel at ufal.mff.cuni.cz | 
-^ Mailing list | rg at ufal.mff.cuni.cz ​    | 
-^ List Archive | [[http://​ufal.mff.cuni.cz/​mailman/​listinfo/​rg]] | 
-^ Meetings ​    | Mondays 15:00, in front of 422 | 
- 
-=== Wishlist === 
- 
- 
-  * Mark Johnson: [[http://​acl.ldc.upenn.edu/​D/​D07/​D07-1031.pdf|Why Doesn'​t EM Find Good HMM POS-Taggers?​]] (Ondřej) 
- 
-Další návrhy (konkrétní aplikace Gibbsova samplingu): 
-  * [[http://​www.aclweb.org/​anthology/​W/​W09/​W09-1114.pdf |Abhishek Arun, Chris Dyer, Barry Haddow, Phil Blunsom, Adam Lopez and Philipp Koehn: Monte Carlo Inference and Maximization for Phrase-based Translation. Conference on Computational Natural Language Learning, 2009. ]] 
-  * [[http://​homepages.inf.ed.ac.uk/​pblunsom/​pubs/​blunsom-acl09.pdf|Phil Blunsom, Trevor Cohn, Chris Dyer and Miles Osborne: A Gibbs Sampler for Phrasal Synchronous Grammar Induction. ACL-IJCNLP 2009]] 
-  * [[http://​homepages.inf.ed.ac.uk/​pblunsom/​pubs/​cohn-blunsom-emnlp09.pdf|Trevor Cohn and Phil Blunsom: A Bayesian Model of Syntax-Directed Tree to String Grammar Induction. EMNLP 2009.]] 
- 
-=== Summer 2010 === 
-^ date | **speaker** | **paper** | 
-^ Apr 12 | David Mareček | [[http://​acl.ldc.upenn.edu/​P/​P06/​P06-1033.pdf|Jens Nillson, Joakim Nivre, and Johan Hall: Graph Transformations in Data-Driven Dependency Parsing]] | 
-^ Mar 29 | Martin Popel | [[http://​acl.ldc.upenn.edu/​N/​N03/​N03-2002.pdf|Jeff Bilmes and Katrin Kirchhoff: Factored Language Models and Generalized Parallel Backoff]] (plus [[https://​www.ee.washington.edu/​techsite/​papers/​documents/​UWEETR-2008-0004.pdf|tutorial]]) | 
-^ Mar 22 | Zdeněk Žabokrtský | [[http://​www.mitpressjournals.org/​doi/​pdf/​10.1162/​coli.2010.36.1.36103|Deniz Yuret and Mehmet Ali Yatbaz: The Noisy Channel Model for Unsupervised Word Sense Disambiguation,​ Computational Linguistics 2010]] | 
-^ Mar 15 | Ondřej Bojar | [[ 
-http://​www.mt-archive.info/​MTS-2009-Koehn-2.pdf|Philipp Koehn and Barry Haddow: Interactive Assistance to Human Translators using Statistical Machine Translation Methods, MT Summit XII, 2009.]] | 
-^ Mar 8 |  | startup meeting | 
- 
-=== Winter 2009/2010 === 
- 
-^ date | **speaker** | **paper** | **přečíst?​** | 
-^ Dec 14 | Martin Popel | [[http://​nlp.csie.ncnu.edu.tw/​~shin/​acl-ijcnlp2009/​proceedings/​CDROM/​EMNLP/​pdf/​EMNLP042.pdf|Wei Lu, Hwee Tou Ng, Wee Sun Lee: Natural Language Generation with Tree Conditional Random Fields]] | | 
-^ Dec 7 | David Mareček | [[http://​www.aclweb.org/​anthology/​P/​P09/​P09-1040.pdf|Nivre,​ J.: Non-Projective Dependency Parsing in Expected Linear Time. ACL 2009]] | | 
-^ Nov 23, Nov 30 | Jana Straková | [[http://​citeseerx.ist.psu.edu/​viewdoc/​download?​doi=10.1.1.23.9849&​rep=rep1&​type=pdf|John Lafferty, Andrew McCallum and Fernando Pereira: Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data]], [[http://​www.inference.phy.cam.ac.uk/​hmw26/​papers/​crf_intro.pdf|Hanna M. Wallach. Conditional Random Fields: An Introduction]] | | 
-^ Nov 9 | Zdeněk Žabokrtský | [[http://​www.aclweb.org/​anthology/​E/​E09/​E09-1018.pdf|Eugene Charniak and Micha Elsner: EM Works for Pronoun Anaphora Resolution. EACL 2009]] ​ | | 
-^ Oct 26, Nov 2 | Ondřej Bojar | [[http://​www.isi.edu/​natural-language/​people/​bayes-with-tears.pdf|Kevin Knight: Bayesian Inference with Tears. September 2009.]]\\ článek byl delší, ale čtivý a přesně takové jsou i [[courses:​rg:​bayes-with-tears|Ondrovy poznámky]] k němu | Ano, i s úkoly | 
-^ Oct 19 | Eda Bejček | [[http://​www.aclweb.org/​anthology/​E/​E09/​E09-1082.pdf|Josh Schroeder, Trevor Cohn, and Philipp Koehn: Word Lattices for Multi-Source Translation. EACL 2009]]\\ přehledné rozdělení metod; dobře, že udělali tolik experimentů,​ škoda některých nepodložených interpretací;​ potřebná znalost Mojžíše; omezili množství trénovacích dat -- chybí test, zda více dat není účinnější než více jazyků; klesá skutečně MAX s množstvím jazyků? (to je divné, neměly by se tedy vážit výsledky jednotlivých systémů (viz věta s "​little benefit",​ 2.1)? A nebo není příčinou spíš než přidání šestého jazyka přidání špatného jazyka (Table 6)? Jak by dopadly testy po dvojicích?​);​ v závěru 2.3 vynechávají reordering s odkazem na diversitu zdrojových jazyků -- to nemusí platit |  Ano  | 
-^ Oct 15 | Pavel Pecina | [[http://​portal.acm.org/​citation.cfm?​id=1401975|Daniel David Walker, Eric K. Ringger: Model-based document clustering with a collapsed gibbs sampler]], [[courses:​rg:​2009-10-15-tabule|Fotky tabule]] | | 
-^ Oct 12 | Pavel Schlesinger | Gibbsův sampling (http://​en.wikipedia.org/​wiki/​Gibbs_sampling)| | 
- 
- 
- 
-=== Summer 2009 === 
- 
-^ date | **speaker** | **paper** | 
-^ May 25 | David Mareček | [[http://​www.lrec-conf.org/​proceedings/​lrec2008/​pdf/​286_paper.pdf|Christian Hanig, Stefan Bordag, Uwe Quasthoff: UnsuParse: Unsupervised Parsing with unsupervised Part of Speech tagging. LREC 2009]] | 
-^ May 11 | Václav Novák | [[http://​newdesign.aclweb.org/​anthology-new/​E/​E09/​E09-1007.pdf|Julien Ah-Pine, Guillaume Jacquet: Clique-Based Clustering for improving Named Entity Recognition systems. EACL 2009]] | 
-^ May 4 | Dan Zeman | [[http://​www.aclweb.org/​anthology/​P/​P08/​P08-1059|Kristina Toutanova, Hisami Suzuki, Achim Ruopp: Applying Morphology Generation Models to Machine Translation. ACL 2008, Columbus, Ohio]] | 
-^ Apr 6 | Zdeněk Žabokrtský | [[http://​pauillac.inria.fr/​~pdenis/​papers/​emnlp08.pdf|Pascal Denis and Jason Baldridge, Specialized models and ranking for coreference resolution]] | 
-^ Mar 30 | Pavel Schlesinger |  | 
-^ Mar 9 | Pavel Pecina |  | 
- 
-=== Winter 2008/2009 === 
- 
-^ date | **speaker** | **paper** | 
-^ Mon Jan 5 | Martin Popel | Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean: {{courses:​rg:​2007brants-large_language_models_in_mt.pdf|Large Language Models in Machine Translation}},​ 2007 | 
-^ Mon Dec 1 | Jan Ptáček | {{courses:​rg:​goldwater-mcclosky-czech-english-mt-0-33.pdf|Improving Statistical MT through Morphological Analysis}} | 
-^ Mon Nov 24 | Pavel Češka | [[http://​citeseerx.ist.psu.edu/​viewdoc/​download;​jsessionid=656CAA9B55D39BC763A26946D91E7FBD?​doi=10.1.1.61.3358&​rep=rep1&​type=pdf|A TAG-based noisy channel model of speech repairs]] | 
-^ Wed Nov 19 | Ondřej Bojar | [[http://​aclweb.org/​anthology-new/​P/​P08/​P08-1023.pdf|Forest Reranking: Discriminative Parsing with Non-Local Features by Liang Huang]] (see Google Tech Talks), [[http://​aclweb.org/​anthology-new/​P/​P08/​P08-1023.pdf|Forest-Based Translation]] by Haitao Mi and Liang Huang and Qun Liu | 
-^ Mon Nov 10 | Zdeněk Žabokrtský | Katja Filippova and Michael Strube: [[http://​www.eml-research.de/​nlp/​papers/​filippova.emnlp08.pdf|Sentence Fusion via Dependency Graph Compression]],​ 2008 | 
-^ Mon Nov 3 | Jiří Mírovský | Alexander E. Richman and Patrick Schone: Mining Wiki Resources for Multilingual Named Entity Recognition,​ ACL, Columbus, 2008. | 
-^ Wed Oct 20 | Jan Raab | Libin Shen, Giorgio Satta, and Aravind K. Joshi: Guided Learning for Bidirectional Sequence Classification,​ ACL, Prague, 2007. | 
- 
-A source of inspiration:​ [[http://​www.statmt.org/​ued/?​n=Public.WeeklyMeeting Edinburgh Reading Group]] 
- 
-=== Summer 2008 === 
- 
-^ date | **speaker** | **paper** | 
-^ Mar 17 | Pavel Schlesinger | Aria Haghighi and Dan Klein: //​[[http://​www.eecs.berkeley.edu/​~aria42/​pubs/​acl07-hdp-coref.pdf|Unsupervised Coreference Resolution in a Nonparametric Bayesian Model]]//, ACL, Prague, 2007.   ​| ​ 
-^ Mar 31 | Pavel Schlesinger | //​Unsupervised Coreference Resolution in a Nonparametric Bayesian Model//, 2nd part :-)| 
-^ Apr 28 | Pavel Straňák | Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll: //​{{courses:​unsupervised_acquisition_of_predominant_word_senses.pdf|Unsupervised Acquisition of Predominant Word Senses}}// in Computational Linguistics 33 (4), 2007 | 
- 
-=== Winter 2007/2008 === 
-^ Nov 26 | Markéta ​ Lopatková | Friedrich Otto: {{courses:​rg:​tr-ra-tutorial.ps|Restarting Automata (Notes for a Course)}}, Technical Report, Universitat kassel, 2004. | 
-^ Nov 19 | Dan Zeman | Anil Kumar Singh (अनिल कुमार सिंह),​ Jagadeesh Gorla: {{courses:​rg:​kumar-gorla-identification-of-languages.pdf|Identification of Languages and Encodings in a Multilingual Document}} {{courses:​kumar-gorla-identification-of-languages.zip|Identification of Languages and Encodings in a Multilingual Document}}. In: Proceedings of the 3rd ACL SIGWAC Workshop on Web as Corpus, pp. 95-108. Louvain-la-Neuve,​ Belgium, 2007. | 
-^ Nov 12 | Otakar Smrž | M. Nowak, N. Komarova, P. Niyogi. [[http://​people.cs.uchicago.edu/​~niyogi/​papersps/​NKNnature.pdf|Computational and Evolutionary Aspects of Language]]. Nature, Vol. 417, pp. 611-617, 2002. | 
-^ Nov 5 | Jan Ptáček | Philipp Koehn; Hieu Hoang: **{{courses:​factored-translation-models.pdf|Factored Translation Models}}**, EMNLP & CoNLL, Prague, 2007. | 
-^ Oct 29 | Miroslav Spousta | David Talbot; Miles Osborne Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap http://​www.aclweb.org/​anthology-new/​D/​D07/​D07-1049.pdf | 
-^ Oct 17 | Zdeněk Žabokrtský | **Mary Hearne, John Tinsley, Ventsislav Zhechev, Andy Way (2007)**: Capturing Translational Divergences with a Statistical Tree-to-Tree Aligner {{seminare:​hearneetal_tmi_07.pdf|HearneEtAl_TMI_07.pdf}} | 
- 
-=== Summer 2007 === 
- 
-=== Winter 2006/2007 === 
- 
-^ Nov 15 | | Glöckner, Ingo; Sven Hartrumpf; and Hermann Helbig (2006): Automatic knowledge acquisition by semantic analysis and assimilation of textual information {{:​seminare:​reading-group:​gloeckner.ps|gloeckner.ps - plná verze}} | 
- 
-=== Summer 2006 === 
- 
-^ Mar 1  | Ondřej Bojar | D. Chiang: **A Hierarchical Phrase-Based Model for Statistical Machine Translation**,​ ACL, Ann Arbor, 2005. | 
-^ Mar 8  | Zdeněk&​nbsp;​Žabokrtský | S. Kahan: **The Meaning-Text Theory**. | 
-^ Mar 15 | Pavel&​nbsp;​Schlesinger | N. A. Smith and J. Eisner: **Contrastive Estimation: Training Log-Linear Models on Unlabeled Data**. ACL, Ann Arbor, 2005. | 
-^ Mar&​nbsp;​22 | Pavel Pecina | D. Ravichandran,​ P. Pantel and E.Hovy: **Randomized Algorithms and NLP: Using Locality Sensitive Hash Functions for High Speed Noun Clustering**,​ ACL, Ann Arbor, 2005. | 
-^ Apr 5  | Pavel Straňák | Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang: **Sinica Treebank: Design criteria, representational issues and implementation**. | 
-^ Apr 12 | Zdeněk Žabokrtský | P. Sgall: **Prague School Typology**. In Masayoshi Shibatani and Theodora Bynon (eds), Approaches to Language Typology. Clarendon Press, ​ Oxford, United Kingdom, 1995. | 
-^ Apr 19 | Barbora Hladká | B. Scholkopf and A.J. Smola: ​ **A Short Introduction to Learning Method with  Kernels**. | 
-^ May 3  | Otakar Smrž | || 
-^ May 10 | Barbora Hladká | B. Scholkopf and A.J. Smola: ​ **A Short Introduction to Learning Method with Kernels**. ​ | 
- 
-=== Winter 2005/2006 === 
- 
-^ Oct 19 | Kiril Ribarov ​   | **Non-projective Dependency Parsing using Spanning Tree Algorithms**. | 
-^ Oct 26 | Petr Podveský ​   | K. Crammer and Y. Singer: **Ultraconservative on-line algorithms for multiclass problems**, JMLR, 2003. | 
-^ Nov 2  | Jiří Havelka ​    | L. Georgiadis: **Arborescence optimization problems solvable by Edmonds’ algorithm**. | 
-^ Nov 9  | Barbora Hladká ​  | | 
-^ Nov 16 | Pavel Pecina ​    | B. Moore: **Discriminative Framework for Bilingual Word Alignment**,​ HLT/EMNLP, Vancouver, 2005. | 
-^ Nov 23 | Otakar Smrž         | Noah A. Smith, David A. Smith, Roy W. Tromble: **Context-Based Morphological Disambiguation with Random Fields**. || 
-^ Nov&​nbsp;​30 | Václav Novák ​    | J. Eisner and D. Karakos: **Bootstrapping Without the Boot**, HLT/EMNLP, Vancouver, 2005. | 
-^ Dec 7  | Pavel&​nbsp;​Schlesinger| B. Taskar, D. Klein, M. Collins, D. Koller and C. Manning: **Max-Margin Parsing**, EMNLP, Barcelona, 2004. | 
-^ Dec 14 | Daniel Zeman     | D. Zeman, Z. Zabokrstky: **Improving Parsing Accuracy by Combining Diverse Dependecy Parsers**, IWPT, Vancouver, 2005. | 
-^ Jan 4  | Ondřej Bojar     | Franz Och: **Tutorial**,​ MT Summit, 2005. | 
-^ Jan 11 | Jiří Semecký ​    | M. Carpuat and D. Wu: **Word Sense Disambiguation vs. Statistical Machine Translation**. | 

[ Back to the navigation ] [ Back to the content ]