[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
courses:rg [2009/06/03 10:38]
marecek
courses:rg [2016/10/06 08:54]
popel
Line 1: Line 1:
 ~~NOTOC~~ ~~NOTOC~~
  
 +===== Reading Group =====  ​
 +Official name of this course is [[https://​is.cuni.cz/​studium/​predmety/​index.php?​do=predmet&​kod=NPFL095|NPFL095]] **Modern Methods in Computational Linguistics**. It is a continuation of informal Reading Group (RG) meetings. Requirements for getting credits: ​
 +  * presenting one paper,
 +    * Select a term (write your name to the schedule below) before October 13.
 +    * If no paper is assigned to the term, suggest [[mailto:​popel@ufal.mff.cuni.cz|me]] 2--3 papers you would like to present (with pdf links, and your preferences) before October 20. Ideally, make a group of 2--4 students presenting papers on a common topic (starting from basics to more advance papers).
 +    * Prepare your presentation and 3--5 quiz questions. At least 3 of the questions should ask for a specific answer, e.g. "write an equation for...",​ "given training set X=([dog,​N],​[cat,​Y]),​ what is the number..."​ (Not "what do you think about..."​). The first question should be quite easy to answer for those who have read the whole paper. The last question may be a tricky one. Send me the questions two weeks before your presentation. We may discuss the paper and refine the questions.
 +    * One week before the presentation,​ write the questions to a dedicated wiki page here. Send a reminder (questions and a link to the pdf of the paper) to rg@ufal.mff.cuni.cz by Monday 15:45.
  
-===== Reading Group =====+  * active participation in the discussions,​ which is conditioned by reading the papers in advance and attending the meetings, 
 +  * sending your answers to me and the presenter by Saturday 23:59 (so the presenter can go through all answers before the presentation and focus more on problematic parts). 
 +  * In case of more than three missed meetings or deadlines, additional work (e.g. reports or answers to tricky questions) will be required.
  
-^ Contact ​     | pecina at ufal.mff.cuni.cz / bojar at ... | +All questions, reports and presented papers must be in EnglishThe presentations are in English by default, but if all present people agree it may be in Czech.
-^ Mailing list | rg at ufal.mff.cuni.cz ​    | +
-^ List Archive | [[http://​ufal.mff.cuni.cz/​mailman/​listinfo/​rg]] | +
-^ Meetings ​    | Mondays 15:00 and bi-weekly Wednesday mornings ​          |+
  
-=== Summer 2009 ===+^ Contact ​     | popel@ufal.mff.cuni.cz | 
 +^ Mailing list | rg@ufal.mff.cuni.cz ​    | 
 +^ Meetings ​    | Tuesdays 15:40, S1 | 
 +^ Past meetings| [[courses:​rg:​past|courses:​rg:​past]] | 
 +^ Inspiration ​ | [[courses:​rg:​wishlist|courses:​rg:​wishlist]] | 
 +^ Other reading groups ​ | [[https://​github.com/​ufal/​rg/​wiki|Machine Learning RG]] |
  
-^ date | **speaker** | **paper** | **expected audience** ​+=== Autumn&​Winter 2016/2017 === 
-May 25 David Mareček ​| [[http://​www.lrec-conf.org/proceedings/lrec2008/pdf/​286_paper.pdf|Christian HanigStefan BordagUwe QuasthoffUnsuParse: Unsupervised Parsing with unsupervised Part of Speech taggingLREC 2009]] |  ​+^ date   ​| **speaker** ​              ​| **paper** | 
-May 11 V.N. | [[http://newdesign.aclweb.org/​anthology-new/​E/E09/E09-1007.pdf|Julien Ah-Pine, Guillaume JacquetClique-Based Clustering ​for improving Named Entity Recognition systemsEACL 2009]] |  ​+^ Oct  4 |   | startup meeting ​
-May 4 Dan Zeman | [[http://www.aclweb.org/anthology/P/P08/P08-1059|Kristina ToutanovaHisami SuzukiAchim RuoppApplying Morphology Generation ​Models ​to Machine ​Translation. ACL 2008Columbus, Ohio]] |  +Oct 11 Martin Popel Kevin Knight, Beáta Megyesi and Christiane Schaefer: ​[[http://​www.isi.edu/natural-language/people/copiale-11.pdf|The Copiale Cipher]]BUCC 2011you can watch a [[https://www.youtube.com/​watch?​v=Eam0Tk-1FyI|trailer]] | 
-Apr 6 Zdeněk Žabokrtský ​| [[http://pauillac.inria.fr/~pdenis/papers/emnlp08.pdf|Pascal Denis and Jason BaldridgeSpecialized models and ranking for coreference resolution]] |  | +Oct 18 Daniela Bodanská  ​Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu: [[http://​aclweb.org/​anthology-new/​P/P02/P02-1040.pdf|BLEUa Method ​for Automatic Evaluation of Machine Translation]],​ ACL 2002[[courses:​rg:​2014:​bleu|Questions]] | 
-Mar 30 Pavel Schlesinger ​|  |  | +Oct 25  Michael Collins: ​[[http://ucrel.lancs.ac.uk/acl/W/W02/W02-1001.pdf|Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms]]EMNLP 2002. [[courses:​rg:​2014:​perceptron|Questions]]| 
-Mar 9 | Pavel Pecina ​|  |  |+^ Nov 1  |  | Andrew McCallumDayne Freitag, Fernando Pereira[[http://​www.ai.mit.edu/​courses/​6.891-nlp/​READINGS/​maxent.pdf|Maximum Entropy Markov ​Models ​for Information Extraction and Segmentation]],​ Conference on Machine ​Learning 2000[[http://​courses.ischool.berkeley.edu/​i290-dm/​s11/​SECURE/​gidofalvi.pdf|slides]] [[courses:​rg:​2014:​memm|Question]]
 +<​del>​Nov 8</​del> ​--- no RG (Dean'​s day)| 
 +^ Nov 15 |  | John Lafferty, Andrew McCallum, Fernando Pereira: ​[[http://www.cs.utah.edu/~piyush/teaching/crf.pdf|Conditional Random Fields: Probabilistic Models for Segmenting ​and Labeling Sequence Data]]2001. [[courses:​rg:​2014:​crf|Questions]] 
 +^ Nov 22 |  ​| Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean: [[http://​arxiv.org/​pdf/​1301.3781.pdf|Efficient Estimation of Word Representations in Vector Space]], ICLR 2013 
 +Nov 29  ​ 
 +^ Dec 6  ​|  ​
 +Dec 13 |  | |
  
-=== Winter 2008/2009 === 
  
-^ date | **speaker** | **paper** | **expected audience** | 
-^ Mon Jan 5 | Martin Popel | Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean: {{courses:​rg:​2007brants-large_language_models_in_mt.pdf|Large Language Models in Machine Translation}},​ 2007 | | 
-^ Mon Dec 1 | Jan Ptáček | {{courses:​rg:​goldwater-mcclosky-czech-english-mt-0-33.pdf|Improving Statistical MT through Morphological Analysis}} | | 
-^ Mon Nov 24 | Pavel Češka | [[http://​citeseerx.ist.psu.edu/​viewdoc/​download;​jsessionid=656CAA9B55D39BC763A26946D91E7FBD?​doi=10.1.1.61.3358&​rep=rep1&​type=pdf|A TAG-based noisy channel model of speech repairs]] | | 
-^ Wed Nov 19 | Ondřej Bojar | [[http://​aclweb.org/​anthology-new/​P/​P08/​P08-1023.pdf|Forest Reranking: Discriminative Parsing with Non-Local Features by Liang Huang]] (see Google Tech Talks), [[http://​aclweb.org/​anthology-new/​P/​P08/​P08-1023.pdf|Forest-Based Translation]] by Haitao Mi and Liang Huang and Qun Liu | 4-5 | 
-^ Mon Nov 10 | Zdeněk Žabokrtský | Katja Filippova and Michael Strube: [[http://​www.eml-research.de/​nlp/​papers/​filippova.emnlp08.pdf|Sentence Fusion via Dependency Graph Compression]],​ 2008 | | 
-^ Mon Nov 3 | Jiří Mírovský | Alexander E. Richman and Patrick Schone: Mining Wiki Resources for Multilingual Named Entity Recognition,​ ACL, Columbus, 2008. | | 
-^ Wed Oct 20 | Jan Raab | Libin Shen, Giorgio Satta, and Aravind K. Joshi: Guided Learning for Bidirectional Sequence Classification,​ ACL, Prague, 2007. | | 
  
-A source of inspiration:​ [[http://​www.statmt.org/​ued/?​n=Public.WeeklyMeeting Edinburgh Reading Group]] 
  
-=== Summer 2008 === 
- 
-^ date | **speaker** | **paper** | **expected audience** | 
-^ Mar 17 | Pavel Schlesinger | Aria Haghighi and Dan Klein: //​[[http://​www.eecs.berkeley.edu/​~aria42/​pubs/​acl07-hdp-coref.pdf|Unsupervised Coreference Resolution in a Nonparametric Bayesian Model]]//, ACL, Prague, 2007.   | 4 | 
-^ Mar 31 | Pavel Schlesinger | //​Unsupervised Coreference Resolution in a Nonparametric Bayesian Model//, 2nd part :-)| | 
-^ Apr 14 | | | | 
-^ Apr 28 | Pavel Straňák | Diana McCarthy, Rob Koeling, Julie Weeds and John Carroll: //​{{courses:​unsupervised_acquisition_of_predominant_word_senses.pdf|Unsupervised Acquisition of Predominant Word Senses}}// in Computational Linguistics 33 (4), 2007 | 3 | 
-^ May 5 | | | | 
-^ May 19 | | | | 
-^ Jun 2 | | | | 
- 
-=== Winter 2007/2008 === 
-^ Nov 26 | Markéta ​ Lopatková | Friedrich Otto: {{courses:​rg:​tr-ra-tutorial.ps|Restarting Automata (Notes for a Course)}}, Technical Report, Universitat kassel, 2004. | 
-^ Nov 19 | Dan Zeman | Anil Kumar Singh (अनिल कुमार सिंह),​ Jagadeesh Gorla: {{courses:​rg:​kumar-gorla-identification-of-languages.pdf|Identification of Languages and Encodings in a Multilingual Document}} {{courses:​kumar-gorla-identification-of-languages.zip|Identification of Languages and Encodings in a Multilingual Document}}. In: Proceedings of the 3rd ACL SIGWAC Workshop on Web as Corpus, pp. 95-108. Louvain-la-Neuve,​ Belgium, 2007. | 
-^ Nov 12 | Otakar Smrž | M. Nowak, N. Komarova, P. Niyogi. [[http://​people.cs.uchicago.edu/​~niyogi/​papersps/​NKNnature.pdf|Computational and Evolutionary Aspects of Language]]. Nature, Vol. 417, pp. 611-617, 2002. | 
-^ Nov 5 | Jan Ptáček | Philipp Koehn; Hieu Hoang: **{{courses:​factored-translation-models.pdf|Factored Translation Models}}**, EMNLP & CoNLL, Prague, 2007. | 
-^ Oct 29 | Miroslav Spousta | David Talbot; Miles Osborne Smoothed Bloom Filter Language Models: Tera-Scale LMs on the Cheap http://​www.aclweb.org/​anthology-new/​D/​D07/​D07-1049.pdf | 
-^ Oct 17 | Zdeněk Žabokrtský | **Mary Hearne, John Tinsley, Ventsislav Zhechev, Andy Way (2007)**: Capturing Translational Divergences with a Statistical Tree-to-Tree Aligner {{seminare:​hearneetal_tmi_07.pdf|HearneEtAl_TMI_07.pdf}} | 
- 
-=== Summer 2007 === 
- 
-^ Feb 21 | | | 
-^ Feb&​nbsp;​28 | | | 
-^ Mar 7  | | | 
-^ Mar 14 | | | 
-^ Mar 21 | | | 
-^ Mar 28 | | | 
-^ Apr 4  | | | 
-^ Apr 11 | | | 
-^ Apr 18 | | | 
- 
- 
- 
-^ Apr 25 | | | 
-^ May 2  | | | 
-^ May 9  | | | 
-^ May 16 | | | 
- 
-=== Winter 2006/2007 === 
- 
-^ Oct 4  | | | 
-^ Oct 11 | | | 
-^ Oct 18 | | | 
-^ Oct 25 | | | 
-^ Nov 1  | | | 
-^ Nov 8  | | | 
-^ Nov 15 | | Glöckner, Ingo; Sven Hartrumpf; and Hermann Helbig (2006): Automatic knowledge acquisition by semantic analysis and assimilation of textual information {{:​seminare:​reading-group:​gloeckner.ps|gloeckner.ps - plná verze}} | 
-^ Nov 22 | | | 
-^ Nov 29 | | | 
-^ Dec 6  | | | 
-^ Dec 13 | | | 
-^ Dec&​nbsp;​20 | | | 
- 
-=== Summer 2006 === 
- 
-^ Mar 1  | Ondřej Bojar | D. Chiang: **A Hierarchical Phrase-Based Model for Statistical Machine Translation**,​ ACL, Ann Arbor, 2005. | 
-^ Mar 8  | Zdeněk&​nbsp;​Žabokrtský | S. Kahan: **The Meaning-Text Theory**. | 
-^ Mar 15 | Pavel&​nbsp;​Schlesinger | N. A. Smith and J. Eisner: **Contrastive Estimation: Training Log-Linear Models on Unlabeled Data**. ACL, Ann Arbor, 2005. | 
-^ Mar&​nbsp;​22 | Pavel Pecina | D. Ravichandran,​ P. Pantel and E.Hovy: **Randomized Algorithms and NLP: Using Locality Sensitive Hash Functions for High Speed Noun Clustering**,​ ACL, Ann Arbor, 2005. | 
-^ Apr 5  | Pavel Straňák | Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang: **Sinica Treebank: Design criteria, representational issues and implementation**. | 
-^ Apr 12 | Zdeněk Žabokrtský | P. Sgall: **Prague School Typology**. In Masayoshi Shibatani and Theodora Bynon (eds), Approaches to Language Typology. Clarendon Press, ​ Oxford, United Kingdom, 1995. | 
-^ Apr 19 | Barbora Hladká | B. Scholkopf and A.J. Smola: ​ **A Short Introduction to Learning Method with  Kernels**. | 
-^ Apr 26 | | || 
-^ May 3  | Otakar Smrž | || 
-^ May 10 | Barbora Hladká | B. Scholkopf and A.J. Smola: ​ **A Short Introduction to Learning Method with Kernels**. ​ | 
- 
-=== Winter 2005/2006 === 
- 
-^ Oct 19 | Kiril Ribarov ​   | **Non-projective Dependency Parsing using Spanning Tree Algorithms**. | 
-^ Oct 26 | Petr Podveský ​   | K. Crammer and Y. Singer: **Ultraconservative on-line algorithms for multiclass problems**, JMLR, 2003. | 
-^ Nov 2  | Jiří Havelka ​    | L. Georgiadis: **Arborescence optimization problems solvable by Edmonds’ algorithm**. | 
-^ Nov 9  | Barbora Hladká ​  | | 
-^ Nov 16 | Pavel Pecina ​    | B. Moore: **Discriminative Framework for Bilingual Word Alignment**,​ HLT/EMNLP, Vancouver, 2005. | 
-^ Nov 23 | Otakar Smrž         | Noah A. Smith, David A. Smith, Roy W. Tromble: **Context-Based Morphological Disambiguation with Random Fields**. || 
-^ Nov&​nbsp;​30 | Václav Novák ​    | J. Eisner and D. Karakos: **Bootstrapping Without the Boot**, HLT/EMNLP, Vancouver, 2005. | 
-^ Dec 7  | Pavel&​nbsp;​Schlesinger| B. Taskar, D. Klein, M. Collins, D. Koller and C. Manning: **Max-Margin Parsing**, EMNLP, Barcelona, 2004. | 
-^ Dec 14 | Daniel Zeman     | D. Zeman, Z. Zabokrstky: **Improving Parsing Accuracy by Combining Diverse Dependecy Parsers**, IWPT, Vancouver, 2005. | 
-^ Jan 4  | Ondřej Bojar     | Franz Och: **Tutorial**,​ MT Summit, 2005. | 
-^ Jan 11 | Jiří Semecký ​    | M. Carpuat and D. Wu: **Word Sense Disambiguation vs. Statistical Machine Translation**. | 

[ Back to the navigation ] [ Back to the content ]