[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
native-language-identification-shared-task-2013 [2013/01/13 15:04]
ufal
native-language-identification-shared-task-2013 [2013/01/16 07:51]
ufal
Line 1: Line 1:
 ====== Native Language Identification Shared Task 2013 ====== ====== Native Language Identification Shared Task 2013 ======
  
-//A shared task in Native Language Identification (NLI) to identify the native language of a writer based solely on a sample of their writing.//  +//A shared task in Native Language Identification to identify the native language of a writer based solely on a sample of their writing.// 
  
   * **Home page:** [[https://sites.google.com/site/nlisharedtask2013/home]]   * **Home page:** [[https://sites.google.com/site/nlisharedtask2013/home]]
-  * **Team:** Barbora Hladka (contact person, related projects, data, ML), Martin Holub(algorithms, ML), Silvie Cinkova(features), ... +  * **Team:** Barbora Hladka (contact person, related projects, data, ML), Martin Holub (algorithms, ML), Silvie Cinkova (features), ... 
   * **Important Dates:**    * **Important Dates:** 
      * January 14 Training Data Release      * January 14 Training Data Release
Line 16: Line 15:
      * June 13 or 14 NLI Shared Task Presentations @ [[http://www.cs.rochester.edu/~tetreaul/naacl-bea8.html|BEA8 Workshop, Atlanta, GA, USA]]      * June 13 or 14 NLI Shared Task Presentations @ [[http://www.cs.rochester.edu/~tetreaul/naacl-bea8.html|BEA8 Workshop, Atlanta, GA, USA]]
    * **Data:** TBA    * **Data:** TBA
- +   * **References** 
 +       - Brooke, Julian, Greme Hirst. Native language detectin with 'cheap' learner corpora. In P//roceedings of the Conference on Learner Corpus Research//, Louvain-la-Neuve. 2011. 
 +          * learner corpora review;  
 +          * they discuss topic bias - Do we have to care about it in the NLI task? 
 +          * Feature set: [character|POS|word] n-grams, function words, features from machine translation, features from L1 corpora. How many features: ??? 
 +          * Machine learning algorithm: ??  
 +       - Wong, Sze-Meng Jojo, Dras Mark, Johnson, Mark. Topic Modeling for Native Language Identification. In Proceedings of Australasian Language Technology Association Workshop, pp. 115-124 ([[http://aclweb.org/anthology-new/U/U11/U11-1015.pdf|pdf]]).

[ Back to the navigation ] [ Back to the content ]