ADACA Discourse Workshop at COLING 2012
W1: Advances in discourse analysis and its computational aspects
Workshop general chair: Eva Hajičová, Charles University in Prague, Czech Republic
e-mail: hajicova at ufal.mff.cuni.cz
Technical co-organizers: Jiří Mírovský and Lucie Poláková
Workshop date: December 15th, 2012 (Saturday)
Brief description of the workshop
The aim of the workshop is twofold:
- to bring together and to spread an up-to-date information on advanced computationally oriented studies in discourse analysis, and
- to provoke a discussion on hot issues in the domain of the study in discourse, especially with respect to modern methodology and to computationally and corpus oriented research and its possible applications. Thus, the workshop may attract a rather broad (and cross-section) audience: those who are just starting their research in the given area will get enough stuff for thought how to proceed, and those who are in an advanced stage of their research will get a stimulating feedback from the floor and the discussion will make it possible for them to sharpen their ideas and plans.
To fulfil the workshop aims specified above, the workshop will attempt to provide a forum free of the conventional shape of unrelated presentations on the topic.
Please do not forget to submit the final non-anonymous version of the paper by November 15th!
September 30th, 2012 (11:59pm Cetral European time, GMT+2): Paper submission deadline October 31st, 2012: Paper accept/reject notification November 15th, 2012: Camera ready paper due
Call for papers
Both long (up to 14 A5 pages + references) and short papers (up to 8 A5 pages + references) are possible, please follow the instructions for the formats of submissions for papers for the main conference (see COLING website).
Submission and reviewing will be online, managed by the START system. The only accepted format of submitted papers is PDF. Submissions must be uploaded on the START system before the submission deadline, which is September 30th, 2012 (11:59pm Central European time, GTM+2). To submit a paper, go to the paper submission page. Do not forget to anonymize your submissions, the reviewing process will be double-blind.
The programme of the workshop will contain
- brief position papers given by prominent researchers who have had significant contributions to the field (see the list of invited speakers below), and
- contributions by workshop participants submitted and accepted (on the basis of a review procedure)
We believe that this procedure will help to concentrate on an intensive interaction and discussion of all the participants of the workshop.
The invitation to present a position paper has already been accepted by
Prof. Kristiina Jokinen, University of Helsinki, Finland, University of Tartu, Estonia
New Information in Wikitalk - story telling for information presentation (abstract, presentation)
Prof. Aravind Joshi, University of Pennsylvania, Philadelphia, USA
Remarks on some not so closed issues concerning discourse connectives (abstract, presentation)
Prof. Katheleen McKeown, Columbia University, New York, USA
What is needed and what can help for people who want to use discourse relations in tasks such as NL generation or text summarization. (abstract, presentation)
Massimo Poesio, University of Essex, Great Britain
Empirical methods in the study of anaphora: lessons learned, remaining problems (abstract)
10.00 – 10.20: Welcome, Introduction (Eva Hajičová)
10.20 – 11.00: Aravind Joshi: Remarks on some not so closed issues concerning discourse connectives (invited position paper)
11.00 – 11.25: Amba Kulkarni and Monali Das: Discourse Analysis of Sanskrit texts
11.25 – 12.00: TEA BREAK
12.00 – 12.40: Kathleen McKeown: Penn Discourse Treebank Relations and their Potential for Language Generation (invited position paper)
12.40 – 13.05: Nik Adilah Hanin Binti Zahri, Fumiyo Fukumoto and Suguru Matsuyoshi: Exploiting Discourse Relations between Sentences for Text Clustering
13.05 – 13.30: Fatemeh Torabi Asr and Vera Demberg: Measuring the Strength of Linguistic Cues for Discourse Relations
13.30 – 14.30: LUNCH
14.30 – 15.10: Kristiina Jokinen: New Information in Wikitalk - story telling for information presentation (invited position paper)
15.10 – 15.35: Pavlína Jínová, Jiří Mírovský and Lucie Poláková: Semi-Automatic Annotation of Intra-Sentential Discourse Relations in PDT
15.35 – 16.00: Andreas Peldszus and David Schlangen: Incremental Construction of Robust but Deep Semantic Representations for Use in Responsive Dialogue Systems
16.00 – 16.30: TEA BREAK
16.30 – 17.10: Massimo Poesio: Empirical methods in the study of anaphora: lessons learned, remaining problems (invited position paper)
17.10 – 18.00: General discussion: Where we are and where to go
Instructions for Presenters
The timeslot for the position papers is 40 minutes (30 minutes + 10 minutes for discussion), the timeslot for the accepted papers is 25 minutes (15 minutes + 10 minutes for discussion). Please prepare your presentations accordingly.
As we are not sure about the software available at the equipment provided by the local organizers, please prepare your presentations in PDF format or bring your own laptop.
Among the issues proposed to be discussed there are
- Intra-sentential and inter-sentential relations: commonalities and differences
- Explicit and implicit relations of coherence of discourse; means of implicit relations.
- What corpora annnotation of discourse relations and related phenomena can reveal?
- Annotation efforts undertaken in languages other than English, and their contribution to advances in Language Technologies and to a greater cross-linguistic understanding of coherence relations, their complexity and their lexicalization.
- Advances in empirically-driven discourse-level methods of language processing (discourse parsing, sense detection) and their impact on theoretical understanding of discourse structure
- Discourse and dialogue, commonalities and differences (e.g. dialogue act standardisation)
- Text segmentation and modelling of coherence in texts, twits, dialogues, monologues etc.
- Structures other than coherence relations that discourse manifests (e.g. layout or “document structure”), or structures specific to particular genres (news report, scientific papers, errata, etc.)