Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
draft [2009/07/15 14:12] ptacek |
draft [2009/07/15 23:56] ptacek |
||
---|---|---|---|
Line 6: | Line 6: | ||
+ | ====== Description of Czech Companion November Demonstrator ====== | ||
- | ====== Description of Czech Companion | + | The Czech version of the Companion |
- | The Czech version of Companion deals with the Reminiscing about User's Photos scenario taking advantage of date recorded in first phase of the project. The basic architecture | + | The dialog |
- | however | + | |
- | photopal domena, nahranej korpus, ze na to sou dafy (reusing SHEFF DM intergrated through Inamode Relayer (TID)) vhodny, moreover reusable for expected pomdp DM from UOX (reuse states, let pomdp' | + | DAFs covering selected topics contain not only Companion replies mined from the corpora, but also new human-authored assessments, remarks and glosses to provide longer system utterances in order to encourage user to tell more. |
- | typy odpovedi a zpusob jejich implementace, | + | |
- | NLP server s tectomt, ASR/TTS/SR client, connected over network | + | |
- | XXX JPta | + | |
- | advances in Czech NLU (on reconstructed spoken data): 300-500vet(? | + | {{user:ptacek: |
- | pos ? analyzovat, generovat a kontrolovat ' | + | |
+ | ===== Automatic Speech Recognition (WP 5.1)===== | ||
+ | features: improved language models, real-time speaker adaptation | ||
+ | performance indicator: WER | ||
- | ===== Automatic Speech Recognition (WP 5.1)===== | ||
- | features: improved language models, real-time speaker adaptation | ||
- | performance indicator: WER | ||
- | ===== Speech Reconstruction (WP 5.1 ???) ===== | + | ===== Speech Reconstruction (WP 5.2) ===== |
- | features: omit filler phrases, irrelevant speech events, false starts, repetitions, | + | features: omit filler phrases, |
- | imlementation(zahrnout tuhle info?): moses natrenovany na korpusu | + | performance indicator: BLEU score between actual output and manually reconstructed sentences from corpora |
- | performance indicator: BLEU score (overall scoring of all features) to annotated corpora from T5.2.1., nejaka | + | |
- | XXX Mirek | + | |
+ | |||
+ | |||
===== Morphology Analyzer and POS tagging (WP 5.2) ===== | ===== Morphology Analyzer and POS tagging (WP 5.2) ===== | ||
- | features: XXX Mirek/Johanka | + | features: |
- | performance indicator: accuracy | + | performance indicator: |
+ | |||
+ | |||
===== Syntactic Parsing (WP 5.2) ===== | ===== Syntactic Parsing (WP 5.2) ===== | ||
features: induce dependencies and labels | features: induce dependencies and labels | ||
- | performance indicator: | + | performance indicator: |
- | v tipu je natrenovat MacDonnalda na dialog datech, ten task je do M42, ted ne. | + | |
===== Semantic Parsing (WP 5.2) ===== | ===== Semantic Parsing (WP 5.2) ===== | ||
- | features: | + | features: |
- | performance indicator: | + | performance indicator: |
===== Information Extraction (WP 5.2) ===== | ===== Information Extraction (WP 5.2) ===== | ||
Line 56: | Line 59: | ||
covering predicates from before-mentioned set of DAFs. | covering predicates from before-mentioned set of DAFs. | ||
performance indicator: accuracy | performance indicator: accuracy | ||
+ | |||
===== Named Entities Recognition (WP 5.2) ===== | ===== Named Entities Recognition (WP 5.2) ===== | ||
- | features: detect person names, geographical locations | + | features: detect person names, geographical locations, organization names |
performance indicator: f-measure | performance indicator: f-measure | ||
+ | |||
+ | |||
+ | |||
===== Dialog Act Tagging (WP 5.2) ===== | ===== Dialog Act Tagging (WP 5.2) ===== | ||
- | features: | + | features: |
performance indicator: accuracy | performance indicator: accuracy | ||
- | ===== Sentiment Analysis (WP 5.2) ===== | ||
- | features: za tohle bych vydaval klasifikator, | ||
- | performance indicator: f-measure | ||
Line 76: | Line 80: | ||
- | ===== Complete System Evaluation | + | |
- | T5.2.7 tohle zminuje, nick webb to pro nas asi neudela | + | ===== Dialog Manager (WP 5.3) ===== |
- | performance indicator: number | + | features: integrated DAF-based dialog manager from previous English prototype, |
+ | manual creation | ||
+ | performance indicator: acceptability | ||
- | ===== Dialog Manager (WP 5.3) ===== | ||
- | features: reply types, using (language independed) predicates (prakticky to znamena, ze pojmenuju testy na prechodech v dafech anglicky) | ||
- | performance indicator: rucni hodnoceni prijatelnosti vybrane akce | ||
===== Natural Language Generation (WP 5.4) ===== | ===== Natural Language Generation (WP 5.4) ===== | ||
- | features: | + | features: |
performance indicator: BLEU score | performance indicator: BLEU score | ||
+ | |||
+ | |||
+ | |||
+ | ===== Emotional TTS (WP 5.5) ===== | ||
+ | features: emotions will be expressed implicitly, through the usage of communicative functions; new female voice database was recorded for this purposes | ||
+ | performance indicator: listening tests | ||
+ | |||
+ | |||
+ | ===== Emotional Avatar Integration (WP 5.5) ===== | ||
+ | features: new Czech female voice with affective features will be integrated with the TID avatar | ||
+ | performance indicator: subjective evaluation of the naturalness and the ability to convey emotions (small-scale, | ||
+ | |||
+ | |||
+ | ====== AZ PO LISTOPADU ====== | ||
+ | |||
+ | ===== Syntactic Parsing (WP 5.2) ===== | ||
+ | features: adapted to domain (McD trained on manual PDTSC trees) | ||
+ | performance indicator: accuracy (correctly induced edges, labels) | ||
+ | |||
+ | |||
+ | ===== Sentiment Analysis (WP 5.2) ===== | ||
+ | features: za tohle bych vydaval klasifikator, | ||
+ | performance indicator: f-measure | ||
+ | |||
+ | ===== Complete System Evaluation ===== | ||
+ | T5.2.7 tohle zminuje, nick webb to pro nas asi neudela | ||
+ | performance indicator: number of tokens in user reply utterances, post-session questionare | ||
+ | |||
+ | |||
+ | ===== advances ===== | ||
+ | |||
+ | advances in Czech NLU (on reconstructed spoken data): 300-500vet(? | ||
+ | pos ? analyzovat, generovat a kontrolovat ' |