Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
user:majlis:czeng:mining [2011/09/20 10:29] majlis |
user:majlis:czeng:mining [2011/09/30 16:11] (current) majlis |
||
---|---|---|---|
Line 5: | Line 5: | ||
===== Navod ===== | ===== Navod ===== | ||
- | svn --username $USER co https:// | + | svn --username $USER co https:// |
+ | ===== Pouziti ===== | ||
+ | cd mining; | ||
+ | make check | ||
+ | make get | ||
+ | make prepare | ||
+ | # copy pastovani tasku | ||
+ | make clean-all | ||
+ | # copy pastovani tasku | ||
+ | make train.parallel | ||
+ | make segment.parallel | ||
+ | make segmented.hunalign.gz | ||
+ | |||
+ | ===== Ukoly ===== | ||
+ | |||
+ | ==== TODO ==== | ||
+ | * zkustil zprovoznit u sebe celou pipelinu CzEngu az (vcetne) Alesova hrubeho filtrovani | ||
+ | * pridal " | ||
+ | * '' | ||
+ | http:// | ||
+ | -> zwc je trosinku chytrejsi pocitadlo nez wc | ||
+ | Ondreje zajima co nejdrive ta prehledna tabulka, kolik dat z jakeho zdroje mame (a jak s tim zacvicily hrube filtry). | ||
+ | |||
+ | Inspirace, jak jsem pocital statistiky minule, je zde: | ||
+ | |||
+ | svn cat https:// | ||
+ | |||
+ | Konkretne jde o cile: | ||
+ | %.stat | ||
+ | | ||
+ | | ||
+ | %.freqerrs ... ten po pouziti na filtrovane ukaze, jake chyby jsou | ||
+ | nejcetnejsi, | ||
+ | vyplati takove chyby radeji resit nez data zahodit. | ||
+ | '' | ||
+ | * Dalsi navazujici ukol, nez na to zapomenu, bude ozivit v nove pipeline ty automaticke opravy pripadu 2-1 a 1-2 na 1-1. Ondrej pak dohleda, kde to byvalo. | ||
+ | * predelat checkouty na export - ale jasne odlisit | ||
+ | * prozatim neni priorita, protoze neni jiste, kde je to vsude natvrdo nastavene | ||
+ | * opravit zpracovani titulku podle ostatnich ukolu | ||
+ | * czeng09 - clean-navajo, | ||
+ | |||
+ | ==== DONE ==== | ||
+ | * potrebne nastroje se automaticky stahnou + zkompiluji | ||
+ | * mining pipelina jde spustit | ||
+ | * pridana kontrola na existenci potrebnych dat | ||
+ | * pridana kontrola na existenci potrebnych nastroju | ||
+ | |||
+ | ===== Veci, co se mi nelibi ===== | ||
+ | |||
+ | * readers-digest-2 vs rd2 - podle mne by se to melo jmenovat stejne | ||