Úprava jmen v BibTeXu ze Zotera
1,$s/{\([a-z-]\+\)_[a-z:-]\+_\([0-9]\{4\}\),/{\u\1-\2,/
Zkrácení autorů
g/\(author\|editor\)/s/, \([A-ZÁČĎĚÉÍŇŘŠŤŮÚÝŽ]\)[a-záčďěéíňřšťůúýžäöüß]*\(\([- ][A-ZÁČĎĚÉÍŇŘŠŤŮÚÝŽ]\)[a-záčďěéíňřšťůúýžäöüß]*\)\?\( and\|}\)/, \1.\3\4/g
Vyházení zbytečností
g/\(abstract\|url\|urldate\|language\|keywords\|file\|shorttitle\|month\|lccn\) =/d
Oprava divných závorek, které ničí syntax highlighting
1,$s/({\([^({]*\))}/{(\1)}/g
treex --qsub "-hard -l mem_free=3g -l act_mem_free=3g" -p -j 30 Util::SetGlobal language=en selector=autom A2T::EN::FixTlemmas -- pcedt-formemes2a/*.treex.gz 2>&1 | tee log.txt
Česky:
treex --qsub "-hard -l mem_free=3g -l act_mem_free=3g" -p -j 30 Util::SetGlobal language=cs selector=autom Util::Eval tnode='$tnode->wild->{orig_formeme} = $tnode->formeme;' A2T::CS::SetFormeme use_version=2 Util::Eval tnode='print $tnode->get_address, " ", $tnode->wild->{orig_formeme}, " " , $tnode->formeme, "\n" if ($tnode->wild->{orig_formeme} ne $tnode->formeme);' -- pcedt-formemes2a/*.treex.gz | tee log1.txt
Anglicky:
treex --qsub "-hard -l mem_free=3g -l act_mem_free=3g" -p -j 30 Util::SetGlobal language=en selector=autom Util::Eval tnode='$tnode->wild->{orig_formeme} = $tnode->formeme;' A2T::EN::SetFormeme2 Util::Eval tnode='print $tnode->get_address, " ", $tnode->wild->{orig_formeme}, " " , $tnode->formeme, "\n" if ($tnode->wild->{orig_formeme} ne $tnode->formeme);' -- pcedt-formemes2a/*.treex.gz | tee log4.txt
treex --qsub "-hard -l mem_free=3g -l act_mem_free=3g" -p -j 30 Util::SetGlobal language=en selector=autom Util::Eval anode='$anode->wild->{orig_afun} = $anode->afun;' W2A::EN::SetAfun Util::Eval anode='print $anode->get_address, " ", $anode->wild->{orig_afun}, " " , $anode->afun, "\n" if ($anode->wild->{orig_afun} ne $anode->afun);' -- pcedt-formemes2a/*.treex.gz | tee log3.txt
t-node [ gram/sempos ~ '^n', a/lex.rf $a1, child $b ]; t-node $b := [ gram/sempos ~ '^n', a/lex.rf $a2 ]; a-node $a1 := [ order-follows $a2 ]; a-node $a2 := [ m/tag ~ '^N' ];
t-node [ gram/sempos = 'v', 1+x a/aux.rf a-node [ m/tag ~ '^V', afun != 'AuxV' ], a/lex.rf a-node [] ];