[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
user:zeman:treebanks:hi [2011/12/06 16:32]
zeman Sample training Shakti.
user:zeman:treebanks:hi [2011/12/06 17:39]
zeman Hindi development data sample.
Line 64: Line 64:
  
 ==== Inside ==== ==== Inside ====
 +
 +  * Broken characters (''\x{FFFD} REPLACEMENT CHARACTER'') in the WX encoding.
 +
 +--
  
 The text uses the [[http://ltrc.iiit.ac.in/nlptools2010/files/documents/map.pdf|WX encoding]] of Indian letters. If we know what the original script is (Bengali in this case) we can map the WX encoding to the original characters in UTF-8. WX uses English letters so if there was embedded English (or other string using Latin letters) it will probably get lost during the conversion. The text uses the [[http://ltrc.iiit.ac.in/nlptools2010/files/documents/map.pdf|WX encoding]] of Indian letters. If we know what the original script is (Bengali in this case) we can map the WX encoding to the original characters in UTF-8. WX uses English letters so if there was embedded English (or other string using Latin letters) it will probably get lost during the conversion.
Line 204: Line 208:
 </Sentence></code> </Sentence></code>
  
-And in the CoNLL format:+The same two sentences converted to the CoNLL format, WX characters decoded back to Devanagari in UTF-8:
  
-| 1 | Agei Age | NP | NST | lex-Age<nowiki>|</nowiki>cat-adv<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-Agei<nowiki>|</nowiki>name-NP | 3 | k7t | _ | _ | +<nowiki>1</nowiki> <nowiki>बात</nowiki> <nowiki>बात</nowiki> <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-bAwa|cat-n|gend-f|num-sg|pers-3|case-d|vib-0|tam-0|posn-10|name-bAwa|chunkId-NP|chunkType-head:NP</nowiki> <nowiki>3</nowiki> | <nowiki>k1</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-| 2 | cA cA NP NN | lex-cA<nowiki>|</nowiki>cat-n<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-sg<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-d<nowiki>|</nowiki>vib-0<nowiki>|</nowiki>tam-0<nowiki>|</nowiki>head-cA<nowiki>|</nowiki>name-NP2 k1 | _ | _ | +| <nowiki>2</nowiki> | <nowiki>गलत</nowiki> | <nowiki>गलत</nowiki> | <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-galawa|cat-adj|gend-any|num-any|pers-|case-any|vib-|tam-|posn-20|name-galawa|chunkId-JJP|chunkType-head:JJP</nowiki> | <nowiki>3</nowiki> | <nowiki>k1s</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-ese As VGF VM | lex-As<nowiki>|</nowiki>cat-v<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-5<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-A_yA+Ce<nowiki>|</nowiki>tam-A<nowiki>|</nowiki>head-ese<nowiki>|</nowiki>name-VGF | 0 | main | _ | _ | +| <nowiki>3</nowiki> | <nowiki>हो</nowiki> | <nowiki>हो</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-ho|cat-v|gend-any|num-any|pers-any|case-|vib-0|tam-0|stype-declarative|posn-30|voicetype-active|name-ho|chunkId-VGF|chunkType-head:VGF</nowiki> | <nowiki>11</nowiki> | <nowiki>vmod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
- +| <nowiki>4</nowiki> | <nowiki>तो</nowiki> | <nowiki>तो</nowiki> | <nowiki>CC</nowiki> | <nowiki>avy</nowiki> | <nowiki>lex-wo|cat-avy|gend-|num-|pers-|case-|vib-|tam-|posn-40|name-wo|chunkId-CCP|chunkType-head:CCP</nowiki> | <nowiki>0</nowiki> | <nowiki>main</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-And after conversion of the WX encoding to the Bengali script in UTF-8: +| <nowiki>5</nowiki> | <nowiki>गुस्सा</nowiki> | <nowiki>गुस्सा</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-gussA|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-50|name-gussA|chunkId-NP2|chunkType-head:NP2</nowiki> | <nowiki>9</nowiki> | <nowiki>pof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
- +| <nowiki>6</nowiki> | <nowiki>सेलेब्रिटिज</nowiki> | <nowiki>सेलेब्रिटिज</nowiki> | <nowiki>NN</nowiki> | <nowiki>unk</nowiki> | <nowiki>lex-selebritija|cat-unk|gend-|num-|pers-|case-|vib-0_ko|tam-|posn-60|vpos-vib_2_RP|name-selebritija|chunkId-NP3|chunkType-head:NP3</nowiki> | <nowiki>9</nowiki> | <nowiki>k4a</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-আগেই আগে NP NST | lex-Age<nowiki>|</nowiki>cat-adv<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-Agei<nowiki>|</nowiki>name-NP | 3 | k7t | _ | _ | +| <nowiki>7</nowiki> | <nowiki>को</nowiki> | <nowiki>को</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-ko|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-70|chunkType-child:NP3|name-ko</nowiki> | <nowiki>6</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-চা চা NP NN | lex-cA<nowiki>|</nowiki>cat-n<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-sg<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-d<nowiki>|</nowiki>vib-0<nowiki>|</nowiki>tam-0<nowiki>|</nowiki>head-cA<nowiki>|</nowiki>name-NP2 | 3 | k1 | _ | _ | +| <nowiki>8</nowiki> | <nowiki>भी</nowiki> | <nowiki>भी</nowiki> | <nowiki>RP</nowiki> | <nowiki>avy</nowiki> | <nowiki>lex-BI|cat-avy|gend-|num-|pers-|case-|vib-|tam-|posn-80|chunkType-child:NP3|name-BI</nowiki> | <nowiki>6</nowiki> | <nowiki>lwg__rp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-এসে আস্ VGF VM | lex-As<nowiki>|</nowiki>cat-v<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-5<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-A_yA+Ce<nowiki>|</nowiki>tam-A<nowiki>|</nowiki>head-ese<nowiki>|</nowiki>name-VGF main | _ | _ |+| <nowiki>9</nowiki> | <nowiki>आना</nowiki> | <nowiki>आ</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-A|cat-v|gend-any|num-any|pers-any|case-d|vib-nA|tam-nA|posn-90|name-AnA|chunkId-VGNN|chunkType-head:VGNN</nowiki> | <nowiki>11</nowiki> | <nowiki>k1</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>10</nowiki> | <nowiki>लाजमी</nowiki> | <nowiki>लाजमी</nowiki> | <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-lAjamI|cat-adj|gend-any|num-any|pers-|case-|vib-|tam-|posn-100|name-lAjamI|chunkId-JJP2|chunkType-head:JJP2</nowiki> | <nowiki>11</nowiki> | <nowiki>pof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>11</nowiki> | <nowiki>है</nowiki> | <nowiki>है</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-hE|cat-v|gend-any|num-sg|pers-3|case-|vib-hE|tam-hE|stype-declarative|posn-110|voicetype-active|name-hE|chunkId-VGF2|chunkType-head:VGF2</nowiki> | <nowiki>4</nowiki> | <nowiki>ccof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>12</nowiki> | <nowiki>.</nowiki> | <nowiki>.</nowiki> | <nowiki>SYM</nowiki> | <nowiki>punc</nowiki> | <nowiki>lex-.|cat-punc|gend-|num-|pers-|case-|vib-|tam-|posn-120|chunkType-child:VGF2|name-.</nowiki> <nowiki>11</nowiki> | <nowiki>rsym</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| |||||||||| 
 +| <nowiki>1</nowiki> | <nowiki>बृहस्पतिवार</nowiki> | <nowiki>बृहस्पतिवार</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-bqhaspawivAra|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_ko|tam-0|posn-10|vpos-vib_2|name-bqhaspawivAra|chunkId-NP|chunkType-head:NP</nowiki> | <nowiki>6</nowiki> | <nowiki>k7t</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>2</nowiki> <nowiki>को</nowiki> <nowiki>को</nowiki> <nowiki>PSP</nowiki> <nowiki>psp</nowiki> <nowiki>lex-ko|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-20|chunkType-child:NP|name-ko</nowiki> | <nowiki>1</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>3</nowiki> | <nowiki>ज़ी</nowiki> | <nowiki>जी</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-jI|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_meM|tam-0|posn-30|vpos-vib_2|name-jZI|chunkId-NP2|chunkType-head:NP2</nowiki> | <nowiki>6</nowiki> | <nowiki>k7</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>4</nowiki> | <nowiki>में</nowiki> | <nowiki>में</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-meM|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-40|chunkType-child:NP2|name-meM</nowiki> | <nowiki>3</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>5</nowiki> | <nowiki>शुरू</nowiki> | <nowiki>शुरू</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-SurU|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-50|name-SurU|chunkId-NP3|chunkType-head:NP3</nowiki> | <nowiki>6</nowiki> | <nowiki>pof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>6</nowiki> | <nowiki>हुए</nowiki> | <nowiki>हो</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-ho|cat-v|gend-m|num-sg|pers-any|case-|vib-eM|tam-eM|posn-60|name-hue|chunkId-VGNF|chunkType-head:VGNF</nowiki> | <nowiki>10</nowiki> | <nowiki>nmod__k1inv</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>7</nowiki> | <nowiki>��वें</nowiki> | <nowiki>��वें</nowiki> | <nowiki>XC</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-��veM|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-70|chunkType-child:NP4|name-��veM</nowiki> | <nowiki>10</nowiki> | <nowiki>mod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>8</nowiki> | <nowiki>अंतर्राष्ट्रीय</nowiki> | <nowiki>अंतर्राष्ट्रीय</nowiki> | <nowiki>XC</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-aMwarrARtrIya|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-80|chunkType-child:NP4|name-aMwarrARtrIya</nowiki> | <nowiki>10</nowiki> | <nowiki>mod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>9</nowiki> | <nowiki>फिल्म</nowiki> | <nowiki>फिल्म</nowiki> | <nowiki>XC</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-Pilma|cat-n|gend-f|num-sg|pers-3|case-d|vib-0|tam-0|posn-90|chunkType-child:NP4|name-Pilma</nowiki> | <nowiki>10</nowiki> <nowiki>mod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>10</nowiki> | <nowiki>महोत्सव</nowiki> | <nowiki>महोत्सव</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-mahowsava|cat-n|gend-m|num-sg|pers-|case-o|vib-0_kA|tam-0|posn-100|vpos-vib_5|name-mahowsava|chunkId-NP4|chunkType-head:NP4</nowiki> <nowiki>12</nowiki> | <nowiki>r6</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>11</nowiki> <nowiki>के</nowiki> <nowiki>का</nowiki> <nowiki>PSP</nowiki> <nowiki>psp</nowiki> <nowiki>lex-kA|cat-psp|gend-m|num-sg|pers-|case-o|vib-|tam-|posn-110|chunkType-child:NP4|name-ke</nowiki> | <nowiki>10</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>12</nowiki> | <nowiki>रंग</nowiki> | <nowiki>रंग</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-raMga|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_meM|tam-0|posn-120|vpos-vib_2|name-raMga|chunkId-NP5|chunkType-head:NP5</nowiki> | <nowiki>17</nowiki> | <nowiki>k7</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>13</nowiki> | <nowiki>में</nowiki> | <nowiki>में</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-meM|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-130|chunkType-child:NP5|name-meM2</nowiki> | <nowiki>12</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>14</nowiki> | <nowiki>भंग</nowiki> | <nowiki>भंग</nowiki> | <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-BaMga|cat-adj|gend-any|num-any|pers-|case-any|vib-|tam-|posn-140|name-BaMga|chunkId-JJP|chunkType-head:JJP</nowiki> | <nowiki>17</nowiki> | <nowiki>pof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>15</nowiki> | <nowiki>उस</nowiki> | <nowiki>वह</nowiki> | <nowiki>DEM</nowiki> | <nowiki>pn</nowiki> | <nowiki>lex-vaha|cat-pn|gend-any|num-sg|pers-3|case-o|vib-|tam-|posn-150|chunkType-child:NP6|name-usa</nowiki> | <nowiki>16</nowiki> | <nowiki>nmod__adj</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>16</nowiki> | <nowiki>समय</nowiki> | <nowiki>समय</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-samaya|cat-n|gend-any|num-sg|pers-3|case-d|vib-0|tam-0|posn-160|name-samaya|chunkId-NP6|chunkType-head:NP6</nowiki> | <nowiki>17</nowiki> <nowiki>k7t</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>17</nowiki> | <nowiki>पड़ा</nowiki> | <nowiki>पड</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-pada|cat-v|gend-any|num-any|pers-any|case-|vib-yA|tam-yA|stype-declarative|posn-170|voicetype-active|name-padZA|chunkId-VGF|chunkType-head:VGF</nowiki> | <nowiki>0</nowiki> <nowiki>main</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +| <nowiki>18</nowiki> | <nowiki>जब</nowiki> | <nowiki>जब</nowiki> | <nowiki>PRP</nowiki> | <nowiki>pn</nowiki> | <nowiki>lex-jaba|cat-pn|gend-|num-|pers-|case-|vib-|tam-|posn-180|coref-samaya|name-jaba|chunkId-NP7|chunkType-head:NP7</nowiki> | <nowiki>32</nowiki> | <nowiki>k7t</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki> | 
 +<nowiki>19</nowiki> <nowiki>वहां</nowiki> <nowiki>वहाँ</nowiki> <nowiki>PRP</nowiki> <nowiki>pn</nowiki> <nowiki>lex-vahAz|cat-pn|gend-|num-|pers-|case-|vib-0_para|tam-|posn-190|vpos-vib_2|name-vahAM|chunkId-NP8|chunkType-head:NP8</nowiki> | <nowiki>21</nowiki> <nowiki>jjmod</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>20</nowiki> <nowiki>पर</nowiki> | <nowiki>पर</nowiki> <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-para|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-200|chunkType-child:NP8|name-para</nowiki> | <nowiki>19</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>21</nowiki> | <nowiki>तैनात</nowiki> | <nowiki>तैनात</nowiki> | <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-wEnAwa|cat-adj|gend-any|num-any|pers-|case-o|vib-|tam-|posn-210|name-wEnAwa|chunkId-JJP2|chunkType-head:JJP2</nowiki> | <nowiki>22</nowiki> <nowiki>nmod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>22</nowiki> | <nowiki>सुरक्षाकर्मियों</nowiki> | <nowiki>सुरक्षाकर्मी</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-surakRAkarmI|cat-n|gend-m|num-pl|pers-3|case-o|vib-0_ne|tam-0|posn-220|vpos-vib_2|name-surakRAkarmiyoM|chunkId-NP9|chunkType-head:NP9</nowiki> | <nowiki>32</nowiki> | <nowiki>k1</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>23</nowiki> <nowiki>ने</nowiki> <nowiki>ने</nowiki> <nowiki>PSP</nowiki> <nowiki>psp</nowiki> <nowiki>lex-ne|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-230|chunkType-child:NP9|name-ne</nowiki> | <nowiki>22</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>24</nowiki> | <nowiki>बॉलीवुड</nowiki> | <nowiki>बॉलीवुड</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-bOYlIvuda|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_kA|tam-0|posn-240|vpos-vib_2|name-bOYlIvuda|chunkId-NP10|chunkType-head:NP10</nowiki> | <nowiki>28</nowiki> <nowiki>r6</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>25</nowiki> | <nowiki>की</nowiki> | <nowiki>का</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-kA|cat-psp|gend-f|num-sg|pers-|case-o|vib-|tam-|posn-250|chunkType-child:NP10|name-kI</nowiki> | <nowiki>24</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
 +<nowiki>26</nowiki> | <nowiki>अभिनेत्री</nowiki> | <nowiki>अभिनेत्री</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-aBinewrI|cat-n|gend-f|num-sg|pers-3|case-o|vib-0|tam-0|posn-260|chunkType-child:NP11|name-aBinewrI</nowiki> | <nowiki>27</nowiki> | <nowiki>nmod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>27</nowiki> | <nowiki>बिपाशा</nowiki> | <nowiki>बिपाशा</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-bipASA|cat-n|gend-f|num-sg|pers-3|case-d|vib-0|tam-0|posn-270|chunkType-child:NP11|name-bipASA</nowiki> | <nowiki>28</nowiki> <nowiki>nmod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>28</nowiki> | <nowiki>बसु</nowiki> | <nowiki>बसु</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-basu|cat-n|gend-f|num-sg|pers-3|case-o|vib-0_ke_sAWa|tam-0|posn-280|vpos-vib_vib_vib_4_5|name-basu|chunkId-NP11|chunkType-head:NP11</nowiki> | <nowiki>32</nowiki> | <nowiki>k2</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>29</nowiki> <nowiki>के</nowiki> <nowiki>के</nowiki> <nowiki>PSP</nowiki> <nowiki>psp</nowiki> <nowiki>lex-ke|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-290|chunkType-child:NP11|name-ke2</nowiki> | <nowiki>28</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> | 
 +| <nowiki>30</nowiki> <nowiki>साथ</nowiki> | <nowiki>साथ</nowiki> <nowiki>NST</nowiki> | <nowiki>nst</nowiki> | <nowiki>lex-sAWa|cat-nst|gend-m|num-sg|pers-3|case-d|vib-|tam-|posn-300|chunkType-child:NP11|name-sAWa</nowiki> | <nowiki>28</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>31</nowiki> | <nowiki>दुव्यर्वहार</nowiki> | <nowiki>दुव्यर्वहार</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-xuvyarvahAra|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-310|name-xuvyarvahAra|chunkId-NP12|chunkType-head:NP12</nowiki> | <nowiki>32</nowiki> | <nowiki>pof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>32</nowiki> | <nowiki>किया</nowiki> | <nowiki>कर</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-kara|cat-v|gend-m|num-sg|pers-any|case-|vib-yA|tam-yA|stype-declarative|posn-320|voicetype-active|name-kiyA|chunkId-VGF2|chunkType-head:VGF2</nowiki> | <nowiki>16</nowiki> <nowiki>nmod__relc</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>33</nowiki> | <nowiki>.</nowiki> | <nowiki>.</nowiki> | <nowiki>SYM</nowiki> | <nowiki>punc</nowiki> | <nowiki>lex-.|cat-punc|gend-|num-|pers-|case-|vib-|tam-|posn-330|chunkType-child:VGF2|name-.</nowiki> <nowiki>32</nowiki> <nowiki>rsym</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> |
  
 The first sentence of the ICON 2010 development data (with fine-grained syntactic tags) in the Shakti format: The first sentence of the ICON 2010 development data (with fine-grained syntactic tags) in the Shakti format:
  
-<code xml><document id="">+<code xml><document docid="fullnews_id_2489467">
 <head> <head>
-<annotated-resource name="HyDT-Bangla" version="0.5" type="dep-interchunk-only" layers="morph,pos,chunk,dep-interchunk-only" language="ben" date-of-release="20100831">+ <caption>jela meM svasWa hE sarabajIwa xo BArawIya aXikAriyoM ne mulAkAwa kI pre isalAmAbAxa.</caption> 
 + <language>Hindi </language> 
 + <domain_name>News Articles </domain_name> 
 + <word_count>524</word_count> 
 + <byte_count>64554</byte_count> 
 + <availability> 
 + <format>CML/SSF</format> 
 + <sentence_marker>.</sentence_marker> 
 + <normalization>No</normalization> 
 + </availability> 
 + <encoding_description> 
 + <original_encoding>ISO 8859</format> 
 + <new_encoding>Unicode UTF8</new_encoding> 
 + </encoding_description> 
 + <distributor>LTRC, IIIT Hyderabad</distributor> 
 + <project_description>NSF Hindi/Urdu Dependency Treebanking Project</place> 
 + <creation> 
 + </raw_corpus creation_date="" institute_name="IIIT Hyderabad"> 
 + </annotated_corpus creation_date="06/01/2009" institute_name="IIIT Hyderabad"> 
 + <edition_number>1.0</edition_number> 
 + </creation> 
 + <publication> 
 + <place>New Delhi</place> 
 + <date>30/5/2004</date> 
 + <type>Newspaper</type> 
 + <publisher> 
 + <name>Amar Ujala</name> 
 + <url>http://www.amarujala.com</url> 
 + </publisher> 
 + </publication> 
 + 
 +<annotated-resource name="HyDT-Hindi" version="2.0" type="dep-words" layers="morph,pos,chunk,dep-word" language="hin" date-of-release="20100831">
     <annotation-standard>     <annotation-standard>
         <morph-standard name="Anncorra-morph" version="1.31" date="20080920" />         <morph-standard name="Anncorra-morph" version="1.31" date="20080920" />
         <pos-standard name="Anncorra-pos" version="" date="20061215" />         <pos-standard name="Anncorra-pos" version="" date="20061215" />
         <chunk-standard name="Anncorra-chunk" version="" date="20061215" />         <chunk-standard name="Anncorra-chunk" version="" date="20061215" />
 +        <intrachunk-dependency-standard name="Anncorra-intrachunk-dep" version="1.0" date="" dep-tagset-granularity="5" />
         <dependency-standard name="Anncorra-dep" version="2.0" date="" dep-tagset-granularity="6" />         <dependency-standard name="Anncorra-dep" version="2.0" date="" dep-tagset-granularity="6" />
     </annotation-standard>     </annotation-standard>
 </annotated-resource> </annotated-resource>
 </head> </head>
 +<body>
 +<tb number="1" segment="no" bullet="no">
 +<foreign language="select" writingsystem="LTR"></foreign>
 +<text>
 <Sentence id="1"> <Sentence id="1">
-1 (( NP <fs af='parabarwIkAle,adv,,,,,,' head="parabarwIkAle" drel=k7t:VGF name=NP> +1 kota XC <fs af='kota,n,m,sg,3,d,0,0posn='10' drel='mod:lAhOra' chunkType='child:NP' name='kota'
-1.1 parabarwIkAle NN <fs af='parabarwIkAle,adv,,,,,,' name="parabarwIkAle"+2 laKapawa XC <fs af='laKapawa,n,m,sg,3,d,0,0' posn='20' drel='mod:lAhOra' chunkType='child:NP' name='laKapawa'
- ))  +3 jela XC <fs af='jela,n,m,sg,3,d,0,0' posn='30' drel='mod:lAhOra' chunkType='child:NP' name='jela'> 
-2 (( NP <fs af='aPisa-biyArAraxera,unk,,,,,,' head="aPisa-biyArAraxera" drel=r6:NP3 name=NP2+4 lAhOra NNP <fs af='lAhOra,n,m,sg,3,o,0_meM,0' drel='jjmod:baMxa' posn='40' vpos='vib_5' name='lAhOra' chunkId='NP' chunkType='head:NP'> 
-2.1 aPisa-biyArAraxera NN <fs af='aPisa-biyArAraxera,unk,,,,,,' name="aPisa-biyArAraxera"+5 meM PSP <fs af='meM,psp,,,,,,' posn='50' drel='lwg__psp:lAhOra' chunkType='child:NP' name='meM'
- ))  +6 baMxa JJ <fs af='baMxa,adj,any,any,,o,,' drel='nmod:siMha' posn='60' name='baMxa' chunkId='JJP' chunkType='head:JJP'
-3 (( NP <fs af='nAma,n,,sg,,d,0,0' head="nAma" drel=k2:VGNN name=NP3+7 sarabajIwa XC <fs af='sarabajIwa,n,m,sg,3,d,0,0' posn='70' drel='mod:siMha' chunkType='child:NP2' name='sarabajIwa'
-3.1 nAma NN <fs af='nAma,n,,sg,,d,0,0' name="nAma"+8 siMha NNP <fs af='siMha,n,m,sg,3,o,0_ne,0' drel='k1:xIM' posn='80' vpos='vib_3' name='siMha' chunkId='NP2' chunkType='head:NP2'
- ))  +9 ne PSP <fs af='ne,psp,,,,,,' posn='90' drel='lwg__psp:siMha' chunkType='child:NP2' name='ne'> 
-4 (( NP <fs af='GoRaNA,unk,,,,,,' head="GoRaNA" drel=pof:VGNN name=NP4> +10 maMgalavAra NNP <fs af='maMgalavAra,n,m,sg,3,o,0_ko,0' drel='k7t:xIM' posn='100' vpos='vib_2' name='maMgalavAra' chunkId='NP3' chunkType='head:NP3'> 
-4.1 GoRaNA NN <fs af='GoRaNA,unk,,,,,,' name="GoRaNA"+11 ko PSP <fs af='ko,psp,,,,,,' posn='110' drel='lwg__psp:maMgalavAra' chunkType='child:NP3' name='ko'> 
- ))  +12 BArawIya JJ <fs af='BArawIya,adj,any,any,,o,,' posn='120' drel='nmod__adj:xUwAvAsa' chunkType='child:NP4' name='BArawIya'
-5 (( VGNN <fs af='kar,n,,,any,,,' head="karAra" drel=r6:NP5 name=VGNN+13 xUwAvAsa NN <fs af='xUwAvAsa,n,m,sg,3,o,0_kA,0' drel='r6:aXikAriyoM' posn='130' vpos='vib_3' name='xUwAvAsa' chunkId='NP4' chunkType='head:NP4'
-5.1 karAra VM <fs af='kar,n,,,any,,,' name="karAra"+14 ke PSP <fs af='kA,psp,m,pl,,o,,' posn='140' drel='lwg__psp:xUwAvAsa' chunkType='child:NP4' name='ke'> 
- ))  +15 xo QC <fs af='xo,num,any,pl,,o,,' posn='150' drel='nmod__adj:aXikAriyoM' chunkType='child:NP5' name='xo'> 
-6 (( NP <fs af='samay,unk,,,,,,' head="samay" drel=k7t:VGF name=NP5+16 aXikAriyoM NN <fs af='aXikArI,n,m,pl,3,o,0_ko,0' drel='k4:xIM' posn='160' vpos='vib_3' name='aXikAriyoM' chunkId='NP5' chunkType='head:NP5'> 
-6.1 samay NN <fs af='samay,unk,,,,,,' name="samay"+17 ko PSP <fs af='ko,psp,,,,,,' posn='170' drel='lwg__psp:aXikAriyoM' chunkType='child:NP5name='ko2'
- ))  +18 apane PRP <fs af='apanA,pn,any,sg,1,o,0_bAre_meM,0' drel='k7:xIM' posn='180' vpos='vib_2_3' name='apane' chunkId='NP6' chunkType='head:NP6'> 
-7 (( NP <fs af='animeRake,unk,,,,,,' head="animeRake" drel=k2:VGF name=NP6+19 bAre PSP <fs af='bAre,psp,,,,,,' posn='190' drel='lwg__psp:apane' chunkType='child:NP6' name='bAre'> 
-7.1 animeRake NNP <fs af='animeRake,unk,,,,,,' name="animeRake"+20 meM PSP <fs af='meM,psp,,,,,,' posn='200' drel='lwg__psp:apane' chunkType='child:NP6' name='meM2'> 
- ))  +21 wamAma JJ <fs af='wamAma,adj,any,any,,d,,' posn='210' drel='nmod__adj:jAnakAriyAM' chunkType='child:NP7' name='wamAma'
-8 (( VGF <fs af='sariye,unk,,,5,,0_rAKA+ka_ha+la,' head="sariye" name=VGF+22 vyakwigawa JJ <fs af='vyakwigawa,adj,any,any,,d,,' posn='220' drel='nmod__adj:jAnakAriyAM' chunkType='child:NP7' name='vyakwigawa'> 
-8.1 sariye VM <fs af='sariye,unk,,,,,,' name="sariye"+23 jAnakAriyAM NN <fs af='jAnakAriyAM,n,f,pl,3,d,0,0' drel='k2:xIM' posn='230' name='jAnakAriyAM' chunkId='NP7' chunkType='head:NP7'> 
-8.2 . SYM <fs af='.,punc,,,,,,'> +24 xIM VM <fs af='xe,v,f,pl,3,,yA,yA' stype='declarative' posn='240' voicetype='active' name='xIM' chunkId='VGF' chunkType='head:VGF'> 
- )) +25 ki CC <fs af='ki,avy,,,,,,' drel='rs:jAnakAriyAM' posn='250' name='ki' chunkId='CCP' chunkType='head:CCP'
 +26 kina WQ <fs af='kOna,pn,any,pl,3,o,,' posn='260' drel='mod__wq:parisWiwiyoM' chunkType='child:NP8' name='kina'> 
 +27 parisWiwiyoM NN <fs af='parisWiwi,n,f,pl,3,o,0_meM,0' drel='k7:kiyA' posn='270' vpos='vib_3' name='parisWiwiyoM' chunkId='NP8' chunkType='head:NP8'
 +28 meM PSP <fs af='meM,psp,,,,,,' posn='280' drel='lwg__psp:parisWiwiyoM' chunkType='child:NP8' name='meM3'> 
 +29 use PRP <fs af='vaha,pn,any,sg,3,o,ko,ko' drel='k2:kiyA' posn='290' name='use' chunkId='NP9' chunkType='head:NP9'> 
 +30 giraPwAra JJ <fs af='giraPwAra,adj,any,any,,,,' drel='pof:kiyA' posn='300' name='giraPwAra' chunkId='JJP2' chunkType='head:JJP2'
 +31 kiyA VM <fs af='kara,v,m,sg,3,,yA_jA+yA�,yA' drel='ccof:Ora' stype='declarative' posn='310' voicetype='passive' vpos='tam_2' name='kiyA' chunkId='VGF2' chunkType='head:VGF2'
 +32 gayA VAUX <fs af='jA,v,m,sg,3,,yA�,yA1' posn='320' drel='lwg__vaux:kiyA' chunkType='child:VGF2' name='gayA'> 
 +33 , SYM <fs af=',s,punc,,,,,' posn='330' drel='rsym:kiyA' chunkType='child:VGF2' name=','> 
 +34 mukaxamA NN <fs af='mukaxamA,n,m,sg,3,d,0,0' drel='k1:calA' posn='340' name='mukaxamA' chunkId='NP10' chunkType='head:NP10'> 
 +35 calA VM <fs af='cala,v,m,sg,3,,yA,yA' hlt='true' drel='ccof:Ora' stype='declarative' posn='350' voicetype='active' name='calA' chunkId='VGF3' chunkType='head:VGF3'
 +36 Ora CC <fs af='Ora,avy,,,,,,' drel='ccof:ki' posn='360' name='Ora' chunkId='CCP2' chunkType='head:CCP2'> 
 +37 sajA NN <fs af='sajA,n,f,sg,3,d,0,0' drel='k1:huI' posn='370' name='sajA' chunkId='NP11' chunkType='head:NP11'> 
 +38 huI VM <fs af='ho,v,f,sg,3,,yA,yA' drel='ccof:Ora' stype='declarative' posn='380' voicetype='active' name='huI' chunkId='VGF4' chunkType='head:VGF4'
 +39 . SYM <fs af='.,punc,,,,,,' posn='390' drel='rsym:huI' chunkType='child:VGF4' name='.'>
 </Sentence></code> </Sentence></code>
  
 And in the CoNLL format: And in the CoNLL format:
  
-| 1 | parabarwIkAle parabarwIkAle NP NN | lex-parabarwIkAle<nowiki>|</nowiki>cat-adv<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-parabarwIkAle<nowiki>|</nowiki>name-NP | k7t | _ | _ | +<nowiki>1</nowiki> <nowiki>kota</nowiki> <nowiki>kota</nowiki> <nowiki>XC</nowiki> <nowiki>n</nowiki> <nowiki>lex-kota|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-10|chunkType-child:NP|name-kota</nowiki> | <nowiki>4</nowiki> <nowiki>mod</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
-aPisa-biyArAraxera aPisa-biyArAraxera NP NN lex-aPisa-biyArAraxera<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-aPisa-biyArAraxera<nowiki>|</nowiki>name-NP2 | r6 | _ | _ | +<nowiki>2</nowiki> <nowiki>laKapawa</nowiki> | <nowiki>laKapawa</nowiki> <nowiki>XC</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-laKapawa|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-20|chunkType-child:NP|name-laKapawa</nowiki> | <nowiki>4</nowiki> <nowiki>mod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-nAma nAma NP NN | lex-nAma<nowiki>|</nowiki>cat-n<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-sg<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-d<nowiki>|</nowiki>vib-0<nowiki>|</nowiki>tam-0<nowiki>|</nowiki>head-nAma<nowiki>|</nowiki>name-NP3 k2 | _ | _ | +| <nowiki>3</nowiki> | <nowiki>jela</nowiki> | <nowiki>jela</nowiki> | <nowiki>XC</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-jela|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-30|chunkType-child:NP|name-jela</nowiki> | <nowiki>4</nowiki> <nowiki>mod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-GoRaNA GoRaNA NP NN | lex-GoRaNA<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-GoRaNA<nowiki>|</nowiki>name-NP4 pof | _ | _ | +| <nowiki>4</nowiki> | <nowiki>lAhOra</nowiki> | <nowiki>lAhOra</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-lAhOra|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_meM|tam-0|posn-40|vpos-vib_5|name-lAhOra|chunkId-NP|chunkType-head:NP</nowiki> <nowiki>6</nowiki> <nowiki>jjmod</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
-karAra kar VGNN VM | lex-kar<nowiki>|</nowiki>cat-n<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-any<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-karAra<nowiki>|</nowiki>name-VGNN r6 | _ | _ | +<nowiki>5</nowiki> <nowiki>meM</nowiki> | <nowiki>meM</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-meM|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-50|chunkType-child:NP|name-meM</nowiki> | <nowiki>4</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> | 
-samay samay NP NN | lex-samay<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-samay<nowiki>|</nowiki>name-NP5 k7t | _ | _ | +| <nowiki>6</nowiki> <nowiki>baMxa</nowiki> | <nowiki>baMxa</nowiki> <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-baMxa|cat-adj|gend-any|num-any|pers-|case-o|vib-|tam-|posn-60|name-baMxa|chunkId-JJP|chunkType-head:JJP</nowiki> | <nowiki>8</nowiki> <nowiki>nmod</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
-animeRake animeRake NP NNP | lex-animeRake<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-animeRake<nowiki>|</nowiki>name-NP6 k2 | _ | _ | +<nowiki>7</nowiki> | <nowiki>sarabajIwa</nowiki> | <nowiki>sarabajIwa</nowiki> | <nowiki>XC</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-sarabajIwa|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-70|chunkType-child:NP2|name-sarabajIwa</nowiki> | <nowiki>8</nowiki> | <nowiki>mod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-sariye sariye VGF VM | lex-sariye<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-5<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-0_rAKA+ka_ha+la<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-sariye<nowiki>|</nowiki>name-VGF main | _ | _ |+| <nowiki>8</nowiki> | <nowiki>siMha</nowiki> | <nowiki>siMha</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-siMha|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_ne|tam-0|posn-80|vpos-vib_3|name-siMha|chunkId-NP2|chunkType-head:NP2</nowiki> <nowiki>24</nowiki> <nowiki>k1</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>9</nowiki> <nowiki>ne</nowiki> <nowiki>ne</nowiki> <nowiki>PSP</nowiki> <nowiki>psp</nowiki> <nowiki>lex-ne|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-90|chunkType-child:NP2|name-ne</nowiki> | <nowiki>8</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
 +<nowiki>10</nowiki> | <nowiki>maMgalavAra</nowiki> | <nowiki>maMgalavAra</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-maMgalavAra|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_ko|tam-0|posn-100|vpos-vib_2|name-maMgalavAra|chunkId-NP3|chunkType-head:NP3</nowiki> | <nowiki>24</nowiki> <nowiki>k7t</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>11</nowiki> | <nowiki>ko</nowiki> | <nowiki>ko</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-ko|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-110|chunkType-child:NP3|name-ko</nowiki> | <nowiki>10</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>12</nowiki> | <nowiki>BArawIya</nowiki> | <nowiki>BArawIya</nowiki> | <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-BArawIya|cat-adj|gend-any|num-any|pers-|case-o|vib-|tam-|posn-120|chunkType-child:NP4|name-BArawIya</nowiki> | <nowiki>13</nowiki> | <nowiki>nmod__adj</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>13</nowiki> | <nowiki>xUwAvAsa</nowiki> | <nowiki>xUwAvAsa</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-xUwAvAsa|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_kA|tam-0|posn-130|vpos-vib_3|name-xUwAvAsa|chunkId-NP4|chunkType-head:NP4</nowiki> | <nowiki>16</nowiki> <nowiki>r6</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>14</nowiki> | <nowiki>ke</nowiki> | <nowiki>kA</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-kA|cat-psp|gend-m|num-pl|pers-|case-o|vib-|tam-|posn-140|chunkType-child:NP4|name-ke</nowiki> <nowiki>13</nowiki> <nowiki>lwg__psp</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>15</nowiki> <nowiki>xo</nowiki> <nowiki>xo</nowiki> <nowiki>QC</nowiki> <nowiki>num</nowiki> <nowiki>lex-xo|cat-num|gend-any|num-pl|pers-|case-o|vib-|tam-|posn-150|chunkType-child:NP5|name-xo</nowiki> | <nowiki>16</nowiki> <nowiki>nmod__adj</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> | 
 +| <nowiki>16</nowiki> | <nowiki>aXikAriyoM</nowiki> | <nowiki>aXikArI</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-aXikArI|cat-n|gend-m|num-pl|pers-3|case-o|vib-0_ko|tam-0|posn-160|vpos-vib_3|name-aXikAriyoM|chunkId-NP5|chunkType-head:NP5</nowiki> | <nowiki>24</nowiki> | <nowiki>k4</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>17</nowiki> | <nowiki>ko</nowiki> | <nowiki>ko</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-ko|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-170|chunkType-child:NP5|name-ko2</nowiki> | <nowiki>16</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>18</nowiki> | <nowiki>apane</nowiki> | <nowiki>apanA</nowiki> | <nowiki>PRP</nowiki> | <nowiki>pn</nowiki> | <nowiki>lex-apanA|cat-pn|gend-any|num-sg|pers-1|case-o|vib-0_bAre_meM|tam-0|posn-180|vpos-vib_2_3|name-apane|chunkId-NP6|chunkType-head:NP6</nowiki> | <nowiki>24</nowiki> | <nowiki>k7</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>19</nowiki> | <nowiki>bAre</nowiki> | <nowiki>bAre</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-bAre|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-190|chunkType-child:NP6|name-bAre</nowiki> | <nowiki>18</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>20</nowiki> | <nowiki>meM</nowiki> | <nowiki>meM</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-meM|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-200|chunkType-child:NP6|name-meM2</nowiki> | <nowiki>18</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>21</nowiki> | <nowiki>wamAma</nowiki> | <nowiki>wamAma</nowiki> | <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-wamAma|cat-adj|gend-any|num-any|pers-|case-d|vib-|tam-|posn-210|chunkType-child:NP7|name-wamAma</nowiki> <nowiki>23</nowiki> <nowiki>nmod__adj</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>22</nowiki> <nowiki>vyakwigawa</nowiki> <nowiki>vyakwigawa</nowiki> <nowiki>JJ</nowiki> <nowiki>adj</nowiki> <nowiki>lex-vyakwigawa|cat-adj|gend-any|num-any|pers-|case-d|vib-|tam-|posn-220|chunkType-child:NP7|name-vyakwigawa</nowiki> | <nowiki>23</nowiki> | <nowiki>nmod__adj</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>23</nowiki> | <nowiki>jAnakAriyAM</nowiki> | <nowiki>jAnakAriyAM</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-jAnakAriyAM|cat-n|gend-f|num-pl|pers-3|case-d|vib-0|tam-0|posn-230|name-jAnakAriyAM|chunkId-NP7|chunkType-head:NP7</nowiki> | <nowiki>24</nowiki> <nowiki>k2</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
 +<nowiki>24</nowiki> <nowiki>xIM</nowiki> | <nowiki>xe</nowiki> <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-xe|cat-v|gend-f|num-pl|pers-3|case-|vib-yA|tam-yA|stype-declarative|posn-240|voicetype-active|name-xIM|chunkId-VGF|chunkType-head:VGF</nowiki> | <nowiki>0</nowiki> <nowiki>main</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
 +<nowiki>25</nowiki> | <nowiki>ki</nowiki> | <nowiki>ki</nowiki> | <nowiki>CC</nowiki> | <nowiki>avy</nowiki> | <nowiki>lex-ki|cat-avy|gend-|num-|pers-|case-|vib-|tam-|posn-250|name-ki|chunkId-CCP|chunkType-head:CCP</nowiki> <nowiki>23</nowiki> | <nowiki>rs</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>26</nowiki> <nowiki>kina</nowiki> <nowiki>kOna</nowiki> <nowiki>WQ</nowiki> <nowiki>pn</nowiki> <nowiki>lex-kOna|cat-pn|gend-any|num-pl|pers-3|case-o|vib-|tam-|posn-260|chunkType-child:NP8|name-kina</nowiki> | <nowiki>27</nowiki> <nowiki>mod__wq</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> | 
 +| <nowiki>27</nowiki> <nowiki>parisWiwiyoM</nowiki> | <nowiki>parisWiwi</nowiki> <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-parisWiwi|cat-n|gend-f|num-pl|pers-3|case-o|vib-0_meM|tam-0|posn-270|vpos-vib_3|name-parisWiwiyoM|chunkId-NP8|chunkType-head:NP8</nowiki> | <nowiki>31</nowiki> <nowiki>k7</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>28</nowiki> <nowiki>meM</nowiki> | <nowiki>meM</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-meM|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-280|chunkType-child:NP8|name-meM3</nowiki> <nowiki>27</nowiki> <nowiki>lwg__psp</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>29</nowiki> <nowiki>use</nowiki> <nowiki>vaha</nowiki> <nowiki>PRP</nowiki> <nowiki>pn</nowiki> <nowiki>lex-vaha|cat-pn|gend-any|num-sg|pers-3|case-o|vib-ko|tam-ko|posn-290|name-use|chunkId-NP9|chunkType-head:NP9</nowiki> | <nowiki>31</nowiki> <nowiki>k2</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> | 
 +| <nowiki>30</nowiki> <nowiki>giraPwAra</nowiki> | <nowiki>giraPwAra</nowiki> <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-giraPwAra|cat-adj|gend-any|num-any|pers-|case-|vib-|tam-|posn-300|name-giraPwAra|chunkId-JJP2|chunkType-head:JJP2</nowiki> | <nowiki>31</nowiki> <nowiki>pof</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
 +<nowiki>31</nowiki> <nowiki>kiyA</nowiki> | <nowiki>kara</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-kara|cat-v|gend-m|num-sg|pers-3|case-|vib-yA_jA+yA�|tam-yA|stype-declarative|posn-310|voicetype-passive|vpos-tam_2|name-kiyA|chunkId-VGF2|chunkType-head:VGF2</nowiki> <nowiki>36</nowiki> | <nowiki>ccof</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>32</nowiki> <nowiki>gayA</nowiki> <nowiki>jA</nowiki> <nowiki>VAUX</nowiki> <nowiki>v</nowiki> <nowiki>lex-jA|cat-v|gend-m|num-sg|pers-3|case-|vib-yA�|tam-yA1|posn-320|chunkType-child:VGF2|name-gayA</nowiki> | <nowiki>31</nowiki> <nowiki>lwg__vaux</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>33</nowiki> | <nowiki>,</nowiki> | <nowiki>,</nowiki> | <nowiki>SYM</nowiki> | <nowiki>s</nowiki> | <nowiki>lex-|cat-s|gend-punc|num-|pers-|case-|vib-|tam-|posn-330|chunkType-child:VGF2|name-,</nowiki> | <nowiki>31</nowiki> | <nowiki>rsym</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>34</nowiki> | <nowiki>mukaxamA</nowiki> | <nowiki>mukaxamA</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-mukaxamA|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-340|name-mukaxamA|chunkId-NP10|chunkType-head:NP10</nowiki> | <nowiki>35</nowiki> | <nowiki>k1</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>35</nowiki> | <nowiki>calA</nowiki> | <nowiki>cala</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-cala|cat-v|gend-m|num-sg|pers-3|case-|vib-yA|tam-yA|hlt-true|stype-declarative|posn-350|voicetype-active|name-calA|chunkId-VGF3|chunkType-head:VGF3</nowiki> | <nowiki>36</nowiki> | <nowiki>ccof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>36</nowiki> | <nowiki>Ora</nowiki> | <nowiki>Ora</nowiki> | <nowiki>CC</nowiki> | <nowiki>avy</nowiki> | <nowiki>lex-Ora|cat-avy|gend-|num-|pers-|case-|vib-|tam-|posn-360|name-Ora|chunkId-CCP2|chunkType-head:CCP2</nowiki> | <nowiki>25</nowiki> | <nowiki>ccof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>37</nowiki> | <nowiki>sajA</nowiki> | <nowiki>sajA</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-sajA|cat-n|gend-f|num-sg|pers-3|case-d|vib-0|tam-0|posn-370|name-sajA|chunkId-NP11|chunkType-head:NP11</nowiki> | <nowiki>38</nowiki> | <nowiki>k1</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>38</nowiki> | <nowiki>huI</nowiki> | <nowiki>ho</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-ho|cat-v|gend-f|num-sg|pers-3|case-|vib-yA|tam-yA|stype-declarative|posn-380|voicetype-active|name-huI|chunkId-VGF4|chunkType-head:VGF4</nowiki> | <nowiki>36</nowiki> <nowiki>ccof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>39</nowiki> | <nowiki>.</nowiki> | <nowiki>.</nowiki> | <nowiki>SYM</nowiki> | <nowiki>punc</nowiki> | <nowiki>lex-.|cat-punc|gend-|num-|pers-|case-|vib-|tam-|posn-390|chunkType-child:VGF4|name-.</nowiki> <nowiki>38</nowiki> <nowiki>rsym</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> |
  
-And after conversion of the WX encoding to the Bengali script in UTF-8:+And after conversion of the WX encoding to the Devanagari script in UTF-8:
  
-| 1 | পরবর্তীকালে পরবর্তীকালে NP NN | lex-parabarwIkAle<nowiki>|</nowiki>cat-adv<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-parabarwIkAle<nowiki>|</nowiki>name-NP | k7t | _ | _ | +<nowiki>1</nowiki> <nowiki>कोट</nowiki> <nowiki>कोट</nowiki> <nowiki>XC</nowiki> <nowiki>n</nowiki> <nowiki>lex-kota|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-10|chunkType-child:NP|name-kota</nowiki> | <nowiki>4</nowiki> <nowiki>mod</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
-অফিস-বিযারারদের অফিস-বিযারারদের NP NN lex-aPisa-biyArAraxera<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-aPisa-biyArAraxera<nowiki>|</nowiki>name-NP2 | r6 | _ | _ | +<nowiki>2</nowiki> <nowiki>लखपत</nowiki> | <nowiki>लखपत</nowiki> <nowiki>XC</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-laKapawa|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-20|chunkType-child:NP|name-laKapawa</nowiki> | <nowiki>4</nowiki> <nowiki>mod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-নাম নাম NP NN | lex-nAma<nowiki>|</nowiki>cat-n<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-sg<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-d<nowiki>|</nowiki>vib-0<nowiki>|</nowiki>tam-0<nowiki>|</nowiki>head-nAma<nowiki>|</nowiki>name-NP3 k2 | _ | _ | +| <nowiki>3</nowiki> | <nowiki>जेल</nowiki> | <nowiki>जेल</nowiki> | <nowiki>XC</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-jela|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-30|chunkType-child:NP|name-jela</nowiki> | <nowiki>4</nowiki> <nowiki>mod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-ঘোষণা ঘোষণা NP NN | lex-GoRaNA<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-GoRaNA<nowiki>|</nowiki>name-NP4 pof | _ | _ | +| <nowiki>4</nowiki> | <nowiki>लाहौर</nowiki> | <nowiki>लाहौर</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-lAhOra|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_meM|tam-0|posn-40|vpos-vib_5|name-lAhOra|chunkId-NP|chunkType-head:NP</nowiki> <nowiki>6</nowiki> <nowiki>jjmod</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
-করার কর্ VGNN VM | lex-kar<nowiki>|</nowiki>cat-n<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-any<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-karAra<nowiki>|</nowiki>name-VGNN r6 | _ | _ | +<nowiki>5</nowiki> <nowiki>में</nowiki> | <nowiki>में</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-meM|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-50|chunkType-child:NP|name-meM</nowiki> | <nowiki>4</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> | 
-সময্ সময্ NP NN | lex-samay<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-samay<nowiki>|</nowiki>name-NP5 k7t | _ | _ | +| <nowiki>6</nowiki> <nowiki>बंद</nowiki> | <nowiki>बंद</nowiki> <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-baMxa|cat-adj|gend-any|num-any|pers-|case-o|vib-|tam-|posn-60|name-baMxa|chunkId-JJP|chunkType-head:JJP</nowiki> | <nowiki>8</nowiki> <nowiki>nmod</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
-অনিমেষকে অনিমেষকে NP NNP | lex-animeRake<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-animeRake<nowiki>|</nowiki>name-NP6 k2 | _ | _ | +<nowiki>7</nowiki> | <nowiki>सरबजीत</nowiki> | <nowiki>सरबजीत</nowiki> | <nowiki>XC</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-sarabajIwa|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-70|chunkType-child:NP2|name-sarabajIwa</nowiki> | <nowiki>8</nowiki> | <nowiki>mod</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
-সরিযে সরিযে VGF VM | lex-sariye<nowiki>|</nowiki>cat-unk<nowiki>|</nowiki>gend-<nowiki>|</nowiki>num-<nowiki>|</nowiki>pers-5<nowiki>|</nowiki>case-<nowiki>|</nowiki>vib-0_rAKA+ka_ha+la<nowiki>|</nowiki>tam-<nowiki>|</nowiki>head-sariye<nowiki>|</nowiki>name-VGF main | _ | _ |+| <nowiki>8</nowiki> | <nowiki>सिंह</nowiki> | <nowiki>सिंह</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-siMha|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_ne|tam-0|posn-80|vpos-vib_3|name-siMha|chunkId-NP2|chunkType-head:NP2</nowiki> <nowiki>24</nowiki> <nowiki>k1</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>9</nowiki> <nowiki>ने</nowiki> <nowiki>ने</nowiki> <nowiki>PSP</nowiki> <nowiki>psp</nowiki> <nowiki>lex-ne|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-90|chunkType-child:NP2|name-ne</nowiki> | <nowiki>8</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
 +<nowiki>10</nowiki> | <nowiki>मंगलवार</nowiki> | <nowiki>मंगलवार</nowiki> | <nowiki>NNP</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-maMgalavAra|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_ko|tam-0|posn-100|vpos-vib_2|name-maMgalavAra|chunkId-NP3|chunkType-head:NP3</nowiki> | <nowiki>24</nowiki> <nowiki>k7t</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>11</nowiki> | <nowiki>को</nowiki> | <nowiki>को</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-ko|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-110|chunkType-child:NP3|name-ko</nowiki> | <nowiki>10</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>12</nowiki> | <nowiki>भारतीय</nowiki> | <nowiki>भारतीय</nowiki> | <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-BArawIya|cat-adj|gend-any|num-any|pers-|case-o|vib-|tam-|posn-120|chunkType-child:NP4|name-BArawIya</nowiki> | <nowiki>13</nowiki> | <nowiki>nmod__adj</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>13</nowiki> | <nowiki>दूतावास</nowiki> | <nowiki>दूतावास</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-xUwAvAsa|cat-n|gend-m|num-sg|pers-3|case-o|vib-0_kA|tam-0|posn-130|vpos-vib_3|name-xUwAvAsa|chunkId-NP4|chunkType-head:NP4</nowiki> | <nowiki>16</nowiki> <nowiki>r6</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>14</nowiki> | <nowiki>के</nowiki> | <nowiki>का</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-kA|cat-psp|gend-m|num-pl|pers-|case-o|vib-|tam-|posn-140|chunkType-child:NP4|name-ke</nowiki> <nowiki>13</nowiki> <nowiki>lwg__psp</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>15</nowiki> <nowiki>दो</nowiki> <nowiki>दो</nowiki> <nowiki>QC</nowiki> <nowiki>num</nowiki> <nowiki>lex-xo|cat-num|gend-any|num-pl|pers-|case-o|vib-|tam-|posn-150|chunkType-child:NP5|name-xo</nowiki> | <nowiki>16</nowiki> <nowiki>nmod__adj</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> | 
 +| <nowiki>16</nowiki> | <nowiki>अधिकारियों</nowiki> | <nowiki>अधिकारी</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-aXikArI|cat-n|gend-m|num-pl|pers-3|case-o|vib-0_ko|tam-0|posn-160|vpos-vib_3|name-aXikAriyoM|chunkId-NP5|chunkType-head:NP5</nowiki> | <nowiki>24</nowiki> | <nowiki>k4</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>17</nowiki> | <nowiki>को</nowiki> | <nowiki>को</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-ko|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-170|chunkType-child:NP5|name-ko2</nowiki> | <nowiki>16</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>18</nowiki> | <nowiki>अपने</nowiki> | <nowiki>अपना</nowiki> | <nowiki>PRP</nowiki> | <nowiki>pn</nowiki> | <nowiki>lex-apanA|cat-pn|gend-any|num-sg|pers-1|case-o|vib-0_bAre_meM|tam-0|posn-180|vpos-vib_2_3|name-apane|chunkId-NP6|chunkType-head:NP6</nowiki> | <nowiki>24</nowiki> | <nowiki>k7</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>19</nowiki> | <nowiki>बारे</nowiki> | <nowiki>बारे</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-bAre|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-190|chunkType-child:NP6|name-bAre</nowiki> | <nowiki>18</nowiki> | <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>20</nowiki> | <nowiki>में</nowiki> | <nowiki>में</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-meM|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-200|chunkType-child:NP6|name-meM2</nowiki> | <nowiki>18</nowiki> <nowiki>lwg__psp</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>21</nowiki> | <nowiki>तमाम</nowiki> | <nowiki>तमाम</nowiki> | <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-wamAma|cat-adj|gend-any|num-any|pers-|case-d|vib-|tam-|posn-210|chunkType-child:NP7|name-wamAma</nowiki> <nowiki>23</nowiki> <nowiki>nmod__adj</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>22</nowiki> <nowiki>व्यक्तिगत</nowiki> <nowiki>व्यक्तिगत</nowiki> <nowiki>JJ</nowiki> <nowiki>adj</nowiki> <nowiki>lex-vyakwigawa|cat-adj|gend-any|num-any|pers-|case-d|vib-|tam-|posn-220|chunkType-child:NP7|name-vyakwigawa</nowiki> | <nowiki>23</nowiki> | <nowiki>nmod__adj</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>23</nowiki> | <nowiki>जानकारियां</nowiki> | <nowiki>जानकारियां</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-jAnakAriyAM|cat-n|gend-f|num-pl|pers-3|case-d|vib-0|tam-0|posn-230|name-jAnakAriyAM|chunkId-NP7|chunkType-head:NP7</nowiki> | <nowiki>24</nowiki> <nowiki>k2</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
 +<nowiki>24</nowiki> <nowiki>दीं</nowiki> | <nowiki>दे</nowiki> <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-xe|cat-v|gend-f|num-pl|pers-3|case-|vib-yA|tam-yA|stype-declarative|posn-240|voicetype-active|name-xIM|chunkId-VGF|chunkType-head:VGF</nowiki> | <nowiki>0</nowiki> <nowiki>main</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
 +<nowiki>25</nowiki> | <nowiki>कि</nowiki> | <nowiki>कि</nowiki> | <nowiki>CC</nowiki> | <nowiki>avy</nowiki> | <nowiki>lex-ki|cat-avy|gend-|num-|pers-|case-|vib-|tam-|posn-250|name-ki|chunkId-CCP|chunkType-head:CCP</nowiki> <nowiki>23</nowiki> | <nowiki>rs</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>26</nowiki> <nowiki>किन</nowiki> <nowiki>कौन</nowiki> <nowiki>WQ</nowiki> <nowiki>pn</nowiki> <nowiki>lex-kOna|cat-pn|gend-any|num-pl|pers-3|case-o|vib-|tam-|posn-260|chunkType-child:NP8|name-kina</nowiki> | <nowiki>27</nowiki> <nowiki>mod__wq</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> | 
 +| <nowiki>27</nowiki> <nowiki>परिस्थितियों</nowiki> | <nowiki>परिस्थिति</nowiki> <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-parisWiwi|cat-n|gend-f|num-pl|pers-3|case-o|vib-0_meM|tam-0|posn-270|vpos-vib_3|name-parisWiwiyoM|chunkId-NP8|chunkType-head:NP8</nowiki> | <nowiki>31</nowiki> <nowiki>k7</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>28</nowiki> <nowiki>में</nowiki> | <nowiki>में</nowiki> | <nowiki>PSP</nowiki> | <nowiki>psp</nowiki> | <nowiki>lex-meM|cat-psp|gend-|num-|pers-|case-|vib-|tam-|posn-280|chunkType-child:NP8|name-meM3</nowiki> <nowiki>27</nowiki> <nowiki>lwg__psp</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>29</nowiki> <nowiki>उसे</nowiki> <nowiki>वह</nowiki> <nowiki>PRP</nowiki> <nowiki>pn</nowiki> <nowiki>lex-vaha|cat-pn|gend-any|num-sg|pers-3|case-o|vib-ko|tam-ko|posn-290|name-use|chunkId-NP9|chunkType-head:NP9</nowiki> | <nowiki>31</nowiki> <nowiki>k2</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> | 
 +| <nowiki>30</nowiki> <nowiki>गिरफ्तार</nowiki> | <nowiki>गिरफ्तार</nowiki> <nowiki>JJ</nowiki> | <nowiki>adj</nowiki> | <nowiki>lex-giraPwAra|cat-adj|gend-any|num-any|pers-|case-|vib-|tam-|posn-300|name-giraPwAra|chunkId-JJP2|chunkType-head:JJP2</nowiki> | <nowiki>31</nowiki> <nowiki>pof</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki>
 +<nowiki>31</nowiki> <nowiki>किया</nowiki> | <nowiki>कर</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-kara|cat-v|gend-m|num-sg|pers-3|case-|vib-yA_jA+yA�|tam-yA|stype-declarative|posn-310|voicetype-passive|vpos-tam_2|name-kiyA|chunkId-VGF2|chunkType-head:VGF2</nowiki> <nowiki>36</nowiki> | <nowiki>ccof</nowiki> | <nowiki>_</nowiki> <nowiki>_</nowiki> 
 +<nowiki>32</nowiki> <nowiki>गया</nowiki> <nowiki>जा</nowiki> <nowiki>VAUX</nowiki> <nowiki>v</nowiki> <nowiki>lex-jA|cat-v|gend-m|num-sg|pers-3|case-|vib-yA�|tam-yA1|posn-320|chunkType-child:VGF2|name-gayA</nowiki> | <nowiki>31</nowiki> <nowiki>lwg__vaux</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>33</nowiki> | <nowiki>,</nowiki> | <nowiki>,</nowiki> | <nowiki>SYM</nowiki> | <nowiki>s</nowiki> | <nowiki>lex-|cat-s|gend-punc|num-|pers-|case-|vib-|tam-|posn-330|chunkType-child:VGF2|name-,</nowiki> | <nowiki>31</nowiki> | <nowiki>rsym</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>34</nowiki> | <nowiki>मुकदमा</nowiki> | <nowiki>मुकदमा</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-mukaxamA|cat-n|gend-m|num-sg|pers-3|case-d|vib-0|tam-0|posn-340|name-mukaxamA|chunkId-NP10|chunkType-head:NP10</nowiki> | <nowiki>35</nowiki> | <nowiki>k1</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>35</nowiki> | <nowiki>चला</nowiki> | <nowiki>चल</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-cala|cat-v|gend-m|num-sg|pers-3|case-|vib-yA|tam-yA|hlt-true|stype-declarative|posn-350|voicetype-active|name-calA|chunkId-VGF3|chunkType-head:VGF3</nowiki> | <nowiki>36</nowiki> | <nowiki>ccof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>36</nowiki> | <nowiki>और</nowiki> | <nowiki>और</nowiki> | <nowiki>CC</nowiki> | <nowiki>avy</nowiki> | <nowiki>lex-Ora|cat-avy|gend-|num-|pers-|case-|vib-|tam-|posn-360|name-Ora|chunkId-CCP2|chunkType-head:CCP2</nowiki> | <nowiki>25</nowiki> | <nowiki>ccof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>37</nowiki> | <nowiki>सजा</nowiki> | <nowiki>सजा</nowiki> | <nowiki>NN</nowiki> | <nowiki>n</nowiki> | <nowiki>lex-sajA|cat-n|gend-f|num-sg|pers-3|case-d|vib-0|tam-0|posn-370|name-sajA|chunkId-NP11|chunkType-head:NP11</nowiki> | <nowiki>38</nowiki> | <nowiki>k1</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>38</nowiki> | <nowiki>हुई</nowiki> | <nowiki>हो</nowiki> | <nowiki>VM</nowiki> | <nowiki>v</nowiki> | <nowiki>lex-ho|cat-v|gend-f|num-sg|pers-3|case-|vib-yA|tam-yA|stype-declarative|posn-380|voicetype-active|name-huI|chunkId-VGF4|chunkType-head:VGF4</nowiki> | <nowiki>36</nowiki> <nowiki>ccof</nowiki> | <nowiki>_</nowiki> | <nowiki>_</nowiki>
 +| <nowiki>39</nowiki> | <nowiki>.</nowiki> | <nowiki>.</nowiki> | <nowiki>SYM</nowiki> | <nowiki>punc</nowiki> | <nowiki>lex-.|cat-punc|gend-|num-|pers-|case-|vib-|tam-|posn-390|chunkType-child:VGF4|name-.</nowiki> <nowiki>38</nowiki> <nowiki>rsym</nowiki> <nowiki>_</nowiki> <nowiki>_</nowiki> |
  
 The first sentence of the ICON 2010 test data (with fine-grained syntactic tags) in the Shakti format: The first sentence of the ICON 2010 test data (with fine-grained syntactic tags) in the Shakti format:

[ Back to the navigation ] [ Back to the content ]