[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
courses:mapreduce-tutorial:step-14 [2012/01/26 23:18]
straka
courses:mapreduce-tutorial:step-14 [2012/01/31 16:08] (current)
dusek
Line 1: Line 1:
 ====== MapReduce Tutorial : Exercise - N-gram language model ====== ====== MapReduce Tutorial : Exercise - N-gram language model ======
  
-For a given //N// create a simple N-gram language model. You can experimenting on the following data:+For a given //N// create a simple N-gram language model. You can start experimenting on the following data:
 ^ Path ^ Size ^ ^ Path ^ Size ^
 | /home/straka/wiki/cs-seq-medium | 8MB | | /home/straka/wiki/cs-seq-medium | 8MB |
Line 18: Line 18:
 Try creating such index. Ideally, the sizes of resulting data files should be as equal as possible. Try creating such index. Ideally, the sizes of resulting data files should be as equal as possible.
  
 +----
 +
 +<html>
 +<table style="width:100%">
 +<tr>
 +<td style="text-align:left; width: 33%; "></html>[[step-13|Step 13]]: Sorting.<html></td>
 +<td style="text-align:center; width: 33%; "></html>[[.|Overview]]<html></td>
 +<td style="text-align:right; width: 33%; "></html>[[step-15|Step 15]]: K-means clustering.<html></td>
 +</tr>
 +</table>
 +</html>

[ Back to the navigation ] [ Back to the content ]