[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Last revision Both sides next revision
courses:mapreduce-tutorial:step-2 [2012/01/25 21:49]
straka
courses:mapreduce-tutorial:step-2 [2012/01/28 10:27]
majlis
Line 1: Line 1:
 ====== MapReduce tutorial : Input and output format, testing data. ====== ====== MapReduce tutorial : Input and output format, testing data. ======
  
-The MapReduce framework is frequently using (key, value) pairs. These +The MapReduce framework is frequently using (key, value) pairs. These pairs can be read from a file and written to a file and there are several formats available.
-pairs can be read from a file and written to a file and there are several formats available.+
  
 ===== Input formats ===== ===== Input formats =====
Line 24: Line 23:
   * ''/home/straka/wiki/cs-text-small'' -- uncompressed plain text files of Czech Wikipedia, 70kB.   * ''/home/straka/wiki/cs-text-small'' -- uncompressed plain text files of Czech Wikipedia, 70kB.
   * ''/home/straka/wiki/en-seq'' -- compressed SequenceFile of English Wikipedia, 1.9GB.   * ''/home/straka/wiki/en-seq'' -- compressed SequenceFile of English Wikipedia, 1.9GB.
 +
 +----
 +
 +<html>
 +<table style="width:100%">
 +<tr>
 +<td style="text-align:left; width: 33%; "></html>[[step-1|Step 1]]: Setting the environment.<html></td>
 +<td style="text-align:center; width: 33%; "></html>[[.|Overview]]<html></td>
 +<td style="text-align:right; width: 33%; "></html>[[step-3|Step 3]]: Basic mapper.<html></td>
 +</tr>
 +</table>
 +</html>
  

[ Back to the navigation ] [ Back to the content ]