Differences
This shows you the differences between two versions of the page.
Both sides previous revision
Previous revision
Next revision
|
Previous revision
Next revision
Both sides next revision
|
spark [2014/11/03 17:18] straka |
spark [2014/11/04 09:28] straka |
| |
[[http://spark.apache.org|Spark]] is a framework for distributed computations. Natively it works in Python, Scala and Java, and can be used limitedly in Perl using pipes. | [[http://spark.apache.org|Spark]] is a framework for distributed computations. Natively it works in Python, Scala and Java, and can be used limitedly in Perl using pipes. |
| |
| {{ :spark:spark-logo.png?150}} |
| |
Apart from embarrassingly parallel computations, Spark framework is suitable for //in-memory// and/or //iterative// computations, making it suitable even for machine learning and complex data processing. (The Spark framework shares some underlying implementation with [[http://http://hadoop.apache.org/|Hadoop]], but it is quite different -- Hadoop framework does not offer in-memory computations and has only limited support for iterative computations.) | Apart from embarrassingly parallel computations, Spark framework is suitable for //in-memory// and/or //iterative// computations, making it suitable even for machine learning and complex data processing. (The Spark framework shares some underlying implementation with [[http://http://hadoop.apache.org/|Hadoop]], but it is quite different -- Hadoop framework does not offer in-memory computations and has only limited support for iterative computations.) |
===== Recipes ===== | ===== Recipes ===== |
| |
* [[spark:recipes:Reading Input and Writing Output]] | * [[spark:recipes:Reading Text Files]] |
| * [[spark:recipes:Writing Text Files]] |
| * [[spark:recipes:Storing Data in Binary Format]] |
* [[spark:recipes:Using Perl via Pipes]] | * [[spark:recipes:Using Perl via Pipes]] |