Both sides previous revision
Previous revision
Next revision
|
Previous revision
|
user:majlis:hadoop [2011/11/06 14:29] majlis |
user:majlis:hadoop [2012/01/21 17:29] (current) majlis |
| [[courses:mapreduce-tutorial|MapReduce Tutorial - official course]] |
| |
====== Hadoop & MapReduce ====== | ====== Hadoop & MapReduce ====== |
* [[http://en.wikipedia.org/wiki/Hadoop|Hadoop - Wikipedii]] | * [[http://en.wikipedia.org/wiki/Hadoop|Hadoop - Wikipedii]] |
* [[http://en.wikipedia.org/wiki/MapReduce|MapReduce - Wikipedia]] | * [[http://en.wikipedia.org/wiki/MapReduce|MapReduce - Wikipedia]] |
* [[http://labs.google.com/papers/mapreduce.html|MapReduce: Simplified Data Processing on Large Clusters (original paper)]] + [[http://scholar.google.cz/scholar?cites=10940266603640308767&as_sdt=2005&sciodt=0,5|3650 citations]] | * [[http://labs.google.com/papers/mapreduce.html|MapReduce: Simplified Data Processing on Large Clusters (original paper)]] + [[http://scholar.google.cz/scholar?cites=10940266603640308767&as_sdt=2005&sciodt=0,5|3650 citations]] |
| |
===== Tasks ===== | |
^ Status ^ Task ^ Info ^ | |
| 1 / 10 | [[.:hadoop:sge]] | Hadoop on SGE cluster | | |
| 1 / 10 | [[.:hadoop:perl]] | MapReduce tasks in Perl | | |
| 0 / 10 | [[.:hadoop:hdfs]] | User friendly access to HDFS | | |
| |
===== Books ===== | ===== Books ===== |
* [[http://www.umiacs.umd.edu/~jimmylin/book.html|Data-Intensive Text Processing with MapReduce]] | * [[http://www.umiacs.umd.edu/~jimmylin/book.html|Data-Intensive Text Processing with MapReduce]] - contains links to courses that are using this book |
| * [[http://i.stanford.edu/~ullman/mmds.html|Mining of Massive Datasets]] |
| |
===== Tutorials ===== | ===== Tutorials ===== |
* [[http://www.mendeley.com/groups/1058401/mapreduce-applications/papers/|Mendeley - MapReduce]] | * [[http://www.mendeley.com/groups/1058401/mapreduce-applications/papers/|Mendeley - MapReduce]] |
* [[http://www.columbia.edu/~ak2834/mapreduce.html|List of papers]] | * [[http://www.columbia.edu/~ak2834/mapreduce.html|List of papers]] |
| |
| ===== Courses ===== |
| * [[http://lintool.github.com/Cloud9/]] |
| * [[http://dicta-f11.utcompling.com/schedule|Data-Intensive Computing for Text Analysis]] - contains slides + homeworks |
| * [[http://courses.cs.tamu.edu/caverlee/csce689/|Internet-Scale Data Management]] - each class covers one general topic |
| * [[http://www.eurecom.fr/~michiard/CCSS.html|Summer School on Cloud Computing: Challenges and opportunities]] - 220 slides |
| * [[http://www.stanford.edu/class/cs341/cs341-10-proj/index.html|Project in Mining Massive Data Sets]] |
| |
===== Related projects ===== | ===== Related projects ===== |
* [[http://mahout.apache.org/|Scalable machine learning and data mining]] | * [[http://mahout.apache.org/|Scalable machine learning and data mining]] |
| |
| |
| |
| |