CS262a: Reading Assignment #16
Due Wednesday, March 16th
For
Monday, read the following two papers:
A Comparison of Approaches to Large-Scale Data Analysis
Andrew Pavlo, Erik Paulson, Alexander Rasin,
Daniel J. Abadi, David J. DeWitt, Samuel Madden, Michael Stonebraker. Appears in Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009
Jockey: Guaranteed Job Latency in Data Parallel Clusters
Andrew D. Ferguson,
Peter Bodik, Srikanth Kandula,
Eric Boutin, and
Rodrigo Fonseca. Appears in Proceedings of the European Professional Society on Computer Systems (EuroSys), 2012
You must also:
Submit a summary for each paper.
Optional Reading (Direct consequence of paper#1):
MapReduce and Parallel DBMSs: Friends or Foes?
Michael Stonebraker, Daniel Abadi,
David J. DeWitt, Sam Madden, Erik Paulson,
Andrew Pavlo, and Alexander Rasin. Appears in Proceedings of the Communications of the ACM (CACM), Vol. 53, No. 1, pp64-71, January 2010
MapReduce: A Flexible Data Processing Tool
JeffreyDean and Sanjay Ghemawat.
Appears in Proceedings of the Communications of the ACM (CACM), Vol. 53, No. 1, pp72-77, January 2010