CS262a: Reading Assignment #15
Due Monday, March 14th
For Wednesday, read the following two papers:
Parallel Database Systems: The Future of High Performance Database Systems
Dave DeWitt and Jim Gray. Appears in
Communications of the ACM,
Vol. 32, No. 6, June 1992
Spark: Cluster Computing with Working Sets
M. Zaharia, M. Chowdhury, M.J. Franklin, S. Shenker and I. Stoica. Appears in Proceedings of
HotCloud
2010, June 2010.
You must also:
Submit a summary for each paper.
Optional Readings for Paper #2:
MapReduce: simplified data processing on large clusters
Jeffrey Dean and Sanjay Ghemawat. Appears in
Communications of the ACM,
Vol. 51, No. 1, pp 107-113. January 2008
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing
. Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael J. Franklin, Scott Shenker, Ion Stoica. Appears in
Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
(NSDI), 2012
Back to CS262a page
Maintained by John Kubatowicz (
kubitron@cs.berkeley.edu
).
Last modified 9/2/2014