Tentative Lecture Schedule


Lecture Date Topic  Reading Comments
1 Jan 21 Overview, Logistics, Goals
2 Jan 26 Datacenter Architecture Examples (I)
  1. An Architecture for Modular Data Centers, James Hamilton
  2. MySpace Architecture, Dan Farino
  3. Web Search for a Planet: The Google Cluster Architecture, L. A. Barroso, J. Dean, U. Holzle
  4. eBay Architecture (lessons)
  5. Facebook Architecture Presentation, Jeff Hammerbacher
3 Jan 28 Datacenter Architecture Examples (II)
4 Feb 2 Failures in Datacenters
  1. Crash: Data Center Horror Stories
  2. Data Center Failure As A Learning Experience
  3. Generator Failures Caused 365 Main Outage
  4. How to Improve your Data Center Operations
5 Feb 4 Failure Diagnosis and High Availability
  1. Designing a highly availabile directory service
  2. An Empirical Study on Data Center System Failure Diagnosis
6 Feb 9 Datacenter Networking
  1. A Scalable, Commodity Data Center Network Architecture
  2. DCell: A Scalable and Fault-Tolerant Network Structure for Data
7 Feb 11 Google Storage Stack
  1. The Google File System
  2. The Chubby Lock Service for Loosely-Coupled Distributed Systems
  3. Bigtable: A Distributed Storage System for Structured Data
Feb 16 Presidents' Day Holiday
8 Feb 18 Dynamo [Peter Vosshall]
    Dynamo: Amazon's Highly Available Key-Value Store
9 Feb 23 Initial Project Proposal Presentations
10 Feb 25 Other Structured and Relational Storage
  1. HIVE: Data Warehousing & Analytics on Hadoop
  2. Cassandra
  3. SCADS
11 March 2 Pig Latin Presentation [Chris Olston]
    Pig Latin: A Not-So-Foreign Language for Data Processing
12 March 4 Computation Models [Mihai Budiu, MSR, presentation on DryadLINQ]
  1. MapReduce: Simplified Data Processing on Large Clusters
  2. Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks
  3. DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language
13 March 9 Resource Management and Scheduling
  1. LATE
  2. ...
14 March 11 Software Platforms
  1. Azures (I), Azures (II)
  2. Comprehensive review of security and vulnerability protections
  3. AWS (1), AWS (II)
15 March 16 Tracing and Monitoring (I)
  1. Artemis
  2. Scribe
  3. Chaqwua
  4. XTrace
16 March 18 Tracing and Monitoring (II)
  1. DTrace: Dynamic Instrumentation of Production Systems
  2. Liblog
March 23 Spring break
March 25 Spring break
17 March 30 Language Support: Erlang
  1. Open-source Erlang - White Paper
  2. Erlang - A survey of the language and its industrial applications
  3. Erlang, or Utility-computing vs. appliance computing
18 April 1 Language Support: Ruby on Rail
    TBA
19 April 6 Development Environments
  1. AJAX + LAMP
  2. Google Geers
20 April 8 Cloud Economics
    TBA
21 April 13 Cloud Interoperability
  1. Portable computing
  2. Cloud: commodity or proprietary?
  3. AppDrop
22 April 15 From Datacenter to DatacenterS
    Scalling Out
23 April 20 CDNs as Cloud Computing Applications
    TBA
24 April 22 P2P: Pushing the Cloud to the Edge
    TBA
25 April 27 Energy Efficiency in Datacenters
  1. Why is Energy efficiency Important?
  2. Reducing Data Center Energy Consumption
  3. Data Center Energy Efficiency
26 April 29 TBA
27 May 4 TBA
28 May 6 Poster Session (6th floor alcove)
29 May 11 Course Summary / Course Evaluation