CS294 - The Mathematics of Information and Data
CS294 - The Mathematics of Information and Data

Instructor:Ben Recht
Time:  Fri 12:00-3:00 PM
Location: 373 Soda Hall


Description: This course will explore the foundations of an emerging discipline: the mathematics of information and data. Through recent and classic texts in mathematical statistics, optimization, and computer science, we will find unifying themes in these three disciplinary approaches. We will draw connections between how we analyze running time, statistical accuracy, and implementation of data-driven computations. We will focus in particular on large deviation inequalities, convex analysis and their applications in minimax statistics; sparse and stochastic optimization; and discrete and convex geometry. This course is ideal for advanced graduate students who would like to apply these theoretical and algorithmic developments to their own research.

The current list of topics (which will change depending on the course we chart) is:

  1. Stochastic Optimization
    • stochastic gradients, online learning, and the Kaczmarz algorithm
    • core sets and importance sampling
    • randomized algorithms for linear systems
  2. Random Matrices
    • Elementary analysis of random matrices
    • Graph sparsification, frames, and matrix approximation
    • Noncommutative Chernoff Bounds
  3. Average Case Analysis of Optimization Problems
    • covering numbers, VC dimension, rademacher complexity
    • metric embedding and restricted isometries
    • compressed sensing and all that it has wrought

Grading: Each student will be required to attend class regularly and either lead the discussion scribe notes for at least one class.

Prerequisites: Consent of the instructor is required. Graduate level courses in probability and optimization will be necessary.


Lecture notes template


Session 1 (08/30): Introduction.

Session 2 (09/13): Stochastic gradient, online learning, and the Kaczmarz algorithm.
Discussion Leader: Ben Recht
Scribe: Jonathan Terhorst [notes]
   Readings:

Session 3 (09/20): Core-sets and importance sampling.
Discussion Leader: Nick Alteri and Nick Boyd
Scribe: Lisa Anne Hendricks [notes]
   Readings:

Session 4 (09/27): Noncommutative Chernoff bounds.
Discussion Leader: Ashia Wilson and John Duchi
Scribe: Miles Lopes
   Readings:

Session 5 (10/04): Graph Sparsificiation.
Discussion Leader:TBA
Scribe: TBA
   Readings:

Session 6 (10/11): Diagonally dominant systems.
Discussion Leader:TBA
Scribe: TBA
   Readings:

Session 7 (10/18): Statistical complexity.
Discussion Leader:TBA
Scribe: TBA
   Readings:

Session 8 (10/25): Inverse Problems 1.
Discussion Leader:TBA
Scribe: TBA
   Readings:

Session 9 (11/1): Inverse Problems 2.
Discussion Leader:TBA
Scribe: TBA
   Readings:

Session 10 (11/8): Learning representations.
Discussion Leader:TBA
Scribe: TBA
   Readings: