16-721 is a graduate seminar devoted to recent research on computer vision. We will be reading an eclectic mix of vision papers on topics such as perception, object and scene recognition, segmentation, tracking, as well as "best papers of all time".
We will meet on Mondays and Wednesdays from 10:30am-11:50am in NSH 3002. The first meeting will be on Monday January 16th, and the final meeting will be on Wednesday May 3, 2006.
Instructor: Alexei (Alyosha) Efros, Assistant Professor, 4207 Newell-Simon Hall.
Office Hours: Monday 12:-12:30 p.m.
Friday 2:30-3:30 p.m.
TA: David Bradley, 2216 Newell-Simon Hall.
Office Hours: Tuesday 1:00-2:00 p.m. or by appointment.
Feel free to send email to efros (at) cs or dbradley (at) cs with any questions.
Check out this list of data sources for some ideas on where to get images to work with.
Time
|
Monday (A)
|
Wednesday (A)
|
Monday (B)
|
Wednesday (B)
|
12:10
- 12:30
|
N/A
|
Zickler
|
N/A
|
Vallespi
|
12:30
- 12:50
|
Thompson & Dunlop
|
|
Batra & Kim
|
|
12:50
- 1:10
|
Ramnath
|
Chan & Barnum
|
||
1:10
- 1:30
|
Djugash
|
|
||
1:30
- 1:50
|
Melchior
|
|
A list of suggested papers to present is available here.
For some journal-length papers, shorter conference versions have been posted. Feel free to read either paper.
The discussion board for signing up for papers is now available here
Sign up for at least 2 papers, demo 1 and oppose 1.
If you want to change your presentation date, please arrange a swap with another student and notify the instructor at least two weeks in advance.
date |
Presenter |
paper title |
author(s) |
discussion |
slides |
Jan. 16 |
Alyosha Efros |
Introduction,
Vision: Measurement vs. Perception Administrative
stuff, overview of the course, datasets |
|
|
|
Jan. 18 |
Alyosha Efros |
Overview lecture on the physiology of vision Suggested reading: The Plenoptic
Function and the Elements of Early Vision (1991) |
Adelson & Bergen |
||
Jan. 23 |
Alyosha Efros Dave Thompson |
Overview lecture on theories of Visual Perception Vision
is getting easier every day (1995) What's up
in top-down processing? (1991) Pictorial art and vision (1991) |
Patrick Cavanagh |
||
Part I:
Low-level Vision (images as texture) |
|||||
Jan. 25 |
Peter Barnum Heather Dunlop |
Presenting: The
Earth Mover's Distance as a Metric for Image Retrieval. (conference version) Optional Presenting: Learning
to Detect Natural Image Boundaries Using Local Brightness, Color, and Texture
Cues (conference version) |
Rubner , Tomasi, & Guibas Rubner, Puzicha, Tomasi, & Buhmann Martin, Fowlkes, Malik |
||
Jan. 30 |
Jonathan Huang |
Statistics
of Natural Image Categories Optional Optional |
Torralba & Oliva Torralba, & Oliva Oliva & Torralba |
||
Feb. 1 |
Alyosha Efros |
Presenting an overview
of bag-of-words
appraoches:
Optional: When is scene recognition
just texture recognition?
Optional: Visual categorization with bags of keypoints
Optional: Object Categorization by
Learned Universal Visual Dictionary |
Renninger, L.W. & Malik, J G. Csurka, C. Bray, C.
Dance, and L. Fan Winn, A. Criminisi and T. Minka |
||
Feb. 6 |
David
Bradley |
Presenting: Object
Recognition with Informative Features and Linear Classification Optional |
Ullman, S., Vidal-Naquet, M. , and Sali, E Michel Vidal-Naquet, Shimon Ullman |
||
Feb. 8 |
Tomasz Malisiewicz (P) Alyosha Effros (O) |
A Bayesian
hierarchical model for learning natural scene categories. |
Fei-Fei and P. Perona Josef Sivic, Bryan
Russell, Alexei A. Efros, Andrew Zisserman, Bill Freeman |
||
Part II: Mid-level Vision (Image Segmentation) |
|||||
Feb. 13-15 |
Carlos Vallespi (P) Joseph Djugash (D) Gunhee Kim (O) |
Jianbo Shi; Malik, J. Weiss, Y. |
|||
Feb. 20 |
Mohit Gupta (P) |
Interactive Graph
Cuts for Optimal Boundary & Region Segmentation of Objects in ND Images Optional: Lazy
Snapping Optional: Video Object Cut and Paste (cool SIGGRAPH video) |
Boykov & Jolly Yin Li, Jian Sun, Chi-Keung Tang, Heung-Yeung Shum Yin Li, Jian Sun, Heung-Yeung Shum |
||
Feb. 22 |
|
Project Proposals |
|
|
|
Feb. 27 |
Derek Hoiem |
Derek Hoiem, Alexei Efros, Martial Hebert |
|
||
Mar. 1 |
Tomasz Malisiewicz (P) |
Tu and Zhu |
|||
Part
III: 2D Recognition |
|||||
Mar. 6 (A) |
Nicolas Chan
(P) Tomasz Malisiewicz (O) Pete Barnum (D) |
H. Schneiderman and T.
Kanade Viola, Jones |
|||
Mar. 8 (A) |
Pete Barnum (P) |
Dalal, Triggs |
|||
Mar.
13 |
|
Spring Break |
|
|
|
Mar.
15 |
|
Spring Break |
|
|
|
Mar. 20 (B) |
David Lee (P) Heather Dunlop (D) David Thompson (O) |
David G. Lowe |
|
||
Mar. 22 (B) |
Stephan Zickler (P) |
Real-time
Object Detection for Smart Vehicles Optional: Automatic Target Recognition by Matching Oriented Edge Pixels |
Gavrila & Philomin Olson & Huttenlocher |
|
|
Mar. 27 (A) |
Gunhee Kim (P) Joseph Djugash (O) ? (D) |
Shape Matching
and Object Recognition Using Shape Contexts Shape Matching and Object Recognition using Low Distortion Correspondences |
Belongie, Malik, and Puzicha A Berg, T Berg, J Malik |
|
|
Mar. 29 (A) |
Dhruv Batra (P) Krishnan Ramnath (D) |
T. F. Cootes, G. J. Edwards, C. J. Taylor |
|
||
Recognition with Segmentation |
|||||
Apr. 3 (B) |
Joseph Djugash (P) Heather Dunlop (O) |
Eran Borenstein, Shimon Ullman Eran Borenstein, Shimon Ullman E. Borenstein, |
|
|
|
Apr. 5 (B) |
Dhruv Batra (P) |
B Leibe, |
|
|
|
Apr. 10 (A) |
Nik Melchior (P) David Lee (O) |
LOCUS: Learning
Object Classes with Unsupervised Segmentation |
J. Winn and N. Jojic |
|
|
Apr. 12 (A) |
David Lee (P) Gunhee Kim (D) |
Context-based
vision system for place and object recognition Optional:
Contextual
Models for Object Detection using Boosted Random Fields |
A. Torralba, K. P. Murphy, W. T. Freeman and M. A. Rubin |
|
|
Apr. 17 (B) |
Heather Dunlop (P) |
Object
recognition as machine translation: Learning a lexicon for a fixed image
vocabulary |
Pinar Duygulu, Kobus Barnard, Nando de Freitas, and David Forsyth Kobus Barnard, Pinar Duygulu, Nando de Freitas, David Forsyth, David Blei, and Michael I. Jordan |
|
|
Apr. 19 (B) |
Krishnan Ramnath(P) |
Tamara L. Berg, Alexander C. Berg, Jaety Edwards, Michael Maire, Ryan White, Yee Whye Teh, Erik Learned-Miller, David A. Forsyth |
|
|
|
Apr. 24 (A) |
Stephan Zickler (O) Mohit Gupta (P) Mohit Gupta (D) |
The
Perception of Shading and Reflectance |
Adelson & Pentland Yair Weiss |
|
|
Apr. 26 (A) |
Malola Prasath (P) |
Marshall F Tappen, William T Freeman, Edward H Adelson |
|
|
|
May. 1 |
Dave Thompson (P) Jonathan
Huang (O) |
A global
geometric framework for nonlinear dimensionality reduction Nonlinear
dimensionality reduction by locally linear embedding |
J. B. Tenenbaum, V. De
Silva, and J. C. Langford Sam Roweis & Lawrence Saul |
|
|
May. 3 |
|
Project Presentations |
|
|
|
Vision Science:
Photons to Phenomenology by Stephen E. Palmer
Computer Vision: A Modern
Approach, Forsyth and Ponce
Introductory
Techniques for 3-D Computer Vision Trucco and Verri
An Invitation to 3D Vision:
From Images to Geometric Models, Y. Ma, S. Soatto,
J. Kosecka, S. Sastry
Multiple View Geometry in
Computer Vision by Hartley & Zisserman
The Geometry of
Multiple Images by Faugeras, Luong,
and Papadopoulo
Neural
Networks for Pattern Recognition, Bishop.
Most recently updated on January. 27, 2006 by David Bradley
Site design courtesy of Serge Belongie.