CS 288: Statistical Natural Language Processing, Fall 2014 |
|
Instructor:
Dan Klein Lecture: Tuesday and Thursday 11:00am-12:30pm, 320 Soda Hall Office Hours: Tuesday 12:30pm-2:00pm 730 SDH |
|
GSI:
Greg Durrett Office Hours: Thursday 3:00pm-5:00pm 751 Soda (alcove) |
|
Forum: Piazza |
Announcements
11/6/14: Project 5 has been released. It is due November 26 at 5pm.
10/18/14: Project 4 has been released. It is due November 7 at 5pm.
9/30/14: Project 3 has been released. It is due October 17 at 5pm.
9/16/14: Project 2 is now due September 29 at 5pm.
9/15/14: Project 2 has been released. It is due September 26 at 5pm.
8/30/14: Project 1 has been released. It is due September 12 at 5pm.
8/28/14: The previous
website has been archived.
Description
This course will explore current statistical techniques for the automatic
analysis of natural (human) language data. The dominant modeling paradigm is
corpus-driven statistical learning, with a split focus between supervised and
unsupervised methods. This term, we are introducing a few new projects
to give increased hands-on experience with a greater variety of NLP tasks
and commonly used techniques.
This course assumes a good background in basic probability and a strong ability
to program in Java. Prior experience with linguistics or natural languages is
helpful, but not required. There will be a lot of statistics, algorithms,
and coding in this class. The recommended background is CS 188 (or CS 281A)
and CS 170 (or CS 270). An A in CS 188 (or CS 281A) is required. This
course will be more work-intensive than most graduate or undergraduate courses.
Readings
The primary recommended texts for this course are:
Note that M&S is free online. Also, make sure you get the purple 2nd edition of J+M, not the white 1st edition.
Syllabus [subject to substantial change!]