HCC lecture 14

Clustering and Summarization:

Readings:

"Document clustering based on non-negative matrix factorization," Wei Xu, Xin Liu, Yihong Gong, in ACM Conference on Progress in Information Retrieval (SIGIR) 2003. (You need to access this from campus, our through the library proxy: http://proxy.lib.berkeley.edu )

"Experiments in MultiDocument Summarization," Barry Schiffman, Ani Nenkova, Kathleen McKeown in Human Language Technology Conference (HLT), 2002.

Recommended:
TextRank: Bringing Order into Texts
Rada Mihalcea and Paul Tarau, Conference on Empirical Methods in Natural Language Processing.

JFC's notes for lecture 14