University of Washington Computer Science & Engineering
 Grouper
  CSE Home   AI Home  About CSE    Search    Contact Info 

Project faculty
 Oren Etzioni
 Dan Weld
Project students
 Cody Kwok
 Erik Selberg
 Oren Zamir
   

Grouper home site

Overview

Clustering is an effective way of organizing documents into collections for ease of browsing. Recently with the growth of WWW, clustering has become a paradigm for organizing search results. Online systems face many new challenges, including the need for fast response time, generating high quality clusters with simple descriptions for novice users, and working with document distributions that violates many traditional assumptions. How do different clustering algorithms trade off quality of clusters and speed? What modifications are necessary to adapt traditional clustering algorithm to the WWW? How do these system scale to larger document collection? How do these systems evaluate the quality of the cluster they generate? How are the clusters generated in each case, and are there any processing after cluster generation to improve on the cluster quality?

Publications


CSE logo Computer Science & Engineering
University of Washington
Box 352350
Seattle, WA  98195-2350
(206) 543-1695 voice, (206) 543-2969 FAX
[comments to Oren Etzioni]