Friday, April 22, 2011

Muddiest Point: 4/18/11

How have classification models used in libraries influenced text classification on the web?

Thursday, April 14, 2011

Unit 13: Text Classification and Clustering

The goal of clustering algorithms is to cluster documents into clusters that are internally coherent. Documents in a cluster should be as similar as possible to each other and as dissimilar as possible from documents in other clusters. The distribution and makeup of the data will determine what documents belong to their appropriate clusters.

Muddiest Point: 4/11/11

What are the storage and performance issues of modeling users for the purpose of personalized search?

Thursday, April 7, 2011

Unit 12: Intelligent Information Retrieval

Cannot access articles for unit.

Muddiest Point: 4/4/11

Since the articles are not exactly the same when using comparable corpora is it likely that words could get translated into incorrect words?

Friday, April 1, 2011

Muddiest Point: 3/28/11

Once PageRank scores for a collection are calculated how are they used to influence the ranklist for a specific query?