Friday, January 28, 2011

Unit 3: Index Construction and Compression

Index Compression is crucial to IR systems. Even a modestly sized collection of documents can take up a lot of memory.

Various compression techniques offer different benefits and disadvantages. One of the biggest factors is whether the compression techniques is lossy or not. Non-lossy compression preserves every term in a document. Lossy compression methods permanently remove terms that may or may not be crucial to the search process. Lossy methods will use less memory than non-lossy.

No comments:

Post a Comment