IR-chapter5: Index compression

2017-04-24  本文已影响0人  woodsouthmmm

statistical properties of terms in information retrieval

The effects of preprocessing for Reuters-RCV1

estimating the number of terms:

modeling the distribution of terms

dictionary compression

dictionary as a string

dictionary as a string

blocked storage

blocked storage term lookup time front coding dictionary compression with different data structure

posting file compression

Variable byte encoding

VB encoding pseudodcode

γ Codes

γ Codes E(L) - the expected length of a code L, H(P)-entropy
上一篇 下一篇

猜你喜欢

热点阅读