Sciweavers

11 search results - page 1 / 3
» Compressing Trigram Language Models With Golomb Coding
Sort
View
EMNLP
2007
13 years 6 months ago
Compressing Trigram Language Models With Golomb Coding
Trigram language models are compressed using a Golomb coding method inspired by the original Unix spell program. Compression methods trade off space, time and accuracy (loss). The...
Kenneth Church, Ted Hart, Jianfeng Gao
DCC
2006
IEEE
14 years 4 months ago
Adaptive Run-Length / Golomb-Rice Encoding of Quantized Generalized Gaussian Sources with Unknown Statistics
We present a simple and efficient entropy coder that combines run-length and Golomb-Rice encoders. The encoder automatically switches between the two modes according to simple rul...
Henrique S. Malvar
TALIP
2002
108views more  TALIP 2002»
13 years 4 months ago
Toward a unified approach to statistical language modeling for Chinese
This paper presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) t...
Jianfeng Gao, Joshua Goodman, Mingjing Li, Kai-Fu ...
ICIP
2005
IEEE
14 years 6 months ago
A BPGC-based scalable image entropy coder resilient to errors
In this paper, we present a new entropy coder, Contextbased Bit Plane Golomb Coder (CB-BPGC) for scalable image coding, which achieves better coding performance with lower complex...
Rong Zhang, Qibin Sun, Wai-Choong Wong
ACL
2009
13 years 2 months ago
A Succinct N-gram Language Model
Efficient processing of tera-scale text data is an important research topic. This paper proposes lossless compression of Ngram language models based on LOUDS, a succinct data stru...
Taro Watanabe, Hajime Tsukada, Hideki Isozaki