کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
9657929 690117 2005 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A lower bound on compression of unknown alphabets
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
A lower bound on compression of unknown alphabets
چکیده انگلیسی
Many applications call for universal compression of strings over large, possibly infinite, alphabets. However, it has long been known that the resulting redundancy is infinite even for i.i.d. distributions. It was recently shown that the redundancy of the strings' patterns, which abstract the values of the symbols, retaining only their relative precedence, is sublinear in the blocklength n, hence the per-symbol redundancy diminishes to zero. In this paper we show that pattern redundancy is at least (1.5log2e)n1/3 bits. To do so, we construct a generating function whose coefficients lower bound the redundancy, and use Hayman's saddle-point approximation technique to determine the coefficients' asymptotic behavior.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Theoretical Computer Science - Volume 332, Issues 1–3, 28 February 2005, Pages 293-311
نویسندگان
, , ,