id author title date pages extension mime words sentence flesch summary cache txt ital-3222 Skibiński, Przemyslaw; Swacha, Jakub The Efficient Storage of Text Documents in Digital Libraries 2009-09-01 11 .pdf application/pdf 6968 256 54 Although text documents are often compressed with general-purpose methods such as Deflate, much better compression can be obtained with a scheme specialized for text, and even better if the scheme is additionally specialized for individual document formats. Table 1 shows the assignment of the mentioned sub- schemes to document formats, with “+” denoting that a given subscheme should be applied when processing a given document format. cache/ital-3222.pdf txt/ital-3222.txt