                                The Art
                              of Lossless
                           Data Compression
                                vol. 24t

Here are the results of tests performed in August 2002 to compare
lossless compression of "plain" texts by all known good enough programs
developed for such purpose, including RK, DC, YBS, Bzip2, RAR, ACE and 7-zip.

See Archive Comparison Test by J.Gilchrist for more details:
http://compression.ca

If anybody wants to start or continue such tests,
or can suggest some other sets of texts, or other compression programs,
 (not sources or algorithm descriptions, executable programs only)
or knows we have missed something important,
 (some new fantastic technology, an algorithm or even a program capable
 of lossless compression of up to 1000:1 etc.)
please let us know immediately: artest@inbox.ru   Thank you!


[[1]] COMPRESSION QUALITY
=========================
             (see also
             [[2]] Speed
             [[3]] Details
             [[4]] Comments)

Last seventh line shows results for the sum of all 1231 texts in six sets.

Original Entropy PPMonstr PPMN   Slim Compressia PPMd     RK     PPMy     BEE
553.31%  100%   101.76  105.51  104.60  105.43  107.64  105.64  108.51  110.23
543.05%  100%   108.50  110.03  111.28  110.13  113.63  112.26  115.04  113.97
435.56%  100%   107.96  107.05  109.84  108.56  111.94  110.78  110.84  112.84
492.76%  100%   110.61  110.13  113.47  112.24  115.48  114.58  115.34  116.03
799.24% 101.80   100%   115.93  115.32  117.76  109.06  110.51  126.27  114.11
353.09%  100%   102.50  101.06  104.12  108.60  105.61  109.98  105.80  106.40
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
468.44%  100%   107.25  107.48  110.27  110.72  111.76  112.23  112.56  112.71

 RAR     DC      SBC     BOA     YBS     UHARC   ERI    7-zip   BZip2   Pkzip
109.41  108.66  108.98  110.03  108.95  107.33  109.53  116.80  124.06  159.23
114.56  114.28  115.55  117.47  117.38  115.17  119.44  120.55  130.92  178.03
112.57  111.19  112.57  113.19  113.95  112.43  115.21  115.85  124.48  163.20
116.21  115.77  116.82  117.62  118.62  117.78  120.29  121.73  131.87  174.67
121.49  121.44  118.06  122.31  120.56  122.76  125.99  138.39  149.67  205.34
106.15  111.66  110.30  107.42  109.44  113.54  111.44  106.72  116.85  148.43
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
113.08  114.02  114.27  114.34  115.32  115.72  117.20  117.69  127.59  167.57



[[2]] Speed
===========
Canterbury Corpus Large Set http://corpus.canterbury.ac.nz/resources/large.zip
was used for this test, and a 970MHz PC with 256Mb RAM and Windows98.

Programs,             Compression/    Overall    Average Users'   Compressed
options               Extraction,      Score         Score           Size
                        seconds     seconds, %    seconds, %       bytes ,  %
no compression            0     0     4446  559     4446  577    16005619  600
777 a -mg               140    84     1209  152     1070  139     3544038  133
777 a -mg -mu32         157   102     1234  155     1079  140     3513162  132
7za a -t7z               82     2     1110  140     1028  134     3692764  138
7za a -t7z -mx          127     2     1089  137      963  125     3454527  129
7za a -tzip              24     1     1245  157     1222  159     4393637  165
7za a -tzip -mx          45     1     1269  160     1224  159     4401174  165
acb B                   395   396     1722  217     1331  173     3352388  126
acb b                   514   516     1939  244     1430  186     3272388  123
acb u                   710   711     2317  291     1614  210     3225662  121
ace32 a -d4096           66     2     1124  141     1058  137     3801917  142
ace32 a -d4096 -m1       31     2     1134  143     1104  143     3965841  149
ace32 a -d4096 -m5      206     2     1249  157     1045  136     3746553  140
arh a                    38    40     1091  137     1053  137     3647067  137
arh a -2 -1              68    40     1121  141     1054  137     3647067  137
ba -k -50                35    12      964  121      929  121     3298943  124
bee a -m2               123   125     1134  143     1012  131     3188565  120
bee a -m3               162   164     1203  151     1043  135     3156801  118
bee a -m3 -s            204   206     1294  163     1092  142     3185012  119
bix a -mdg -s            92     1     1069  134      978  127     3514944  132
boa -m1                  86    88     1253  158     1168  152     3886863  146
boa -m15                139   141     1165  146     1027  133     3182739  119
boa -m15 -s             138   140     1148  144     1011  131     3132810  117
bzip2 -k                 21     6     1032  130     1011  131     3616113  136
bzip2 -k -9              20     6     1031  130     1011  131     3616113  136
ery-e.bat t o12          94    95     1003  126      910  118     2932445  110
ery-e.bat t o16          98    99     1001  126      904  117     2892711  108
ery-e.bat t o32         105   106     1009  127      905  118     2873677  108
ery-e.bat t o64         112   111     1022  128      911  118     2873318  108
compcl c -b15            37    20      904  114      868  113     3049569  114
compcl c -b15 -s         38    29      808  102      770  100     2668128  100
dc e                     13     7      903  114      890  116     3179173  119
dc e -b16300 -mt5        17     7      795  100      778  101     2773427  104
eri a                    39    17      936  118      897  116     3168414  119
eri a -m3                59    21      996  125      937  122     3295385  124
eri a -m6                59    21      989  124      931  121     3272926  123
gcac a                   26    12      980  123      954  124     3390603  127
gcac s                   26    12      981  123      955  124     3395064  127
imp98 a -mm              31     1     1175  148     1143  148     4112387  154
imp98 a -mm -2           13     5      999  126      986  128     3533761  132
imp98 a -2 -s4           13     5      999  126      986  128     3533693  132
pkzip -es                 1     1     1654  208     1652  215     5945622  223
pkzip -a                  4     1     1308  164     1304  169     4691491  176
pkzip -exx               16     1     1296  163     1280  166     4605942  173
ppmdi e -o7 -m232        11    12      904  114      893  116     3169000  119
ppmdi e -o12 -m232       25    26      915  115      891  116     3113630  117
ppmdi e -o16 -m232       27    28      916  115      890  116     3100943  116
ppmn_km e -o6 -MT1       30    30      931  117      901  117     3132278  117
ppmn_km e -o8 -MT1       64    65      993  125      929  121     3107654  116
ppmn_km e -o9            62    63      990  125      929  121     3115560  117
ppmn_km e -o9 -M:50      49    50      949  119      900  117     3058436  115
ppmonstr e -o7 -m232     64    67      974  123      911  118     3035498  114
ppmonstr e -o8 -m232     71    74      980  123      910  118     3007964  113
ppmonstr e -o64 -m232   101   103     1020  128      920  119     2937387  110
ppmy /o6 /m220          240   248     1430  180     1193  155     3391576  127
ppmy /o7 /m220          273   281     1456  183     1186  154     3251362  122
ppmy /o9 /m220          323   333     1514  190     1194  155     3088212  116
qlfc a                   22    11      973  122      952  124     3385084  127
rar a -m4                20    11      914  115      895  116     3178768  119
rar a -m5                25    16      920  116      896  116     3164821  119
rar a -m5 -s             25    17      927  117      902  117     3185542  119
rar a -mc16:128t -s      41    31      927  117      886  115     3077033  115
rk -mf2                  50    20     1108  139     1058  137     3735704  140
rk -mx1                 144   143     1147  144     1004  130     3093640  116
rk -mx2                 173   173     1203  151     1032  134     3086312  116
sbc c -b63               29     9      914  115      885  115     3151930  118
sbc c -os -b63           29     9      810  102      782  101     2779632  104
slim a -d32 -w22        524   522     1862  234     1343  174     2935640  110
slim a -d16 -w22        507   512     1835  231     1333  173     2935647  110
slim a -d8 -w22         477   485     1778  224     1306  170     2935701  110
slim a -d4 -w22         483   414     1713  215     1235  160     2936380  110
szip -o4                  4    10     1027  129     1023  133     3647445  137
szip -o6                 17    14      996  125      979  127     3475264  130
szip -o8 -b41            27    17      973  122      947  123     3348344  125
ufa a -mg                93   104     1138  143     1046  136     3387626  127
ufa a -mg -mu32         118   129     1176  148     1059  137     3344003  125
ufa a -mg -s            247    11     1339  168     1094  142     3890454  146
uharc a -m3 -md32768    112     5      983  124      872  113     3115786  117
uharc a -mz -md32768      8    10     1093  137     1084  141     3868781  145
uharc a -mx -md32768     51    47      934  117      883  115     3009015  113
ybs -m16mu               34     9      837  105      803  104     2857446  107
ybs -m16mu -r            34     9      843  106      809  105     2878433  108
zzip a                   21    11      977  123      956  124     3400243  127
zzip a -mx               22    12      973  122      952  124     3383060  127
zzip a -mx -30m          30    12      940  118      910  118     3233147  121

Overall score is calculated by adding compression time, extraction time, and
time it would take to transfer the compressed file over a 28,800bps network:
(compressed_size)/3600

Average Users' score is calculated by adding (compress_time/10)+ extract_time +
time it would take to transfer the compressed file over a 28,800bps network.
Compression time is divided by 10 here, because more than 90% of people would
never compress anything during their life (with compression programs), but they
use compressed data almost _every_ time they use computers and/or Internet.
That's why compression time is not so actual for them.


[[3]] Details
=============
are no longer put to this main text
(1513 lines reporting 73042 results on 1231 files in 6 sets),
but can be found in FULL version with TEXTS.DAT and *.BAT
at http://geocities.com/SiliconValley/Bay/1995/artest24.zip
or http://artest1.tripod.com/artest24.zip


[[4]] Comments
==============
Links to download programs:
~~~~~~~~~~~~~~~~~~~~~~~~~~~
ACE 2.04      :W http://winace.host.sk/ace204.exe                                 593K
ERI 5.1fre    :e http://geocities.com/eri32/eri51fre.zip                           89K
GCac 0.9k     :W http://www.emit.jp/gca/gca_v09k.exe                              295K
UHArc 0.4b    :eW ftp://ftp.elf.stuba.sk/pub/pc/pack/uharc04.zip                  220K
QLFC 6.6W     :W http://ghido.shelter.ro/DownloadQLFC.php                          49K
ZZip 0.36c    :W http://www.via.ecp.fr/~damien/downloads/zzip-win32.zip            35K
PPMD var.I,
PPmonstr v.I  :W ftp://ftp.elf.stuba.sk/pub/pc/pack/ppmdi1.rar                     73K
PkzipC  4.00  :W ftp://ftp.pkware.com/pkzc400s.exe                               3470K
PkZip   2.50  :a ftp://ftp.simtel.net/pub/simtelnet/msdos/arcers/pk250dos.exe     202K
RK-dos 1.04.1 :e http://rksoft.virtualave.net/downloads/rk104a1d.exe              461K
RK     1.04.1 :W http://rksoft.virtualave.net/downloads/rk104a1w.exe              380K
BA 1.01b5     :e http://hem.spray.se/mikael.lundqvist/ba101br5.zip                 61K
BIX 1.00b7    :W http://www.7-zip.org/dl/ufa/bix100b7.zip                          89K
777 0.04b1    :W http://www.7-zip.org/dl/ufa/777004b1.zip                          72K
UFA 0.04b1    :W http://www.7-zip.org/dl/ufa/ufa004b1.zip                          64K
ArHanGeL 1.40 :a http://geocities.com/SiliconValley/Lab/6606/arh140.zip            50K
Imp     1.1   :e http://www.technelysium.com.au/imp110d.zip                       266K
Imp-win 1.12  :W http://www.technelysium.com.au/imp112.exe                        122K
ACB 2.00c     :e ftp://ftp.simtel.net/pub/simtelnet/msdos/compress/acb_200c.zip    42K
BOA 0.58b     :e ftp://ftp.elf.stuba.sk/pub/pc/pack/boa058.zip                     74K
DC 0.98b      :W ftp://ftp.elf.stuba.sk/pub/pc/pack/dc124.zip                      55K
SZip 1.12a    :W http://www.compressconsult.com/szip/szip_112a_win32.zip           71K
YBS 0.03e     :e http://compression.graphicon.ru/ybs/ybs003ed.zip                  56K
YBS 0.03e     :W http://compression.graphicon.ru/ybs/ybs003ew.zip                  44K
Bzip2 1.0.2   :W ftp://sources.redhat.com/pub/bzip2/v102/bzip2-102-x86-win32.exe
Compressia 0.98b :W http://www.compressia.com/compcl.exe
SBC_d 0.969br1 :e http://personal.inet.fi/musiikki/sjm/sbc0969b_dos.zip
SBC   0.969br1 :W http://personal.inet.fi/musiikki/sjm/sbc0969b_win32.zip
BEE 0.6.3     :W http://compression.graphicon.ru/fa/files2/bee063.rar
PPMN_km b4    :W http://compression.graphicon.ru/ms/ppmn_km.rar
PPMY 0.3c     :W http://compression.graphicon.ru/sh/ppmy_3c.rar
7-Zip 2.30b23 :W http://belnet.dl.sourceforge.net/sevenzip/7z230b23.exe
WinRAR 3.00   :W http://www.rarlab.com/rar/wrar300.exe
RAR32  3.00   :e http://www.rarlab.com/rar/rarx300.exe

:a - any DOS  - DOS programs, will run under pure DOS or in a DOS box
:e - extender - DOS programs using DOS extenders like DOS/4GW or CWSDPMI
:W - windows  - Windows95/98/NT/etc programs

If direct link doesn't work-most probably newer version of the program appeared
at the same site: visit web page, or read the whole directory from ftp server
(i.e. try the same URL, but without filename).


Homepages:
~~~~~~~~~~
Arhangel   : http://geocities.com/SiliconValley/Lab/6606
BA         : http://hem.spray.se/mikael.lundqvist
Eri32      : http://geocities.com/eri32
  mirror   : http://artest1.tripod.com
RK         : http://rksoft.virtualave.net
Imp,WinImp : http://www.technelysium.com.au/winimp.html
ACE,WinACE : http://www.winace.com
PkZip      : http://www.pkware.com
RAR,WinRAR : http://www.rarlab.com
BZip2      : http://sources.redhat.com/bzip2
SZip       : http://www.compressconsult.com/szip
PPMN       : http://compression.graphicon.ru/ms
YBS        : http://compression.graphicon.ru/ybs
SBC        : http://sbcarchiver.netfirms.com
QLFC       : http://ghido.shelter.ro
GCAC       : http://emit.jp/gca/gca.html
PPMy       : http://www.pilabs.org.ua/sh
ZZip       : ? http://www.zzip.f2s.com
Compressia : http://www.compressia.com/
Ufa,777,
 BIX,7-Zip : http://www.7-zip.org


What's new:
~~~~~~~~~~~
12 new programs were tested:

PPMd v.I
PPMonstr v.I
7-zip 2.30b22
RAR 3.00
BZip2 1.0.2
PPMN_km b4
SBC 0.969b r1
Entropy 0.7
Compressia 0.98b
Bee 0.6.3 u1
PPMy 0.3c
Slim aug17

7-zip, RAR and BZip2 perform exactly as their previous versions
tested in March or earlier.

Latest beta versions of Entropy, Slim, DC were available
from authors by e-mail request:
Entropy: artest@inbox.ru
Slim: grayw@secom.lg.ua
DC: EdgarBinder@t-online.de

 Results of BA, ZZip, ACB, 777, GCAC, SZip, QLFC, ArHanGeL, IMP, BICOM,
 BIX, ACE are in full version only, TEXTS.DAT file.

The set of Russian texts is at http://arte.nm.ru/m120

Some old or less efficient programs will not be included next time:
777, UFA, BIX, ACB, BA, PPMy.


WARNINGS:
~~~~~~~~~
BA 1.00beta5 can't correctly decompress shaks12.txt and set used for speed
measurements.

DC 0.99.158b failed to decompress 1DFRE10.dc , ANDES10.dc , and BTI0110.dc,
saying "Corrupted block" (while t(est) command writes "Test successful").

ERI 5.1fre can't compress files larger than (free DPMI memory)/6, i.e.
about 10Mb on a PC with 64Mb RAM. The largest 44Mb file was split to 5 chunks
9000000 bytes long (last chunk was 8894190 bytes).

Problems in all other compressors were not found.


The LATEST RELEASE, and all previous versions of these tests can be found
at http://geocities.com/SiliconValley/Bay/1995/ and http://artest1.tripod.com/


Send your suggestions, comments to artest@inbox.ru
With best kind regards,
A.Ratushnyak
