                                The Art
                              of Lossless
                           Data Compression
                                vol. 23t

Here are the results of tests performed in March 2002 to compare
lossless compression of "plain" texts by all known good enough programs
developed for such purpose, including RK, DC, YBS, Bzip2, RAR, ACE and 7-zip.

See Archive Comparison Test by J.Gilchrist for more details:
http://compression.ca

If anybody wants to start or continue such tests,
or can suggest some other sets of texts, or other compression programs,
 (not sources or algorithm descriptions, executable programs only)
or knows we have missed something important,
 (some new fantastic technology, an algorithm or even a program capable
 of lossless compression of up to 1000:1 etc.)
please let us know immediately: artest@inbox.ru   Thank you!


[[1]] COMPRESSION QUALITY
=========================
             (see also
             [[2]] Speed
             [[3]] Details
             [[4]] Comments)

Last seventh line shows results for the sum of all 1231 texts in six sets.

Original Entropy PPMonstr PPMN   PPMd     RK     PPMY     RAR     DC      BOA
552.36%  100%   102.24  105.09  109.82  105.46  108.37  109.22  108.47  109.83
529.19%  100%   106.13  107.42  111.43  109.39  112.25  111.63  111.36  114.46
427.31%  100%   105.95  105.08  109.88  108.68  108.87  110.43  109.08  111.04
472.03%  100%   106.02  105.55  110.70  109.76  110.60  111.31  110.89  112.67
790.80% 103.71   100%   114.70  108.72  109.33  125.47  120.20  120.16  121.01
344.47% 100.01   100%   100.24  103.16  107.29  103.27  103.56  108.93  104.79
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
453.41%  100%   103.97  104.47  108.38  108.63  109.07  109.45  110.36  110.67

  SBC     BEE     YBS    UHARC    ERI    7-zip    BA     ZZip    BZip2   pkzip
109.26  112.09  108.76  107.15  109.34  116.60  111.75  111.57  123.85  158.95
113.27  115.15  114.38  112.22  116.39  117.47  116.50  117.28  127.57  173.48
110.95  112.84  111.79  110.30  113.02  113.65  113.98  114.71  122.12  160.11
112.41  113.44  113.63  112.82  115.22  116.60  115.96  116.78  126.31  167.32
117.55  122.56  119.29  121.46  124.65  136.93  129.23  123.54  148.08  203.17
107.55  105.28  106.76  110.77  108.72  104.11  109.78  110.87  113.99  144.81
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
110.98  111.59  111.62  112.00  113.43  113.91  114.50  115.01  123.49  162.19



[[2]] Speed
===========
Canterbury Corpus Large Set http://corpus.canterbury.ac.nz/resources/large.zip
was used for this test, and a 400MHz PC with 256Mb RAM and Windows98.

Programs,             Compression/   Overall   Average Users'   Compressed
options               Extraction,     Score        Score           Size
                        seconds    seconds, %   seconds, %       bytes ,  %
no compression            0     0    4446  537    4446  562    16005619  628
777 a -mg               251   162    1397  169    1171  148     3544031  139
7za a -t7z              178     6    1213  146    1053  133     3707526  145
7za a -t7z -mx          269     6    1289  156    1046  132     3649093  143
7za a -tzip              62     6    1287  155    1232  156     4393623  172
7za a -tzip -mx         102     5    1329  161    1238  157     4401160  173
acb B                   718   672    2319  280    1673  212     3346915  131
acb u                  1277  1276    3448  416    2298  291     3221349  126
ace32 a                 100     6    1216  147    1126  142     3998222  157
ace32 a -d4096          106     6    1212  146    1117  141     3962314  155
ace32 a -d4096 -m1       57     6    1197  145    1146  145     4086782  160
ace32 a -d4096 -m5      328     6    1423  172    1128  143     3923686  154
ace a -d4096 -m5        311     4    1405  170    1125  142     3923686  154
arh a -2 -mm             68    68    1148  139    1087  137     3647060  143
arh a -1 -mm            212     8    1499  181    1308  165     4605600  181
arh a -2 -1             280    68    1360  164    1108  140     3647060  143
ba -k -1                 44    23    1154  139    1114  141     3915288  154
ba -k -10                62    23    1031  125     976  123     3409364  134
ba -k -50                82    26    1021  123     947  120     3288361  129
bee a -m1 -d3           124   152    1207  146    1096  139     3353767  132
bee a -m2 -d3           161   186    1260  152    1115  141     3289365  129
bee a -m3 -d3           255   297    1454  176    1225  155     3248025  127
bix a -mdg              156     5    1185  143    1045  132     3690815  145
bix a -mdg -s           162     4    1191  144    1045  132     3690984  145
boa -m15                254   274    1411  170    1183  150     3182732  125
boa -m15 -s             243   262    1389  168    1170  148     3182732  125
boa -m7                 231   240    1364  165    1156  146     3217347  126
bzip2 -k -1              39    13    1192  144    1157  146     4107255  161
bzip2 -k -9              44    14    1061  128    1021  129     3613587  142
dc e                     29    17     929  112     903  114     3179166  125
dc e -b16300 -mt5        41    17     828  100     791  100     2773420  109
Entropy o12             163   165    1065  129     918  116     2652071  104
Entropy o16             219   208    1135  137     938  119     2550196  100
Entropy o32             399   355    1541  186    1182  149     2837256  111
eri a                    67    32     979  118     919  116     3168414  124
eri a -m6                81    37    1027  124     954  121     3272926  128
gcac a                   62    27    1031  125     975  123     3390596  133
gcac s                   58    26    1031  125     978  124     3408061  134
imp98 a -mm -2           37    10    1029  124     996  126     3535344  139
imp98 a -mm -s4          57     4    1202  145    1151  146     4112870  161
imp98 a -2 -s4           34    10    1032  125    1001  127     3557637  140
pkzip -es                 4     3    1658  200    1655  209     5945608  233
pkzip -a                 14     1    1318  159    1305  165     4691477  184
pkzip -exx              128     1    1408  170    1293  163     4605928  181
ppmd e -o3 -m250         12    14    1096  132    1085  137     3852169  151
ppmd e -o11 -m250        43    45     883  107     844  107     2863807  112
ppmd e -o12 -m250        53    56     890  107     843  107     2813928  110
ppmd e -o16 -m250        90    72     918  111     836  106     2719635  107
ppmn e -o6 -MT1          73    76     994  120     928  117     3044236  119
ppmn e -o8 -MT1         120   122    1084  131     976  123     3035542  119
ppmn e -o9 -MT1         128   135    1107  134     991  125     3040103  119
ppmonstr e -o3 -m250     97   103    1234  149    1147  145     3723983  146
ppmonstr e -o7 -m250    138   139    1097  132     973  123     2951420  116
ppmonstr e -o12 -m250   176   185    1104  133     946  120     2675291  105
ppmonstr e -o16 -m250   239   219    1173  142     957  121     2572106  101
ppmy /o6 /m220         1040  1040    3021  365    2085  264     3390712  133
ppmy /o9 /m220         1142  1180    3177  384    2150  272     3082736  121
qlfc a                   64    23    1027  124     969  123     3385077  133
rar a -m3                70     5    1193  144    1130  143     4026937  158
rar a -m4                42    25     950  115     912  115     3178761  125
rar a -m5                47    33     958  116     916  116     3164814  124
rk -mf1                  34    25    1121  135    1091  138     3826096  150
rk -mf2                 105    62    1181  143    1087  137     3652520  143
rk -mx1                 284   287    1428  172    1172  148     3083632  121
rk -mx3                 318   339    1507  182    1222  154     3064076  120
sbc c -m3 -b63           74    24     951  115     885  112     3072619  120
sbc c -of -b63           78    22     887  107     817  103     2835293  111
sbc c -os -b63           76    22     885  107     817  103     2835282  111
szip -o8                 45    31    1027  124     987  125     3427399  134
szip -o8 -b41            42    31    1000  121     962  122     3339124  131
ufa a -mg               165   181    1287  155    1138  144     3387619  133
ufa a -mg -mu32         201   214    1343  162    1163  147     3343996  131
uharc a -m1 -md32768    137    19    1112  134     989  125     3446073  135
uharc a -m2 -md32768    193    17    1086  131     912  115     3151576  124
uharc a -m3 -md32768    226    18    1101  133     897  113     3087253  121
uharc a -mz -md32768     29    30    1129  136    1103  139     3851126  151
uharc a -mx -md32768    106    90    1033  125     937  118     3013102  118
ybs -m16mu               55    16     864  104     814  103     2857431  112
ybs -m16mu -r            58    16     872  105     820  104     2878427  113
zzip a                   52    25    1020  123     974  123     3400236  133
zzip a -mx               55    25    1019  123     970  123     3383053  133
zzip a -mx -30m          68    25     990  120     929  117     3233140  127

Overall score is calculated by adding compression time, extraction time, and
time it would take to transfer the compressed file over a 28,800bps network:
(compressed_size)/3600

Average Users' score is calculated by adding (compress_time/10)+ extract_time +
time it would take to transfer the compressed file over a 28,800bps network.
Compression time is divided by 10 here, because more than 90% of people would
never compress anything during their life (with compression programs), but they
use compressed data almost _every_ time they use computers and/or Internet.
That's why compression time is not so actual for them.


[[3]] Details
=============
are no longer put to this main text
(1514 lines reporting 69328 results on 1231 files in 6 sets),
but can be found in FULL version with TEXTS.DAT and *.BAT
at http://geocities.com/SiliconValley/Bay/1995/artest23.zip
or http://artest1.tripod.com/artest23.zip


[[4]] Comments
==============
Links to download programs:
~~~~~~~~~~~~~~~~~~~~~~~~~~~
7-Zip 2.30b17 :W http://www.7-zip.org/dl/7zip230b17.exe                           891K
ACE 2.04      :W http://winace.host.sk/ace204.exe                                 593K
ERI 5.1fre    :e http://geocities.com/eri32/eri51fre.zip                           89K
GCac 0.9k     :W http://www.emit.jp/gca/gca_v09k.exe                              295K
PPMN 1.00b1+  :W http://compression.graphicon.ru/ms/ppmnb1+.rar                    40K
PPMY 0.3b     :W http://www.pilabs.org.ua/sh/ppmy_3b.zip                           15K
RAR32  3.00b4 :e ftp://pubftp.slovenska.sk/pub/rarsoft/rar/rarx30b4.exe           415K
WinRAR 3.00b4 :W ftp://pubftp.slovenska.sk/pub/rarsoft/rar/wrar30b4.exe           942K
SBC_d 0.910b  :e http://personal.inet.fi/musiikki/sjm/sbc0910b_dos.zip            195K
SBC   0.910b  :W http://personal.inet.fi/musiikki/sjm/sbc0910b_win32.zip          154K
UHArc 0.4b    :eW ftp://ftp.elf.stuba.sk/pub/pc/pack/uharc04.zip                  220K
QLFC 6.6W     :W http://ghido.shelter.ro/DownloadQLFC.php                          49K
ZZip 0.36c    :W http://www.via.ecp.fr/~damien/downloads/zzip-win32.zip            35K
PPMD var.H,
PPmonstr v.H  :W ftp://ftp.elf.stuba.sk/pub/pc/pack/ppmdh.rar                      57K
PkzipC  4.00  :W ftp://ftp.pkware.com/pkzc400s.exe                               3470K
PkZip   2.50  :a ftp://ftp.simtel.net/pub/simtelnet/msdos/arcers/pk250dos.exe     202K
RK-dos 1.04.1 :e http://rksoft.virtualave.net/downloads/rk104a1d.exe              461K
RK     1.04.1 :W http://rksoft.virtualave.net/downloads/rk104a1w.exe              380K
BA 1.01b5     :e http://hem.spray.se/mikael.lundqvist/ba101br5.zip                 61K
BIX 1.00b7    :W http://www.7-zip.org/dl/ufa/bix100b7.zip                          89K
777 0.04b1    :W http://www.7-zip.org/dl/ufa/777004b1.zip                          72K
UFA 0.04b1    :W http://www.7-zip.org/dl/ufa/ufa004b1.zip                          64K
ArHanGeL 1.40 :a http://geocities.com/SiliconValley/Lab/6606/arh140.zip            50K
Imp     1.1   :e http://www.technelysium.com.au/imp110d.zip                       266K
Imp-win 1.12  :W http://www.technelysium.com.au/imp112.exe                        122K
ACB 2.00c     :e ftp://ftp.simtel.net/pub/simtelnet/msdos/compress/acb_200c.zip    42K
BOA 0.58b     :e ftp://ftp.elf.stuba.sk/pub/pc/pack/boa058.zip                     74K
DC 0.98b      :W ftp://ftp.elf.stuba.sk/pub/pc/pack/dc124.zip                      55K
Bzip2 1.0.2   :W ftp://sourceware.cygnus.com/pub/bzip2/v102/bzip2-102-x86-win32.exe 72K
SZip 1.12a    :W http://www.compressconsult.com/szip/szip_112a_win32.zip           71K
YBS 0.03e     :e http://compression.graphicon.ru/ybs/ybs003ed.zip                  56K
YBS 0.03e     :W http://compression.graphicon.ru/ybs/ybs003ew.zip                  44K
BEE 0.4.8     :W mailto:Andrew.Filinsky@p11.f4.n452.z2.fidonet.org

:a - any DOS  - DOS programs, will run under pure DOS or in a DOS box
:e - extender - DOS programs using DOS extenders like DOS/4GW or CWSDPMI
:W - windows  - Windows95/98/NT/etc programs

If direct link doesn't work-most probably newer version of the program appeared
at the same site: visit web page, or read the whole directory from ftp server
(i.e. try the same URL, but without filename).


Homepages:
~~~~~~~~~~
Arhangel   : http://geocities.com/SiliconValley/Lab/6606
BA         : http://hem.spray.se/mikael.lundqvist
Eri32      : http://geocities.com/eri32
  mirror   : http://artest1.tripod.com
RK         : http://rksoft.virtualave.net
Imp,WinImp : http://www.technelysium.com.au/winimp.html
ACE,WinACE : http://www.winace.com
PkZip      : http://www.pkware.com
RAR,WinRAR : http://www.rarsoft.com
BZip2      : http://sources.redhat.com/bzip2
SZip       : http://www.compressconsult.com/szip
PPMN       : http://compression.graphicon.ru/ms
YBS        : http://compression.graphicon.ru/ybs
SBC        : http://sbcarchiver.netfirms.com
QLFC       : http://ghido.shelter.ro
GCAC       : http://emit.jp/gca/gca.html
PPMy       : http://www.pilabs.org.ua/sh
ZZip       : ? http://www.zzip.f2s.com
Ufa,777,
 BIX,7-Zip : http://www.7-zip.org


What's new:
~~~~~~~~~~~
14 new programs were tested:

UHArc 0.4 beta
PPMd v.Ipre
PPMonstr v.Ipre
7-zip 2.30b17
RAR 3.00b4
PPMN 1.00b1
ACE 2.04
SBC 0.910 beta
ERI 5.1fre
ZZip 0.36c
GCAC 0.9k
QLFC 6.6W
PPMy 0.3b
Entropy 0.5

Latest beta versions of PPMonstr, Entropy, BEE, DC, are available
from authors by e-mail request:
PPMonstr: shkarin@arstel.ru
Entropy: artest@inbox.ru
BEE: Andrew.Filinsky@p11.f4.n452.z2.fidonet.org
DC: EdgarBinder@t-online.de

 Results of ACB, 777, GCAC, SZip, QLFC, ArHanGeL, IMP, BICOM, BIX, ACE
 are in full version only, TEXTS.DAT file.

The set of Russian texts has moved to http://arte.nm.ru/m120

Some old or less efficient programs will not be tested next time:
777, UFA, BIX, ACB, BA, BEE, PPMy.


WARNINGS:
~~~~~~~~~
BA 1.00beta5 can't correctly decompress shaks12.txt .

DC 0.99.158b failed to decompress 1DFRE10.dc , ANDES10.dc , and BTI0110.dc,
saying "Corrupted block" (while t(est) command writes "Test successful").

ERI 5.1fre can't compress files larger than (free DPMI memory)/6, i.e.
about 10Mb on a PC with 64Mb RAM. The largest 44Mb file was split to 5 chunks
9000000 bytes long (last chunk was 8894190 bytes).

Problems in all other compressors were not found.


The LATEST RELEASE, and all previous versions of these tests can be found
at http://geocities.com/SiliconValley/Bay/1995/ and http://artest1.tripod.com/


Send your suggestions, comments to artest@inbox.ru
With best kind regards,
A.Ratushnyak,
RAO Inc.
