Make your own free website on Tripod.com

The Art of Lossless Data Compression vol. 23t

Here are the results of tests performed in March 2002 to compare lossless compression of "plain" texts by all known good enough programs developed for such purpose, including RK, DC, YBS, Bzip2, RAR, ACE and 7-zip. See Archive Comparison Test by J.Gilchrist for more details: http://compression.ca If anybody wants to start or continue such tests, or can suggest some other sets of texts, or other compression programs, (not sources or algorithm descriptions, executable programs only) or knows we have missed something important, (some new fantastic technology, an algorithm or even a program capable of lossless compression of up to 1000:1 etc.) please let us know immediately: artest@inbox.ru Thank you!

[[1]] COMPRESSION QUALITY

(see also [[2]] Speed [[3]] Details [[4]] Comments) Last seventh line shows results for the sum of all 1231 texts in six sets. Original Entropy PPMonstr PPMN PPMd RK PPMY RAR DC BOA 552.36% 100% 102.24 105.09 109.82 105.46 108.37 109.22 108.47 109.83 529.19% 100% 106.13 107.42 111.43 109.39 112.25 111.63 111.36 114.46 427.31% 100% 105.95 105.08 109.88 108.68 108.87 110.43 109.08 111.04 472.03% 100% 106.02 105.55 110.70 109.76 110.60 111.31 110.89 112.67 790.80% 103.71 100% 114.70 108.72 109.33 125.47 120.20 120.16 121.01 344.47% 100.01 100% 100.24 103.16 107.29 103.27 103.56 108.93 104.79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453.41% 100% 103.97 104.47 108.38 108.63 109.07 109.45 110.36 110.67 SBC BEE YBS UHARC ERI 7-zip BA ZZip BZip2 pkzip 109.26 112.09 108.76 107.15 109.34 116.60 111.75 111.57 123.85 158.95 113.27 115.15 114.38 112.22 116.39 117.47 116.50 117.28 127.57 173.48 110.95 112.84 111.79 110.30 113.02 113.65 113.98 114.71 122.12 160.11 112.41 113.44 113.63 112.82 115.22 116.60 115.96 116.78 126.31 167.32 117.55 122.56 119.29 121.46 124.65 136.93 129.23 123.54 148.08 203.17 107.55 105.28 106.76 110.77 108.72 104.11 109.78 110.87 113.99 144.81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 110.98 111.59 111.62 112.00 113.43 113.91 114.50 115.01 123.49 162.19

[[2]] Speed

Canterbury Corpus Large Set http://corpus.canterbury.ac.nz/resources/large.zip was used for this test, and a 400MHz PC with 256Mb RAM and Windows98. Programs, Compression/ Overall Average Users' Compressed options Extraction, Score Score Size seconds seconds, % seconds, % bytes , % no compression 0 0 4446 537 4446 562 16005619 628 777 a -mg 251 162 1397 169 1171 148 3544031 139 7za a -t7z 178 6 1213 146 1053 133 3707526 145 7za a -t7z -mx 269 6 1289 156 1046 132 3649093 143 7za a -tzip 62 6 1287 155 1232 156 4393623 172 7za a -tzip -mx 102 5 1329 161 1238 157 4401160 173 acb B 718 672 2319 280 1673 212 3346915 131 acb u 1277 1276 3448 416 2298 291 3221349 126 ace32 a 100 6 1216 147 1126 142 3998222 157 ace32 a -d4096 106 6 1212 146 1117 141 3962314 155 ace32 a -d4096 -m1 57 6 1197 145 1146 145 4086782 160 ace32 a -d4096 -m5 328 6 1423 172 1128 143 3923686 154 ace a -d4096 -m5 311 4 1405 170 1125 142 3923686 154 arh a -2 -mm 68 68 1148 139 1087 137 3647060 143 arh a -1 -mm 212 8 1499 181 1308 165 4605600 181 arh a -2 -1 280 68 1360 164 1108 140 3647060 143 ba -k -1 44 23 1154 139 1114 141 3915288 154 ba -k -10 62 23 1031 125 976 123 3409364 134 ba -k -50 82 26 1021 123 947 120 3288361 129 bee a -m1 -d3 124 152 1207 146 1096 139 3353767 132 bee a -m2 -d3 161 186 1260 152 1115 141 3289365 129 bee a -m3 -d3 255 297 1454 176 1225 155 3248025 127 bix a -mdg 156 5 1185 143 1045 132 3690815 145 bix a -mdg -s 162 4 1191 144 1045 132 3690984 145 boa -m15 254 274 1411 170 1183 150 3182732 125 boa -m15 -s 243 262 1389 168 1170 148 3182732 125 boa -m7 231 240 1364 165 1156 146 3217347 126 bzip2 -k -1 39 13 1192 144 1157 146 4107255 161 bzip2 -k -9 44 14 1061 128 1021 129 3613587 142 dc e 29 17 929 112 903 114 3179166 125 dc e -b16300 -mt5 41 17 828 100 791 100 2773420 109 Entropy o12 163 165 1065 129 918 116 2652071 104 Entropy o16 219 208 1135 137 938 119 2550196 100 Entropy o32 399 355 1541 186 1182 149 2837256 111 eri a 67 32 979 118 919 116 3168414 124 eri a -m6 81 37 1027 124 954 121 3272926 128 gcac a 62 27 1031 125 975 123 3390596 133 gcac s 58 26 1031 125 978 124 3408061 134 imp98 a -mm -2 37 10 1029 124 996 126 3535344 139 imp98 a -mm -s4 57 4 1202 145 1151 146 4112870 161 imp98 a -2 -s4 34 10 1032 125 1001 127 3557637 140 pkzip -es 4 3 1658 200 1655 209 5945608 233 pkzip -a 14 1 1318 159 1305 165 4691477 184 pkzip -exx 128 1 1408 170 1293 163 4605928 181 ppmd e -o3 -m250 12 14 1096 132 1085 137 3852169 151 ppmd e -o11 -m250 43 45 883 107 844 107 2863807 112 ppmd e -o12 -m250 53 56 890 107 843 107 2813928 110 ppmd e -o16 -m250 90 72 918 111 836 106 2719635 107 ppmn e -o6 -MT1 73 76 994 120 928 117 3044236 119 ppmn e -o8 -MT1 120 122 1084 131 976 123 3035542 119 ppmn e -o9 -MT1 128 135 1107 134 991 125 3040103 119 ppmonstr e -o3 -m250 97 103 1234 149 1147 145 3723983 146 ppmonstr e -o7 -m250 138 139 1097 132 973 123 2951420 116 ppmonstr e -o12 -m250 176 185 1104 133 946 120 2675291 105 ppmonstr e -o16 -m250 239 219 1173 142 957 121 2572106 101 ppmy /o6 /m220 1040 1040 3021 365 2085 264 3390712 133 ppmy /o9 /m220 1142 1180 3177 384 2150 272 3082736 121 qlfc a 64 23 1027 124 969 123 3385077 133 rar a -m3 70 5 1193 144 1130 143 4026937 158 rar a -m4 42 25 950 115 912 115 3178761 125 rar a -m5 47 33 958 116 916 116 3164814 124 rk -mf1 34 25 1121 135 1091 138 3826096 150 rk -mf2 105 62 1181 143 1087 137 3652520 143 rk -mx1 284 287 1428 172 1172 148 3083632 121 rk -mx3 318 339 1507 182 1222 154 3064076 120 sbc c -m3 -b63 74 24 951 115 885 112 3072619 120 sbc c -of -b63 78 22 887 107 817 103 2835293 111 sbc c -os -b63 76 22 885 107 817 103 2835282 111 szip -o8 45 31 1027 124 987 125 3427399 134 szip -o8 -b41 42 31 1000 121 962 122 3339124 131 ufa a -mg 165 181 1287 155 1138 144 3387619 133 ufa a -mg -mu32 201 214 1343 162 1163 147 3343996 131 uharc a -m1 -md32768 137 19 1112 134 989 125 3446073 135 uharc a -m2 -md32768 193 17 1086 131 912 115 3151576 124 uharc a -m3 -md32768 226 18 1101 133 897 113 3087253 121 uharc a -mz -md32768 29 30 1129 136 1103 139 3851126 151 uharc a -mx -md32768 106 90 1033 125 937 118 3013102 118 ybs -m16mu 55 16 864 104 814 103 2857431 112 ybs -m16mu -r 58 16 872 105 820 104 2878427 113 zzip a 52 25 1020 123 974 123 3400236 133 zzip a -mx 55 25 1019 123 970 123 3383053 133 zzip a -mx -30m 68 25 990 120 929 117 3233140 127 Overall score is calculated by adding compression time, extraction time, and time it would take to transfer the compressed file over a 28,800bps network: (compressed_size)/3600 Average Users' score is calculated by adding (compress_time/10)+ extract_time + time it would take to transfer the compressed file over a 28,800bps network. Compression time is divided by 10 here, because more than 90% of people would never compress anything during their life (with compression programs), but they use compressed data almost _every_ time they use computers and/or Internet. That's why compression time is not so actual for them.

[[3]] Details

are no longer put to this main text (1514 lines reporting 69328 results on 1231 files in 6 sets), but can be found in FULL version with TEXTS.DAT and *.BAT at http://geocities.com/SiliconValley/Bay/1995/artest23.zip or http://artest1.tripod.com/artest23.zip

[[4]] Comments

Links to download programs:

7-Zip 2.30b17 :W http://www.7-zip.org/dl/7zip230b17.exe 891K ACE 2.04 :W http://winace.host.sk/ace204.exe 593K ERI 5.1fre :e http://geocities.com/eri32/eri51fre.zip 89K GCac 0.9k :W http://www.emit.jp/gca/gca_v09k.exe 295K PPMN 1.00b1+ :W http://compression.graphicon.ru/ms/ppmnb1+.rar 40K PPMY 0.3b :W http://www.pilabs.org.ua/sh/ppmy_3b.zip 15K RAR32 3.00b4 :e ftp://pubftp.slovenska.sk/pub/rarsoft/rar/rarx30b4.exe 415K WinRAR 3.00b4 :W ftp://pubftp.slovenska.sk/pub/rarsoft/rar/wrar30b4.exe 942K SBC_d 0.910b :e http://personal.inet.fi/musiikki/sjm/sbc0910b_dos.zip 195K SBC 0.910b :W http://personal.inet.fi/musiikki/sjm/sbc0910b_win32.zip 154K UHArc 0.4b :eW ftp://ftp.elf.stuba.sk/pub/pc/pack/uharc04.zip 220K QLFC 6.6W :W http://ghido.shelter.ro/DownloadQLFC.php 49K ZZip 0.36c :W http://www.via.ecp.fr/~damien/downloads/zzip-win32.zip 35K PPMD var.H, PPmonstr v.H :W ftp://ftp.elf.stuba.sk/pub/pc/pack/ppmdh.rar 57K PkzipC 4.00 :W ftp://ftp.pkware.com/pkzc400s.exe 3470K PkZip 2.50 :a ftp://ftp.simtel.net/pub/simtelnet/msdos/arcers/pk250dos.exe 202K RK-dos 1.04.1 :e http://rksoft.virtualave.net/downloads/rk104a1d.exe 461K RK 1.04.1 :W http://rksoft.virtualave.net/downloads/rk104a1w.exe 380K BA 1.01b5 :e http://hem.spray.se/mikael.lundqvist/ba101br5.zip 61K BIX 1.00b7 :W http://www.7-zip.org/dl/ufa/bix100b7.zip 89K 777 0.04b1 :W http://www.7-zip.org/dl/ufa/777004b1.zip 72K UFA 0.04b1 :W http://www.7-zip.org/dl/ufa/ufa004b1.zip 64K ArHanGeL 1.40 :a http://geocities.com/SiliconValley/Lab/6606/arh140.zip 50K Imp 1.1 :e http://www.technelysium.com.au/imp110d.zip 266K Imp-win 1.12 :W http://www.technelysium.com.au/imp112.exe 122K ACB 2.00c :e ftp://ftp.simtel.net/pub/simtelnet/msdos/compress/acb_200c.zip 42K BOA 0.58b :e ftp://ftp.elf.stuba.sk/pub/pc/pack/boa058.zip 74K DC 0.98b :W ftp://ftp.elf.stuba.sk/pub/pc/pack/dc124.zip 55K Bzip2 1.0.2 :W ftp://sourceware.cygnus.com/pub/bzip2/v102/bzip2-102-x86-win32.exe 72K SZip 1.12a :W http://www.compressconsult.com/szip/szip_112a_win32.zip 71K YBS 0.03e :e http://compression.graphicon.ru/ybs/ybs003ed.zip 56K YBS 0.03e :W http://compression.graphicon.ru/ybs/ybs003ew.zip 44K BEE 0.4.8 :W mailto:Andrew.Filinsky@p11.f4.n452.z2.fidonet.org :a - any DOS - DOS programs, will run under pure DOS or in a DOS box :e - extender - DOS programs using DOS extenders like DOS/4GW or CWSDPMI :W - windows - Windows95/98/NT/etc programs If direct link doesn't work-most probably newer version of the program appeared at the same site: visit web page, or read the whole directory from ftp server (i.e. try the same URL, but without filename).

Homepages:

Arhangel : http://geocities.com/SiliconValley/Lab/6606 BA : http://hem.spray.se/mikael.lundqvist Eri32 : http://geocities.com/eri32 mirror : http://artest1.tripod.com RK : http://rksoft.virtualave.net Imp,WinImp : http://www.technelysium.com.au/winimp.html ACE,WinACE : http://www.winace.com PkZip : http://www.pkware.com RAR,WinRAR : http://www.rarsoft.com BZip2 : http://sources.redhat.com/bzip2 SZip : http://www.compressconsult.com/szip PPMN : http://compression.graphicon.ru/ms YBS : http://compression.graphicon.ru/ybs SBC : http://sbcarchiver.netfirms.com QLFC : http://ghido.shelter.ro GCAC : http://emit.jp/gca/gca.html PPMy : http://www.pilabs.org.ua/sh ZZip : ? http://www.zzip.f2s.com Ufa,777, BIX,7-Zip : http://www.7-zip.org

What's new:

14 new programs were tested: UHArc 0.4 beta PPMd v.Ipre PPMonstr v.Ipre 7-zip 2.30b17 RAR 3.00b4 PPMN 1.00b1 ACE 2.04 SBC 0.910 beta ERI 5.1fre ZZip 0.36c GCAC 0.9k QLFC 6.6W PPMy 0.3b Entropy 0.5 Latest beta versions of PPMonstr, Entropy, BEE, DC, are available from authors by e-mail request: PPMonstr: shkarin@arstel.ru Entropy: artest@inbox.ru BEE: Andrew.Filinsky@p11.f4.n452.z2.fidonet.org DC: EdgarBinder@t-online.de Results of ACB, 777, GCAC, SZip, QLFC, ArHanGeL, IMP, BICOM, BIX, ACE are in full version only, TEXTS.DAT file. The set of Russian texts has moved to http://arte.nm.ru/m120 Some old or less efficient programs will not be tested next time: 777, UFA, BIX, ACB, BA, BEE, PPMy.

WARNINGS:

BA 1.00beta5 can't correctly decompress shaks12.txt . DC 0.99.158b failed to decompress 1DFRE10.dc , ANDES10.dc , and BTI0110.dc, saying "Corrupted block" (while t(est) command writes "Test successful"). ERI 5.1fre can't compress files larger than (free DPMI memory)/6, i.e. about 10Mb on a PC with 64Mb RAM. The largest 44Mb file was split to 5 chunks 9000000 bytes long (last chunk was 8894190 bytes). Problems in all other compressors were not found. The LATEST RELEASE, and all previous versions of these tests can be found at http://geocities.com/SiliconValley/Bay/1995/ and http://artest1.tripod.com/ Send your suggestions, comments to artest@inbox.ru With best kind regards, A.Ratushnyak, RAO Inc. Back to main ARTest page