annotate src/zlib-1.2.7/contrib/asm686/README.686 @ 148:b4bfdf10c4b3

Update Win64 capnp builds to v0.6
author Chris Cannam <cannam@all-day-breakfast.com>
date Mon, 22 May 2017 18:56:49 +0100
parents 8a15ff55d9af
children
rev   line source
cannam@89 1 This is a patched version of zlib, modified to use
cannam@89 2 Pentium-Pro-optimized assembly code in the deflation algorithm. The
cannam@89 3 files changed/added by this patch are:
cannam@89 4
cannam@89 5 README.686
cannam@89 6 match.S
cannam@89 7
cannam@89 8 The speedup that this patch provides varies, depending on whether the
cannam@89 9 compiler used to build the original version of zlib falls afoul of the
cannam@89 10 PPro's speed traps. My own tests show a speedup of around 10-20% at
cannam@89 11 the default compression level, and 20-30% using -9, against a version
cannam@89 12 compiled using gcc 2.7.2.3. Your mileage may vary.
cannam@89 13
cannam@89 14 Note that this code has been tailored for the PPro/PII in particular,
cannam@89 15 and will not perform particuarly well on a Pentium.
cannam@89 16
cannam@89 17 If you are using an assembler other than GNU as, you will have to
cannam@89 18 translate match.S to use your assembler's syntax. (Have fun.)
cannam@89 19
cannam@89 20 Brian Raiter
cannam@89 21 breadbox@muppetlabs.com
cannam@89 22 April, 1998
cannam@89 23
cannam@89 24
cannam@89 25 Added for zlib 1.1.3:
cannam@89 26
cannam@89 27 The patches come from
cannam@89 28 http://www.muppetlabs.com/~breadbox/software/assembly.html
cannam@89 29
cannam@89 30 To compile zlib with this asm file, copy match.S to the zlib directory
cannam@89 31 then do:
cannam@89 32
cannam@89 33 CFLAGS="-O3 -DASMV" ./configure
cannam@89 34 make OBJA=match.o
cannam@89 35
cannam@89 36
cannam@89 37 Update:
cannam@89 38
cannam@89 39 I've been ignoring these assembly routines for years, believing that
cannam@89 40 gcc's generated code had caught up with it sometime around gcc 2.95
cannam@89 41 and the major rearchitecting of the Pentium 4. However, I recently
cannam@89 42 learned that, despite what I believed, this code still has some life
cannam@89 43 in it. On the Pentium 4 and AMD64 chips, it continues to run about 8%
cannam@89 44 faster than the code produced by gcc 4.1.
cannam@89 45
cannam@89 46 In acknowledgement of its continuing usefulness, I've altered the
cannam@89 47 license to match that of the rest of zlib. Share and Enjoy!
cannam@89 48
cannam@89 49 Brian Raiter
cannam@89 50 breadbox@muppetlabs.com
cannam@89 51 April, 2007