annotate src/zlib-1.2.8/contrib/asm686/README.686 @ 48:9530b331f8c1

Add Cap'n Proto source
author Chris Cannam <cannam@all-day-breakfast.com>
date Tue, 25 Oct 2016 11:17:01 +0100
parents 5ea0608b923f
children
rev   line source
Chris@43 1 This is a patched version of zlib, modified to use
Chris@43 2 Pentium-Pro-optimized assembly code in the deflation algorithm. The
Chris@43 3 files changed/added by this patch are:
Chris@43 4
Chris@43 5 README.686
Chris@43 6 match.S
Chris@43 7
Chris@43 8 The speedup that this patch provides varies, depending on whether the
Chris@43 9 compiler used to build the original version of zlib falls afoul of the
Chris@43 10 PPro's speed traps. My own tests show a speedup of around 10-20% at
Chris@43 11 the default compression level, and 20-30% using -9, against a version
Chris@43 12 compiled using gcc 2.7.2.3. Your mileage may vary.
Chris@43 13
Chris@43 14 Note that this code has been tailored for the PPro/PII in particular,
Chris@43 15 and will not perform particuarly well on a Pentium.
Chris@43 16
Chris@43 17 If you are using an assembler other than GNU as, you will have to
Chris@43 18 translate match.S to use your assembler's syntax. (Have fun.)
Chris@43 19
Chris@43 20 Brian Raiter
Chris@43 21 breadbox@muppetlabs.com
Chris@43 22 April, 1998
Chris@43 23
Chris@43 24
Chris@43 25 Added for zlib 1.1.3:
Chris@43 26
Chris@43 27 The patches come from
Chris@43 28 http://www.muppetlabs.com/~breadbox/software/assembly.html
Chris@43 29
Chris@43 30 To compile zlib with this asm file, copy match.S to the zlib directory
Chris@43 31 then do:
Chris@43 32
Chris@43 33 CFLAGS="-O3 -DASMV" ./configure
Chris@43 34 make OBJA=match.o
Chris@43 35
Chris@43 36
Chris@43 37 Update:
Chris@43 38
Chris@43 39 I've been ignoring these assembly routines for years, believing that
Chris@43 40 gcc's generated code had caught up with it sometime around gcc 2.95
Chris@43 41 and the major rearchitecting of the Pentium 4. However, I recently
Chris@43 42 learned that, despite what I believed, this code still has some life
Chris@43 43 in it. On the Pentium 4 and AMD64 chips, it continues to run about 8%
Chris@43 44 faster than the code produced by gcc 4.1.
Chris@43 45
Chris@43 46 In acknowledgement of its continuing usefulness, I've altered the
Chris@43 47 license to match that of the rest of zlib. Share and Enjoy!
Chris@43 48
Chris@43 49 Brian Raiter
Chris@43 50 breadbox@muppetlabs.com
Chris@43 51 April, 2007