Mail Archives: pgcc/2000/01/30/16:57:28
On Sun, Jan 30, 2000 at 10:20:36AM +0100, Wolfgang Formann <w DOT formann AT netsurf213 DOT neuss DOT netsurf DOT de> wrote:
> For whatever reason, I added three nop's in from of the assembler stuff
> in libdes's implementation of fcrypt, I got a speedup of 10%.
> The funny thing is, that now the main loop does start ad some odd
> address.
I did the same in my zip-cracker ;) 3% speedup (and no jump anywhere near
that place ;)
> Well, my processor is an AMD-K6-I/233, so maybe another issue for
> aligning could be to prevent multi-byte instructions not to go over some
> cache (or whatever) boundary, which is the only explanation I curently
> have.
The pentium suffers from this a lot, but the individual speed improvement
is far from your 10%.
> 10% is really a lot, inside a loop, which takes (about) 25 * 35 cycles.
That's very much. I doubt it really is the three nops, but...
--
-----==- |
----==-- _ |
---==---(_)__ __ ____ __ Marc Lehmann +--
--==---/ / _ \/ // /\ \/ / pcg AT opengroup DOT org |e|
-=====/_/_//_/\_,_/ /_/\_\ XX11-RIPE --+
The choice of a GNU generation |
|
- Raw text -