delorie.com/archives/browse.cgi   search  
Mail Archives: pgcc/2000/01/30/09:24:03

Sender: wolfi AT netsurf213 DOT neuss DOT netsurf DOT de
Message-ID: <38940264.30C35BDD@neuss.netsurf.de>
Date: Sun, 30 Jan 2000 10:20:36 +0100
From: Wolfgang Formann <w DOT formann AT netsurf213 DOT neuss DOT netsurf DOT de>
X-Mailer: Mozilla 4.6 [en] (X11; I; Linux 2.2.8 i586)
X-Accept-Language: German, de, en
MIME-Version: 1.0
To: pgcc AT delorie DOT com
Subject: Re: pgcc and egcs alignment -- function, basic block and string
References: <38921CD6 DOT 2A725779 AT ix DOT netcom DOT com> <20000129032101 DOT A25630 AT atrey DOT karlin DOT mff DOT cuni DOT cz>
Reply-To: pgcc AT delorie DOT com

Jan Hubicka wrote:
> 
[... deleted ...]
> 
> Honza
> >
> > Chris Sears
> > cbsears AT ix DOT netcom DOT com

Please before you make any changes in the code, test it again and again.
For whatever reason, I added three nop's in from of the assembler stuff
in libdes's implementation of fcrypt, I got a speedup of 10%.
The funny thing is, that now the main loop does start ad some odd
address.
Well, my processor is an AMD-K6-I/233, so maybe another issue for
aligning could be to prevent multi-byte instructions not to go over some
cache (or whatever) boundary, which is the only explanation I curently
have.

10% is really a lot, inside a loop, which takes (about) 25 * 35 cycles.

Any other ideas?

Wolfgang

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019