delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2019/08/18/02:05:04

X-Recipient: archive-cygwin AT delorie DOT com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:subject:to:references:from:message-id:date
:mime-version:in-reply-to:content-type
:content-transfer-encoding; q=dns; s=default; b=XFWi8r6Q8JETxCTK
iwIrXyoWGPCJojCleSzZ8Mt+KQhDBBCPWs7mHRS5sMfN7wh1z68RilBaUz6LIL0q
FxtODiekjytuOeKBGuznJL+DK8H5UWoqz9cjvSJBOGLBVCNKN2Zd33+Ai1GUEJ+g
yzcW22yJCkiUv48qcV9mCPcwE14=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:subject:to:references:from:message-id:date
:mime-version:in-reply-to:content-type
:content-transfer-encoding; s=default; bh=2c3v8WTfF6hsDnrhk+ePzz
ZV8rE=; b=UoBMkkOylFlm12BqIvBrEITdW1BCyXTdX0MMIZxiwt2Ak1iKJFL8/z
u8ZdmJXPQommZIvOXlhGIETk/e6iPfq8afXdySmCXDfhVqmD7JWvWtdVkEURl7eU
/SezqIdmJxQqP9t3dxlV/LVTBvdagd82rfMr+Kc3ECYDIm4vE/SPo=
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
Authentication-Results: sourceware.org; auth=none
X-Spam-SWARE-Status: No, score=-1.4 required=5.0 tests=AWL,BAYES_00,RCVD_IN_DNSWL_NONE,SPF_HELO_PASS,SPF_PASS autolearn=ham version=3.3.1 spammy=HX-HELO:sk:gateway, offending, H*RU:sk:gateway, initializers
X-HELO: gateway30.websitewelcome.com
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=agner.org; s=default; h=Content-Transfer-Encoding:Content-Type:In-Reply-To:MIME-Version :Date:Message-ID:From:References:To:Subject:Sender:Reply-To:Cc:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=dJjm+08k+e/fuUDYFwPZW0QRzofJto8hKeKAYKuBuFw=; b=JxIC5dOjxUxSJDh/OiMN9upOQd 5iuBsKuJW1YIe2qXEveQiwiCNgR6ZgokYDw2xWAKEGkaf6kikMqCe6v5zrtT7t+OAHB1Uygx3W6lt R8SpmTwMhCVTYKffpsrYyLiGloZajieKCJiI59VYAg+FVdYOoRZnywgFwr8PTTR/8U+xYWqBhGoxe yPieCw4c3IpriAsUbr0mTBFe8/Wj7kXbpCudbFSHXBkhTbFtvrk+juyZ8Sd0JcE8XUX5ze6/kvLiL bqJnFYbz5v/rTBO05jub7jiyK47S9UVJdZotsUDH318PT1LYSLLd6ng1rwWATp+xIGKx3s/NbU2LU WR0KNc1w==;
Subject: Re: Clang is using the wrong memory model
To: cygwin AT cygwin DOT com
References: <732305300 DOT 268416262 DOT 1565603129541 DOT JavaMail DOT root AT zimbra54-e10 DOT priv DOT proxad DOT net> <024ce5b5-6024-b371-e382-5eef5d1d4a90 AT agner DOT org> <075477ca-e031-39ea-3dfe-bc8bc2fc8fa6 AT agner DOT org> <20190816082603 DOT GR11632 AT calimero DOT vinschen DOT de> <8d594f4d-17e0-447a-8b18-1ed620815e0f AT agner DOT org> <20190816095200 DOT GU11632 AT calimero DOT vinschen DOT de> <2024f551-66e6-47c5-a0b6-2dfa735a690d AT agner DOT org> <20190816111133 DOT GV11632 AT calimero DOT vinschen DOT de> <6008d30a-cd6c-5449-2933-a041e804856c AT agner DOT org> <20190817081605 DOT GX11632 AT calimero DOT vinschen DOT de>
From: Agner Fog <agner AT agner DOT org>
Message-ID: <17081147-f46a-51bf-65e9-3df7748b1780@agner.org>
Date: Sun, 18 Aug 2019 08:04:09 +0200
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0
MIME-Version: 1.0
In-Reply-To: <20190817081605.GX11632@calimero.vinschen.de>
X-IsSubscribed: yes

Thanks a lot for your help in clarifying this.

When I complained here about the wasteful 64-bit addresses you said that 
it was an LLVM issue. When I complained to LLVM they said it was a 
Cygwin issue, and that you were using the wrong memory model.

All this confusion is due to a terrible lack of documentation of everything.
I had to do a lot of reverse engineering to figure out what is 
happening. What I have found out so far is listed below. Much of this is 
undocumented. Obviously, I would like to know if any or this is wrong or 
if specific documentation is available other than the SysV ABI and 
Windows ABI:

* Cygwin is using its own loader which is different from the Windows 
loader.
* The Cygwin loader emulates the behavior of Linux shared objects. This 
includes the ability to directly access a variable inside a DLL
* Access to a variable in a different DLL requires a 64-bit address. 
This is obtained by using the medium memory model with a gcc or Clang 
compiler.
* The small memory model works differently on different targets. A 
-mcmodel=small with a Linux target puts everything below 2GB addresses. 
32-bit absolute addresses are allowed. -mcmodel=small with a Windows or 
Mac target allows addresses above 2GB, but limits the distance between 
code and data in the same executable to 2GB. 32-bit absolute addresses 
are not allowed. 32-bit relative addresses are used instead.
* The memory models work differently in gcc an Clang. Gcc with a medium 
or large memory model is using 64-bit address tables to access a 
variable in a different C/CPP file. Clang with a medium or large memory 
model is using 64-bit addresses not only for external variables, but 
also for local static data. This includes floating point constants, 
string constants, array initializers, jump tables, global variables, and 
more.
* Cygwin uses a medium memory model by default. The medium memory model 
is necessary only for a program that makes direct access to a variable 
in a different DLL. The medium memory model is wasteful, and more so 
with Clang than with gcc.

Now I am speculating what we can do to avoid the wasteful 64-bit 
address-load instructions to improve the performance of Cygwin programs.

We can improve performance by using the small memory model when 
possible. The medium memory model is needed only for programs that link 
to a variable in a different DLL. The DLL that contains the link target 
does not need the medium memory model.

Direct access to a variable in a different DLL is considered bad 
programming practice by modern standards. This should occur only in old 
Linux code.

A link to a variable in a different DLL may be replaced by function 
calls (this is done with errno). In some cases, static linking can be an 
efficient alternative.

It would be helpful if the Cygwin loader could print the name of the 
offending variable when relocation fails with the small memory model. 
This could help programmers remove any obstacles to using the more 
efficient small memory model.


Agner


On 17/08/2019 10.16, Corinna Vinschen wrote:
> Oe Aug 17 07:31, Agner Fog wrote:
>>> So errno was a bad example but you can try accessing e.g. __ctype_ptr__,
>>> __progname, optarg, h_errno, or use FE_DFL_ENV from another DLL, just
>>> for kicks.
>> __ctype_ptr__ is a function
>>
>> h_errno works like errno with an imported function
>>
>> FE_DFL_ENV is a macro
>>
>> __progname and optarg are local variables to each exe or dll
> That would contradict what, e.g., __progname is for.  Here's a test:
>
> $ cat > dll.c <<EOF
> #include <stdio.h>
>
> extern char *__progname;
>
> void
> printprog ()
> {
>    printf ("progname: %s\n", __progname);
> }
> EOF
> $ cat > main.c <<EOF
> extern void printprog();
>
> int
> main ()
> {
>    printprog ();
> }
> EOF
> $ uname -a
> CYGWIN_NT-10.0 vmbert10 3.1.0(0.340/5/3) 2019-08-16 14:36 x86_64 Cygwin
>
> Lets try the medium model first:
>
>    $ gcc -g -shared -mcmodel=medium -o dll.dll dll.c
>    $ gcc -g -mcmodel=medium -o main main.c dll.dll
>    $ ./main
>    progname: main
>
> Now let's try the small model:
>
>    $ gcc -g -shared -mcmodel=small -o dll.dll dll.c
>    $ gcc -g -mcmodel=small -o main main.c dll.dll
>    $ ./main
>    Cygwin runtime failure: /home/corinna/main.exe: Invalid relocation.  Offset
>    0xfffffffd80348989 at address 0x40000103b doesn't fit into 32 bits
>
> Now let's try without explicit mcmodel on the CLI:
>
>    $ gcc -g -shared -o dll.dll dll.c
>    $ gcc -g -o main main.c dll.dll
>    $ ./main
>    progname: main
>
>> gcc is using the small memory model by default in Cygwin64, and it works.
> No, it's not, see above.
>
>> clang is using the small memory by default when cross-compiling for a Cygwin64 target from Linux, and it works.
> ...in *your* example code.
>
>
> Corinna
>

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019