delorie.com/archives/browse.cgi | search |
X-Recipient: | archive-cygwin AT delorie DOT com |
DomainKey-Signature: | a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id |
:list-unsubscribe:list-subscribe:list-archive:list-post | |
:list-help:sender:subject:to:references:from:message-id:date | |
:mime-version:in-reply-to:content-type | |
:content-transfer-encoding; q=dns; s=default; b=XFWi8r6Q8JETxCTK | |
iwIrXyoWGPCJojCleSzZ8Mt+KQhDBBCPWs7mHRS5sMfN7wh1z68RilBaUz6LIL0q | |
FxtODiekjytuOeKBGuznJL+DK8H5UWoqz9cjvSJBOGLBVCNKN2Zd33+Ai1GUEJ+g | |
yzcW22yJCkiUv48qcV9mCPcwE14= | |
DKIM-Signature: | v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id |
:list-unsubscribe:list-subscribe:list-archive:list-post | |
:list-help:sender:subject:to:references:from:message-id:date | |
:mime-version:in-reply-to:content-type | |
:content-transfer-encoding; s=default; bh=2c3v8WTfF6hsDnrhk+ePzz | |
ZV8rE=; b=UoBMkkOylFlm12BqIvBrEITdW1BCyXTdX0MMIZxiwt2Ak1iKJFL8/z | |
u8ZdmJXPQommZIvOXlhGIETk/e6iPfq8afXdySmCXDfhVqmD7JWvWtdVkEURl7eU | |
/SezqIdmJxQqP9t3dxlV/LVTBvdagd82rfMr+Kc3ECYDIm4vE/SPo= | |
Mailing-List: | contact cygwin-help AT cygwin DOT com; run by ezmlm |
List-Id: | <cygwin.cygwin.com> |
List-Subscribe: | <mailto:cygwin-subscribe AT cygwin DOT com> |
List-Archive: | <http://sourceware.org/ml/cygwin/> |
List-Post: | <mailto:cygwin AT cygwin DOT com> |
List-Help: | <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs> |
Sender: | cygwin-owner AT cygwin DOT com |
Mail-Followup-To: | cygwin AT cygwin DOT com |
Delivered-To: | mailing list cygwin AT cygwin DOT com |
Authentication-Results: | sourceware.org; auth=none |
X-Spam-SWARE-Status: | No, score=-1.4 required=5.0 tests=AWL,BAYES_00,RCVD_IN_DNSWL_NONE,SPF_HELO_PASS,SPF_PASS autolearn=ham version=3.3.1 spammy=HX-HELO:sk:gateway, offending, H*RU:sk:gateway, initializers |
X-HELO: | gateway30.websitewelcome.com |
DKIM-Signature: | v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=agner.org; s=default; h=Content-Transfer-Encoding:Content-Type:In-Reply-To:MIME-Version :Date:Message-ID:From:References:To:Subject:Sender:Reply-To:Cc:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=dJjm+08k+e/fuUDYFwPZW0QRzofJto8hKeKAYKuBuFw=; b=JxIC5dOjxUxSJDh/OiMN9upOQd 5iuBsKuJW1YIe2qXEveQiwiCNgR6ZgokYDw2xWAKEGkaf6kikMqCe6v5zrtT7t+OAHB1Uygx3W6lt R8SpmTwMhCVTYKffpsrYyLiGloZajieKCJiI59VYAg+FVdYOoRZnywgFwr8PTTR/8U+xYWqBhGoxe yPieCw4c3IpriAsUbr0mTBFe8/Wj7kXbpCudbFSHXBkhTbFtvrk+juyZ8Sd0JcE8XUX5ze6/kvLiL bqJnFYbz5v/rTBO05jub7jiyK47S9UVJdZotsUDH318PT1LYSLLd6ng1rwWATp+xIGKx3s/NbU2LU WR0KNc1w==; |
Subject: | Re: Clang is using the wrong memory model |
To: | cygwin AT cygwin DOT com |
References: | <732305300 DOT 268416262 DOT 1565603129541 DOT JavaMail DOT root AT zimbra54-e10 DOT priv DOT proxad DOT net> <024ce5b5-6024-b371-e382-5eef5d1d4a90 AT agner DOT org> <075477ca-e031-39ea-3dfe-bc8bc2fc8fa6 AT agner DOT org> <20190816082603 DOT GR11632 AT calimero DOT vinschen DOT de> <8d594f4d-17e0-447a-8b18-1ed620815e0f AT agner DOT org> <20190816095200 DOT GU11632 AT calimero DOT vinschen DOT de> <2024f551-66e6-47c5-a0b6-2dfa735a690d AT agner DOT org> <20190816111133 DOT GV11632 AT calimero DOT vinschen DOT de> <6008d30a-cd6c-5449-2933-a041e804856c AT agner DOT org> <20190817081605 DOT GX11632 AT calimero DOT vinschen DOT de> |
From: | Agner Fog <agner AT agner DOT org> |
Message-ID: | <17081147-f46a-51bf-65e9-3df7748b1780@agner.org> |
Date: | Sun, 18 Aug 2019 08:04:09 +0200 |
User-Agent: | Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 |
MIME-Version: | 1.0 |
In-Reply-To: | <20190817081605.GX11632@calimero.vinschen.de> |
X-IsSubscribed: | yes |
Thanks a lot for your help in clarifying this. When I complained here about the wasteful 64-bit addresses you said that it was an LLVM issue. When I complained to LLVM they said it was a Cygwin issue, and that you were using the wrong memory model. All this confusion is due to a terrible lack of documentation of everything. I had to do a lot of reverse engineering to figure out what is happening. What I have found out so far is listed below. Much of this is undocumented. Obviously, I would like to know if any or this is wrong or if specific documentation is available other than the SysV ABI and Windows ABI: * Cygwin is using its own loader which is different from the Windows loader. * The Cygwin loader emulates the behavior of Linux shared objects. This includes the ability to directly access a variable inside a DLL * Access to a variable in a different DLL requires a 64-bit address. This is obtained by using the medium memory model with a gcc or Clang compiler. * The small memory model works differently on different targets. A -mcmodel=small with a Linux target puts everything below 2GB addresses. 32-bit absolute addresses are allowed. -mcmodel=small with a Windows or Mac target allows addresses above 2GB, but limits the distance between code and data in the same executable to 2GB. 32-bit absolute addresses are not allowed. 32-bit relative addresses are used instead. * The memory models work differently in gcc an Clang. Gcc with a medium or large memory model is using 64-bit address tables to access a variable in a different C/CPP file. Clang with a medium or large memory model is using 64-bit addresses not only for external variables, but also for local static data. This includes floating point constants, string constants, array initializers, jump tables, global variables, and more. * Cygwin uses a medium memory model by default. The medium memory model is necessary only for a program that makes direct access to a variable in a different DLL. The medium memory model is wasteful, and more so with Clang than with gcc. Now I am speculating what we can do to avoid the wasteful 64-bit address-load instructions to improve the performance of Cygwin programs. We can improve performance by using the small memory model when possible. The medium memory model is needed only for programs that link to a variable in a different DLL. The DLL that contains the link target does not need the medium memory model. Direct access to a variable in a different DLL is considered bad programming practice by modern standards. This should occur only in old Linux code. A link to a variable in a different DLL may be replaced by function calls (this is done with errno). In some cases, static linking can be an efficient alternative. It would be helpful if the Cygwin loader could print the name of the offending variable when relocation fails with the small memory model. This could help programmers remove any obstacles to using the more efficient small memory model. Agner On 17/08/2019 10.16, Corinna Vinschen wrote: > Oe Aug 17 07:31, Agner Fog wrote: >>> So errno was a bad example but you can try accessing e.g. __ctype_ptr__, >>> __progname, optarg, h_errno, or use FE_DFL_ENV from another DLL, just >>> for kicks. >> __ctype_ptr__ is a function >> >> h_errno works like errno with an imported function >> >> FE_DFL_ENV is a macro >> >> __progname and optarg are local variables to each exe or dll > That would contradict what, e.g., __progname is for. Here's a test: > > $ cat > dll.c <<EOF > #include <stdio.h> > > extern char *__progname; > > void > printprog () > { > printf ("progname: %s\n", __progname); > } > EOF > $ cat > main.c <<EOF > extern void printprog(); > > int > main () > { > printprog (); > } > EOF > $ uname -a > CYGWIN_NT-10.0 vmbert10 3.1.0(0.340/5/3) 2019-08-16 14:36 x86_64 Cygwin > > Lets try the medium model first: > > $ gcc -g -shared -mcmodel=medium -o dll.dll dll.c > $ gcc -g -mcmodel=medium -o main main.c dll.dll > $ ./main > progname: main > > Now let's try the small model: > > $ gcc -g -shared -mcmodel=small -o dll.dll dll.c > $ gcc -g -mcmodel=small -o main main.c dll.dll > $ ./main > Cygwin runtime failure: /home/corinna/main.exe: Invalid relocation. Offset > 0xfffffffd80348989 at address 0x40000103b doesn't fit into 32 bits > > Now let's try without explicit mcmodel on the CLI: > > $ gcc -g -shared -o dll.dll dll.c > $ gcc -g -o main main.c dll.dll > $ ./main > progname: main > >> gcc is using the small memory model by default in Cygwin64, and it works. > No, it's not, see above. > >> clang is using the small memory by default when cross-compiling for a Cygwin64 target from Linux, and it works. > ...in *your* example code. > > > Corinna > -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
webmaster | delorie software privacy |
Copyright © 2019 by DJ Delorie | Updated Jul 2019 |