X-Recipient: archive-cygwin@delorie.com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
	:list-unsubscribe:list-subscribe:list-archive:list-post
	:list-help:sender:subject:to:references:from:message-id:date
	:mime-version:in-reply-to:content-type
	:content-transfer-encoding; q=dns; s=default; b=RXOSU5UXT1yLqPC0
	hpK2Au5vcb8CITzqz/ey+m6ULb9gN0fAvgj3pIh5dzusVFmc6KtcK+kmSEXjTtLt
	mmcyuxLS8n4VeLNxYV+rU57tLi/llpga0Z257Z0DQQmVU+YDbO31Or0UyQFq4Qgi
	Fa7PixoyAUBizZx1O4aGobn5TZM=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
	:list-unsubscribe:list-subscribe:list-archive:list-post
	:list-help:sender:subject:to:references:from:message-id:date
	:mime-version:in-reply-to:content-type
	:content-transfer-encoding; s=default; bh=TqfXht1clVXkwOvpuVXuRO
	7Ip4g=; b=hovkJ1QxY+rM2tF81SOPKKfVqyXQLok8NHB3Q1TFSGcnFy4mHad+Vg
	FeFUXDNbSXxFpd5DKPT8fpXi5sQOaCKvnpx9//juuRJxhO4VZsGLSUt+355cxqf+
	ZrbzHCQBNY6tJt1z52uk7ljZDehCSaM1czyPBCQA6I4h3w4/wrPig=
Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe@cygwin.com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-help@cygwin.com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner@cygwin.com
Mail-Followup-To: cygwin@cygwin.com
Delivered-To: mailing list cygwin@cygwin.com
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=0.2 required=5.0 tests=AWL,BAYES_50,RP_MATCHES_RCVD,SPF_PASS autolearn=ham version=3.3.2 spammy=kylheku, kaz, Kylheku, Kaz
X-HELO: mail.hepe.com
Subject: Re: Lengthy "xmlto" build step in Cygwin.
To: cygwin@cygwin.com
References: <bfbac5abef5b3937edd1f9814c5f4bc8@mail.kylheku.com>
From: Aaron Digulla <digulla@hepe.com>
Message-ID: <57758E8A.6030701@hepe.com>
Date: Thu, 30 Jun 2016 23:26:34 +0200
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.7.0
MIME-Version: 1.0
In-Reply-To: <bfbac5abef5b3937edd1f9814c5f4bc8@mail.kylheku.com>
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
X-IsSubscribed: yes

On 30.06.2016 19:19, Kaz Kylheku wrote:
> Hi All,
>
> What's this doing?
>
>   Writing faq.html for article(faq)
>   sed -i 's;<a name="id[mp][0-9]*"></a>;;g' faq/faq.html
>   ../../.././winsup/doc/bodysnatcher.pl faq/faq.html
>   xmlto --skip-validation --with-dblatex pdf -o cygwin-ug-net/ -m
> ../../.././winsup/doc/fo.xsl ../../..  /./winsup/doc/cygwin-ug-net.xml
>   Build the book set list...
>
>
>   Build the listings...
>
> It sat for a long time in "book set list ..." with the CPU idle.
>
> Same thing for "Build the listings...". It's just sitting there and
> sitting there.
>
> Task Manager shows an xsltproc.exe that isn't doing anything.
>
> Is this making network calls to an unresponsive server in Cameroon or
> something?

No, in the US. Most likely something from w3c.org or a similar standard
site.

It's an old problem with XML files which reference external sources like
XML Schemas or DTDs. The URL points to a standard place, so the tools
try to download the file. Since there is no accepted standard for
caching such files, each tool does it for itself or not at all. The W3C
doesn't seem to be swimming in money, so their site is slow at times. I
had 2-3 minute download times for some XML schema just two weeks ago.

I even have the problem with Eclipse which has built-in caches for XML
but for some reason, it's still trying to download some files when it's
validating XML files in projects.

Solutions:

- Force the tool to download via a proxy and make sure it keeps those files.
- Try to enable caching in the tool.
- Disable network access for XML tools. See your OS and tool options.
- Enable logging. If you have a firewall, tell it to deny access to the
sites which show up in the log. Make sure the firewall rejects the
access instead of simply swalling the request.
- Download the referenced files yourself and replace the URLs in the
sources with relative or local ones.
- Clench your teeth and wait.

Regards,

-- 
Aaron "Optimizer" Digulla a.k.a. Philmann Dark
"It's not the universe that's limited, it's our imagination.
Follow me and I'll show you something beyond the limits." 
http://blog.pdark.de/


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

