delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2001/09/16/18:21:22

Mailing-List: contact cygwin-help AT sourceware DOT cygnus DOT com; run by ezmlm
List-Subscribe: <mailto:cygwin-subscribe AT sources DOT redhat DOT com>
List-Archive: <http://sources.redhat.com/ml/cygwin/>
List-Post: <mailto:cygwin AT sources DOT redhat DOT com>
List-Help: <mailto:cygwin-help AT sources DOT redhat DOT com>, <http://sources.redhat.com/ml/#faqs>
Sender: cygwin-owner AT sources DOT redhat DOT com
Delivered-To: mailing list cygwin AT sources DOT redhat DOT com
Message-ID: <3BA52566.72FBBB3E@hackdata.com>
Date: Mon, 17 Sep 2001 00:19:18 +0200
From: Hack =?iso-8859-1?Q?Kampbj=F8rn?= <hack AT hackdata DOT com>
X-Mailer: Mozilla 4.77 [en] (Windows NT 5.0; U)
X-Accept-Language: en,da,es,ca
MIME-Version: 1.0
To: "C. Porter Bassett" <porter AT et DOT byu DOT edu>
CC: cygwin AT cygwin DOT com
Subject: Re: wget not behaving correctly
References: <999982012 DOT 13196 DOT ezmlm AT sources DOT redhat DOT com>
X-MIME-Autoconverted: from quoted-printable to 8bit by delorie.com id SAA09039

Sorry about the delay, have been too busy this week 8-(


cygwin-digest-help AT sources DOT redhat DOT com wrote:
> Subject: Re: wget not behaving correctly
> Date: Sat, 8 Sep 2001 14:34:59 -0600
> From: "C. Porter Bassett" <porter AT et DOT byu DOT edu>
> Reply-To: "C. Porter Bassett" <cporter AT byu DOT edu>
> To: <cygwin AT cygwin DOT com>
> 
> ----- Original Message -----
> From: "Hack Kampbjørn" <hack AT hackdata DOT com>
> 
> >Yes this is the '?' in the URL. There can be two problems here depending
> >on which shell you use. First your shell may not send the '?' to wget
> >but directly complain that it cannot expand the globbing character '?'
> >to a filename, I cannot tell if this is the case as you did not include
> >full output of running the wget command.
> 
> OK, here it is:
> 1427:PWORK:~$ wget.exe -rd -A gif,jpg,png,bmp
> http://fan.theonering.net/rolozo/galleries.php?
> mode=2
> DEBUG output created by Wget 1.6 on cygwin32.
> 
> parseurl ("http://fan.theonering.net/rolozo/galleries.php?mode=2") -> host
> fan.theonering.net -> opath rolozo/galleries.php?mode=2 -> dir rolozo ->
> file galleries.php?mode=2 -> ndir rolozo
> newpath: /rolozo/galleries.php?mode=2
> Checking for fan.theonering.net.
> This is the first time I hear about host fan.theonering.net by that name.
> --14:27:54--  http://fan.theonering.net/rolozo/galleries.php?mode=2
>            => `fan.theonering.net/rolozo/galleries.php?mode=2'
> Connecting to fan.theonering.net:80... Created fd 3.
> connected!
> ---request begin---
> GET /rolozo/galleries.php?mode=2 HTTP/1.0
> User-Agent: Wget/1.6
> Host: fan.theonering.net
> Accept: */*
> 
> ---request end---
> HTTP request sent, awaiting response... HTTP/1.1 200 OK
> Date: Sat, 08 Sep 2001 20:35:39 GMT
> Server: Apache/1.3.19 (Unix) PHP/4.0.4pl1 mod_perl/1.25
> X-Powered-By: PHP/4.0.4pl1
> Connection: close
> Content-Type: text/html
> 
> Length: unspecified [text/html]
> fan.theonering.net/rolozo/galleries.php?mode=2: No such file or directory
> Closing fd 3
> 
> Cannot write to `fan.theonering.net/rolozo/galleries.php?mode=2' (No such
> file or directory).

OK, it's the problem of '?' being an illegal character on Windows file
systems

> 
> FINISHED --14:27:54--
> Downloaded: 0 bytes in 0 files
> 1427:PWORK:~$
> 

You can work around it with the --output-document option but then you
loose the recursion. 

What I use this is patch (should apply clean on wget-1.6 too). Yes, it's
a crude hack, and doesn't address all the problems with illegal
characters. And some of the code in wget doesn't expect the file to be
saved with a different name than on the webserver so you loose some
options too (IIRC --convert-links).

I keep forgetting that the wget-patch list isn't archive. And not every
patch is sent to the wget list. Sorry for sending you on a fruitless
search there 8-(

Index: src/url.c
===================================================================
RCS file: /pack/anoncvs/wget/src/url.c,v
retrieving revision 1.21.2.1
diff -u -r1.21.2.1 url.c
--- src/url.c   2000/12/17 19:28:20     1.21.2.1
+++ src/url.c   2001/02/03 15:53:24
@@ -1272,16 +1272,17 @@
          file = nfile;
        }
     }
-  /* DOS-ish file systems don't like `%' signs in them; we change it
-     to `@'.  */
-#ifdef WINDOWS
+  /* Windows file systems don't like `?' signs in them; we change it
+     to `@'. 
+     #### Note: nor are \ / : * " < > | allowed */
+#if defined(WINDOWS) || defined (__CYGWIN__)
   {
     char *p = file;
     for (p = file; *p; p++)
-      if (*p == '%')
+      if (*p == '?')
        *p = '@';
   }
-#endif /* WINDOWS */
+#endif /* WINDOWS or CYGWIN */
 
   /* Check the cases in which the unique extensions are not used:
      1) Clobbering is turned off (-nc).


-- 
Med venlig hilsen / Kind regards

Hack Kampbjørn               hack AT hackdata DOT com
HackLine                     +45 2031 7799

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Bug reporting:         http://cygwin.com/bugs.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019