delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2016/11/28/09:34:23

X-Recipient: archive-cygwin AT delorie DOT com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:from:subject:reply-to:references:to:message-id
:date:mime-version:in-reply-to:content-type
:content-transfer-encoding; q=dns; s=default; b=g1MLLYNv0MQZ+8tY
Kpk719DsnrraVORc7GR+jouXD1OcU7Z/0MJZoZa/NVxFiJMIyGyvaGEUN0Mw/r7U
0CMoQ+Vc2jWuPt3S/N1wEbkjopt+RkuEjzzuPUfC44kUKcKz9W79QQ5K9IjWrR5e
QPxiCB1ZDuMXGWvzgjhi205eL48=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:from:subject:reply-to:references:to:message-id
:date:mime-version:in-reply-to:content-type
:content-transfer-encoding; s=default; bh=F0S9yg758AypuDFJdQIFcc
WJCUc=; b=mOavhPJke2ZaXwu33MinqRQROIVNxjKJ4+IMl/BSX8E2pcINI5bFB1
nRvaSGS87YIbkE6sHG7vydF0il9be4gkNZj5CzkxOHIw3wR1gasq1fRbCM4dVV2A
ISD44twt6UTIiOFSFxnguzUbrdx46I+2hlT4KQRojf4vA0uCE7yIc=
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=1.1 required=5.0 tests=AWL,BAYES_50,KAM_LAZY_DOMAIN_SECURITY,RCVD_IN_DNSWL_LOW autolearn=no version=3.3.2 spammy=8:15, 8:-1, 8:01, 8:17
X-HELO: smtp-out-so.shaw.ca
X-Authority-Analysis: v=2.2 cv=cNuQihWN c=1 sm=1 tr=0 a=WqCeCkldcEjBO3QZneQsCg==:117 a=WqCeCkldcEjBO3QZneQsCg==:17 a=IkcTkHD0fZMA:10 a=wn2tWnfOKRrLvmu7J_sA:9 a=1-885NVyOodx8ZRR:21 a=Ch_wTLCTNhlbGX0g:21 a=QEXdDO2ut3YA:10
From: Brian Inglis <Brian DOT Inglis AT SystematicSw DOT ab DOT ca>
Subject: Re: Cygwin: 2.6.0, rsync 3.1.2, "file has vanished"
Reply-To: Brian DOT Inglis AT SystematicSw DOT ab DOT ca
References: <cba315e68e3e97d97e54c33070fefd9e AT 192 DOT 168 DOT 11 DOT 198>
To: cygwin AT cygwin DOT com
Message-ID: <68b37996-00bc-023b-b5ad-b3209cd2f0b7@SystematicSw.ab.ca>
Date: Mon, 28 Nov 2016 07:33:50 -0700
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.5.0
MIME-Version: 1.0
In-Reply-To: <cba315e68e3e97d97e54c33070fefd9e@192.168.11.198>
X-CMAE-Envelope: MS4wfH4ZLCWeEswtOJ87M4w0hjQxMkRXXMUIquQYc9NJ/uIrFq09k9wr1pl/KZ7FW6dXJpbcVGux1A688eGn8EVDUTU2LAPs2VEhvlwFEYnUihJWpztamUdw tVVOfiY9oxCctjYB1lbelMw3Q0Jg7NHs/agAUC6oy+RBSzH4PGGzSWHhhIs3deaxCMju969bzTyHIw==
X-IsSubscribed: yes

On 2016-11-28 00:07, Shaun Maher wrote:
> I'm trying to use rsync on Windows Server 2012R2 to sync files to a
> *nix system (Ubuntu 14.04 or FreeNAS 9.10). The file set I'm trying
> to sync contains files with unusual characters in the names. Most
> files sync without issue but those files with unusual characters do
> not sync and rsync logs a message like the following:
>   file has vanished: "BackUps(old)/Backup Pre-Rolled Payroll
> 2015"16-MBTT2016.zip" (in VSSG)
>   file has vanished: "Corrupted files/Payment Summaries/Payment
> Summaries 2015"2016" (in VSSG)
>   file has vanished: "Restore/MBTT2017"2016-07-12 MBTT2016.zip" (in
> VSSG)
> The " character mid file name is where the unusual character is. The
> character in question in all my test cases is (on Windows) a dot
> vertically centered. I've only a minimal understanding of character
> encoding but it seems to me that it is a 0x2D followed by a 0x31
> which makes it a type of dash I guess.
> I've given this a good Googling but haven't been able to work it out.
> I've tried every combination of --iconv and "charset = " that has
> been suggested.
> I'm a bit stuck on how to move forward from here. Any help or
> suggestions would be greatly appreciated. I can provide an archive
> with some example files is it will help.

utf-16 0x2d31 would be "â´±" Tifinagh letter Yab utf-8 0xe2b4b1 which 
seems unlikely 
"⸱" word separator middle dot would be utf-16 0x2e31 utf-8 0xe2b8b1
you can type this in any console window as 
	<alt...><num-+><2><e><3><1><...alt>
meaning you hold alt down while typing the other characters.

Your file names should look like: 
"BackUps(old)/Backup Pre-Rolled Payroll 2015⸱16-MBTT2016.zip"
"Corrupted files/Payment Summaries/Payment Summaries 2015⸱2016"
"Restore/MBTT2017⸱2016-07-12 MBTT2016.zip"

If you're not using a utf-8 character set, Cygwin may not be encoding 
this utf-16 character, or using the correct utf-8 equivalent for Linux, 
so it can be reversibly decoded.

I don't know if rsync --iconv utf-16,utf-8 can bypass Cygwin to get the 
original names, or if it will internally encode, then rsync decode and 
encode, and produce a mess.

You might want to get the creator to change the punctuation to just 
a normal minus and not get too creative, or skip those directories.

-- 
Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019