X-Recipient: archive-cygwin AT delorie DOT com DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:message-id:date:from:mime-version:to:subject :references:in-reply-to:content-type; q=dns; s=default; b=eyLn2t x8J+SGCBtHuZlYCmJ8hVYn87CMgfEBdRHGiyP0l/dQFseQOzWm/hbnud3AOEUnhc qY4GbHwZiivTsOdgVGkSZq1dZ6J2pieRHKNePOoo0UBsF6SfpHIkp6+jvYSUqH0k 41VBQmGhyVrDQZdLQYZjvQX4JRc61uvpA3B/Q= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:message-id:date:from:mime-version:to:subject :references:in-reply-to:content-type; s=default; bh=scDhhnbez4Zc MrnBlxWQYdAot9Q=; b=Kpb6AZILkvaVziQ/3TUiBcynomQmgOZv0lslCxh8vFVz 81bdgNAe9CPsWe9g+g44dMYM4h1VMjdV773OcDedy3VGxHAIGYDTXaTNcn0WboL2 jM29dWjlMrnltr5LJCI6qB3cxSkymlc0QIjgIoFIXuo1gSnITGTLQCVq7vQNXms= Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-0.5 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS autolearn=ham version=3.3.2 X-HELO: mail-wi0-f174.google.com X-Received: by 10.194.10.72 with SMTP id g8mr9671680wjb.28.1431619405248; Thu, 14 May 2015 09:03:25 -0700 (PDT) Message-ID: <5554C74A.7070901@gmail.com> Date: Thu, 14 May 2015 18:03:22 +0200 From: =?UTF-8?B?VsOhY2xhdiBIYWlzbWFu?= User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.6.0 MIME-Version: 1.0 To: cygwin AT cygwin DOT com Subject: Re: Grepping Unicode files? References: <3C280897-291A-4A8C-8C3F-46D1D9BEFCFE AT solidrocksystems DOT com> In-Reply-To: <3C280897-291A-4A8C-8C3F-46D1D9BEFCFE@solidrocksystems.com> Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="45tNaDDEsb8pMXNET3jde0djK8tMGnff9" X-IsSubscribed: yes --45tNaDDEsb8pMXNET3jde0djK8tMGnff9 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable On 14.5.2015 17:42, Vince Rice wrote: > uname says "CYGWIN_NT-6.1 machinename 1.7.35(0.287/5/3) 2015-03-04 > 12:07 i686 Cygwin=E2=80=9D. I=E2=80=99m running grep 2.21.2, which cygche= ck -c says > is OK. >=20 > Does Cygwin=E2=80=99s grep support Unicode files? The output from a SQL > Server SQL Agent job is a Unicode file, i.e. if you look at it in a > hex editor every other character is 00 because each character is > taking up two bytes. The filename itself is fine, it=E2=80=99s the conten= ts > that is Unicode. I can=E2=80=99t get grep to work on it, either with or > without -a. That sounds like UTF-16. Have you tried funneling it through `iconv` first? >=20 > This may not be a Cygwin-specific question, but I haven=E2=80=99t been ab= le > to find anything after several Google searches, including the > archives, and neither --help nor the man page for grep references > Unicode. >=20 > By default I have neither LC_ALL nor LC_COLLATE set. >=20 > A pointer to a better search or a website that explains this would be > great, or if it can=E2=80=99t currently be done, that=E2=80=99s OK, too. >=20 --=20 VH --45tNaDDEsb8pMXNET3jde0djK8tMGnff9 Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iF0EAREKAAYFAlVUx0oACgkQonnuNA9W3VLK0QD+Krtz0TPcvwIj/T6aMjOYjiO/ XPp0GJ/NtaWoKcQFzCMA+IJFVGyKbLXtQqx6mPuobKabIPXv7NKLsSHeCeiLawo= =pFY2 -----END PGP SIGNATURE----- --45tNaDDEsb8pMXNET3jde0djK8tMGnff9--