X-Recipient: archive-cygwin@delorie.com
X-SWARE-Spam-Status: No, hits=-0.3 required=5.0 	tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <814i7o$49eric@dmzms99802.na.baesystems.com>
References: <20091106135152.GK26344@calimero.vinschen.de> 	 <814i7o$49eric@dmzms99802.na.baesystems.com>
Date: Fri, 6 Nov 2009 15:22:47 -0700
Message-ID: <806a89db0911061422l290ff84u3d58cbbe1d3eface@mail.gmail.com>
Subject: Re: 1.7] BUG - GREP slows to a crawl with large number of matches on  	a single file
From: Jim Reisert AD1C <jjreisert@alum.mit.edu>
To: cygwin@cygwin.com
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
X-IsSubscribed: yes
Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm
Precedence: bulk
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie.com@cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe@cygwin.com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-help@cygwin.com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner@cygwin.com
Mail-Followup-To: cygwin@cygwin.com
Delivered-To: mailing list cygwin@cygwin.com

On Fri, Nov 6, 2009 at 7:12 AM, Cooper, Karl (US SSA)
<karl.cooper@baesystems.com> wrote:

> Corinna Vinschen wrote:
>> Or try LANG=3DC.ASCII since LANG=3DC will still return UTF-8 as charset
>> when calling nl_langinfo(CHARSET).
>
> Yes, this solves it:
>
> $ time LC_ALL=3DC.ASCII grep dog testfile | wc
> =A0100000 =A0900000 4500000
>
> real =A0 =A00m0.359s
> user =A0 =A00m0.279s
> sys =A0 =A0 0m0.232s


I just tried this on my system, I routinely grep groups of files
containing 100K lines.  I was *astounded* how fast "grep" is after
setting LC_ALL=3DC.ASCII !

--=20
Jim Reisert AD1C, <jjreisert@alum.mit.edu>, http://www.ad1c.us

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

