delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2009/11/06/09:13:14

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-0.7 required=5.0 tests=AWL,BAYES_00,MSGID_FROM_MTA_HEADER
X-Spam-Check-By: sourceware.org
Message-Id: <814i7o$49eric@dmzms99802.na.baesystems.com>
X-SENDER-IP: 10.37.193.66
X-SENDER-REPUTATION: None
X-SENDER-IP: 10.44.64.12
X-SENDER-REPUTATION: None
From: "Cooper, Karl (US SSA)" <karl DOT cooper AT baesystems DOT com>
To: "cygwin AT cygwin DOT com" <cygwin AT cygwin DOT com>
Date: Fri, 6 Nov 2009 09:12:54 -0500
Subject: RE: 1.7] BUG - GREP slows to a crawl with large number of matches on a single file
In-Reply-To: <20091106135152.GK26344@calimero.vinschen.de>
MIME-Version: 1.0
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

Corinna Vinschen wrote:
> Or try LANG=3DC.ASCII since LANG=3DC will still return UTF-8 as charset
> when calling nl_langinfo(CHARSET).=20

Yes, this solves it:

$ time LC_ALL=3DC.ASCII grep dog testfile | wc
 100000  900000 4500000

real    0m0.359s
user    0m0.279s
sys     0m0.232s

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019