delorie.com/archives/browse.cgi | search |
X-Recipient: | archive-cygwin AT delorie DOT com |
X-SWARE-Spam-Status: | No, hits=0.4 required=5.0 tests=AWL,BAYES_20,RCVD_NUMERIC_HELO,RP_MATCHES_RCVD,SPF_HELO_PASS |
X-Spam-Check-By: | sourceware.org |
To: | cygwin AT cygwin DOT com |
From: | Oleksandr Gavenko <gavenko AT bifit DOT com DOT ua> |
Subject: | Re: 'cygcheck -f' pattern syntax. |
Date: | Thu, 22 Sep 2011 17:32:38 +0300 |
Lines: | 102 |
Message-ID: | <j5fgu0$2on$1@dough.gmane.org> |
References: | <j5f8io$2uq$1 AT dough DOT gmane DOT org> <4E7B2D8F DOT 409 AT gmail DOT com> |
Mime-Version: | 1.0 |
User-Agent: | Mozilla/5.0 (Windows NT 5.1; rv:6.0.2) Gecko/20110902 Thunderbird/6.0.2 |
In-Reply-To: | <4E7B2D8F.409@gmail.com> |
X-IsSubscribed: | yes |
Mailing-List: | contact cygwin-help AT cygwin DOT com; run by ezmlm |
List-Id: | <cygwin.cygwin.com> |
List-Unsubscribe: | <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com> |
List-Subscribe: | <mailto:cygwin-subscribe AT cygwin DOT com> |
List-Archive: | <http://sourceware.org/ml/cygwin/> |
List-Post: | <mailto:cygwin AT cygwin DOT com> |
List-Help: | <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs> |
Sender: | cygwin-owner AT cygwin DOT com |
Mail-Followup-To: | cygwin AT cygwin DOT com |
Delivered-To: | mailing list cygwin AT cygwin DOT com |
Note-from-DJ: | This may be spam |
22.09.2011 15:43, Marco atzeri пишет: > On 9/22/2011 2:10 PM, Oleksandr Gavenko wrote: >> So 'cygcheck -f' does not allow 'glob' and 'regex'. >> >> I write simple script that allow use regex: >> >> #!/bin/sh >> >> regex=$(echo "$1" | sed -e 's|\\|&&|g' -e 's|=|\\=|g') >> >> for file in /etc/setup/*.lst.gz; do >> name=${file#/etc/setup/} >> name=${name%.lst.gz} >> gzip -d -c $file | sed -n "\=$regex={s=^=$name: /=;p;}" >> done >> >> But this script is extremely slow: >> >> $ time ./cygsearch.sh 'bin/emacs' >> emacs: /usr/bin/emacs-nox.exe >> emacs: /usr/bin/emacs.ico >> emacs: /usr/bin/emacsclient.exe >> emacs-X11: /usr/bin/emacs-X11.exe >> >> real 0m38.797s >> user 0m44.620s >> sys 0m25.574s >> > > much faster > > $ time cp /etc/setup/*.gz . > > real 0m1.450s > user 0m0.061s > sys 0m1.202s > > $ time gunzip *.gz > > real 0m1.937s > user 0m0.170s > sys 0m1.764s > > $ time grep -H "stdio.h" *.lst > cygwin.lst:usr/include/stdio.h > cygwin.lst:usr/include/sys/stdio.h > > real 0m0.816s > user 0m0.046s > sys 0m0.781s > I try eliminate sed call by using internal bash capabilities but unsuccessfully: for file in /etc/setup/*.lst.gz; do name=${file#/etc/setup/} name=${name%.lst.gz} gzip -d -c $file | while read line; do if [[ /"$line" =~ $regex ]]; then echo $name: "$line" fi done done $ time ./cygsearch 'bin/emacs' time ./cygsearch 'bin/emacs' emacs: usr/bin/emacs-nox.exe emacs: usr/bin/emacs.ico real 0m57.955s user 0m42.105s sys 0m34.710s Next I try eliminate spawning any process at all with success: #!/usr/bin/python import sys import glob import re import gzip r = re.compile(sys.argv[1]) n = re.compile(r'/etc/setup/(.*)\.lst\.gz') for f in glob.glob(r'/etc/setup/*.lst.gz'): name = n.match(f).group(1) plain = gzip.open(f, "rb") for line in plain: line = line.rstrip('\n\r') if r.search(line) != None: print '%s: %s' % (name, line) plain.close() $ time ./cygsearch.py 'bin/emacs' emacs-X11: usr/bin/emacs-X11.exe emacs: usr/bin/emacs-nox.exe real 0m3.152s user 0m2.421s sys 0m0.171s -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
webmaster | delorie software privacy |
Copyright © 2019 by DJ Delorie | Updated Jul 2019 |