delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2006/09/26/06:39:43

X-Spam-Check-By: sourceware.org
From: "Dave Korn" <dave DOT korn AT artimi DOT com>
To: <cygwin AT cygwin DOT com>
Subject: RE: grep weirdness - matching space character
Date: Tue, 26 Sep 2006 11:39:27 +0100
Message-ID: <078e01c6e158$0c036ac0$a501a8c0@CAM.ARTIMI.COM>
MIME-Version: 1.0
X-Mailer: Microsoft Office Outlook 11
In-Reply-To: <11329.194.203.201.98.1159265802.squirrel@www.yankeeboysoftware.com>
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

On 26 September 2006 11:17, The Blog User wrote:

> I am really struggling to understand what I am doing wrong here.
> 
> I have a log file with a line that looks like this:
> 
> ++ 04:51:32 All 94 items succeeded
> 
> The binary data for that line is this:
> 
> 2B 2B 20 30 34 3A 35 31 3A 33 32 20 41 6C 6C 20 39 34 20 69 74 65 6D 73 20
> 73 75 63 63 65 65 64 65 64 0A
> 
> using grep and tail (versions below) I am failing to match that line
> 
> $ tail -1 /path/to/file/the.log | grep -a "All \d*.items succeeded"

  There's no such thing as \d.
 
> however if I insert 3 (why three?) dots (or a .*) between 'All' and '\d' I
> get a match, what is happening ?

  The dots are eating the '94' as well as the space.

> This seems wrong to me, since - from my knowledge of regex's - that is
> saying there must be three characters between the 'All' and the first
> digit, yet I can see there is only a single space character.

  Escaping a d just matches a literal 'd'.  So the expression '\d*' matches
zero or more of the letter d.  If you use the three dots to eat the two digits
as well as the space, the optional any-number-of-d's is matched by the zero
d's following, and then the trailing 'items succeeded' matches.

  Whereas with only the one dot, the dot matches the space, then there's
zero-optional-'d's, then the '9' fails to match against '.items succeeded'.



    cheers,
      DaveK
-- 
Can't think of a witty .sigline today....


--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019