delorie.com/archives/browse.cgi | search |
X-Spam-Check-By: | sourceware.org |
From: | "Dave Korn" <dave DOT korn AT artimi DOT com> |
To: | <cygwin AT cygwin DOT com> |
Subject: | RE: grep weirdness - matching space character |
Date: | Tue, 26 Sep 2006 11:39:27 +0100 |
Message-ID: | <078e01c6e158$0c036ac0$a501a8c0@CAM.ARTIMI.COM> |
MIME-Version: | 1.0 |
X-Mailer: | Microsoft Office Outlook 11 |
In-Reply-To: | <11329.194.203.201.98.1159265802.squirrel@www.yankeeboysoftware.com> |
Mailing-List: | contact cygwin-help AT cygwin DOT com; run by ezmlm |
List-Unsubscribe: | <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com> |
List-Subscribe: | <mailto:cygwin-subscribe AT cygwin DOT com> |
List-Archive: | <http://sourceware.org/ml/cygwin/> |
List-Post: | <mailto:cygwin AT cygwin DOT com> |
List-Help: | <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs> |
Sender: | cygwin-owner AT cygwin DOT com |
Mail-Followup-To: | cygwin AT cygwin DOT com |
Delivered-To: | mailing list cygwin AT cygwin DOT com |
On 26 September 2006 11:17, The Blog User wrote: > I am really struggling to understand what I am doing wrong here. > > I have a log file with a line that looks like this: > > ++ 04:51:32 All 94 items succeeded > > The binary data for that line is this: > > 2B 2B 20 30 34 3A 35 31 3A 33 32 20 41 6C 6C 20 39 34 20 69 74 65 6D 73 20 > 73 75 63 63 65 65 64 65 64 0A > > using grep and tail (versions below) I am failing to match that line > > $ tail -1 /path/to/file/the.log | grep -a "All \d*.items succeeded" There's no such thing as \d. > however if I insert 3 (why three?) dots (or a .*) between 'All' and '\d' I > get a match, what is happening ? The dots are eating the '94' as well as the space. > This seems wrong to me, since - from my knowledge of regex's - that is > saying there must be three characters between the 'All' and the first > digit, yet I can see there is only a single space character. Escaping a d just matches a literal 'd'. So the expression '\d*' matches zero or more of the letter d. If you use the three dots to eat the two digits as well as the space, the optional any-number-of-d's is matched by the zero d's following, and then the trailing 'items succeeded' matches. Whereas with only the one dot, the dot matches the space, then there's zero-optional-'d's, then the '9' fails to match against '.items succeeded'. cheers, DaveK -- Can't think of a witty .sigline today.... -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
webmaster | delorie software privacy |
Copyright © 2019 by DJ Delorie | Updated Jul 2019 |