| delorie.com/archives/browse.cgi | search |
| X-Spam-Check-By: | sourceware.org |
| From: | "Dave Korn" <dave DOT korn AT artimi DOT com> |
| To: | <cygwin AT cygwin DOT com> |
| Subject: | RE: grep weirdness - matching space character |
| Date: | Tue, 26 Sep 2006 11:39:27 +0100 |
| Message-ID: | <078e01c6e158$0c036ac0$a501a8c0@CAM.ARTIMI.COM> |
| MIME-Version: | 1.0 |
| X-Mailer: | Microsoft Office Outlook 11 |
| In-Reply-To: | <11329.194.203.201.98.1159265802.squirrel@www.yankeeboysoftware.com> |
| Mailing-List: | contact cygwin-help AT cygwin DOT com; run by ezmlm |
| List-Unsubscribe: | <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com> |
| List-Subscribe: | <mailto:cygwin-subscribe AT cygwin DOT com> |
| List-Archive: | <http://sourceware.org/ml/cygwin/> |
| List-Post: | <mailto:cygwin AT cygwin DOT com> |
| List-Help: | <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs> |
| Sender: | cygwin-owner AT cygwin DOT com |
| Mail-Followup-To: | cygwin AT cygwin DOT com |
| Delivered-To: | mailing list cygwin AT cygwin DOT com |
On 26 September 2006 11:17, The Blog User wrote:
> I am really struggling to understand what I am doing wrong here.
>
> I have a log file with a line that looks like this:
>
> ++ 04:51:32 All 94 items succeeded
>
> The binary data for that line is this:
>
> 2B 2B 20 30 34 3A 35 31 3A 33 32 20 41 6C 6C 20 39 34 20 69 74 65 6D 73 20
> 73 75 63 63 65 65 64 65 64 0A
>
> using grep and tail (versions below) I am failing to match that line
>
> $ tail -1 /path/to/file/the.log | grep -a "All \d*.items succeeded"
There's no such thing as \d.
> however if I insert 3 (why three?) dots (or a .*) between 'All' and '\d' I
> get a match, what is happening ?
The dots are eating the '94' as well as the space.
> This seems wrong to me, since - from my knowledge of regex's - that is
> saying there must be three characters between the 'All' and the first
> digit, yet I can see there is only a single space character.
Escaping a d just matches a literal 'd'. So the expression '\d*' matches
zero or more of the letter d. If you use the three dots to eat the two digits
as well as the space, the optional any-number-of-d's is matched by the zero
d's following, and then the trailing 'items succeeded' matches.
Whereas with only the one dot, the dot matches the space, then there's
zero-optional-'d's, then the '9' fails to match against '.items succeeded'.
cheers,
DaveK
--
Can't think of a witty .sigline today....
--
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
Problem reports: http://cygwin.com/problems.html
Documentation: http://cygwin.com/docs.html
FAQ: http://cygwin.com/faq/
| webmaster | delorie software privacy |
| Copyright © 2019 by DJ Delorie | Updated Jul 2019 |