--- code/trunk/doc/pcretest.1 2007/04/17 15:55:53 152 +++ code/trunk/doc/pcretest.1 2007/08/09 09:52:43 211 @@ -411,7 +411,10 @@ The use of \ex{hh...} to represent UTF-8 characters is not dependent on the use of the \fB/8\fP modifier on the pattern. It is recognized always. There may be any number of hexadecimal digits inside the braces. The result is from one to -six bytes, encoded according to the UTF-8 rules. +six bytes, encoded according to the original UTF-8 rules of RFC 2279. This +allows for values in the range 0 to 0x7FFFFFFF. Note that not all of those are +valid Unicode code points, or indeed valid UTF-8 characters according to the +later rules in RFC 3629. . . .SH "THE ALTERNATIVE MATCHING FUNCTION" @@ -537,7 +540,7 @@ match with additional subject data by means of the \eR escape sequence. For example: .sp - re> /^\d?\d(jan|feb|mar|apr|may|jun|jul|aug|sep|oct|nov|dec)\d\d$/ + re> /^\ed?\ed(jan|feb|mar|apr|may|jun|jul|aug|sep|oct|nov|dec)\ed\ed$/ data> 23ja\eP\eD Partial match: 23ja data> n05\eR\eD @@ -690,6 +693,6 @@ .rs .sp .nf -Last updated: 16 April 2007 +Last updated: 09 August 2007 Copyright (c) 1997-2007 University of Cambridge. .fi