--- code/trunk/doc/html/pcretest.html 2007/08/08 14:24:50 210 +++ code/trunk/doc/html/pcretest.html 2007/08/09 09:52:43 211 @@ -406,7 +406,10 @@ The use of \x{hh...} to represent UTF-8 characters is not dependent on the use of the /8 modifier on the pattern. It is recognized always. There may be any number of hexadecimal digits inside the braces. The result is from one to -six bytes, encoded according to the UTF-8 rules. +six bytes, encoded according to the original UTF-8 rules of RFC 2279. This +allows for values in the range 0 to 0x7FFFFFFF. Note that not all of those are +valid Unicode code points, or indeed valid UTF-8 characters according to the +later rules in RFC 3629.


THE ALTERNATIVE MATCHING FUNCTION

@@ -668,7 +671,7 @@


REVISION

-Last updated: 24 April 2007 +Last updated: 09 August 2007
Copyright © 1997-2007 University of Cambridge.