--- code/trunk/doc/html/pcrepattern.html 2007/06/19 13:26:46 184
+++ code/trunk/doc/html/pcrepattern.html 2007/06/19 13:39:46 185
@@ -384,8 +384,10 @@
When PCRE is built with Unicode character property support, three additional -escape sequences to match character properties are available when UTF-8 mode -is selected. They are: +escape sequences that match characters with specific properties are available. +When not in UTF-8 mode, these sequences are of course limited to testing +characters whose codepoints are less than 256, but they do work in this mode. +The extra escape sequences are:
\p{xx} a character with the xx property
\P{xx} a character without the xx property
@@ -566,7 +568,8 @@
atomic group
(see below).
Characters with the "mark" property are typically accents that affect the
-preceding character.
+preceding character. None of them have codepoints less than 256, so in
+non-UTF-8 mode \X matches any one character.
Matching characters by Unicode property is not fast, because PCRE has to search
@@ -1987,7 +1990,7 @@
REVISION
-Last updated: 13 June 2007
+Last updated: 19 June 2007
Copyright © 1997-2007 University of Cambridge.