--- code/trunk/doc/html/pcrepattern.html 2007/06/19 13:26:46 184 +++ code/trunk/doc/html/pcrepattern.html 2007/06/19 13:39:46 185 @@ -384,8 +384,10 @@

When PCRE is built with Unicode character property support, three additional -escape sequences to match character properties are available when UTF-8 mode -is selected. They are: +escape sequences that match characters with specific properties are available. +When not in UTF-8 mode, these sequences are of course limited to testing +characters whose codepoints are less than 256, but they do work in this mode. +The extra escape sequences are:

   \p{xx}   a character with the xx property
   \P{xx}   a character without the xx property
@@ -566,7 +568,8 @@
 atomic group
 (see below).
 Characters with the "mark" property are typically accents that affect the
-preceding character.
+preceding character. None of them have codepoints less than 256, so in
+non-UTF-8 mode \X matches any one character.
 

Matching characters by Unicode property is not fast, because PCRE has to search @@ -1987,7 +1990,7 @@


REVISION

-Last updated: 13 June 2007 +Last updated: 19 June 2007
Copyright © 1997-2007 University of Cambridge.