--- code/trunk/ChangeLog 2010/06/09 19:30:57 538 +++ code/trunk/ChangeLog 2010/06/13 21:35:04 539 @@ -81,9 +81,8 @@ used to create a list of bytes that can start a match. For \s, it was including 0x85 and 0xa0, which of course cannot start UTF-8 characters. I have changed the code so that only real ASCII characters (less than 128) - are set in this case because the \s etc escapes are documented as - recognizing only ASCII characters. (When PCRE_UCP is set - see 9 above - - the code is different altogether.) + and the correct starting bytes for UTF-8 encodings are set in this case. + (When PCRE_UCP is set - see 9 above - the code is different altogether.) Version 8.02 19-Mar-2010