[pcre-dev] [Bug 712] \w class not matched few hungarian char…

Top Page
Delete this message
Author: Philip Hazel
Date:  
To: pcre-dev
Subject: [pcre-dev] [Bug 712] \w class not matched few hungarian characters
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=712

Philip Hazel <ph10@???> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |INVALID





--- Comment #1 from Philip Hazel <ph10@???> 2008-05-19 17:21:34 ---
This is a quote from the pcrepattern man page:

"In UTF-8 mode, characters with values greater than 128 never match \d, \s, or
\w, and always match \D, \S, and \W. This is true even when Unicode
character property support is available. These sequences retain their original
meanings from before UTF-8 support was available, mainly for efficiency
reasons."

In other words, this is not a bug. If you want to test for properties of such
characters, you must use the \p escapes. Note that a bug concerning character
properties was fixed in release 7.5. The current release of PCRE is 7.7.


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email