| 30 |
for requesting some minor changes that give better JavaScript compatibility. |
for requesting some minor changes that give better JavaScript compatibility. |
| 31 |
</P> |
</P> |
| 32 |
<P> |
<P> |
| 33 |
The current implementation of PCRE corresponds approximately with Perl |
The current implementation of PCRE corresponds approximately with Perl 5.12, |
| 34 |
5.10/5.11, including support for UTF-8 encoded strings and Unicode general |
including support for UTF-8 encoded strings and Unicode general category |
| 35 |
category properties. However, UTF-8 and Unicode support has to be explicitly |
properties. However, UTF-8 and Unicode support has to be explicitly enabled; it |
| 36 |
enabled; it is not the default. The Unicode tables correspond to Unicode |
is not the default. The Unicode tables correspond to Unicode release 5.2.0. |
|
release 5.2.0. |
|
| 37 |
</P> |
</P> |
| 38 |
<P> |
<P> |
| 39 |
In addition to the Perl-compatible matching function, PCRE contains an |
In addition to the Perl-compatible matching function, PCRE contains an |
| 275 |
low-valued characters, unless the PCRE_UCP option is set. |
low-valued characters, unless the PCRE_UCP option is set. |
| 276 |
</P> |
</P> |
| 277 |
<P> |
<P> |
| 278 |
8. However, the Perl 5.10 horizontal and vertical whitespace matching escapes |
8. However, the horizontal and vertical whitespace matching escapes (\h, \H, |
| 279 |
(\h, \H, \v, and \V) do match all the appropriate Unicode characters, |
\v, and \V) do match all the appropriate Unicode characters, whether or not |
| 280 |
whether or not PCRE_UCP is set. |
PCRE_UCP is set. |
| 281 |
</P> |
</P> |
| 282 |
<P> |
<P> |
| 283 |
9. Case-insensitive matching applies only to characters whose values are less |
9. Case-insensitive matching applies only to characters whose values are less |
| 285 |
property support is available, PCRE still uses its own character tables when |
property support is available, PCRE still uses its own character tables when |
| 286 |
checking the case of low-valued characters, so as not to degrade performance. |
checking the case of low-valued characters, so as not to degrade performance. |
| 287 |
The Unicode property information is used only for characters with higher |
The Unicode property information is used only for characters with higher |
| 288 |
values. Even when Unicode property support is available, PCRE supports |
values. Furthermore, PCRE supports case-insensitive matching only when there is |
| 289 |
case-insensitive matching only when there is a one-to-one mapping between a |
a one-to-one mapping between a letter's cases. There are a small number of |
| 290 |
letter's cases. There are a small number of many-to-one mappings in Unicode; |
many-to-one mappings in Unicode; these are not supported by PCRE. |
|
these are not supported by PCRE. |
|
| 291 |
</P> |
</P> |
| 292 |
<br><a name="SEC5" href="#TOC1">AUTHOR</a><br> |
<br><a name="SEC5" href="#TOC1">AUTHOR</a><br> |
| 293 |
<P> |
<P> |
| 305 |
</P> |
</P> |
| 306 |
<br><a name="SEC6" href="#TOC1">REVISION</a><br> |
<br><a name="SEC6" href="#TOC1">REVISION</a><br> |
| 307 |
<P> |
<P> |
| 308 |
Last updated: 22 October 2010 |
Last updated: 13 November 2010 |
| 309 |
<br> |
<br> |
| 310 |
Copyright © 1997-2010 University of Cambridge. |
Copyright © 1997-2010 University of Cambridge. |
| 311 |
<br> |
<br> |