| 376 |
.rs |
.rs |
| 377 |
.sp |
.sp |
| 378 |
When PCRE is built with Unicode character property support, three additional |
When PCRE is built with Unicode character property support, three additional |
| 379 |
escape sequences to match character properties are available when UTF-8 mode |
escape sequences that match characters with specific properties are available. |
| 380 |
is selected. They are: |
When not in UTF-8 mode, these sequences are of course limited to testing |
| 381 |
|
characters whose codepoints are less than 256, but they do work in this mode. |
| 382 |
|
The extra escape sequences are: |
| 383 |
.sp |
.sp |
| 384 |
\ep{\fIxx\fP} a character with the \fIxx\fP property |
\ep{\fIxx\fP} a character with the \fIxx\fP property |
| 385 |
\eP{\fIxx\fP} a character without the \fIxx\fP property |
\eP{\fIxx\fP} a character without the \fIxx\fP property |
| 555 |
(see below). |
(see below). |
| 556 |
.\" |
.\" |
| 557 |
Characters with the "mark" property are typically accents that affect the |
Characters with the "mark" property are typically accents that affect the |
| 558 |
preceding character. |
preceding character. None of them have codepoints less than 256, so in |
| 559 |
|
non-UTF-8 mode \eX matches any one character. |
| 560 |
.P |
.P |
| 561 |
Matching characters by Unicode property is not fast, because PCRE has to search |
Matching characters by Unicode property is not fast, because PCRE has to search |
| 562 |
a structure that contains data for over fifteen thousand characters. That is |
a structure that contains data for over fifteen thousand characters. That is |
| 1976 |
.rs |
.rs |
| 1977 |
.sp |
.sp |
| 1978 |
.nf |
.nf |
| 1979 |
Last updated: 13 June 2007 |
Last updated: 19 June 2007 |
| 1980 |
Copyright (c) 1997-2007 University of Cambridge. |
Copyright (c) 1997-2007 University of Cambridge. |
| 1981 |
.fi |
.fi |