| 12 |
with the PCRE_UTF8 option flag, or the pattern must start with the sequence |
with the PCRE_UTF8 option flag, or the pattern must start with the sequence |
| 13 |
(*UTF8). When either of these is the case, both the pattern and any subject |
(*UTF8). When either of these is the case, both the pattern and any subject |
| 14 |
strings that are matched against it are treated as UTF-8 strings instead of |
strings that are matched against it are treated as UTF-8 strings instead of |
| 15 |
strings of 1-byte characters. PCRE does not support any other formats (in |
strings of 1-byte characters. PCRE does not support any other formats (in |
| 16 |
particular, it does not support UTF-16). |
particular, it does not support UTF-16). |
| 17 |
.P |
.P |
| 18 |
If you compile PCRE with UTF-8 support, but do not use it at run time, the |
If you compile PCRE with UTF-8 support, but do not use it at run time, the |
| 81 |
If you want to process strings of values in the full range 0 to 0x7FFFFFFF, |
If you want to process strings of values in the full range 0 to 0x7FFFFFFF, |
| 82 |
encoded in a UTF-8-like manner as per the old RFC, you can set |
encoded in a UTF-8-like manner as per the old RFC, you can set |
| 83 |
PCRE_NO_UTF8_CHECK to bypass the more restrictive test. However, in this |
PCRE_NO_UTF8_CHECK to bypass the more restrictive test. However, in this |
| 84 |
situation, you will have to apply your own validity check, and avoid the use of |
situation, you will have to apply your own validity check, and avoid the use of |
| 85 |
JIT optimization. |
JIT optimization. |
| 86 |
. |
. |
| 87 |
. |
. |
| 101 |
.P |
.P |
| 102 |
5. The escape sequence \eC can be used to match a single byte in UTF-8 mode, |
5. The escape sequence \eC can be used to match a single byte in UTF-8 mode, |
| 103 |
but its use can lead to some strange effects. This facility is not available in |
but its use can lead to some strange effects. This facility is not available in |
| 104 |
the alternative matching function, \fBpcre_dfa_exec()\fP, nor is it supported |
the alternative matching function, \fBpcre_dfa_exec()\fP, nor is it supported |
| 105 |
by the JIT optimization of \fBpcre_exec()\fP. If JIT optimization is requested |
by the JIT optimization of \fBpcre_exec()\fP. If JIT optimization is requested |
| 106 |
for a pattern that contains \eC, it will not succeed, and so the matching will |
for a pattern that contains \eC, it will not succeed, and so the matching will |
| 107 |
be carried out by the normal interpretive function. |
be carried out by the normal interpretive function. |
| 108 |
.P |
.P |
| 109 |
6. The character escapes \eb, \eB, \ed, \eD, \es, \eS, \ew, and \eW correctly |
6. The character escapes \eb, \eB, \ed, \eD, \es, \eS, \ew, and \eW correctly |