| 215 |
<b>/U</b> PCRE_UNGREEDY |
<b>/U</b> PCRE_UNGREEDY |
| 216 |
<b>/W</b> PCRE_UCP |
<b>/W</b> PCRE_UCP |
| 217 |
<b>/X</b> PCRE_EXTRA |
<b>/X</b> PCRE_EXTRA |
| 218 |
|
<b>/Y</b> PCRE_NO_START_OPTIMIZE |
| 219 |
<b>/<JS></b> PCRE_JAVASCRIPT_COMPAT |
<b>/<JS></b> PCRE_JAVASCRIPT_COMPAT |
| 220 |
<b>/<cr></b> PCRE_NEWLINE_CR |
<b>/<cr></b> PCRE_NEWLINE_CR |
| 221 |
<b>/<lf></b> PCRE_NEWLINE_LF |
<b>/<lf></b> PCRE_NEWLINE_LF |
| 257 |
PCRE_ANCHORED flags set in order to search for another, non-empty, match at the |
PCRE_ANCHORED flags set in order to search for another, non-empty, match at the |
| 258 |
same point. If this second match fails, the start offset is advanced, and the |
same point. If this second match fails, the start offset is advanced, and the |
| 259 |
normal match is retried. This imitates the way Perl handles such cases when |
normal match is retried. This imitates the way Perl handles such cases when |
| 260 |
using the <b>/g</b> modifier or the <b>split()</b> function. Normally, the start |
using the <b>/g</b> modifier or the <b>split()</b> function. Normally, the start |
| 261 |
offset is advanced by one character, but if the newline convention recognizes |
offset is advanced by one character, but if the newline convention recognizes |
| 262 |
CRLF as a newline, and the current character is CR followed by LF, an advance |
CRLF as a newline, and the current character is CR followed by LF, an advance |
| 263 |
of two is used. |
of two is used. |
| 264 |
</P> |
</P> |
| 265 |
<br><b> |
<br><b> |
| 386 |
\t tab (\x09) |
\t tab (\x09) |
| 387 |
\v vertical tab (\x0b) |
\v vertical tab (\x0b) |
| 388 |
\nnn octal character (up to 3 octal digits) |
\nnn octal character (up to 3 octal digits) |
| 389 |
always a byte unless > 255 in UTF-8 mode |
always a byte unless > 255 in UTF-8 mode |
| 390 |
\xhh hexadecimal byte (up to 2 hex digits) |
\xhh hexadecimal byte (up to 2 hex digits) |
| 391 |
\x{hh...} hexadecimal character, any number of digits in UTF-8 mode |
\x{hh...} hexadecimal character, any number of digits in UTF-8 mode |
| 392 |
\A pass the PCRE_ANCHORED option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
\A pass the PCRE_ANCHORED option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
| 417 |
\Y pass the PCRE_NO_START_OPTIMIZE option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
\Y pass the PCRE_NO_START_OPTIMIZE option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
| 418 |
\Z pass the PCRE_NOTEOL option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
\Z pass the PCRE_NOTEOL option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
| 419 |
\? pass the PCRE_NO_UTF8_CHECK option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
\? pass the PCRE_NO_UTF8_CHECK option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
| 420 |
\>dd start the match at offset dd (optional "-"; then any number of digits); this sets the <i>startoffset</i> |
\>dd start the match at offset dd (optional "-"; then any number of digits); this sets the <i>startoffset</i> |
| 421 |
argument for <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
argument for <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
| 422 |
\<cr> pass the PCRE_NEWLINE_CR option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
\<cr> pass the PCRE_NEWLINE_CR option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
| 423 |
\<lf> pass the PCRE_NEWLINE_LF option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
\<lf> pass the PCRE_NEWLINE_LF option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
| 425 |
\<anycrlf> pass the PCRE_NEWLINE_ANYCRLF option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
\<anycrlf> pass the PCRE_NEWLINE_ANYCRLF option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
| 426 |
\<any> pass the PCRE_NEWLINE_ANY option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
\<any> pass the PCRE_NEWLINE_ANY option to <b>pcre_exec()</b> or <b>pcre_dfa_exec()</b> |
| 427 |
</pre> |
</pre> |
| 428 |
Note that \xhh always specifies one byte, even in UTF-8 mode; this makes it |
Note that \xhh always specifies one byte, even in UTF-8 mode; this makes it |
| 429 |
possible to construct invalid UTF-8 sequences for testing purposes. On the |
possible to construct invalid UTF-8 sequences for testing purposes. On the |
| 430 |
other hand, \x{hh} is interpreted as a UTF-8 character in UTF-8 mode, |
other hand, \x{hh} is interpreted as a UTF-8 character in UTF-8 mode, |
| 431 |
generating more than one byte if the value is greater than 127. When not in |
generating more than one byte if the value is greater than 127. When not in |
| 432 |
UTF-8 mode, it generates one byte for values less than 256, and causes an error |
UTF-8 mode, it generates one byte for values less than 256, and causes an error |
| 433 |
for greater values. |
for greater values. |
| 434 |
</P> |
</P> |
| 435 |
<P> |
<P> |
| 757 |
</P> |
</P> |
| 758 |
<br><a name="SEC15" href="#TOC1">REVISION</a><br> |
<br><a name="SEC15" href="#TOC1">REVISION</a><br> |
| 759 |
<P> |
<P> |
| 760 |
Last updated: 07 November 2010 |
Last updated: 21 November 2010 |
| 761 |
<br> |
<br> |
| 762 |
Copyright © 1997-2010 University of Cambridge. |
Copyright © 1997-2010 University of Cambridge. |
| 763 |
<br> |
<br> |