| 302 |
strings: a single CR (carriage return) character, a single LF (linefeed) |
strings: a single CR (carriage return) character, a single LF (linefeed) |
| 303 |
character, the two-character sequence CRLF, any of the three preceding, or any |
character, the two-character sequence CRLF, any of the three preceding, or any |
| 304 |
Unicode newline sequence. The Unicode newline sequences are the three just |
Unicode newline sequence. The Unicode newline sequences are the three just |
| 305 |
mentioned, plus the single characters VT (vertical tab, U+000B), FF (formfeed, |
mentioned, plus the single characters VT (vertical tab, U+000B), FF (form feed, |
| 306 |
U+000C), NEL (next line, U+0085), LS (line separator, U+2028), and PS |
U+000C), NEL (next line, U+0085), LS (line separator, U+2028), and PS |
| 307 |
(paragraph separator, U+2029). |
(paragraph separator, U+2029). |
| 308 |
.P |
.P |
| 642 |
.sp |
.sp |
| 643 |
PCRE_EXTENDED |
PCRE_EXTENDED |
| 644 |
.sp |
.sp |
| 645 |
If this bit is set, whitespace data characters in the pattern are totally |
If this bit is set, white space data characters in the pattern are totally |
| 646 |
ignored except when escaped or inside a character class. Whitespace does not |
ignored except when escaped or inside a character class. White space does not |
| 647 |
include the VT character (code 11). In addition, characters between an |
include the VT character (code 11). In addition, characters between an |
| 648 |
unescaped # outside a character class and the next newline, inclusive, are also |
unescaped # outside a character class and the next newline, inclusive, are also |
| 649 |
ignored. This is equivalent to Perl's /x option, and it can be changed within a |
ignored. This is equivalent to Perl's /x option, and it can be changed within a |
| 661 |
happen to represent a newline do not count. |
happen to represent a newline do not count. |
| 662 |
.P |
.P |
| 663 |
This option makes it possible to include comments inside complicated patterns. |
This option makes it possible to include comments inside complicated patterns. |
| 664 |
Note, however, that this applies only to data characters. Whitespace characters |
Note, however, that this applies only to data characters. White space characters |
| 665 |
may never appear within special character sequences in a pattern, for example |
may never appear within special character sequences in a pattern, for example |
| 666 |
within the sequence (?( that introduces a conditional subpattern. |
within the sequence (?( that introduces a conditional subpattern. |
| 667 |
.sp |
.sp |
| 741 |
preceding sequences should be recognized. Setting PCRE_NEWLINE_ANY specifies |
preceding sequences should be recognized. Setting PCRE_NEWLINE_ANY specifies |
| 742 |
that any Unicode newline sequence should be recognized. The Unicode newline |
that any Unicode newline sequence should be recognized. The Unicode newline |
| 743 |
sequences are the three just mentioned, plus the single characters VT (vertical |
sequences are the three just mentioned, plus the single characters VT (vertical |
| 744 |
tab, U+000B), FF (formfeed, U+000C), NEL (next line, U+0085), LS (line |
tab, U+000B), FF (form feed, U+000C), NEL (next line, U+0085), LS (line |
| 745 |
separator, U+2028), and PS (paragraph separator, U+2029). For the 8-bit |
separator, U+2028), and PS (paragraph separator, U+2029). For the 8-bit |
| 746 |
library, the last two are recognized only in UTF-8 mode. |
library, the last two are recognized only in UTF-8 mode. |
| 747 |
.P |
.P |
| 753 |
other combinations may yield unused numbers and cause an error. |
other combinations may yield unused numbers and cause an error. |
| 754 |
.P |
.P |
| 755 |
The only time that a line break in a pattern is specially recognized when |
The only time that a line break in a pattern is specially recognized when |
| 756 |
compiling is when PCRE_EXTENDED is set. CR and LF are whitespace characters, |
compiling is when PCRE_EXTENDED is set. CR and LF are white space characters, |
| 757 |
and so are ignored in this mode. Also, an unescaped # outside a character class |
and so are ignored in this mode. Also, an unescaped # outside a character class |
| 758 |
indicates a comment that lasts until after the next line break sequence. In |
indicates a comment that lasts until after the next line break sequence. In |
| 759 |
other circumstances, line break sequences in patterns are treated as literal |
other circumstances, line break sequences in patterns are treated as literal |