/[pcre]/code/trunk/doc/pcreapi.3
ViewVC logotype

Diff of /code/trunk/doc/pcreapi.3

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 967 by ph10, Fri May 4 13:03:39 2012 UTC revision 968 by ph10, Sat May 26 14:22:52 2012 UTC
# Line 302  PCRE supports five different conventions Line 302  PCRE supports five different conventions
302  strings: a single CR (carriage return) character, a single LF (linefeed)  strings: a single CR (carriage return) character, a single LF (linefeed)
303  character, the two-character sequence CRLF, any of the three preceding, or any  character, the two-character sequence CRLF, any of the three preceding, or any
304  Unicode newline sequence. The Unicode newline sequences are the three just  Unicode newline sequence. The Unicode newline sequences are the three just
305  mentioned, plus the single characters VT (vertical tab, U+000B), FF (formfeed,  mentioned, plus the single characters VT (vertical tab, U+000B), FF (form feed,
306  U+000C), NEL (next line, U+0085), LS (line separator, U+2028), and PS  U+000C), NEL (next line, U+0085), LS (line separator, U+2028), and PS
307  (paragraph separator, U+2029).  (paragraph separator, U+2029).
308  .P  .P
# Line 642  documentation. Line 642  documentation.
642  .sp  .sp
643    PCRE_EXTENDED    PCRE_EXTENDED
644  .sp  .sp
645  If this bit is set, whitespace data characters in the pattern are totally  If this bit is set, white space data characters in the pattern are totally
646  ignored except when escaped or inside a character class. Whitespace does not  ignored except when escaped or inside a character class. White space does not
647  include the VT character (code 11). In addition, characters between an  include the VT character (code 11). In addition, characters between an
648  unescaped # outside a character class and the next newline, inclusive, are also  unescaped # outside a character class and the next newline, inclusive, are also
649  ignored. This is equivalent to Perl's /x option, and it can be changed within a  ignored. This is equivalent to Perl's /x option, and it can be changed within a
# Line 661  comment is a literal newline sequence in Line 661  comment is a literal newline sequence in
661  happen to represent a newline do not count.  happen to represent a newline do not count.
662  .P  .P
663  This option makes it possible to include comments inside complicated patterns.  This option makes it possible to include comments inside complicated patterns.
664  Note, however, that this applies only to data characters. Whitespace characters  Note, however, that this applies only to data characters. White space characters
665  may never appear within special character sequences in a pattern, for example  may never appear within special character sequences in a pattern, for example
666  within the sequence (?( that introduces a conditional subpattern.  within the sequence (?( that introduces a conditional subpattern.
667  .sp  .sp
# Line 741  CRLF sequence. Setting PCRE_NEWLINE_ANYC Line 741  CRLF sequence. Setting PCRE_NEWLINE_ANYC
741  preceding sequences should be recognized. Setting PCRE_NEWLINE_ANY specifies  preceding sequences should be recognized. Setting PCRE_NEWLINE_ANY specifies
742  that any Unicode newline sequence should be recognized. The Unicode newline  that any Unicode newline sequence should be recognized. The Unicode newline
743  sequences are the three just mentioned, plus the single characters VT (vertical  sequences are the three just mentioned, plus the single characters VT (vertical
744  tab, U+000B), FF (formfeed, U+000C), NEL (next line, U+0085), LS (line  tab, U+000B), FF (form feed, U+000C), NEL (next line, U+0085), LS (line
745  separator, U+2028), and PS (paragraph separator, U+2029). For the 8-bit  separator, U+2028), and PS (paragraph separator, U+2029). For the 8-bit
746  library, the last two are recognized only in UTF-8 mode.  library, the last two are recognized only in UTF-8 mode.
747  .P  .P
# Line 753  PCRE_NEWLINE_CR with PCRE_NEWLINE_LF is Line 753  PCRE_NEWLINE_CR with PCRE_NEWLINE_LF is
753  other combinations may yield unused numbers and cause an error.  other combinations may yield unused numbers and cause an error.
754  .P  .P
755  The only time that a line break in a pattern is specially recognized when  The only time that a line break in a pattern is specially recognized when
756  compiling is when PCRE_EXTENDED is set. CR and LF are whitespace characters,  compiling is when PCRE_EXTENDED is set. CR and LF are white space characters,
757  and so are ignored in this mode. Also, an unescaped # outside a character class  and so are ignored in this mode. Also, an unescaped # outside a character class
758  indicates a comment that lasts until after the next line break sequence. In  indicates a comment that lasts until after the next line break sequence. In
759  other circumstances, line break sequences in patterns are treated as literal  other circumstances, line break sequences in patterns are treated as literal

Legend:
Removed from v.967  
changed lines
  Added in v.968

webmaster@exim.org
ViewVC Help
Powered by ViewVC 1.1.12