/[pcre]/code/trunk/doc/pcresyntax.3
ViewVC logotype

Diff of /code/trunk/doc/pcresyntax.3

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 517 by ph10, Wed May 5 10:44:20 2010 UTC revision 518 by ph10, Tue May 18 15:47:01 2010 UTC
# Line 57  syntax. Line 57  syntax.
57    \eW         a "non-word" character    \eW         a "non-word" character
58    \eX         an extended Unicode sequence    \eX         an extended Unicode sequence
59  .sp  .sp
60  In PCRE, \ed, \eD, \es, \eS, \ew, and \eW recognize only ASCII characters.  In PCRE, by default, \ed, \eD, \es, \eS, \ew, and \eW recognize only ASCII
61    characters, even in UTF-8 mode. However, this can be changed by setting the
62    PCRE_UCP option.
63  .  .
64  .  .
65  .SH "GENERAL CATEGORY PROPERTIES FOR \ep and \eP"  .SH "GENERAL CATEGORY PROPERTIES FOR \ep and \eP"
# Line 239  Yi. Line 241  Yi.
241    word        same as \ew    word        same as \ew
242    xdigit      hexadecimal digit    xdigit      hexadecimal digit
243  .sp  .sp
244  In PCRE, POSIX character set names recognize only ASCII characters. You can use  In PCRE, POSIX character set names recognize only ASCII characters by default,
245    but some of them use Unicode properties if PCRE_UCP is set. You can use
246  \eQ...\eE inside a character class.  \eQ...\eE inside a character class.
247  .  .
248  .  .
# Line 267  In PCRE, POSIX character set names recog Line 270  In PCRE, POSIX character set names recog
270  .SH "ANCHORS AND SIMPLE ASSERTIONS"  .SH "ANCHORS AND SIMPLE ASSERTIONS"
271  .rs  .rs
272  .sp  .sp
273    \eb          word boundary (only ASCII letters recognized)    \eb          word boundary
274    \eB          not a word boundary    \eB          not a word boundary
275    ^           start of subject    ^           start of subject
276                 also after internal newline in multiline mode                 also after internal newline in multiline mode
# Line 330  In PCRE, POSIX character set names recog Line 333  In PCRE, POSIX character set names recog
333    (?x)            extended (ignore white space)    (?x)            extended (ignore white space)
334    (?-...)         unset option(s)    (?-...)         unset option(s)
335  .sp  .sp
336  The following is recognized only at the start of a pattern or after one of the  The following are recognized only at the start of a pattern or after one of the
337  newline-setting options with similar syntax:  newline-setting options with similar syntax:
338  .sp  .sp
339    (*UTF8)         set UTF-8 mode    (*UTF8)         set UTF-8 mode (PCRE_UTF8)
340      (*UCP)          set PCRE_UCP (use Unicode properties for \ed etc)
341  .  .
342  .  .
343  .SH "LOOKAHEAD AND LOOKBEHIND ASSERTIONS"  .SH "LOOKAHEAD AND LOOKBEHIND ASSERTIONS"
# Line 422  pattern is not anchored. Line 426  pattern is not anchored.
426  .rs  .rs
427  .sp  .sp
428  These are recognized only at the very start of the pattern or after a  These are recognized only at the very start of the pattern or after a
429  (*BSR_...) or (*UTF8) option.  (*BSR_...) or (*UTF8) or (*UCP) option.
430  .sp  .sp
431    (*CR)           carriage return only    (*CR)           carriage return only
432    (*LF)           linefeed only    (*LF)           linefeed only
# Line 435  These are recognized only at the very st Line 439  These are recognized only at the very st
439  .rs  .rs
440  .sp  .sp
441  These are recognized only at the very start of the pattern or after a  These are recognized only at the very start of the pattern or after a
442  (*...) option that sets the newline convention or UTF-8 mode.  (*...) option that sets the newline convention or UTF-8 or UCP mode.
443  .sp  .sp
444    (*BSR_ANYCRLF)  CR, LF, or CRLF    (*BSR_ANYCRLF)  CR, LF, or CRLF
445    (*BSR_UNICODE)  any Unicode newline sequence    (*BSR_UNICODE)  any Unicode newline sequence
# Line 469  Cambridge CB2 3QH, England. Line 473  Cambridge CB2 3QH, England.
473  .rs  .rs
474  .sp  .sp
475  .nf  .nf
476  Last updated: 05 May 2010  Last updated: 12 May 2010
477  Copyright (c) 1997-2010 University of Cambridge.  Copyright (c) 1997-2010 University of Cambridge.
478  .fi  .fi

Legend:
Removed from v.517  
changed lines
  Added in v.518

webmaster@exim.org
ViewVC Help
Powered by ViewVC 1.1.12