| 6 |
.sp |
.sp |
| 7 |
The PCRE library is a set of functions that implement regular expression |
The PCRE library is a set of functions that implement regular expression |
| 8 |
pattern matching using the same syntax and semantics as Perl, with just a few |
pattern matching using the same syntax and semantics as Perl, with just a few |
| 9 |
differences. Certain features that appeared in Python and PCRE before they |
differences. Some features that appeared in Python and PCRE before they |
| 10 |
appeared in Perl are also available using the Python syntax. There is also some |
appeared in Perl are also available using the Python syntax, there is some |
| 11 |
support for certain .NET and Oniguruma syntax items, and there is an option for |
support for one or two .NET and Oniguruma syntax items, and there is an option |
| 12 |
requesting some minor changes that give better JavaScript compatibility. |
for requesting some minor changes that give better JavaScript compatibility. |
| 13 |
.P |
.P |
| 14 |
The current implementation of PCRE (release 8.xx) corresponds approximately |
The current implementation of PCRE corresponds approximately with Perl 5.10, |
| 15 |
with Perl 5.10, including support for UTF-8 encoded strings and Unicode general |
including support for UTF-8 encoded strings and Unicode general category |
| 16 |
category properties. However, UTF-8 and Unicode support has to be explicitly |
properties. However, UTF-8 and Unicode support has to be explicitly enabled; it |
| 17 |
enabled; it is not the default. The Unicode tables correspond to Unicode |
is not the default. The Unicode tables correspond to Unicode release 5.1. |
|
release 5.1. |
|
| 18 |
.P |
.P |
| 19 |
In addition to the Perl-compatible matching function, PCRE contains an |
In addition to the Perl-compatible matching function, PCRE contains an |
| 20 |
alternative matching function that matches the same compiled patterns in a |
alternative function that matches the same compiled patterns in a different |
| 21 |
different way. In certain circumstances, the alternative function has some |
way. In certain circumstances, the alternative function has some advantages. |
| 22 |
advantages. For a discussion of the two matching algorithms, see the |
For a discussion of the two matching algorithms, see the |
| 23 |
.\" HREF |
.\" HREF |
| 24 |
\fBpcrematching\fP |
\fBpcrematching\fP |
| 25 |
.\" |
.\" |
| 65 |
\fBpcrebuild\fP |
\fBpcrebuild\fP |
| 66 |
.\" |
.\" |
| 67 |
page. Documentation about building PCRE for various operating systems can be |
page. Documentation about building PCRE for various operating systems can be |
| 68 |
found in the \fBREADME\fP file in the source distribution. |
found in the \fBREADME\fP and \fBNON-UNIX-USE\fP files in the source |
| 69 |
|
distribution. |
| 70 |
.P |
.P |
| 71 |
The library contains a number of undocumented internal functions and data |
The library contains a number of undocumented internal functions and data |
| 72 |
tables that are used by more than one of the exported external functions, but |
tables that are used by more than one of the exported external functions, but |
| 100 |
.\" JOIN |
.\" JOIN |
| 101 |
pcrepattern syntax and semantics of supported |
pcrepattern syntax and semantics of supported |
| 102 |
regular expressions |
regular expressions |
|
pcresyntax quick syntax reference |
|
| 103 |
pcreperform discussion of performance issues |
pcreperform discussion of performance issues |
| 104 |
pcreposix the POSIX-compatible C API |
pcreposix the POSIX-compatible C API |
| 105 |
pcreprecompile details of saving and re-using precompiled patterns |
pcreprecompile details of saving and re-using precompiled patterns |
| 106 |
pcresample discussion of the pcredemo program |
pcresample discussion of the pcredemo program |
| 107 |
pcrestack discussion of stack usage |
pcrestack discussion of stack usage |
| 108 |
|
pcresyntax quick syntax reference |
| 109 |
pcretest description of the \fBpcretest\fP testing command |
pcretest description of the \fBpcretest\fP testing command |
| 110 |
.sp |
.sp |
| 111 |
In addition, in the "man" and HTML formats, there is a short page for each |
In addition, in the "man" and HTML formats, there is a short page for each |
| 148 |
.\" |
.\" |
| 149 |
documentation. |
documentation. |
| 150 |
. |
. |
|
.\" HTML <a name="utf8support"></a> |
|
| 151 |
. |
. |
| 152 |
|
.\" HTML <a name="utf8support"></a> |
| 153 |
. |
. |
| 154 |
.SH "UTF-8 AND UNICODE PROPERTY SUPPORT" |
.SH "UTF-8 AND UNICODE PROPERTY SUPPORT" |
| 155 |
.rs |
.rs |
| 167 |
with the PCRE_UTF8 option flag, or the pattern must start with the sequence |
with the PCRE_UTF8 option flag, or the pattern must start with the sequence |
| 168 |
(*UTF8). When either of these is the case, both the pattern and any subject |
(*UTF8). When either of these is the case, both the pattern and any subject |
| 169 |
strings that are matched against it are treated as UTF-8 strings instead of |
strings that are matched against it are treated as UTF-8 strings instead of |
| 170 |
just strings of bytes. |
strings of 1-byte characters. |
| 171 |
.P |
.P |
| 172 |
If you compile PCRE with UTF-8 support, but do not use it at run time, the |
If you compile PCRE with UTF-8 support, but do not use it at run time, the |
| 173 |
library will be a bit bigger, but the additional run time overhead is limited |
library will be a bit bigger, but the additional run time overhead is limited |
| 187 |
Furthermore, in Perl, many properties may optionally be prefixed by "Is", for |
Furthermore, in Perl, many properties may optionally be prefixed by "Is", for |
| 188 |
compatibility with Perl 5.6. PCRE does not support this. |
compatibility with Perl 5.6. PCRE does not support this. |
| 189 |
. |
. |
| 190 |
|
. |
| 191 |
.\" HTML <a name="utf8strings"></a> |
.\" HTML <a name="utf8strings"></a> |
| 192 |
. |
. |
| 193 |
.SS "Validity of UTF-8 strings" |
.SS "Validity of UTF-8 strings" |
| 293 |
.rs |
.rs |
| 294 |
.sp |
.sp |
| 295 |
.nf |
.nf |
| 296 |
Last updated: 01 September 2009 |
Last updated: 28 September 2009 |
| 297 |
Copyright (c) 1997-2009 University of Cambridge. |
Copyright (c) 1997-2009 University of Cambridge. |
| 298 |
.fi |
.fi |