| 1 |
News about PCRE releases |
News about PCRE releases |
| 2 |
------------------------ |
------------------------ |
| 3 |
|
|
| 4 |
|
Release 8.12 15-Jan-2011 |
| 5 |
|
------------------------ |
| 6 |
|
|
| 7 |
|
This release fixes some bugs in pcregrep, one of which caused the tests to fail |
| 8 |
|
on 64-bit big-endian systems. There are no changes to the code of the library. |
| 9 |
|
|
| 10 |
|
|
| 11 |
|
Release 8.11 10-Dec-2010 |
| 12 |
|
------------------------ |
| 13 |
|
|
| 14 |
|
A number of bugs in the library and in pcregrep have been fixed. As always, see |
| 15 |
|
ChangeLog for details. The following are the non-bug-fix changes: |
| 16 |
|
|
| 17 |
|
. Added --match-limit and --recursion-limit to pcregrep. |
| 18 |
|
|
| 19 |
|
. Added an optional parentheses number to the -o and --only-matching options |
| 20 |
|
of pcregrep. |
| 21 |
|
|
| 22 |
|
. Changed the way PCRE_PARTIAL_HARD affects the matching of $, \z, \Z, \b, and |
| 23 |
|
\B. |
| 24 |
|
|
| 25 |
|
. Added PCRE_ERROR_SHORTUTF8 to make it possible to distinguish between a |
| 26 |
|
bad UTF-8 sequence and one that is incomplete when using PCRE_PARTIAL_HARD. |
| 27 |
|
|
| 28 |
|
. Recognize (*NO_START_OPT) at the start of a pattern to set the PCRE_NO_ |
| 29 |
|
START_OPTIMIZE option, which is now allowed at compile time |
| 30 |
|
|
| 31 |
|
|
| 32 |
|
Release 8.10 25-Jun-2010 |
| 33 |
|
------------------------ |
| 34 |
|
|
| 35 |
|
There are two major additions: support for (*MARK) and friends, and the option |
| 36 |
|
PCRE_UCP, which changes the behaviour of \b, \d, \s, and \w (and their |
| 37 |
|
opposites) so that they make use of Unicode properties. There are also a number |
| 38 |
|
of lesser new features, and several bugs have been fixed. A new option, |
| 39 |
|
--line-buffered, has been added to pcregrep, for use when it is connected to |
| 40 |
|
pipes. |
| 41 |
|
|
| 42 |
|
|
| 43 |
|
Release 8.02 19-Mar-2010 |
| 44 |
|
------------------------ |
| 45 |
|
|
| 46 |
|
Another bug-fix release. |
| 47 |
|
|
| 48 |
|
|
| 49 |
|
Release 8.01 19-Jan-2010 |
| 50 |
|
------------------------ |
| 51 |
|
|
| 52 |
|
This is a bug-fix release. Several bugs in the code itself and some bugs and |
| 53 |
|
infelicities in the build system have been fixed. |
| 54 |
|
|
| 55 |
|
|
| 56 |
|
Release 8.00 19-Oct-09 |
| 57 |
|
---------------------- |
| 58 |
|
|
| 59 |
|
Bugs have been fixed in the library and in pcregrep. There are also some |
| 60 |
|
enhancements. Restrictions on patterns used for partial matching have been |
| 61 |
|
removed, extra information is given for partial matches, the partial matching |
| 62 |
|
process has been improved, and an option to make a partial match override a |
| 63 |
|
full match is available. The "study" process has been enhanced by finding a |
| 64 |
|
lower bound matching length. Groups with duplicate numbers may now have |
| 65 |
|
duplicated names without the use of PCRE_DUPNAMES. However, they may not have |
| 66 |
|
different names. The documentation has been revised to reflect these changes. |
| 67 |
|
The version number has been expanded to 3 digits as it is clear that the rate |
| 68 |
|
of change is not slowing down. |
| 69 |
|
|
| 70 |
|
|
| 71 |
|
Release 7.9 11-Apr-09 |
| 72 |
|
--------------------- |
| 73 |
|
|
| 74 |
|
Mostly bugfixes and tidies with just a couple of minor functional additions. |
| 75 |
|
|
| 76 |
|
|
| 77 |
|
Release 7.8 05-Sep-08 |
| 78 |
|
--------------------- |
| 79 |
|
|
| 80 |
|
More bug fixes, plus a performance improvement in Unicode character property |
| 81 |
|
lookup. |
| 82 |
|
|
| 83 |
|
|
| 84 |
|
Release 7.7 07-May-08 |
| 85 |
|
--------------------- |
| 86 |
|
|
| 87 |
|
This is once again mainly a bug-fix release, but there are a couple of new |
| 88 |
|
features. |
| 89 |
|
|
| 90 |
|
|
| 91 |
|
Release 7.6 28-Jan-08 |
| 92 |
|
--------------------- |
| 93 |
|
|
| 94 |
|
The main reason for having this release so soon after 7.5 is because it fixes a |
| 95 |
|
potential buffer overflow problem in pcre_compile() when run in UTF-8 mode. In |
| 96 |
|
addition, the CMake configuration files have been brought up to date. |
| 97 |
|
|
| 98 |
|
|
| 99 |
|
Release 7.5 10-Jan-08 |
| 100 |
|
--------------------- |
| 101 |
|
|
| 102 |
|
This is mainly a bug-fix release. However the ability to link pcregrep with |
| 103 |
|
libz or libbz2 and the ability to link pcretest with libreadline have been |
| 104 |
|
added. Also the --line-offsets and --file-offsets options were added to |
| 105 |
|
pcregrep. |
| 106 |
|
|
| 107 |
|
|
| 108 |
|
Release 7.4 21-Sep-07 |
| 109 |
|
--------------------- |
| 110 |
|
|
| 111 |
|
The only change of specification is the addition of options to control whether |
| 112 |
|
\R matches any Unicode line ending (the default) or just CR, LF, and CRLF. |
| 113 |
|
Otherwise, the changes are bug fixes and a refactoring to reduce the number of |
| 114 |
|
relocations needed in a shared library. There have also been some documentation |
| 115 |
|
updates, in particular, some more information about using CMake to build PCRE |
| 116 |
|
has been added to the NON-UNIX-USE file. |
| 117 |
|
|
| 118 |
|
|
| 119 |
|
Release 7.3 28-Aug-07 |
| 120 |
|
--------------------- |
| 121 |
|
|
| 122 |
|
Most changes are bug fixes. Some that are not: |
| 123 |
|
|
| 124 |
|
1. There is some support for Perl 5.10's experimental "backtracking control |
| 125 |
|
verbs" such as (*PRUNE). |
| 126 |
|
|
| 127 |
|
2. UTF-8 checking is now as per RFC 3629 instead of RFC 2279; this is more |
| 128 |
|
restrictive in the strings it accepts. |
| 129 |
|
|
| 130 |
|
3. Checking for potential integer overflow has been made more dynamic, and as a |
| 131 |
|
consequence there is no longer a hard limit on the size of a subpattern that |
| 132 |
|
has a limited repeat count. |
| 133 |
|
|
| 134 |
|
4. When CRLF is a valid line-ending sequence, pcre_exec() and pcre_dfa_exec() |
| 135 |
|
no longer advance by two characters instead of one when an unanchored match |
| 136 |
|
fails at CRLF if there are explicit CR or LF matches within the pattern. |
| 137 |
|
This gets rid of some anomalous effects that previously occurred. |
| 138 |
|
|
| 139 |
|
5. Some PCRE-specific settings for varying the newline options at the start of |
| 140 |
|
a pattern have been added. |
| 141 |
|
|
| 142 |
|
|
| 143 |
|
Release 7.2 19-Jun-07 |
| 144 |
|
--------------------- |
| 145 |
|
|
| 146 |
|
WARNING: saved patterns that were compiled by earlier versions of PCRE must be |
| 147 |
|
recompiled for use with 7.2 (necessitated by the addition of \K, \h, \H, \v, |
| 148 |
|
and \V). |
| 149 |
|
|
| 150 |
|
Correction to the notes for 7.1: the note about shared libraries for Windows is |
| 151 |
|
wrong. Previously, three libraries were built, but each could function |
| 152 |
|
independently. For example, the pcreposix library also included all the |
| 153 |
|
functions from the basic pcre library. The change is that the three libraries |
| 154 |
|
are no longer independent. They are like the Unix libraries. To use the |
| 155 |
|
pcreposix functions, for example, you need to link with both the pcreposix and |
| 156 |
|
the basic pcre library. |
| 157 |
|
|
| 158 |
|
Some more features from Perl 5.10 have been added: |
| 159 |
|
|
| 160 |
|
(?-n) and (?+n) relative references for recursion and subroutines. |
| 161 |
|
|
| 162 |
|
(?(-n) and (?(+n) relative references as conditions. |
| 163 |
|
|
| 164 |
|
\k{name} and \g{name} are synonyms for \k<name>. |
| 165 |
|
|
| 166 |
|
\K to reset the start of the matched string; for example, (foo)\Kbar |
| 167 |
|
matches bar preceded by foo, but only sets bar as the matched string. |
| 168 |
|
|
| 169 |
|
(?| introduces a group where the capturing parentheses in each alternative |
| 170 |
|
start from the same number; for example, (?|(abc)|(xyz)) sets capturing |
| 171 |
|
parentheses number 1 in both cases. |
| 172 |
|
|
| 173 |
|
\h, \H, \v, \V match horizontal and vertical whitespace, respectively. |
| 174 |
|
|
| 175 |
|
|
| 176 |
|
Release 7.1 24-Apr-07 |
| 177 |
|
--------------------- |
| 178 |
|
|
| 179 |
|
There is only one new feature in this release: a linebreak setting of |
| 180 |
|
PCRE_NEWLINE_ANYCRLF. It is a cut-down version of PCRE_NEWLINE_ANY, which |
| 181 |
|
recognizes only CRLF, CR, and LF as linebreaks. |
| 182 |
|
|
| 183 |
|
A few bugs are fixed (see ChangeLog for details), but the major change is a |
| 184 |
|
complete re-implementation of the build system. This now has full Autotools |
| 185 |
|
support and so is now "standard" in some sense. It should help with compiling |
| 186 |
|
PCRE in a wide variety of environments. |
| 187 |
|
|
| 188 |
|
NOTE: when building shared libraries for Windows, three dlls are now built, |
| 189 |
|
called libpcre, libpcreposix, and libpcrecpp. Previously, everything was |
| 190 |
|
included in a single dll. |
| 191 |
|
|
| 192 |
|
Another important change is that the dftables auxiliary program is no longer |
| 193 |
|
compiled and run at "make" time by default. Instead, a default set of character |
| 194 |
|
tables (assuming ASCII coding) is used. If you want to use dftables to generate |
| 195 |
|
the character tables as previously, add --enable-rebuild-chartables to the |
| 196 |
|
"configure" command. You must do this if you are compiling PCRE to run on a |
| 197 |
|
system that uses EBCDIC code. |
| 198 |
|
|
| 199 |
|
There is a discussion about character tables in the README file. The default is |
| 200 |
|
not to use dftables so that that there is no problem when cross-compiling. |
| 201 |
|
|
| 202 |
|
|
| 203 |
|
Release 7.0 19-Dec-06 |
| 204 |
|
--------------------- |
| 205 |
|
|
| 206 |
|
This release has a new major number because there have been some internal |
| 207 |
|
upheavals to facilitate the addition of new optimizations and other facilities, |
| 208 |
|
and to make subsequent maintenance and extension easier. Compilation is likely |
| 209 |
|
to be a bit slower, but there should be no major effect on runtime performance. |
| 210 |
|
Previously compiled patterns are NOT upwards compatible with this release. If |
| 211 |
|
you have saved compiled patterns from a previous release, you will have to |
| 212 |
|
re-compile them. Important changes that are visible to users are: |
| 213 |
|
|
| 214 |
|
1. The Unicode property tables have been updated to Unicode 5.0.0, which adds |
| 215 |
|
some more scripts. |
| 216 |
|
|
| 217 |
|
2. The option PCRE_NEWLINE_ANY causes PCRE to recognize any Unicode newline |
| 218 |
|
sequence as a newline. |
| 219 |
|
|
| 220 |
|
3. The \R escape matches a single Unicode newline sequence as a single unit. |
| 221 |
|
|
| 222 |
|
4. New features that will appear in Perl 5.10 are now in PCRE. These include |
| 223 |
|
alternative Perl syntax for named parentheses, and Perl syntax for |
| 224 |
|
recursion. |
| 225 |
|
|
| 226 |
|
5. The C++ wrapper interface has been extended by the addition of a |
| 227 |
|
QuoteMeta function and the ability to allow copy construction and |
| 228 |
|
assignment. |
| 229 |
|
|
| 230 |
|
For a complete list of changes, see the ChangeLog file. |
| 231 |
|
|
| 232 |
|
|
| 233 |
|
Release 6.7 04-Jul-06 |
| 234 |
|
--------------------- |
| 235 |
|
|
| 236 |
|
The main additions to this release are the ability to use the same name for |
| 237 |
|
multiple sets of parentheses, and support for CRLF line endings in both the |
| 238 |
|
library and pcregrep (and in pcretest for testing). |
| 239 |
|
|
| 240 |
|
Thanks to Ian Taylor, the stack usage for many kinds of pattern has been |
| 241 |
|
significantly reduced for certain subject strings. |
| 242 |
|
|
| 243 |
|
|
| 244 |
Release 6.5 01-Feb-06 |
Release 6.5 01-Feb-06 |
| 245 |
--------------------- |
--------------------- |
| 246 |
|
|