| 36 |
use Unicode properties. (*UCP) at the start of a pattern can be used to set |
use Unicode properties. (*UCP) at the start of a pattern can be used to set |
| 37 |
this option. Modified pcretest to add /W to test this facility. Added |
this option. Modified pcretest to add /W to test this facility. Added |
| 38 |
REG_UCP to make it available via the POSIX interface. |
REG_UCP to make it available via the POSIX interface. |
| 39 |
|
|
| 40 |
|
10. Added --line-buffered to pcregrep. |
| 41 |
|
|
| 42 |
|
11. In UTF-8 mode, if a pattern that was compiled with PCRE_CASELESS was |
| 43 |
|
studied, and the match started with a letter with a code point greater than |
| 44 |
|
127 whose first byte was different to the first byte of the other case of |
| 45 |
|
the letter, the other case of this starting letter was not recognized. |
| 46 |
|
|
| 47 |
|
12. pcreposix.c included pcre.h before including pcre_internal.h. This caused a |
| 48 |
|
conflict in the definition of PCRE_EXP_DECL. I have removed the include of |
| 49 |
|
pcre.h as pcre_internal.h includes pcre.h itself. (This may be a bit of |
| 50 |
|
historical tidying that never got done.) |
| 51 |
|
|
| 52 |
|
13. If a pattern that was studied started with a repeated Unicode property |
| 53 |
|
test, for example, \p{Nd}+, there was the theoretical possibility of |
| 54 |
|
setting up an incorrect bitmap of starting bytes, but fortunately it could |
| 55 |
|
not have actually happened in practice until change 8 above was made (it |
| 56 |
|
added property types that matched character-matching opcodes). |
| 57 |
|
|
| 58 |
|
14. pcre_study() now recognizes \h, \v, and \R when constructing a bit map of |
| 59 |
|
possible starting bytes for non-anchored patterns. |
| 60 |
|
|
| 61 |
|
15. The "auto-possessify" feature of pcre_compile() now recognizes \R. |
| 62 |
|
|
| 63 |
|
16. If a repeated Unicode property match (e.g. \p{Lu}*) was used with non-UTF-8 |
| 64 |
|
input, it could crash or give wrong results if characters with values |
| 65 |
|
greater than 0xc0 were present in the subject string. (Detail: it assumed |
| 66 |
|
UTF-8 input when processing these items.) |
| 67 |
|
|
| 68 |
|
|
| 69 |
|
|
| 70 |
Version 8.02 19-Mar-2010 |
Version 8.02 19-Mar-2010 |