| 1 |
nigel |
41 |
The perltest program |
| 2 |
|
|
-------------------- |
| 3 |
|
|
|
| 4 |
ph10 |
456 |
The perltest.pl script tests Perl's regular expressions; it has the same |
| 5 |
nigel |
41 |
specification as pcretest, and so can be given identical input, except that |
| 6 |
|
|
input patterns can be followed only by Perl's lower case modifiers and /+ (as |
| 7 |
|
|
used by pcretest), which is recognized and handled by the program. |
| 8 |
|
|
|
| 9 |
|
|
The data lines are processed as Perl double-quoted strings, so if they contain |
| 10 |
nigel |
75 |
" $ or @ characters, these have to be escaped. For this reason, all such |
| 11 |
|
|
characters in testinput1 and testinput4 are escaped so that they can be used |
| 12 |
|
|
for perltest as well as for pcretest. The special upper case pattern |
| 13 |
|
|
modifiers such as /A that pcretest recognizes, and its special data line |
| 14 |
|
|
escapes, are not used in these files. The output should be identical, apart |
| 15 |
|
|
from the initial identifying banner. |
| 16 |
nigel |
41 |
|
| 17 |
ph10 |
456 |
The perltest.pl script can also test UTF-8 features. It recognizes the special |
| 18 |
|
|
modifier /8 that pcretest uses to invoke UTF-8 functionality. The testinput4 |
| 19 |
|
|
file can be fed to perltest to run compatible UTF-8 tests. |
| 20 |
nigel |
49 |
|
| 21 |
ph10 |
456 |
The other testinput files are not suitable for feeding to perltest.pl, since |
| 22 |
|
|
they make use of the special upper case modifiers and escapes that pcretest |
| 23 |
|
|
uses to test some features of PCRE. Some of these files also contains malformed |
| 24 |
|
|
regular expressions, in order to check that PCRE diagnoses them correctly. |
| 25 |
nigel |
63 |
|
| 26 |
nigel |
77 |
Philip Hazel |
| 27 |
ph10 |
456 |
September 2009 |