| 16 |
Files in the maint directory |
Files in the maint directory |
| 17 |
---------------------------- |
---------------------------- |
| 18 |
|
|
| 19 |
Builducptable A Perl script that creates the contents of the ucptable.h file |
Builducptable A Perl script that creates the contents of the ucptable.h file |
| 20 |
from two Unicode data files, which themselves are downloaded |
from two Unicode data files, which themselves are downloaded |
| 21 |
from the Unicode web site. Run this script in the "maint" |
from the Unicode web site. Run this script in the "maint" |
| 22 |
directory. |
directory. |
| 23 |
|
|
| 24 |
Tech.Notes Some notes about the internals of the PCRE code. |
ManyConfigTests A shell script that runs "configure, make, test" a number of |
| 25 |
|
times with different configuration settings. |
| 26 |
Unicode.tables The files in this directory, Scripts.txt and UnicodeData.txt, |
|
| 27 |
were downloaded from the Unicode web site. They contain |
Unicode.tables The files in this directory, Scripts.txt and UnicodeData.txt, |
| 28 |
information about Unicode characters and scripts. |
were downloaded from the Unicode web site. They contain |
| 29 |
|
information about Unicode characters and scripts. |
| 30 |
ucptest.c A short C program for testing the Unicode property functions in |
|
| 31 |
pcre_ucp_searchfuncs.c, mainly useful after rebuilding the |
ucptest.c A short C program for testing the Unicode property functions |
| 32 |
Unicode property table. Compile and run this in the "maint" |
in pcre_ucp_searchfuncs.c, mainly useful after rebuilding the |
| 33 |
directory. |
Unicode property table. Compile and run this in the "maint" |
| 34 |
|
directory. |
| 35 |
ucptestdata A directory containing two files, testinput1 and testoutput1, |
|
| 36 |
to use in conjunction with the ucptest program. |
ucptestdata A directory containing two files, testinput1 and testoutput1, |
| 37 |
|
to use in conjunction with the ucptest program. |
| 38 |
utf8.c A short, freestanding C program for converting a Unicode code |
|
| 39 |
point into a sequence of bytes in the UTF-8 encoding, and vice |
utf8.c A short, freestanding C program for converting a Unicode code |
| 40 |
versa. If its argument is a hex number such as 0x1234, it |
point into a sequence of bytes in the UTF-8 encoding, and vice |
| 41 |
outputs a list of the equivalent UTF-8 bytes. If its argument |
versa. If its argument is a hex number such as 0x1234, it |
| 42 |
is sequence of concatenated UTF-8 bytes (e.g. e188b4) it treats |
outputs a list of the equivalent UTF-8 bytes. If its argument |
| 43 |
them as a UTF-8 character and outputs the equivalent code point |
is sequence of concatenated UTF-8 bytes (e.g. e188b4) it |
| 44 |
in hex. |
treats them as a UTF-8 character and outputs the equivalent |
| 45 |
|
code point in hex. |
| 46 |
|
|
| 47 |
|
|
| 48 |
Updating to a new Unicode release |
Updating to a new Unicode release |
| 66 |
. Run ./autogen.sh to ensure everything is up-to-date. |
. Run ./autogen.sh to ensure everything is up-to-date. |
| 67 |
|
|
| 68 |
. Compile and test with many different config options, and combinations of |
. Compile and test with many different config options, and combinations of |
| 69 |
options: |
options. The maint/ManyConfigTests script now encapsulates this testing. |
|
|
|
|
* Totally standard ./configure with no options |
|
|
* --disable-shared |
|
|
* --disable-static |
|
|
* --enable-utf8 |
|
|
* --enable-unicode-properties |
|
|
* --disable-cpp |
|
|
* --with-link-size=3 (occasionally check with 4 as well) |
|
|
* --disable-stack-for-recursion |
|
|
* --enable-newline-is-any |
|
|
|
|
|
I've never automated this, but perhaps I should. The newline testing could be |
|
|
enhanced; at present, some tests fail unless plain LF is a newline. |
|
| 70 |
|
|
| 71 |
. Run perltest.pl on the test data for tests 1 and 4. The output should match |
. Run perltest.pl on the test data for tests 1 and 4. The output should match |
| 72 |
the PCRE test output, apart from the version identification at the top. The |
the PCRE test output, apart from the version identification at the top. The |
| 252 |
. Someone suggested --disable-callout to save code space when callouts are |
. Someone suggested --disable-callout to save code space when callouts are |
| 253 |
never wanted. This seems rather marginal. |
never wanted. This seems rather marginal. |
| 254 |
|
|
|
. Automate some of the testing before release into a script that compiles with |
|
|
different options and runs the tests in each case. |
|
|
|
|
|
. How about distributing a fixed pcre_chartables.c file and abandoning the |
|
|
on-the-fly generation using dftables. This will make cross-compiling easier, |
|
|
and in any case, locales are going out of fashion. |
|
|
|
|
| 255 |
Philip Hazel |
Philip Hazel |
| 256 |
Email local part: ph10 |
Email local part: ph10 |
| 257 |
Email domain: cam.ac.uk |
Email domain: cam.ac.uk |
| 258 |
Last updated: 12 March 2007 |
Last updated: 20 March 2007 |