| 406 |
The use of \x{hh...} to represent UTF-8 characters is not dependent on the use |
The use of \x{hh...} to represent UTF-8 characters is not dependent on the use |
| 407 |
of the <b>/8</b> modifier on the pattern. It is recognized always. There may be |
of the <b>/8</b> modifier on the pattern. It is recognized always. There may be |
| 408 |
any number of hexadecimal digits inside the braces. The result is from one to |
any number of hexadecimal digits inside the braces. The result is from one to |
| 409 |
six bytes, encoded according to the UTF-8 rules. |
six bytes, encoded according to the original UTF-8 rules of RFC 2279. This |
| 410 |
|
allows for values in the range 0 to 0x7FFFFFFF. Note that not all of those are |
| 411 |
|
valid Unicode code points, or indeed valid UTF-8 characters according to the |
| 412 |
|
later rules in RFC 3629. |
| 413 |
</P> |
</P> |
| 414 |
<br><a name="SEC6" href="#TOC1">THE ALTERNATIVE MATCHING FUNCTION</a><br> |
<br><a name="SEC6" href="#TOC1">THE ALTERNATIVE MATCHING FUNCTION</a><br> |
| 415 |
<P> |
<P> |
| 671 |
</P> |
</P> |
| 672 |
<br><a name="SEC15" href="#TOC1">REVISION</a><br> |
<br><a name="SEC15" href="#TOC1">REVISION</a><br> |
| 673 |
<P> |
<P> |
| 674 |
Last updated: 24 April 2007 |
Last updated: 09 August 2007 |
| 675 |
<br> |
<br> |
| 676 |
Copyright © 1997-2007 University of Cambridge. |
Copyright © 1997-2007 University of Cambridge. |
| 677 |
<br> |
<br> |