(General Escape Syntax): Update explanation of unicode escape syntax.
This commit is contained in:
parent
d55f6ca59b
commit
fb080b287c
1 changed files with 10 additions and 9 deletions
|
@ -346,18 +346,19 @@ following text.)
|
|||
@subsubsection General Escape Syntax
|
||||
|
||||
In addition to the specific escape sequences for special important
|
||||
control characters, Emacs provides general categories of escape syntax
|
||||
that you can use to specify non-ASCII text characters.
|
||||
control characters, Emacs provides several types of escape syntax that
|
||||
you can use to specify non-ASCII text characters.
|
||||
|
||||
@cindex unicode character escape
|
||||
For instance, you can specify characters by their Unicode values.
|
||||
You can specify characters by their Unicode values.
|
||||
@code{?\u@var{nnnn}} represents a character that maps to the Unicode
|
||||
code point @samp{U+@var{nnnn}}. There is a slightly different syntax
|
||||
for specifying characters with code points above @code{#xFFFF};
|
||||
@code{\U00@var{nnnnnn}} represents the character whose Unicode code
|
||||
point is @samp{U+@var{nnnnnn}}, if such a character is supported by
|
||||
Emacs. If the corresponding character is not supported, Emacs signals
|
||||
an error.
|
||||
code point @samp{U+@var{nnnn}} (by convention, Unicode code points are
|
||||
given in hexadecimal). There is a slightly different syntax for
|
||||
specifying characters with code points higher than
|
||||
@code{U+@var{ffff}}: @code{\U00@var{nnnnnn}} represents the character
|
||||
whose code point is @samp{U+@var{nnnnnn}}. The Unicode standard only
|
||||
defines code points up to @samp{U+@var{10ffff}}, so if you specify a
|
||||
code point higher than that, Emacs signals an error.
|
||||
|
||||
This peculiar and inconvenient syntax was adopted for compatibility
|
||||
with other programming languages. Unlike some other languages, Emacs
|
||||
|
|
Loading…
Add table
Reference in a new issue