Fix styling of Unicode codepoints in manuals
* doc/lispref/nonascii.texi (Character Properties): * doc/lispref/display.texi (Glyphless Chars) (Bidirectional Display): * doc/emacs/search.texi (Lax Search): * doc/emacs/text.texi (Quotation Marks): * doc/emacs/basic.texi (Inserting Text): Canonicalize the style of "U+NNNN CHARACTER NAME". (Bug#35885)
This commit is contained in:
parent
ff7ec6ff33
commit
f68b33f502
5 changed files with 29 additions and 27 deletions
|
@ -113,7 +113,7 @@ just like digits. Case is ignored.
|
|||
@cindex curved quotes, inserting
|
||||
A few common Unicode characters can be inserted via a command
|
||||
starting with @kbd{C-x 8}. For example, @kbd{C-x 8 [} inserts @t{‘}
|
||||
which is Unicode code-point @code{U+2018} LEFT SINGLE QUOTATION MARK,
|
||||
which is Unicode code-point U+2018 @sc{left single quotation mark},
|
||||
sometimes called a left single ``curved quote'' or ``curly quote''.
|
||||
Similarly, @kbd{C-x 8 ]}, @kbd{C-x 8 @{} and @kbd{C-x 8 @}} insert the
|
||||
curved quotes @t{’}, @t{“} and @t{”}, respectively. Also, a working
|
||||
|
|
|
@ -1310,14 +1310,14 @@ of its accented cousins like @code{@"a} and @code{@'a}, i.e., the
|
|||
match disregards the diacritics that distinguish these
|
||||
variants. In addition, @code{a} matches other characters that
|
||||
resemble it, or have it as part of their graphical representation,
|
||||
such as @sc{u+249c parenthesized latin small letter a} and @sc{u+2100
|
||||
account of} (which looks like a small @code{a} over @code{c}).
|
||||
such as U+249C @sc{parenthesized latin small letter a} and U+2100
|
||||
@sc{account of} (which looks like a small @code{a} over @code{c}).
|
||||
Similarly, the @acronym{ASCII} double-quote character @code{"} matches
|
||||
all the other variants of double quotes defined by the Unicode
|
||||
standard. Finally, character folding can make a sequence of one or
|
||||
more characters match another sequence of a different length: for
|
||||
example, the sequence of two characters @code{ff} matches @sc{u+fb00
|
||||
latin small ligature ff}. Character sequences that are not identical,
|
||||
example, the sequence of two characters @code{ff} matches U+FB00
|
||||
@sc{latin small ligature ff}. Character sequences that are not identical,
|
||||
but match under character folding are known as @dfn{equivalent
|
||||
character sequences}.
|
||||
|
||||
|
@ -1642,8 +1642,9 @@ replacement has already been made, @key{DEL} and @key{SPC} are
|
|||
equivalent in this situation; both move to the next occurrence.
|
||||
|
||||
You can type @kbd{C-r} at this point (see below) to alter the replaced
|
||||
text. You can also type @kbd{C-x u} to undo the replacement; this exits
|
||||
the @code{query-replace}, so if you want to do further replacement you
|
||||
text. You can also undo the replacement with the @code{undo} command
|
||||
(e.g., type @kbd{C-x u}; @pxref{Undo}); this exits the
|
||||
@code{query-replace}, so if you want to do further replacement you
|
||||
must use @kbd{C-x @key{ESC} @key{ESC} @key{RET}} to restart
|
||||
(@pxref{Repetition}).
|
||||
|
||||
|
|
|
@ -428,10 +428,10 @@ using straight apostrophes @t{'like this'} or double-quotes @t{"like
|
|||
this"}. Another common way is the curved quote convention, which uses
|
||||
left and right single or double quotation marks `@t{like this}' or
|
||||
``@t{like this}''@footnote{
|
||||
The curved single quote characters are U+2018 LEFT SINGLE QUOTATION
|
||||
MARK and U+2018 RIGHT SINGLE QUOTATION MARK; the curved double quotes
|
||||
are U+201C LEFT DOUBLE QUOTATION MARK and U+201D RIGHT DOUBLE
|
||||
QUOTATION MARK. On text terminals which cannot display these
|
||||
The curved single quote characters are U+2018 @sc{left single quotation
|
||||
mark} and U+2018 @sc{right single quotation mark}; the curved double quotes
|
||||
are U+201C @sc{left double quotation mark} and U+201D @sc{right double
|
||||
quotation mark}. On text terminals which cannot display these
|
||||
characters, the Info reader might show them as the typewriter ASCII
|
||||
quote characters.
|
||||
}. In text files, typewriter quotes are simple and
|
||||
|
|
|
@ -7269,9 +7269,9 @@ Non-@acronym{ASCII}, non-printing characters @code{U+0080} to
|
|||
@samp{\230}).
|
||||
|
||||
@item format-control
|
||||
Characters of Unicode General Category [Cf], such as @samp{U+200E}
|
||||
(Left-to-Right Mark), but excluding characters that have graphic
|
||||
images, such as @samp{U+00AD} (Soft Hyphen).
|
||||
Characters of Unicode General Category [Cf], such as U+200E
|
||||
@sc{left-to-right mark}, but excluding characters that have graphic
|
||||
images, such as U+00AD @sc{soft hyphen}.
|
||||
|
||||
@item no-font
|
||||
Characters for which there is no suitable font, or which cannot be
|
||||
|
@ -7654,12 +7654,12 @@ problem:
|
|||
|
||||
@itemize @minus
|
||||
@item
|
||||
Append the special character @code{U+200E}, LEFT-TO-RIGHT MARK, or
|
||||
Append the special character U+200E @sc{left-to-right mark}, or
|
||||
@acronym{LRM}, to the end of each field that may have bidirectional
|
||||
content, or prepend it to the beginning of the following field. The
|
||||
function @code{bidi-string-mark-left-to-right}, described below, comes
|
||||
in handy for this purpose. (In a right-to-left paragraph, use
|
||||
@code{U+200F}, RIGHT-TO-LEFT MARK, or @acronym{RLM}, instead.) This
|
||||
U+200F @sc{right-to-left mark}, or @acronym{RLM}, instead.) This
|
||||
is one of the solutions recommended by the UBA.
|
||||
|
||||
@item
|
||||
|
|
|
@ -550,8 +550,8 @@ characters whose @code{Numeric_Type} is @samp{Numeric}. The value of
|
|||
this property is a number. Examples of characters that have this
|
||||
property include fractions, subscripts, superscripts, Roman numerals,
|
||||
currency numerators, and encircled numbers. For example, the value of
|
||||
this property for the character @code{U+2155} (@sc{vulgar fraction one
|
||||
fifth}) is @code{0.2}. For characters that don't have any numeric
|
||||
this property for the character U+2155 @sc{vulgar fraction one
|
||||
fifth} is @code{0.2}. For characters that don't have any numeric
|
||||
value, and for unassigned codepoints, the value is @code{nil}, which
|
||||
means @acronym{NaN}.
|
||||
|
||||
|
@ -622,23 +622,24 @@ is @code{nil}, which means the character itself.
|
|||
@item special-uppercase
|
||||
Corresponds to Unicode language- and context-independent special upper-casing
|
||||
rules. The value of this property is a string (which may be empty). For
|
||||
example mapping for @code{U+00DF} (@sc{latin small letter sharp s}) is
|
||||
example mapping for U+00DF @sc{latin small letter sharp s} is
|
||||
@code{"SS"}. For characters with no special mapping, the value is @code{nil}
|
||||
which means @code{uppercase} property needs to be consulted instead.
|
||||
|
||||
@item special-lowercase
|
||||
Corresponds to Unicode language- and context-independent special lower-casing
|
||||
rules. The value of this property is a string (which may be empty). For
|
||||
example mapping for @code{U+0130} (@sc{latin capital letter i with dot above})
|
||||
the value is @code{"i\u0307"} (i.e. 2-character string consisting of @sc{latin
|
||||
small letter i} followed by @sc{combining dot above}). For characters with no
|
||||
special mapping, the value is @code{nil} which means @code{lowercase} property
|
||||
needs to be consulted instead.
|
||||
Corresponds to Unicode language- and context-independent special
|
||||
lower-casing rules. The value of this property is a string (which may
|
||||
be empty). For example mapping for U+0130 @sc{latin capital letter i
|
||||
with dot above} the value is @code{"i\u0307"} (i.e. 2-character string
|
||||
consisting of @sc{latin small letter i} followed by U+0307
|
||||
@sc{combining dot above}). For characters with no special mapping,
|
||||
the value is @code{nil} which means @code{lowercase} property needs to
|
||||
be consulted instead.
|
||||
|
||||
@item special-titlecase
|
||||
Corresponds to Unicode unconditional special title-casing rules. The value of
|
||||
this property is a string (which may be empty). For example mapping for
|
||||
@code{U+FB01} (@sc{latin small ligature fi}) the value is @code{"Fi"}. For
|
||||
U+FB01 @sc{latin small ligature fi} the value is @code{"Fi"}. For
|
||||
characters with no special mapping, the value is @code{nil} which means
|
||||
@code{titlecase} property needs to be consulted instead.
|
||||
@end table
|
||||
|
|
Loading…
Add table
Reference in a new issue